Teaching Computers to Understand Human Actions

by Joshua New November 2, 2017

written by Joshua New November 2, 2017

Google has published a dataset of short film clips containing “atomic visual actions (AVA),” which are distinct human actions such as walking or drinking. The AVA dataset consists of links to 57,600 three-second YouTube videos of 80 actions performed from different angles, as well as annotations about the actions performed and the number of human actors in each clip. This dataset could help researchers develop computer vision systems capable of recognizing actions in video, rather than just classifying the contents of the frame.

Get the data.

Joshua New

Joshua New was a senior policy analyst at the Center for Data Innovation. He has a background in government affairs, policy, and communication. Prior to joining the Center for Data Innovation, Joshua graduated from American University with degrees in C.L.E.G. (Communication, Legal Institutions, Economics, and Government) and Public Communication. His research focuses on methods of promoting innovative and emerging technologies as a means of improving the economy and quality of life.

Teaching Computers to Understand Human Actions

Smart Cities Will Not Thrive Without National Policy Support, Center for Data Innovation Finds in New Report

Visualizing the Rise and Fall of ISIS

You may also like