Teaching AI to Recognize Speech

by Joshua New August 31, 2017

written by Joshua New August 31, 2017

Google has published its Speech Commands Dataset consisting of 65,000 one-second long clips of thousands of people speaking 30 short words, such as “yes,” “on,” and “stop.” Google developed the Speech Commands Dataset to serve as training data for speech recognition albums powering basic voice interfaces. Google also made the application it used to create this dataset available as open source to encourage developers to create similar systems in other languages and with bigger vocabularies.

Get the data.

Image: OpenClipart-Vectors.

Joshua New

Joshua New was a senior policy analyst at the Center for Data Innovation. He has a background in government affairs, policy, and communication. Prior to joining the Center for Data Innovation, Joshua graduated from American University with degrees in C.L.E.G. (Communication, Legal Institutions, Economics, and Government) and Public Communication. His research focuses on methods of promoting innovative and emerging technologies as a means of improving the economy and quality of life.

Teaching AI to Recognize Speech

Visualizing Dowry Harassment in Delhi

Response to the Call for Evidence by the House of Lords Select Committee on Artificial Intelligence

You may also like