Teaching Machines How People Talk

by Michael McLaughlin March 1, 2019

written by Michael McLaughlin March 1, 2019

Mozilla has published the latest dataset from its Common Voice project, which aims to spur the development of voice-enabled technologies. The dataset consists of nearly 1,400 hours of recordings from 42,000 individuals speaking a total of 18 different languages. In addition, the dataset includes labels such as the age, sex, and accent of contributors who opted in to provide the metadata.

Get the data.

Image: DPic

Michael McLaughlin

Michael McLaughlin is a research analyst at the Center for Data Innovation. He researches and writes about a variety of issues related to information technology and Internet policy, including digital platforms, e-government, and artificial intelligence. Michael graduated from Wake Forest University, where he majored in Communication with Minors in Politics and International Affairs and Journalism. He received his Master’s in Communication at Stanford University, specializing in Data Journalism.

Teaching Machines How People Talk

5 Q’s for Luca Boschin, CEO of LogoGrab

The U.S. May Lose the AI Race Because of An Unchecked Techno-Panic

You may also like