Training Speech-Enabled Applications

by Morgan Stevens August 25, 2021

written by Morgan Stevens August 25, 2021

Nvidia and Mozilla have updated a dataset of crowdsourced speech data. The dataset now contains 13,905 hours of speech in 76 languages. The newest version of the dataset features 182,000 unique voices, demographic information of the speaker like age, gender, and accent, and adds 16 new languages: Basaa, Slovak, Northern Kurdish, Bulgarian, Kazakh, Bashkir, Galician, Uyghur, Armenian, Belarusian, Urdu, Guarani, Serbian, Uzbek, Azerbaijani, and Hausa.

Get the data.

Image credit: Flickr user Drestwn

Morgan Stevens

Morgan Stevens is a Research Assistant at the Center for Data Innovation. She holds a J.D. from the Sandra Day O'Connor College of Law at Arizona State University and a B.A. in Economics and Government from the University of Texas at Austin.

Training Speech-Enabled Applications

Digital Vaccine Passports Only Real Solution to Fake Vaccination Cards

Visualizing Final Rankings for the Tokyo 2020 Olympics

You may also like