Home BlogDataset Building a Public Database for Voices

Building a Public Database for Voices

by Joshua New
audio wave

Mozilla has published data from its Common Voice project to develop a public repository of voice recordings to spur the development of speech-recognition machine learning systems. The Common Voice data consists of nearly 400,000 recordings of 20,000 people speaking. Members of the public are invited to contribute recordings of them speaking to Common Voice, as well as help validate recordings to ensure their accuracy.

Get the data.

Image: GDJ

You may also like

Show Buttons
Hide Buttons