Improving Speech Recognition Systems

by Morgan Stevens July 21, 2023

written by Morgan Stevens July 21, 2023

Meta has created a dataset to improve accuracy in speech recognition AI systems. The dataset contains audio recordings and transcriptions of over 27,000 verbal commands and messages from around 600 people, as well as self-reported demographic information. The commands and messages followed prompts relevant to voice assistant tools, including notification controls, dictation, calling, messaging, music, photo or video capture, and utilities. Meta organized the dataset by clustering commands or messages of similar content together to enable researchers to improve the accuracy of speech recognition systems across different demographics.

Get the data.

Image credit: Flickr user drestwn

Morgan Stevens

Morgan Stevens is a Research Assistant at the Center for Data Innovation. She holds a J.D. from the Sandra Day O'Connor College of Law at Arizona State University and a B.A. in Economics and Government from the University of Texas at Austin.

Improving Speech Recognition Systems

Visualizing the Ages of Labor Forces

10 Bits: The Data News Hotlist

You may also like