Diversifying Speech Audio Data

by Mitalee Pasricha June 6, 2025

written by Mitalee Pasricha June 6, 2025

Researchers at the University of Essex have released a new speech dataset to study how people try to sound trustworthy. It includes approximately 1,000 audio clips from around 100 speakers, each recording sentences in both a neutral tone and a deliberately trustworthy tone. The clips are labeled with speaker demographics and vocal features, such as pitch and vocal clarity. Unlike existing datasets that rely on mostly white, younger speakers, this collection captures individuals of different ages and ethnic backgrounds, helping fill a key gap in voice perception research and enabling more inclusive speech-based AI models.

Get the data.

Image Credits: Volodymyr Hryshchenko

Mitalee Pasricha

Mitalee Pasricha is a Google Public Policy Fellow with the Information Technology and Innovation Foundation. She is focused on the intersection of technology, sustainability, and policy. Pasricha is currently pursuing Bachelor of Science degrees in Environmental Sciences and Environmental Economics and Policy at the University of California, Berkeley.

Diversifying Speech Audio Data

Analyzing Reforestation Initiatives in the Philippines

10 Bits: The Data News Hotlist

You may also like