Home BlogDataset Diversifying Speech Audio Data

Diversifying Speech Audio Data

by Mitalee Pasricha
by

Researchers at the University of Essex have released a new speech dataset to study how people try to sound trustworthy. It includes approximately 1,000 audio clips from around 100 speakers, each recording sentences in both a neutral tone and a deliberately trustworthy tone. The clips are labeled with speaker demographics and vocal features, such as pitch and vocal clarity. Unlike existing datasets that rely on mostly white, younger speakers, this collection captures individuals of different ages and ethnic backgrounds, helping fill a key gap in voice perception research and enabling more inclusive speech-based AI models.

Get the data.

Image Credits: Volodymyr Hryshchenko 

You may also like

Show Buttons
Hide Buttons