Improving Virtual Assistants

by Morgan Stevens March 31, 2023

written by Morgan Stevens March 31, 2023

Google has created a dataset to better train virtual assistants. The dataset contains over 550,000 conversations between humans and virtual assistants in English, French, German, Hindi, Japanese, and Spanish. It includes conversations in which the user had to revise their instructions after the virtual assistant misunderstood the initial request, conversations with repeated or filler words, and conversations in which the user switches languages while saying an instruction, as well as users’ lists, notes, and contacts to provide context for each conversation.

Get the data.

Image credit: Flickr user Marc Wathieu

Morgan Stevens

Morgan Stevens is a Research Assistant at the Center for Data Innovation. She holds a J.D. from the Sandra Day O'Connor College of Law at Arizona State University and a B.A. in Economics and Government from the University of Texas at Austin.

Improving Virtual Assistants

Visualizing Cherry Blossoms’ Blooming Periods

10 Bits: The Data News Hotlist

You may also like