Expanding Translation Systems’ Capabilities

by Morgan Stevens June 9, 2021

written by Morgan Stevens June 9, 2021

A spherical art installation displaying the word welcome in several languages

Facebook has released a dataset designed to test multilingual translation models. The dataset, known as Flores-101, contains 3,001 English sentences taken from Wikipedia, and their translated counterparts, in 101 human languages. Researchers can use the data to evaluate translation systems’ performance and advance natural language processing projects.

Get the data.

Image credit: Flickr user Mike

Morgan Stevens

Morgan Stevens is a Research Assistant at the Center for Data Innovation. She holds a J.D. from the Sandra Day O'Connor College of Law at Arizona State University and a B.A. in Economics and Government from the University of Texas at Austin.

Expanding Translation Systems’ Capabilities

The Case for a National Quantum Computing Research Task Force in the United States

Event Recap: What’s Next on the EU’s Proposed AI Law?

You may also like