Evaluating Language Models

by Morgan Stevens September 15, 2023

written by Morgan Stevens September 15, 2023

Meta and U.S.-based AI companies Abridge AI and Reka AI have created a dataset to improve multilingual language models. The dataset contains 900 multiple choice questions designed to test reading comprehension for 488 passages spanning 122 languages. Researchers can use the dataset to evaluate high-, medium-, and low-resource language models’ understanding of text.

Get the data.

Image credit: Flickr user Abhi Sharma

Morgan Stevens

Morgan Stevens is a Research Assistant at the Center for Data Innovation. She holds a J.D. from the Sandra Day O'Connor College of Law at Arizona State University and a B.A. in Economics and Government from the University of Texas at Austin.

Evaluating Language Models

Visualizing Cloud Heights in the United States

10 Bits: The Data News Hotlist

You may also like