Evaluating Language Models

by Morgan Stevens

Meta and U.S.-based AI companies Abridge AI and Reka AI have created a dataset to improve multilingual language models. The dataset contains 900 multiple choice questions designed to test reading comprehension for 488 passages spanning 122 languages. Researchers can use the dataset to evaluate high-, medium-, and low-resource language models’ understanding of text.

Image credit: Flickr user Abhi Sharma

