Measuring Bias in Natural Language Models

by Michael McLaughlin May 29, 2020

written by Michael McLaughlin May 29, 2020

A building with large columns and a dome on MIT's campus.

Researchers from MIT, Facebook, Intel, and McGill University in Canada have released Stereoset, a dataset of 17,000 sentences that researchers can use to measure a natural language processing model’s bias towards stereotypes. The dataset tasks models to choose from options to fill in a blank for a sentence or to provide additional information after receiving an input sentence. The options include stereotypes, anti-stereotypes, and unrelated information. To score well, a model should prefer options that provide relevant info but not prefer options conveying a stereotype over those that do not.

Get the data.

Image: Jiaqian AirplaneFan

Michael McLaughlin

Michael McLaughlin is a research analyst at the Center for Data Innovation. He researches and writes about a variety of issues related to information technology and Internet policy, including digital platforms, e-government, and artificial intelligence. Michael graduated from Wake Forest University, where he majored in Communication with Minors in Politics and International Affairs and Journalism. He received his Master’s in Communication at Stanford University, specializing in Data Journalism.

Measuring Bias in Natural Language Models

Visualizing Germany’s Intensive Care Capacity

10 Bits: the Data News Hotlist

You may also like