Improving Large Language Models

by Morgan Stevens November 10, 2023

written by Morgan Stevens November 10, 2023

Researchers at the University of California, Berkeley, the Georgia Institute of Technology, and Harvard University have created a dataset of prompt injection attacks, which are third party attempts to maliciously manipulate the results of a large language model. It contains over 126,000 prompt injection attacks as well as over 46,000 defenses against attacks. Researchers can use the dataset to strengthen large language models’ defenses.

Get the data.

Image credit: Flickr user Christiaan Colen

Morgan Stevens

Morgan Stevens is a Research Assistant at the Center for Data Innovation. She holds a J.D. from the Sandra Day O'Connor College of Law at Arizona State University and a B.A. in Economics and Government from the University of Texas at Austin.

Improving Large Language Models

Visualizing Tomato Production in Europe

10 Bits: The Data News Hotlist

You may also like