Home BlogDataset Every Reddit Comment from the Last Eight Years

Every Reddit Comment from the Last Eight Years

by Joshua New
Reddit

Reddit user Stuck_In_The_Matrix has compiled a dataset of every comment made on the popular forum from October 2007 through May 2015—all 1.65 billion of them. Data on each comment includes the comment’s score, author, timestamp, location on the site, and other information available through Reddit’s application program interface. Fellow Reddit users say this dataset is valuable fodder for a wide variety of research, such as analyzing how the site’s users discuss topics over time and modeling the conversations.

Get the data.

You may also like

Show Buttons
Hide Buttons