Reddit user Stuck_In_The_Matrix has compiled a dataset of every comment made on the popular forum from October 2007 through May 2015—all 1.65 billion of them. Data on each comment includes the comment’s score, author, timestamp, location on the site, and other information available through Reddit’s application program interface. Fellow Reddit users say this dataset is valuable fodder for a wide variety of research, such as analyzing how the site’s users discuss topics over time and modeling the conversations.
Every Reddit Comment from the Last Eight Years
previous post