Yahoo has released a massive data set of Flickr images and videos that are free to share under their copyright licenses. Yahoo believes the data set, which comprises 99.3 million images and 0.7 million videos, is one of the largest public multimedia data sets ever released. The data set, which promises to be a boon to computer vision researchers, contains metadata including title, description, camera type, and tags. About 49 million of the images are also geotagged. Yahoo is collaborating with the Lawrence Livermore National Laboratory to provide compute resources through the lab’s supercomputer to help researchers analyze the data.
Travis Korte
Travis Korte is a research analyst at the Center for Data Innovation specializing in data science applications and open data. He has a background in journalism, computer science and statistics. Prior to joining the Center for Data Innovation, he launched the Science vertical of The Huffington Post and served as its Associate Editor, covering a wide range of science and technology topics. He has worked on data science projects with HuffPost and other organizations. Before this, he graduated with highest honors from the University of California, Berkeley, having studied critical theory and completed coursework in computer science and economics. His research interests are in computational social science and using data to engage with complex social systems. You can follow him on Twitter @traviskorte.
