Social20

The Social20 is a groundtruth set for tag-based social image retrieval, which was created as follows.
  • Queries: We select 20 diverse visual concepts as queries.

  • Labeling criterion: We consider a concept and an image relevant if the concept is clearly visible in the image and we shall relate the concept to the visual content easily and consistently with common knowledge. Therefore, toys, cartoons, painting, and statues of the concept are treated as irrelevant.

  • Data set: For each concept, we randomly select 1000 examples from images labeled with the concept in the Flickr-3.5M collection, and relabel them according to our labeling criterion.
Social20: a groundtruth set for social image retrieval

Downloads

Acknowledgements

We thank Arjan Setz for his contributions in creating the dataset.