Perceptual Reasoning and Interaction Research

About imSitu

imSitu is a dataset supporting situation recognition, the problem of producing a concise summary of the situation an image depicts including: (1) the main activity, (2) the participating actors, objects, substances, and locations and most importantly (3) the roles these participants play in the activity. The role set used by imSitu is derived from the linguistic resource FrameNet and the entities are derived from ImageNet. The data in imSitu can be used to create robust algorithms for situation recognition.

Press

On September 19, 2016, the New York Times published an article about imSitu called Computer Vision: On the Way to Seeing More.

Dataset Statistics

Verbs	504
Images	126,102
Situations per Image	3
Total Annotations	1,481,851
Unique Entity Types (>3)	11,538 (6,794)
Unique Roles (role types)	1,788 (190)
Images per Verb (range)	250.2 (200 - 400)
Unique Situations (>3)	205,095 (21,505)

imSitu Paper

Situation Recognition: Visual Semantic Role Labeling for Image Understanding

Mark Yatskar, Luke Zettlemoyer, and Ali Farhadi • CVPR • 2016

PDF View PDF
Semantic Scholar View and cite on Semantic Scholar

AI2 Works Building on imSitu

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Mark Yatskar, Vicente Ordonez, Luke Zettlemoyer, and Ali Farhadi • CVPR • 2017

PDF View PDF
Semantic Scholar View and cite on Semantic Scholar

Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints

Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, and Kai-Wei Chang • EMNLP • 2017

Browse imSitu

On the Browse page, you can search for images that match by verbs, nouns or both. As you select words, the dropdowns will update based on the number of remaining matching images. You can also browse using the convenient example links.

Live Demo

The imSitu demo will predict situations for images of your choice. You can start by clicking on the example images. The demo provides the nearest neighbors in the imSitu training set and a list of predicted situations and associated probability.

Data and Models

The annotations, supporting metadata and python evaluation scripts are hosted on GitHub. To download images you can follow these direct links:

imSitu

Dataset and methods supporting Situation Recognition