Datasets

This page is dedicated to providing links to different stimulus databases that might be relevant for members of the lab.

Scenes

Objects

  • ImageNet - Does it even need a description?
    • EcoSet - 1.5m images from 565 basic level categories, chosen to be both (i) frequent in linguistic usage, and (ii) rated by human observers as concrete. A big subset of images comes from ImageNet.
  • THINGS Dataset - A freely available database of 26,107 high quality, manually-curated images of 1,854 diverse object concepts
  • COCO Dataset - Common Objects in Context: large-scale object detection, segmentation, and captioning dataset.
  • Google’s Line Drawings Dataset
  • MNIST dataset - Do we need any explanation?

Faces

Language

  • Mother of Unification Studies - A 204-subject multimodal neuroimaging dataset to study language processing. All subjects performed a language task, during which they processed linguistic utterances that either consisted of normal or scrambled sentences. Half of the subjects were reading the stimuli, the other half listened to the stimuli.
  • Narratives Dataset - Dataset containing the fMRI recordings of 345 individuals listening to 27 spoken stories in English, from 7 to 56 min (4.6 h of unique stimulus in total).

Dynamic Stimuli

  • Ingmar’s Dataset of Ballet Dancers (ask him for it) - The stimulus set consisted of videos of 14 unique ballet dancing sequences, each consisting of four smoothly connected ballet figures selected from a pool of five unique figures