1 Introduction

week 6.

1. How many visual object categories are there?

1500-3000 basic-level nouns, ~10 types per basic-level category

Untitled

Alternative explanation (Perona): ~1000 names per domain (broad scene category), 20-30 domains

Untitled

Scene categorization or classification
1. outdoor/indoor
2. city/forest/factory/etc.
Image annotation / tagging / attributes
1. street, people, building, mountain, tourism, cloudy, brick, …
Scene understanding?

Story: caption -> short description
Image parsing / semantic segmentation
Object detection

Untitled

Category:

–Find all the people

–Find all the buildings

–Often within a single image

–Often ‘sliding window’

Untitled

Instance:

–Is this face James?

–Find this specific famous building

–Often within a database of images

Untitled

Variability:

Untitled

                                                            High-dimensional space

Untitled

Recognition as an alignment problem:
1. Alignment: fitting a model to a transformation between pairs of features (matches) in two images
2. Representing and recognizing object categories is harder...
  1. ACRONYM (Brooks and Binford, 1981), Binford (1971), Nevatia & Binford (1972), Marr & Nishihara (1978)