CS 7636
Computational Perception

Spring 2003
Howey-Physics S107
MWF 11:00 - 12:00 noon

Problem Sets        Syllabus        Projects


This course will cover statistical and algorithmic methods for sensing people using cameras and microphones. We will develop video and audio models and explore their application to complex sensing tasks. The bulk of the syllabus is devoted to vision-based human sensing, a branch of computer vision concerned with "looking at people". We will also cover topics in speech recognition and multi-modal sensing. We will emphasize unifying statistical models and techniques, primarily graphical models such as Bayesian networks.

Instructor

Jim Rehg
Email: rehg@cc.gatech.edu
Office: CoC Bldg (CCB) 253
Office hours: 11-12 Tues and Thurs
Phone: 404-894-9105 (email preferred)

Prerequisites

Some previous experience or coursework in computer vision, image processing, or computer graphics (such as CS 4495/7495) and pattern recognition or machine learning (such as CS 4803 or 4640). Familiarity with Matlab and basic linear algebra and statistics. Permission of the instructor.


Organization

Grades will be assessed as follows:

Problem Sets 55%
Final Project 40%
Participation 5%

Problem sets will be largely based on Matlab and will be distributed weekly or bi-weekly. Collaboration on problem sets is encouraged at the "white board interaction" level. That is, share ideas and technical conversation, but write your own code. A few problem sets may require you to work in teams of 2-3. I plan to grade and return problem sets promptly. As a result, I will require all problem sets to be turned in on time.

No late submissions will be accepted without prior permission of the instructor.

Undergrads and grads will be graded on separate curves; more is expected from a graduate project than an undergraduate project.


Text

There is no required text. The following supplemental texts may be helpful:

Background texts:

Help with Matlab

Related courses:


Problem Sets

PS 1 [ps] [pdf]: Out Jan 14; Due Jan 20 (Review of background material)


Syllabus

  1. Skin color modeling and detection [1/6-17/03]
  2. Modeling facial appearance
  3. Face recognition, tracking, and synthesis
  4. Speech recognition
  5. Facial Expressions and Gesture Recognition
  6. Modeling Human Motion
  7. Head and hand tracking
  8. Figure tracking
  9. Action recognition
  10. Multi-modal sensing

Final Projects

Project Ideas