CS 7635
Computational Perception

Spring 2002
College of Computing 102
MWF 11:00 - 12:00 noon

Problem Sets        Syllabus        Projects


This course will cover statistical and algorithmic methods for sensing people using cameras and microphones. We will develop video and audio models and explore their application to complex sensing tasks. The bulk of the syllabus is devoted to vision-based human sensing, a branch of computer vision concerned with "looking at people". We will also cover topics in speech recognition and multi-modal sensing. We will emphasize unifying statistical models and techniques.

Instructor

Jim Rehg
Email: rehg@cc.gatech.edu
Office: CoC Bldg (CCB) 253
Office hours: 11-12 Tues and Thurs
Phone: 404-894-9105 (email preferred)

Teaching Assistant

Hao Wang
Email: wanghao@cc.gatech.edu

Prerequisites

Some previous experience or coursework in computer vision, image processing, or computer graphics. Familiarity with Matlab and basic linear algebra, statistics, and pattern recognition techniques. Permission of the instructor.


Organization

Grades will be assessed as follows:

Problem Sets 40%
Midterm Exam 15%
Final Project 40%
Participation 5%

There will be approximately 5 problem sets based on Matlab. Collaboration on problem sets is encouraged at the "white board interaction" level. That is, share ideas and technical conversation, but write your own code. All problem sets should be in on time. One late problem set is accepted late (but before the next one is due) without excuse. After that, get prior permission.

There will be a take-home mid-term exam.

Undergrads and grads will be graded on separate curves; more is expected from a graduate project than an undergraduate project.


Text

There is no required text. The following supplemental texts may be helpful:

Background texts:

Help with Matlab

Related courses:


Problem Sets

PS 1 [ps] [pdf]: Out Jan 07; Due Jan 14: Solutions [ps] [pdf] [tex] (Review of background material)   (2% of grade)
PS 2: [ps] [pdf] [empca, facedata] Out Feb 5; Due Feb 25: (Face detection and recognition using PPCA)    (23% of grade)
Midterm: [ps] [pdf] Out Mar 13; Due Mar 14: Solutions [ps] [pdf] [tex] (15% of grade)
PS 3: [ps] [pdf] [digits, hmm, hmm2, epmt] Out Mar 15; Due Mar 29: (Isolated digit recognition using HMM)    (15% of grade)
Final Project: [swiki] In-class presentations 8:00-10:50 am on Wed. May 1. Reports due by midnight on Fri. May 3.


Syllabus

  1. Introduction and overview of course contents. Review of basic material. [1/4/02]
  2. Skin Color Modeling [1/7,9,11/02]
  3. Introduction to face analysis and applications [1/14/02]
  4. Modeling facial appearance 
  5. Face recognition, tracking, and synthesis
  6. Speech recognition
  7. Facial Expressions and Gesture Recognition
  8. Modeling Human Motion
  9. Head and hand tracking
  10. Figure tracking
  11. Action recognition
  12. Multi-modal sensing

Final Projects

Project Ideas