CS 4803B/8803B
Pattern Recongition
Spring 2001
College of Computing, CCB 102
Tues. and Thurs. 3:00-4:30
Description
This course introduces techniques for Pattern Recognition. The course
presumes a reasonable background in probablility and linear algebra.
The syllabus includes basic PR including Bayesian decision and
estimation, non-parametric methods, multi-class classifiers,
eigenvector and other feature selection methods, and EM
techniques. Time permitting we will also cover additional topics of
interest including sequence analysis via HMMs and support vector machines.
- Instructor
-
Aaron Bobick
- afb@cc.gatech.edu
- 253 CCB (temporary)
- (404)894-8591 (never pick it up - email much better...)
- Office Hours: For now, drop by or send email to schedule an appointment.
-
- Teaching Assistant
- None yet....
-
- Course Administrator
- Melissa O'Malley
- omalley@cc.gatech.edu
- GVU Office, CCB 244 CCB (temporary)
- (404)894-4666
-
This course will teach you the basic techniques of Pattern
Recognition. By the end of the cousre you should be able to implement
a pretty standard PR system, and also have enough basis to understand
more complex approaches.
The text for the class is Pattern Classification by Duda, Hart,
and Stork. This is the second edition of the text, and should be in
the bookstore.
The lectures for this class are being captured. To find them go
to
this eclass page.
Requirements, Collaboration, and Grading
The grades will be assessed as follows:
|
Problem Sets (not all PS are created equal) |
50% |
|
Mid-term |
20% |
|
Final project |
20% |
|
Class Participation |
10% |
There will be 5 or so bi-weekly problem sets that will involve some
Matlab and hopefully some thinking. Collaboration on problem sets is
encouraged at the "white board interaction" level. That is, share
ideas and technical conversation, but write your own code. All problem
sets should be in on time. One late problem set is accepted late (but
before the next one is due) without excuse. After that, get prior
permission.
There will be a "mid-term" somewhere around 2/3 the way through,
just to make sure we're all on the same page.
There will be occasional readings assigned that will typically
cover technical material that you may be responsible for.
Undergrads and grads will be graded on separate curves; more is
expected from a graduate project than an undergraduate project.
Approximate Syllabus
This course is designed to be an senior level or first year or two
grad student course covering basic pattern recognition, plus some
more modern techniques. The goal is that by the end of the semester
you know enough Pattern Recognition to be dangerous: can attempt a real
problem but probably need to dig deeper to handle a real example.
Also, you will have exposure to some more current approaches that are
currently being researched. Should allow you to take a more advanced
PR or machine learning course.
When possible we will use examples from computer vision, something
close to my heart.
- [Wk 1-2] Review of probability and some linear algebra
- [Wks 2-3] Bayesian Decision making; linear discriminants,
separability, multi-class discrimination; quadratic classifiers.
- [Wks 4-6] Bayesian estimation; non-parametric estimation; maximum
likelihood, MAP
- [Wks 7-8] Linear discriminants again; eigen vector analysis,
- [Wks 9-10] Clustering, unsupervised learning, K-means and E/M,
neural nets.
- [Wks 11-12] Other topics: Sequence analysis, HMMs
- [Wk 13] Project presentations.
Some extra readings
There will be additional required readings. When they are available
electronically you will be able to find them here:
Problem sets
Problem Set 1: Due Jan 23 The postscript for the PS is here and the PDF file is here.
Problem Set 2: Due Feb 13 The PDF file is here. The data set of question 4 is here
but since it's binary data you need to right click (or ctl-click for
you die hard mac types) and select "Download link to disk".
Problem Set 3: Due Feb 13 The PDF file is here.
Problem Set 4: Due April 5 The PDF is here. The data sets of question 1 are training and testing but since it's binary data you
need to right click (or ctl-click for you die hard mac types) and
select "Download link to disk". The ascii data sets for problem 3 are
class1, class2, and class3. These can just be downloaded
to a local ascii copy.
Project data
For those without their own project data, here are some to play with.
All these files are ASCII (in case not using MATLAB) so you should
just down load them and then load with a command such as LOAD
easy.ascii The files are the easy data,
some slightly harder data, and
labels. The data are 4 rows of 40
dimensional data for each of 25 subjects (data matrices are 100 rows
by 40 columns). See how well you do by leave one out, or some other test.
Contact Information:
Aaron Bobick
afb@cc.gatech.edu
College of Computing
Georgia Institute of Technology
Atlanta, GA 30332-0280
Tel: 404-894-8591
email: afb@cc.gatech.edu