Vinay Bettadapura (Photo)
Vinay Bettadapura
Ph.D. Candidate, Computer Science
College of Computing (CoC)
Georgia Institute of Technology
Computer Vision and Machine Learning Research
Advisor: Prof. Irfan Essa
Member of Computational Perception Lab and Robotics & Intelligent Machines @ GT
Email: vinay [at] gatech.edu
Before coming to Georgia Tech, I did my Masters in Computer Science from Columbia University where I worked with Prof. Peter Belhumeur. Going back further, I used to work as a Software Engineer at Subex Limited's Telecommunication Fraud Management Group. I have done my Bachelor of Engineering in Electronics and Communications from the Visvesvaraya Technological University, India.
[CV] CV | Research Interests | Research Projects | Publications | Courses | Course Projects | Awards | Contact | Calendar |   Blogger Facebook LinkedIn
Research Interests
My research interests are in the areas of Computer Vision, Machine Learning and Knowledge Based AI. In particular, I am interested in activity recognition from videos and more broadly in the area of video understanding.
I am interested in the exploitation of context to become better at activity recognition and video understanding. We humans make use of tremendous amount of knowledge in order to make sense of what we perceive. So, along with context understanding, I am also very interested in knowledge representation techniques that can help us build better activity recognition systems that can go beyond detecting simple short-term activities and generate long-term narratives that describe the scene.

Top

Research Projects
Activity Recognition
Activity Recognition from Wide Area Motion Imagery
This project aims at recognizing anomalous activities from aerial videos. My work is a part of the Persistent Stare Exploitation and Analysis System (PerSEAS) research program which aims to develop software systems that can automatically and interactively discover actionable intelligence from airborne, wide area motion imagery (WAMI) in complex urban environments.

Electronics Field Guide
Leafsnap: An Electronics Field Guide

This project aims to simplify the process of plant species identification using visual recognition software on mobile devices such as the iPhone. This work is part of an ongoing collaboration with researchers at Columbia University, University of Maryland and the Smithsonian's National Museum of Natural History. My major contribution to this project was the server's database integration and management. I also worked on stress-testing the backend server to improve its performance and scalability.

The free iPhone app can be downloaded from the app-store. Here is the project webpage and here is a video explaining the app's usage.


Face Verification
Visual Attributes for Face Verification

The project involves face verification in uncontrolled settings with non-cooperative subjects. The method is based on attribute (binary) classifiers that are trained to recognize the degrees of various visual attributes like gender, race, age, etc. Here is the project page.

I was a part of this research at Columbia University from December 2009 to May 2010. I mainly worked on Boosting to improve the classifiers' performance.

Face Rec
Face Recognition

The face representation is based on a Gabor wavelet transform. The features are extracted using a carefully chosen symmetrical Gabor wavelet matrix and a Multi Layer Perceptron is used for classification. The designed system is insensitive to small changes in head poise and homogenous or step illumination changes and is robust against facial hair and glasses for small datasets.

This was my undergraduate thesis supervised by Dr. C. N. S. Ganesh Murthy, Principal Scientist at Mercedes-Benz Research and Development, Bangalore, India. Here is the project report [FACE REC]

Top

Publications
  • V. Bettadapura, "Face Expression Recognition and Analysis: The State of the Art", Tech Report, arXiv:1203.6722, April 2012. [FACE EXP REC]
  • V. Bettadapura, D. R. Sai Sharan, "Pattern Recognition with Localized Gabor Wavelet Grids", Proceedings of the IEEE International Conference on Computational Intelligence and Multimedia Applications, vol. 2, pp. 517-521, Sivakasi, India, December 2007. [ICCIMA 07]
  • V. Bettadapura, B. S. Shreyas, C. N. S Ganesh Murthy, "A Back Propagation Based Face Recognition Model, Using 2D Symmetric Gabor Features", Proceedings of the IEEE International Conference on Signal Processing, Communications and Networking, pp. 433-437, Chennai, India, February 2007. [ICSCN 07]
  • V. Bettadapura, B. S. Shreyas, "Face Recognition Using Gabor Wavelets", Proceedings of the 40th IEEE Asilomar Conference on Signals, Systems and Computers, pp. 593-597, Pacific Groves (Monterey Bay), California, October 2006. [ASILOMAR 06]

Top

Courses
Spring 2012
Special Problems (CS 8903)

Prep - Doctoral Qualifiers (CS 7999)
Prof. Irfan Essa

N/A
Fall 2011
Knowledge-Based AI (CS 7637)

Numerical Linear Algebra (MATH 6643)

Special Problems (CS 8903)
Prof. Ashok Goel

Prof. Silas Alben

Prof. Irfan Essa
Summer 2011
Special Problems (CS 8903)
Prof. Irfan Essa
Spring 2011
Machine Learning (CS 7641)

Special Problems (CS 8903)
Prof. Charles Isbell

Prof. Irfan Essa
Fall 2010
Computer Vision (CS 7495)

Grad Studies (CS 7001)

Special Problems (CS 8903)
Prof. Jim Rehg

Prof. Gregory Abowd and Prof. Nick Feamster

Prof. Irfan Essa
Spring 2010
Operating Systems (COMS W4118)

Projects in Computer Science (COMS E6901)

Research Assistantship (COMS E9910)
Prof. Junfeng Yang

Prof. Peter Belhumeur

Prof. Peter Belhumeur
Fall 2009
Analysis of Algorithms (COMS W4231)

Biometrics (COMS W4737)

Projects in Computer Science (COMS E6901)
Prof. Clifford Stein

Prof. Peter Belhumeur

Prof. Peter Belhumeur
Spring 2009
Programming Languages and Translators (COMS W4115)

Computational Aspects of Robotics (COMS W4733)

Visual Interfaces to Computers (COMS W4735)

Machine Learning (COMS 4771)
Prof. Alfred Aho

Prof. Peter Allen

Prof. John Kender

Prof. Tony Jebara

Top

Course Projects
Geotagging
Automatic Geo-Tagging of Photos Using Google Street View Images

The goal of this project was to develop a system that automatically geo-tags an image by comparing it with a large collection of geo-tagged images (Google Street View images, in our case). SIFT descriptors are computed for the images and the matching is done using a KD-Tree. This project is an implementation based on the work of Schindler et al. (CVPR 2007) and Zamir et al. (ECCV 2010). This project was done as a part of the 'Computer Vision' course at Georgia Tech (instructor: Prof. Jim M. Rehg).

Here is the project presentation [GEO-TAG]

Raven's Test
Solving Raven's Matrices Using Visual and Propositional Reasoning

The goal of this project is to learn about the close relationship between learning and problem solving. In this project, we explore this relationship by considering several problems from the Raven's test of intelligence (Raven's matrices). We develop techniques to solve the Raven's matrices using both propositional and visual reasoning. This project was done as a part of the 'Knowledge Based AI' course at Georgia Tech (instructor: Prof. Ashok K. Goel).

Here are the project reports: Solving the Raven's matrices using Propositional Reasoning [GEO-TAG] using Visual Reasoning [GEO-TAG] and a combination of Visual and Propositional Reasoning [GEO-TAG]

SNOW Logo
The SN*W Programming Language

The SN*W Programming Language is a special purpose declarative language designed for Genetic Programming by allowing programmers to easily harness the power of Genetic Algorithms (GA). A SN*W program is a simple description of an organism structure along with simple methods for construction, mutation, selection and recombination. The SN*W compiler translates these events into a full environmental simulation. The language was developed by five of us as a part of the Programming Languages and Translators course at Columbia under the guidance of Prof. Alfred V. Aho.

Here is the complete SN*W Report (includes the Reference Manual and Tutorial) [CV]

Guess Who
Guess Who? - An iPhone Application for Real-Time Face Recognition Using Side Profiles

The goal of this project was to develop a face recognition system that could recognize people based on side-profile images. The system was designed to be invariant to head-tilt and pose. An iPhone application was developed to showcase the real-time capabilities of the system. The user takes the profile picture of a person using his/her iPhone and uploads it to the server (the server is a Ruby on Rails application). The server does the recognition and sends the results back which gets displayed on the iPhone UI. The entire request-process-response loops takes no longer than 3.5 seconds (on average). This project was done as a part of the Biometrics course at Columbia (instructor: Prof. Peter N. Belhumeur).

Here is the project report [GUESS WHO]

Visual Combination Lock
Visual Combination Lock

The goal of this project was to take a sequence of visual images, and to determine from them if the user has placed some body part(s) in a predetermined sequence of locations and/or poses. If the sequence of images matches the predetermined sequence, the user's access gets 'APPROVED', else the access gets 'DENIED'. An arbitrary predetermined sequence of hand gestures was used where the user displays a combination of numbers using his fingers followed by a specific hand rotation and closure of the fist. The 'Visual Lock' gets unlocked only if the hand gestures are the same as the predetermined sequence. The recognition sequence can be changed to handle any (controlled) hand gestures. This project was done as a part of the Visual Interfaces to Computer course at Columbia (instructor: Prof. John R. Kender).

Here is the project report [VISUAL LOCK]

Columbia Map Assist
Columbia Map Assist

The goal of this project was to develop a 'Columbia Map Assistant' that would describe the location of a visitor to the Columbia campus and give the visitor directions from one building to another. The first main job was to use the given map to encode the buildings' shapes, to determine their spatial relationships to each other and to filter out any relationships that are unnecessary because they can be easily inferred. The second main job was to use these descriptions to generate a natural language description that unambiguously indicates how to reach the goal from the source. This project was done as a part of the Visual Interfaces to Computer course at Columbia (instructor: Prof. John R. Kender).

Here is the project report [MAP ASSIST]

Visual Information Retrieval
Visual Information Retrieval

The goal of this project was to write and analyze algorithms that explore different ways of deciding the degree of similarities amongst actual images. A set of images of fruits and vegetables along with a few random objects (distracters) were used. The algorithm performs a color-based match and a texture based match and then uses the total match to decide the similarity amongst the images. This kind of an algorithm is useful in retrieving images based on the visual content rather than the associated labels or other metadata. This project was done as a part of the Visual Interfaces to Computer course at Columbia (instructor: Prof. John R. Kender).

Here is the project report [VISUAL RETREIVAL]

Top

Awards

Top

Contact
Email:
vinay [at] gatech.edu
Address:
304C, College of Computing Building
801, Atlantic Drive
Atlanta, GA 30332
Also On:
Blogger Facebook LinkedIn

Top

Calendar

Top

VTU Logo  --- CS@CU Logo  --- COC@GT Logo  --- RIM@GT Logo

Valid HTML 4.01 Transitional Valid CSS!