Like Egocentric Vision on Facebook

Alireza Fathi


Ph.D. Student, Wall Lab, School of Interactive Computing

College of Computing
Georgia Institute of Technology

Email:  afathi3@gatech.edu

C.V. | Datasets/Software | Publications | Invited Talks | Services | Courses | Useful Links | Experience | Fun! | My Sister
 

I am currently a Ph.D. candidate at GeorgiaTech working with Jim Rehg. I worked with Greg Mori during my Masters. I have also closely worked with Jessica Hodgins, Frank Dellaert and John Krumm.

News

05/03/2013: I will defend on May 22nd, 2013. My thesis committee members are James Rehg, Martial Hebert, Antonio Torralba, Gregory Abowd, Aaron Bobick and Thad Starner.

02/24/2013: Paper accepted to CVPR 2013, Modeling Actions through State Changes (PDF).

06/25/2012: Paper accepted to ECCV 2012, Learning to Recognize Daily Actions using Gaze (PDF, GTEA Gaze(+) Dataset).

02/26/2012: Paper accepted to CVPR 2012, Social Interactions: A First-Person Perspective (PDF), get the dataset here.

01/21/2012: I am the co-organizer of the 2nd IEEE workshop on Egocentric (First-Person) Vision in conjunction with CVPR 2012.

 

Publications

Google Scholar

Modeling Actions through State Changes

Alireza Fathi, James M. Rehg

CVPR, 2013 (PDF)

Learning to Recognize Daily Actions using Gaze

Alireza Fathi, Yin Li, James M. Rehg

ECCV, 2012 (PDF, Project Page)

Detecting Eye Contact using Wearable Eye-Tracking Glasses

Zhefan Ye, Yin Li, Alireza Fathi, Yi Han, Agata Rozga, Gergory D. Abowd, James M. Rehg

2nd Workshop on Pervasive Eye Tracking and Mobile Eye-based Interaction (in conjunction with UbiComp), 2012 (PDF)

Social Interactions: A First-Person Perspective

Alireza Fathi, Jessica K. Hodgins, James M. Rehg

CVPR, 2012 (PDF, Dataset)

Understanding Egocentric Activities

Alireza Fathi, Ali Farhadi, James M. Rehg

ICCV, 2011 (PDF, Dataset)

Learning to Recognize Objects in Egocentric Activities

Alireza Fathi, Xiaofeng Ren, James M. Rehg

CVPR, 2011 (PDF, Dataset)

Combining Self Training and Active Learning for Video Segmentation

Alireza Fathi, Maria Florina Balcan, Xiaofeng Ren, James M. Rehg

BMVC, 2011 (PDF, Abstract, Software)

Detecting Road Intersections from GPS Traces

Alireza Fathi, John Krumm

GIScience, 2010 (PDF)

Action Recognition by Learning Mid-Level Motion Features

Alireza Fathi, Greg Mori

CVPR, 2008 (PDF, Bibtex)

Human Pose Estimation using Motion Exemplars

Alireza Fathi, Greg Mori

ICCV, 2007 (PDF, Bibtex, More Information, Slides, Course Project that led to this paper)

Voice Synthesis using the Generalized Pressure-Controlled Valve

Tamara Smyth, Alireza Fathi

International Computer Music Conference (ICMC), 2008 (PDF)

A Standard Workflow for Illumination-Invariant Image Extraction

Mark S. Drew, Muntaseer Salahuddin, Alireza Fathi

15th Color and Imaging Conference, 2007 (PDF)

EasySLAM

Alireza Fathi, Alex Cunninghum, Balmanohar Paluri, Kai Ni and Frank Dellaert

GVU Technical Report (GIT-GVU-10-03), 2010. (Link)

Local Exponential Maps: Towards Massively Distributed Multi-Robot Mapping

Frank Dellaert, Alireza Fathi, Alex Cunninghum, Balmanohar Paluri, Kai Ni

GVU Technical Report(GIT-GVU-10-04), 2010. (Link)

Poseidon Team Description Paper

Nasrin Mostafazadeh, Saba Ardeshiri, Sepideh Movaghati, Shadi Hariri, Zeinab Jahanzad, Alireza Fathi, Majid Valipour

Ranked 2nd in Rescue Simulation League, Robocup 2006, Bremen, Germany (PDF)

Impossibles Sony Aibo 4-Legged RoboCup Technical report

Saman Aliari Zonouz, Hamid Reza Vaezi Joze, Siavash Rahbar, Majid Valipour, Alireza Fathi

RoboCup 2006, Bremen, Germany. (PDF)

Impossibles Sony Aibo 4-Legged RoboCup Team Description Paper

Hamid Reza Vaezi Joze, Saman Aliari Zonouz, Siavash Rahbar, Majid Valipour, Alireza Fathi

RoboCup 2006, Bremen, Germany. (PDF)

Impossibles Team Description Paper

Jafar Habibi, Alireza Fathi, Saeed Hassanpour, Mohammad Reza Ghodsi, Behzad Sadjadi, Hamid Reza Vaezi, Majid Valipour

Ranked 1st in Rescue Similation League, RoboCup 2005, Osaka, Japan (PDF)

 
Invited Talks

An Egocentric Paradigm for Learning to Understand Daily Activities


 
Datasets/Software
 
Interactive Image Segmentation Toolbox
 
 
My Computer Vision Toolbox
 
 

GTEA Gaze(+)

 
 

Social Interactions at Disney parks

 
 

Georgia Tech Egocentric Activities (GTEA)

 
Projects

Egocentric (First-Person) Vision: An egocentric vision system, is a framework consisting of a wearable camera that continuoulsy captures the scene in front of the first-person. In particular, I define an egocentric vision system as a framework that leverages different levels of first-person attention to identify important objects and faces in the scene that contribute to subject's activities. First-person's attitude, including where she looks (gaze) and what she does (hands manipulating objects) provide an invaluable context for determining the objects that grab her attention at any given time. Our goal is to use these structured sources of information coming from first-person in order to enable weakly supervised recognition of objects and activities.

  • Alireza Fathi, Yin Li, James M. Rehg, Learning to Recognize Daily Actions using Gaze, ECCV, 2012. (PDF, GTEA Gaze(+) Dataset)

  • Alireza Fathi, Jessica K. Hodgins, James M. Rehg, Social Interactions: A First-Person Perspective, CVPR, 2012. (PDF, Dataset)

  • Alireza Fathi, Ali Farhadi, James M. Rehg, Understanding Egocentric Activities, ICCV, 2011. (PDF, Dataset)

  • Alireza Fathi, Xiaofeng Ren, James M. Rehg, Learning to Recognize Objects in Egocentric Activities, CVPR, 2011. (PDF, Dataset)



    Video and Image Segmentation: I believe that segmentation is probably the most fundamental problem in computer vision. If segmentation is solved, many of the big challenges in the field become trivial.

  • Alireza Fathi, Maria Florina Balcan, Xiaofeng Ren, James M. Rehg, Combining Self Training and Active Learning for Video Segmentation, BMVC, 2011 (PDF, Abstract, Software).



    Action Recognition (ICCV07, CVPR08, ICCV11, CVPR12, MSc Thesis): I aim at developing action recognition techniques that rely on semantically meaningful features which capture interaction of objects with each other. This is in contrast to state of the art techniques that are based on space-time interest points or point trajectories.

  • Alireza Fathi, Greg Mori, Action Recognition by Learning Mid-level Motion Features, CVPR, 2008. (PDF, Bibtex)

    Human Pose Estimation:

  • Alireza Fathi and Greg Mori, Human Pose Estimation using Motion Exemplars, ICCV, 2007. (PDF, Bibtex, More Information, Slides, Course Project that led to this paper)

  • MSc Thesis: Alireza Fathi, Human Figure Tracking using Motion Exemplars, Department of Computing Science, Simon Fraser University, 2008. (PDF)
     
     
     

    Localization and Mapping:

  • Helped in developing GTSAM as part of Frank Dellaert's team.

  • Alireza Fathi, John Krumm, Detecting Road Intersections from GPS Traces, GIScience, 2010. (PDF)

  • Alireza Fathi, Alex Cunninghum, Balmanohar Paluri, Kai Ni and Frank Dellaert, EasySLAM, GVU Technical Report(GIT-GVU-10-03), 2010. (Link)

  • Frank Dellaert, Alireza Fathi, Alex Cunninghum, Balmanohar Paluri and Kai Ni, Local Exponential Maps: Towards Massively Distributed Multi-robot Mapping, GVU Technical Report(GIT-GVU-10-04), 2010. (Link)
     

     

     

    Color Constancy and Illumination Invariance:

  • Mark S. Drew, Muntaseer Salahuddin, Alireza Fathi, A Standardized Workflow for Illumination-Invariant Image Extraction, 15th Color Imaging Conference, New Mexico, 2007. (PDF)
     

    RoboCup:

     
  • Nasrin Mostafazadeh, Saba Ardeshiri, Sepideh Movaghati, Shadi Hariri, Zeinab Jahanzad, Alireza Fathi, Majid Valipour, Poseidon Team Description Paper, RoboCup 2006, Bremen, Germany. (PDF)

  • Saman Aliari Zonouz, Hamid Reza Vaezi Joze, Siavash Rahbar, Majid Valipour, Alireza Fathi, Impossibles Sony Aibo 4-Legged RoboCup Technical report, RoboCup 2006, Bremen, Germany. (PDF)

  • Hamid Reza Vaezi Joze, Saman Aliari Zonouz, Siavash Rahbar, Majid Valipour, Alireza Fathi, Impossibles Sony Aibo 4-Legged Team Description Paper, RoboCup 2006, Bremen, Germany. (PDF)

  • Jafar Habibi, Alireza Fathi, Saeed Hassanpour, Mohammad Reza Ghodsi, Behzad Sadjadi, Hamid Reza Vaezi, Majid Valipour, Impossibles Team Description Paper, RoboCup 2005, Osaka, Japan. (PDF)

     

     

    Other:

  • Tamara Smyth, Alireza Fathi, Voice Synthesis using the Generalized Pressure-Controlled Valve, International Computer Music Conference (ICMC), 2008. (PDF)
     
  • BSc Thesis: Alireza Fathi, Assembler and Simulator for IBM 360/370, Computer Engineering Department, Sharif University of Technology, 2004. (Director: Dr. Hamid Sarbazi Azad) (Persian PDF)

     

     

     

    Courses

     

    Services
     
    Co-Chair of 2nd IEEE Workshop on Egocentric (First-Person) Vision in Conjunction with CVPR 2012.
    Program Committee member of IEEE International Conference on Automatic Face and Gesture Recognition (FG) 2013.
    Program Committee member of AAAI 2012.
    Reviewer of IEEE Transaction on Pattern Analysis and Machine Intelligence (PAMI).
    Reviewer of IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
    Reviewer of International Conference on Computer Vision (ICCV).
    Reviewer of British Machine Vision Conference (BMVC).
    Reviewer of ACM Conference on Ubiquitous Computing (UbiComp).
    Reviewer of IEEE Transactions on Circuits and Systems for Video Technology (TCSVT).
     

    Useful Links

  • Notes on Graduate Studies, Alireza Fathi, 2010.
  • How to Buy a Used Car, Alireza Fathi, 2010.
  • Recent Hot Machine Learning Hammers used in Computer Vision, Alireza Fathi, 2011.
  • My Facebook App for Data Collection
  • Simultaneous Recovery of Shape, Motion and Grouping by Applying Rank Constraints, 2010. (PDF, Poster)



    Experience

    Teaching Experience:

    Research Experience:

    Work Experience:

    Languages:

     

    Fun

    This video is seen by 20,000 people by now (Nov 2010)

     

     

    website statistics