|
| TalksIrfan Essa and Aaron Bobick, GVU Center, GA Tech "Activity Recognition: From HMMs to Grammars to Network Representations." I will present a brief overview of various methods that have been studied for visual recognition. These methods vary from use of action and object context to recognize activities, to the use of simple and extended grammar representations, to the use of various network representations. The goal of this presentation will be more to highlight the open problems and the need for newer representations. Jianbo Shi, Department of Computer and Information Science University of Pennsylvania "Finding Unusual Activity in Video" Imagine you are given a long video, possibly thousands of hours long, and you are asked to analyze the video to detect unusual events. By definition, unusual events are rare, difficult to describe, and impossible to predict. Any system that detects unusual events must sift through extremely large amount of statistical details to detect a few relevant bits. We present an unsupervised technique for detecting unusual activity in a large video set using many simple features. No complex activity models and no supervised feature selections are used. We divide the video into equal length segments and classify the extracted features into prototypes, from which a prototype-segment co-occurrence matrix is computed. Motivated by a similar problem in document-keyword analysis, we analyze the co-clustering between the documents(videos) and keywords(features). We define a simultaneous clustering and feature selection criterion using the transitive closure constraint. We show that an important sub-family of correspondence functions can be reduced to co-embedding prototypes and segments to N-D Euclidean space. We prove that an efficient, globally optimal algorithm exists for the co-embedding problem. Experimentally, we have tested our algorithm on a variety of videos ranging from nursing home monitoring, poker game cheating, to roadway surveillance. This is a joint work with Mirko Visontai at U.Penn., and Hua Zhong at CMU Chris Wren, MERL Event and Activity Discovery work at MERL Posters"Activity recognition and abnormality detection with the
Switching Hidden Semi Markov Model" "Activity Recognition in the Driving Domain" Kari Torkkola, Motorola, Intelligent Systems Lab, Tempe, AZ Future intelligent systems in automobiles need to be aware of the driving and
driver context. Available sensor data stream has to be modeled and monitored in
order to do so. We are interested in developing intelligent driver assistance
systems that, for example, manage the presentation of information to the driver
from various devices or subsystems in the car, essentially managing the workload
of the driver, or alert the driver when his or her attention is not where it
should be. One necessary sub-component of such an intelligent assistance
system is a driving situation detector that recognizes difficult driving
situations requiring full attention of the driver, and then acts as a gate to
information presentation from other devices to the driver. Another component
could be a system detecting where the attention of the driver is directed, or
what is happening in the cockpit. "Activity Mining for Sensor Networks" Chris Wren, (MERL), David Minnen, (GA Tech). We present results from the exploration of activity discovery based on
impoverished sensors. Networks of low-cost, low-power, low-bandwidth sensors are
a practical way of gathering context awareness in buildings. They are more
widely applicable than dense networks of cameras because of their low component
cost, low installation cost, and low privacy cost. However impoverished sensors
pose a significant challenge for activity monitoring due their low capability.
We build on our behavior understand work with impoverished sensors to show some
results relating to behavior discovery and novel event detection. "Discovery of Multiple Resolution Activity" D. Ashbrook, T. Westeyn, D. Minnen, T. Starner (GA Tech) Propagation Networks for Recognition of Partially Ordered Sequential Action Georgia Institute of Technology ,We present Propagation Networks (P-Nets), a novel approach for representing and recognizing sequential activities that include parallel streams of action. We represent each activity using partially ordered intervals. Each interval is restricted by both temporal and logical constraints, including information about its duration and its temporal relationship with other intervals. P-Nets associate one node with each temporal interval. Each node is triggered according to a probability density function that depends on the state of its parent nodes. Each node also has an associated observation function that characterizes supporting perceptual evidence. To facilitate realtime analysis, we introduce a particle filter framework to explore the conditional state space. We modify the original Condensation algorithm to more efficiently sample a discrete state space (D-Condensation). Experiments in the domain of blood glucose monitor calibration demonstrate both the representational power of P-Nets and the effectiveness of the D-Condensation algorithm. Yifan Shi, Yan Huang, David Minnen, Aaron F. Bobick, Irfan A. Essa: Propagation Networks for Recognition of Partially Ordered Sequential Action. CVPR (2) 2004: 862-869
|
|
|