College of Computing,
Georgia Institute of Technology,
Atlanta, GA 30332-0280 USA
Presented at ICCV 1999 in Corfu, Greece.
We present a method to decompose video sequences into layers that represent the relative depths of complex scenes. Our method combines spatial information with temporal occlusions to determine relative depths of these layers. Spatial information is obtained through edge detection and a customized contour completion algorithm. Activity in a scene is used to extract temporal occlusion events, which are in turn, used to classify objects as occluders or occludees. The path traversed by the moving objects determines the segmentation of the scene. Several examples of decompositing and compositing of video are shown. This approach can be applied in the pre-processing of sequences for compositing or tracking purposes and to determine the approximate 3D structure of a scene.
PowerPoint Slides Used in Oral Presentation of this paper at the conference. (Videos not embedded. Samples are linked below instead.)
of the paper.
PDF Version of this paper.
Simple example of pushing into a background layer. (5.7Mb)
Step-by-step decomposition of "toy" sequence. (17Mb)
&nb sp; Note both pushing and popping occur here.
&n bsp; Results of processing and demonstration of recompositing application. (2.5Mb)
sequence (5Mb): &n
; Results of processing (1.5Mb)
Original sequence (6Mb):
Results appear in PowerPoint presentation.