Next: Modeling The Scene Up: Steady State Tracking Previous: Steady State Tracking

Modeling The Person

Figure 2.1: Analysis of a user in the ALIVE environment. The frame on the left is the video input (n.b. color image shown here in black and white for printing purposes), the center frame shows the segmentation of the user into blobs, and the frame on the right shows a model reconstructed from blob statistics alone (with contour shape ignored).

Figure 2.2: The user's hand and face blobs have very similar color statistics. These two blobs end up very close to each other in this frame.

The human is modeled as a connected set of blobs. Each blob has a spatial (x,y) and color (Y,U,V) Gaussian distribution, and a support map that indicates which pixels are members of the blob. We define to be the mean (x,y,Y,U,V) of blob k, and to be the covariance of that blob's distribution. Because of their different semantics, the spatial and color distributions are assumed to be independent. That is, is block-diagonal, with uncoupled spatial and spectral components.

Each blob has associated with it a support map, that indicates which image pixels are members of a particular blob. We define , the support map for blob k, to be

An aggregate support map s(x,y) over all the blob models is also a useful data structure. Since the individual support maps indicate which image pixels are members of that particular blob, the aggregate support map represents the segmentation of the image into spatial/color classes.

Each blob can also have a detailed representation of its shape and appearance, modeled as differences from the underlying blob statistics. The ability to efficiently compute compact representations of people's appearance is useful for low-bandwidth applications, such as our demonstration of a shared virtual environments at SIGGRAPH '95 [6].

The statistics of each blob are recursively updated to combine information contained in the most recent measurements with knowledge contained in the current class statistics and the priors. Because the detailed dynamics of each blob are unknown, we use approximate models derived from experience with a wide range of users. For instance, blobs that are near the center of mass have substantial inertia, whereas blobs toward the extremities can move much faster.

Next: Modeling The Scene Up: Steady State Tracking Previous: Steady State Tracking

Christopher R. Wren
Wed Feb 25 14:56:43 EST 1998