next up previous
Next: The Inverse Observation Model Up: Mathematical Formulation Previous: Soft Constraints


The Observation Model

The low-level features extracted from video comprise the final element of our system. Our system tracks regions that are visually similar, and spatially coherent: blobs [17,12]. The blob spatial statistics are described in terms of their second-order properties for which we denote ${\mbox{\boldmath$\ \mu $}}$ as the mean and ${\mbox{\boldmath$\ \Lambda $}}$ as the covariance. For computational convenience we will interpret this as a Gaussian model, so the probability of an observaion ${\bf o}$ (of a certain color and position within the image), given a blob model is:

\begin{displaymath}
\Pr( {\bf o} \vert {\mbox{\boldmath$ \mu $}}_k, {\mbox{\bol...
...vert{\mbox{\boldmath$ \Lambda $}}_k\right\vert}^{\frac{1}{2}}}
\end{displaymath}

The Gaussian interpretation is not terribly significant, because we also keep a pixel-by-pixel support map showing the actual occupancy [22].

Like other representations used in computer vision and signal analysis, including superquadrics, modal analysis, and eigen representations, blobs represent the global aspects of the shape and can be augmented with higher-order statistics to attain more detail if the data supports it. The reduction of degrees of freedom from individual pixels to blob parameters is a form of regularization which allows the ill-conditioned problem to be solved in a principled and stable way.

These 2-D features are the input to the 3-D blob estimation formulation used by Azarbayejani and Pentland [2]. This relates the 2-D distribution of pixel values to a tracked object's 3-D position and orientation. In our current system we track the hands and head using their color and shape characteristics; the observation equation therefore relates the distribution of hand/face pixel values to the probability distribution of the 3-D state variables that characterize the skeletal models' hand and head links.

These observations supply constraints on the underlying 3-D human model. Due to their probabilistic nature, observations are easily modeled as soft constraints. Observations are integrated into the dynamic evolution of the system by modeling them as descriptions of potential fields, as discussed in Section 2.1.2. The simplest such model is a linear spring model with a fixed a priori weighting $\rho$:

\begin{displaymath}
{\bf Q}_k = \rho \Vert {\bf q} - {\mbox{\boldmath$ \mu $}}_k \Vert
\end{displaymath}

A more sophisticated model takes into account the covariance of the observation model, ${\mbox{\boldmath$\ \Lambda $}}$, to create a more detailed potential field:

\begin{displaymath}
{\bf Q}_k = \rho ( {\bf q} - {\mbox{\boldmath$ \mu $}}_k ) ...
... \Lambda $}}_k^{-1}
( {\bf q} - {\mbox{\boldmath$ \mu $}}_k )
\end{displaymath}

Here $\rho$ is again an a priori weighting of the influence of the observations on the model. Thus $\rho$ plays the role of the Kalman gain in the recursive estimation process, and its value is set accordingly.




next up previous
Next: The Inverse Observation Model Up: Mathematical Formulation Previous: Soft Constraints

1999-02-13