next up previous
Next: Models of Purposeful Motion Up: The Observation Model Previous: The Observation Model

The Inverse Observation Model

In the open-loop system, the vision system uses a Maximum Likelihood (ML) framework to label individual pixels in the scene:

\begin{displaymath}
\Gamma_{ij} = \arg \max_{k}
\left[ \Pr( {\bf O}_{i j} \ver...
...\boldmath$ \mu $}}_k, {\mbox{\boldmath$ \Sigma $}}_k ) \right]
\end{displaymath} (2.8)

where $\Gamma_{ij}$ is the labeling of pixel (i,j), and $({\mbox{\boldmath$\ \mu $}}_k, {\mbox{\boldmath$\ \Sigma $}}_k)$ are the second-order statistics of model k.

To close the loop, we need to incorporate information from the 3-D model. Given the current state of the model ${\bf q}$, it is possible to compute the state of an individual link that matches a specific tracked feature (say the hand), and call it ${\bf v}$. Then, given a model of the camera, it is possible to calculate the perspective projection of that state into 2-D and call it ${\bf v}^{*}$.

Since the vision system uses a stochastic framework, it is necessary to represent this link projection as a statistical model: $\Pr( {\bf O}_{i j} \vert {\bf v}_k^{*} )$. Integrating this information into the 2-D statistical decision framework results in a Maximum A Posteriori decision rule:

\begin{displaymath}
\Gamma_{ij} = \arg \max_{k} \left[ \alpha \Pr ({\bf O}_{i j...
... \alpha - 1 ) \Pr( {\bf O}_{i j} \vert {\bf v}_k^{*} ) \right]
\end{displaymath} (2.9)


next up previous
Next: Models of Purposeful Motion Up: The Observation Model Previous: The Observation Model

1999-06-15