The Inverse Observation Model

Next: Models of Purposeful Motion Up: The Observation Model Previous: The Observation Model

The Inverse Observation Model

In the open-loop system, the vision system uses a Maximum Likelihood (ML) framework to label individual pixels in the scene:

$\begin{displaymath} \Gamma_{ij} = \arg \max_{k} \left[ \Pr( {\bf O}_{i j} \ver... ...\boldmath$ \mu $}}_k, {\mbox{\boldmath$ \Sigma $}}_k ) \right] \end{displaymath}$

(2.8)

where $\Gamma_{ij}$ is the labeling of pixel (i,j), and $({\mbox{\boldmath$\ \mu $}}_k, {\mbox{\boldmath$\ \Sigma $}}_k)$ are the second-order statistics of model k.

To close the loop, we need to incorporate information from the 3-D model. Given the current state of the model ${\bf q}$ , it is possible to compute the state of an individual link that matches a specific tracked feature (say the hand), and call it ${\bf v}$ . Then, given a model of the camera, it is possible to calculate the perspective projection of that state into 2-D and call it ${\bf v}^{*}$ .

Since the vision system uses a stochastic framework, it is necessary to represent this link projection as a statistical model: $\Pr( {\bf O}_{i j} \vert {\bf v}_k^{*} )$ . Integrating this information into the 2-D statistical decision framework results in a Maximum A Posteriori decision rule:

$\begin{displaymath} \Gamma_{ij} = \arg \max_{k} \left[ \alpha \Pr ({\bf O}_{i j... ... \alpha - 1 ) \Pr( {\bf O}_{i j} \vert {\bf v}_k^{*} ) \right] \end{displaymath}$

(2.9)

Next: Models of Purposeful Motion Up: The Observation Model Previous: The Observation Model

1999-06-15