In addition, we define translation as . Rotation is defined in terms of which are the incremental Euler angles for the interframe rotation. This representation of rotation overcomes the normality constraints of the quaternion representation by linearizing with a tangent hyper-plane on the unit hyper-sphere formed by the quaternion representation.
The final representation of the internal state vector has a total of 7+N parameters where N is the number of feature points being tracked (each of which requires one scalar depth value to determine 3D structure):
At each time step, we also have a measurement or observation vector, of size 2N with the following form:
Where (Xi,Yi) are the positions of a feature point currently being tracked in the image. Unlike other formulations which are underdetermined at every time step, the above parametrization of the SfM problem is well-posed when or when . Thus, if 7 or more feature points are being tracked in 2D simultaneously, a unique, well-constrained solution can be found for the internal state and a recursive filter can be employed.
Due to the non-linearities in the mapping of state vector to measurements, an extended Kalman filter is used as the estimator. The dynamics of the internal state are trivially chosen to be identity with Gaussian noise for each time step.