Figure 1. Various graphical models for visual tracking. A node corresponds to a frame whose color indicates the type of target variation in the node. The temporal order of frames is given by the alphabetical order. (a) is conventional chain model, while (b),(c) and (d) represent the proposed graphical models. |
Although visual tracking problem has been studied extensively for decades, the underlying graphical model of most existing probabilistic tracking algorithms is limited to linear structure, i.e., first-order Markov chain (Figure 1a). In this model, the video is represented by chain of temporally ordered frames with assumptions on smooth variation of the target between two consecutive frames. When the assumption does not hold due to the challenges in real-world videos, e.g. fast motion, shot change, etc., tracking may fail on few challenging frames and the failures would be propagated to the rest of frames.
To resolve such issues and achieve persistent tracking in real-world videos, we believe that more general representation of the video is necessary, and developed various graphical models beyond chain model as follows.
|
|
|
Datasets and results (ZIP, 139 MB)
(Codes are not available at the moment due to proprietary issue, and will be released after the issue is resolved.)