Weighted Multiple Instance-Based Deep Correlation Filter for Video Tracking Processing

With the development of internet technology, the video data has been widely used in multimedia devices, such as video surveillance, webcast, and so on. Lots of visual processing algorithms are developed to handle the corresponding visual task, but the challenging problems still exist. In this paper, we propose a weighted multiple instances based deep correlation filter for visual tracking processing, which utilizes the importance of instances for training of deep learning model and correlation filter. First, the initial object appearance is modeled based on the confidence of the object and background at the first frame. During the tracking, the superpixel is used to capture the object appearance variations. Most importantly, our tracker can enhance the discriminative ability of the object using deep residual network and improve the tracking efficiency with correlation filter. Second, we introduce the sample importance into residual deep learning model to improve the training performance. We define the importance of each instance by computing the sore of all the pixels within the corresponding instance. Third, we update the parameters of deep learning network and correlation filter in a fixed interval frames to reduce the object drift. Extensive experiments on the OTB2015 benchmark and VOT2018 dataset demonstrate that the proposed object tracking algorithm outperforms the state-of-the-art tracking algorithms.