基于动态捕获区域的DC-TLD目标跟踪算法

何俊衡; 刘曙; 狄红卫

doi:10.12086/oee.2018.180030

基于动态捕获区域的DC-TLD目标跟踪算法

- 暨南大学理工学院光电工程系，广东广州 510632
基金项目:
广东省科技计划项目(2013B010204055)

详细信息

作者简介:
何俊衡(1994-)，男，硕士研究生，主要从事目标跟踪和图像处理的研究。E-mail：hjh381@163.com

**^*通讯作者:** 狄红卫(1969-)，男，博士，教授，主要从事图像与视频处理、目标检测和目标跟踪的研究。E-mail：tdhw@jnu.edu.cn

中图分类号: TP394.1; TH691.9

收稿日期: 2018-01-18

修回日期: 2018-04-29

刊出日期: 2018-08-01

TLD target tracking algorithm based on dynamic capture

- Department of Optoelectronic Engineering, College of Science & Engineering, Jinan University, Guangzhou, Guangdong 510632, China
Fund Project: Supported by Guangdong Science and Technology Project (2013B010204055)

More Information

**^*Corresponding author:** Di Hongwei, E-mail:tdhw@jnu.edu.cn

Received Date 18 January 2018

Revised Date 29 April 2018

Published Date 01 August 2018

摘要

摘要

为提升TLD目标跟踪算法的处理速度，以达到在更高分辨率视频中跟踪目标的实时性要求，在TLD算法框架的基础上，提出了一种基于动态捕获区域的TLD目标跟踪算法(DC-TLD)。算法采用前一帧目标位置作为当前帧目标位置的预测值，减小了目标位置的预测误差。研究了检测区域负样本出现需满足的条件，分析了检测区域大小对算法鲁棒性的影响。针对样本的访问方式，提出基于索引的访问方法，极大地减少了访问时间。实验结果表明，该方法不仅有效降低了TLD算法的样本检测时间，而且提高了算法的鲁棒性。
- 目标跟踪 /
- TLD /
- 动态捕获区域 /
- 负样本 /
- 索引访问
Abstract

In order to improve the ability of processing video image with high frame rate and meet the real-time requirement in higher resolution video tracking goal, DC-TLD target tracking algorithm, which is based on TLD algorithm and the theory of capturing sample dynamically, is proposed. Firstly, DC-TLD takes the target location of the previous frame as the predicted value of the current frame. Secondly, it calculates the threshold r of negative samples for ensuring that the number of negative samples is sufficient. Thirdly, it accesses the samples by index. The results of experiments show that DC-TLD is more robust and more efficient.
- target tracking /
- tracking-learning-detection /
- dynamic capture /
- negative sample /
- index access

Overview

Overview

Overview: The target tracking technology based on computer vision is widely used in civil and military fields, such as traffic monitoring, security monitoring, uav tracking and human-computer interaction. However, target tracking is hardly applied to the actual scene. The main reasons are as followed. To begin with, target may suffer from deformation, illumination variation and background clutter during the tracking, which will make the tracking system lose the target. What's more, target may be vague due to the fast motion. Last but not least, target may be blocked by something. In addition, many target tracking algorithms are too complicated to complete real-time tracking.

The tracking-learning-detection algorithm is a new single-target long time tracking algorithm proposed by Zdenek Kalal. The tracking-learning-detection (TLD) algorithm is different with the algorithm which is based on the conventional tracking algorithm. The TLD algorithm combines the tracking algorithm and the detection algorithm to solve the problem about the shape change, fast moving and partial shade of the target during the tracking. At the same time, the target model and related parameters of tracking module and detection module are constantly updated through online learning module, which makes the tracking effect more stable, robust and reliable. However, TLD target tracking algorithm has high complexity and it is difficult to achieve the real-time effect of target tracking. Because the TLD algorithm has good robustness and is suitable for application to actual tracking process, this paper studies how to improve and optimize the TLD algorithm.

By analyzing the characteristics of the TLD algorithm, this paper proposes DC-TLD target tracking algorithm based on dynamic capture region. Firstly, by changing the target location prediction method, the position of the previous frame is used to predict the position of the target in the current frame, which will reduce the detection time of the target sample. Secondly, the sample selection method is improved to obtain enough positive and negative samples in the small sample selection. Thirdly, the sample access method is improved by numbering the sample, and the access speed of the sample is improved by index access. The above measures successfully reduce the complexity of the TLD target tracking algorithm and improve the robustness of the algorithm, which make the algorithm closer to real-time requirements. However, when the tracking target is large, the tracking effect is similar as DC-TLD algorithm and TLD algorithm. And it is hard for DC-TLD to track multiple targets.

HTML全文

图 1 TLD算法框架^[5]

Figure 1. TLD algorithm framework^[5]

下载: 全尺寸图片幻灯片

图 2 DC-TLD算法流程

Figure 2. DC-TLD algorithm framework

下载: 全尺寸图片幻灯片

图 3 BlurFace目标位置预测与误差

Figure 3. BlurFace target location prediction and error

下载: 全尺寸图片幻灯片

图 4 Jumping目标位置预测与误差

Figure 4. Jumping target location prediction and error

下载: 全尺寸图片幻灯片

图 5 临界点分析图

Figure 5. Critical point analysis diagram

下载: 全尺寸图片幻灯片

图 6 R/r和跟踪成功率、失败率及负样本数的关系

Figure 6. R/r and tracking success rate, failure rate, negative sample size relationship

下载: 全尺寸图片幻灯片

图 7 R/r和品质因数Q的关系

Figure 7. R/r and Q relations

下载: 全尺寸图片幻灯片

图 8 Crowds测试集算法每帧运行时间

Figure 8. Crowds test set algorithm running time per frame

下载: 全尺寸图片幻灯片

图 9 BlurFace测试集算法每帧运行时间

Figure 9. BlurFace test set algorithm running time per frame

下载: 全尺寸图片幻灯片

图 10 视频序列测试图。

Figure 10. Video sequence test diagram.

下载: 全尺寸图片幻灯片

表 1 重叠率方法和索引访问方法运行时间对比

Table 1. Overlap ratio method and index access method running time comparison

测试集	重叠率选取样本数N_o	索引访问选取样本数N_i	重叠率耗时T_o/ms	索引访问耗时T_i/ms	加速比γ
Crowds	2523	5278	2.79	0.0282	207
Football	3069	4054	2.87	0.0221	172
Human6	2861	7608	3.15	0.0597	140
Walking	3849	14000	3.55	0.0741	174

下载: 导出CSV

表 2 视频测试实验跟踪时间结果

Table 2. Results of video test tracking time

测试集	视频帧数	TLD/ms	DC-TLD/ms
Crowds	347	85.42	27.47
Football	312	104.37	55.28
Jumping	313	45.16	20.34
Walking	412	149.63	76.47
Fish	475	71.22	34.43
BlurFace	493	130.30	73.85

下载: 导出CSV

表 3 测试视频精确度

Table 3. Test video accuracy

测试集	视频帧数	TLD成功跟踪帧数	DC-TLD成功跟踪帧数
Crowds	347	76	232
Football	312	286	288
Jumping	313	217	309
Walking	412	412	412
Fish	475	399	448
BlurFace	493	493	493

下载: 导出CSV

参考文献(13)

参考文献

[1]	程卫亮, 王向军, 万子敬, 等.压缩域目标跟踪算法在小型化DSP平台上的研究与实现[J].光电工程, 2017, 44(10): 972–982. doi: 10.3969/j.issn.1003-501X.2017.10.005 Cheng W L, Wang X J, Wan Z J, et al. Research and implementation of target tracking algorithm in compression domain on miniaturized DSP platform[J]. Opto-Electronic Engineering, 2017, 44(10): 972–982. doi: 10.3969/j.issn.1003-501X.2017.10.005
[2]	Henriques J F, Rui C, Martins P, et al. High-speed tracking with kernelized correlation filters[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(3): 583–596. doi: 10.1109/TPAMI.2014.2345390
[3]	Nam H, Han B. Learning multi-domain convolutional neural networks for visual tracking[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 4293–4302.http://ieeexplore.ieee.org/document/7780834/
[4]	Nam H, Baek M, Han B. Modeling and propagating CNNs in a tree structure for visual tracking[EB/OL]. https://arxiv.org/abs/1608.07242v1, 2016.
[5]	Kalal Z, Mikolajczyk K, Matas J. Tracking-learning-detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(7): 1409–1422. doi: 10.1109/TPAMI.2011.239
[6]	Hare S, Saffari A, Torr P H S. Struck: structured output tracking with kernels[C]//Proceedings of 2011 IEEE International Conference on Computer Vision. Barcelona, Spain: IEEE, 2012: 263–270.http://ieeexplore.ieee.org/xpls/abs_all.jsp%3Farnumber%3D6126251
[7]	Brox T, Bruhn A, Papenberg N, et al. High accuracy optical flow estimation based on a theory for warping[C]//Proceedings of 2004 Computer Vision-ECCV. Berlin, Heidelberg: Springer, 2004: 25–36.http://link.springer.com/10.1007/978-3-540-24673-2_3
[8]	Kalal Z, Mikolajczyk K, Matas J. Forward-backward error: automatic detection of tracking failures[C]//Proceedings of 2010 20th International Conference on Pattern Recognition. Istanbul, Turkey: IEEE, 2010: 2756–2759.http://doi.ieeecomputersociety.org/10.1109/ICPR.2010.675
[9]	Viola P, Jones M. Rapid object detection using a boosted cascade of simple features[C]//Proceedings of 2011 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Kauai: IEEE, 2003.http://doi.ieeecomputersociety.org/10.1109/CVPR.2001.990517
[10]	Kalal Z, Matas J, Mikolajczyk K. P-N learning: bootstrapping binary classifiers by structural constraints[C]//Proceedings of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Francisco: IEEE, 2014: 49–56.http://doi.ieeecomputersociety.org/10.1109/CVPR.2010.5540231
[11]	江博. 基于Kalman的TLD目标跟踪算法研究[D]. 西安: 西安科技大学, 2013. Jiang B. Research on TLD algorithm based on Kalman filter[D]. Xi'an: Xi'an University of Science and Technology, 2013.http://cdmd.cnki.com.cn/Article/CDMD-10704-1014000315.htm
[12]	王美健, 吴小俊.适于非线性机动目标跟踪的新IMM平滑算法[J].光电工程, 2016, 43(10): 18–24. doi: 10.3969/j.issn.1003-501X.2016.10.004 Wang M J, Wu X J. New IMM smoothing algorithm suitable for nonlinear maneuvering target tracking[J]. Opto-Electronic Engineering, 2016, 43(10): 18–24. doi: 10.3969/j.issn.1003-501X.2016.10.004
[13]	Wu Y, Lim J, Yang M H. Object tracking benchmark[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1834–1848. doi: 10.1109/TPAMI.2014.2388226

施引文献

资源附件(0)

访问统计

访问统计

点击扫一扫

图(10)

表(3)

计量

文章访问数:
PDF下载数:
施引文献: 0

基于动态捕获区域的DC-TLD目标跟踪算法

作者简介:
何俊衡(1994-)，男，硕士研究生，主要从事目标跟踪和图像处理的研究。E-mail：hjh381@163.com

**^*通讯作者:** 狄红卫(1969-)，男，博士，教授，主要从事图像与视频处理、目标检测和目标跟踪的研究。E-mail：tdhw@jnu.edu.cn

TLD target tracking algorithm based on dynamic capture

**^*Corresponding author:** Di Hongwei, E-mail:tdhw@jnu.edu.cn

摘要

Abstract

Overview

参考文献

访问统计

计量

目录

作者须知

其他内容

条款和政策

基于动态捕获区域的DC-TLD目标跟踪算法

作者简介: 何俊衡(1994-)，男，硕士研究生，主要从事目标跟踪和图像处理的研究。E-mail：hjh381@163.com

*通讯作者: 狄红卫(1969-)，男，博士，教授，主要从事图像与视频处理、目标检测和目标跟踪的研究。E-mail：tdhw@jnu.edu.cn

TLD target tracking algorithm based on dynamic capture

*Corresponding author: Di Hongwei, E-mail:tdhw@jnu.edu.cn

摘要

Abstract

Overview

参考文献

访问统计

计量

出版历程

目录

作者须知

其他内容

条款和政策

作者简介:
何俊衡(1994-)，男，硕士研究生，主要从事目标跟踪和图像处理的研究。E-mail：hjh381@163.com

**^*通讯作者:** 狄红卫(1969-)，男，博士，教授，主要从事图像与视频处理、目标检测和目标跟踪的研究。E-mail：tdhw@jnu.edu.cn

**^*Corresponding author:** Di Hongwei, E-mail:tdhw@jnu.edu.cn