A Hybrid Model for Video Compression Based on the Fusion of Feature Compression Framework and Multi-object Tracking Network

被引:0
|
作者
Chen, Yunyu [1 ]
Wang, Lichuan [1 ]
Zhang, Yuan [1 ,2 ]
机构
[1] China Telecom Res Inst, Coll Big Data & Artificial Intelligence, Shanghai, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou, Peoples R China
关键词
Feature Compression; Video Compression; Object Tracking;
D O I
10.1007/978-981-99-8552-4_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a video compression algorithm which bases on the combination of feature compression framework and multi-object tracking network. Here, the multi-object tracking network uses joint detection and embedding (JDE) network. The feature compression framework is consisted of extraction network, encoder/decoder and reconstruction network. More specifically, features of the last layer which is before FPN (Feature Pyramid Network) in JDE are first condensed by extraction network to obtain a low-dimensional compact representation. The compact representation allows for losing feature details, whilst capturing several key important cues within the feature at an extremely low bit-rate cost. Then, the condensed features are encoded and decoded by DCT (Discrete Cosine Transform) method. After that, the features are restored by reconstruction network. Therefore, our method combines deep neural network and DCT, so as to constitute a new hybrid compression architecture to ensure robust and efficient compression. The proposed method is evaluated on object tracking task and the experimental results prove that this proposed framework significantly outperforms both image anchor and feature anchor. In particular, it is reported that 46.70% BD-rate saving can be achieved against the image anchor and 92.17% BD-rate saving can be achieved against the feature anchor for the object tracking task.
引用
收藏
页码:14 / 25
页数:12
相关论文
共 50 条
  • [1] Feature Compression for Multimodal Multi-Object Tracking
    Li, Xinlin
    Hanna, Osama A.
    Fragouli, Christina
    Diggavi, Suhas
    Verma, Gunjan
    Bhattacharyya, Joydeep
    MILCOM 2023 - 2023 IEEE MILITARY COMMUNICATIONS CONFERENCE, 2023,
  • [2] Multi-object tracking based on network flow model and ORB feature
    Jieyu Chen
    Zhenghao Xi
    Junxin Lu
    Jingjing Ji
    Applied Intelligence, 2022, 52 : 12282 - 12300
  • [3] Multi-object tracking based on network flow model and ORB feature
    Chen, Jieyu
    Xi, Zhenghao
    Lu, Junxin
    Ji, Jingjing
    APPLIED INTELLIGENCE, 2022, 52 (11) : 12282 - 12300
  • [4] Multi-feature fusion tracking algorithm based on generative compression network
    Wang, Peng
    Fu, Huitong
    Li, Xiaoyan
    Guo, Jia
    Lv, Zhigang
    Di, Ruohai
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 124 : 206 - 214
  • [5] Multiple Feature Fusion in the Dempster-Shafer Framework for Multi-Object Tracking
    Riahi, Dorra
    Bilodeau, Guillaume-Alexandre
    2014 CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2014, : 313 - 320
  • [6] Pedestrian Multi-object Tracking Algorithm Based on Attention Feature Fusion
    Zhou, Yan
    Du, Zhennan
    Wang, Dongli
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2023, PT I, 2023, 14134 : 105 - 118
  • [7] Multi-Object Tracking Based on Feature Fusion and Hierarchical Data Association
    Liu, Yan
    Qin, Pinle
    Zeng, Jianchao
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1742 - 1747
  • [8] Three-Dimensional Multi-Object Tracking Based on Feature Fusion and Similarity Estimation Network
    Chen Wenming
    Hong Ru
    Gai Shaoyan
    Da Feipeng
    ACTA OPTICA SINICA, 2022, 42 (16)
  • [9] Embedded tracking algorithm based on multi-feature crowd fusion and visual object compression
    Zheng Wenyi
    Dong Decun
    EURASIP JOURNAL ON EMBEDDED SYSTEMS, 2016,
  • [10] A Framework to Combine Multi-Object Video Segmentation and Tracking
    Nadeem, Sehr
    Rahman, Anis
    Butt, Asad A.
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 525 - 531