HUMAN-MACHINE COLLABORATIVE VIDEO CODING THROUGH CUBOIDAL PARTITIONING

被引:5
|
作者
Ahmmed, Ashek [1 ,4 ]
Paul, Manoranjan [1 ]
Murshed, Manzur [2 ]
Taubman, David [3 ]
机构
[1] Charles Sturt Univ, Sch Comp & Math, Bathurst, NSW, Australia
[2] Federat Univ, Sch Sci Engn & Informat Technol, Ballarat, Vic, Australia
[3] Univ New South Wales, Sch Elect Engn & Telecommun, Kensington, NSW, Australia
[4] Univ New South Wales, Sch Engn & Informat Technol, Kensington, NSW, Australia
关键词
Cuboid; HEVC; VCM; Object detection;
D O I
10.1109/ICIP42928.2021.9506150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video coding algorithms encode and decode an entire video frame while feature coding techniques only preserve and communicate the most critical information needed for a given application. This is because video coding targets human perception, while feature coding aims for machine vision tasks. Recently, attempts are being made to bridge the gap between these two domains. In this work, we propose a video coding framework by leveraging on to the commonality that exists between human vision and machine vision applications using cuboids. This is because cuboids, estimated rectangular regions over a video frame, are computationally efficient, has a compact representation and object centric. Such properties are already shown to add value to traditional video coding systems. Herein cuboidal feature descriptors are extracted from the current frame and then employed for accomplishing a machine vision task in the form of object detection. Experimental results show that a trained classifier yields superior average precision when equipped with cuboidal features oriented representation of the current test frame. Additionally, this representation costs 7% less in bit rate if the captured frames are need be communicated to a receiver.
引用
收藏
页码:2074 / 2078
页数:5
相关论文
共 50 条
  • [1] Learned Image Coding for Human-Machine Collaborative Optimization
    He, Jingbo
    He, Xiaohai
    Xiong, Shuhua
    Chen, Honggang
    IEEE TRANSACTIONS ON BROADCASTING, 2025, 71 (01) : 203 - 216
  • [2] Human-Machine Collaborative Image and Video Compression: A Survey
    Li, Huanyang
    Zhang, Xinfeng
    Wang, Shiqi
    Wang, Shanshe
    Pan, Jingshan
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2024, 13 (06)
  • [3] Autonomous Crowdsourcing through Human-Machine Collaborative Learning
    Abad, Azad
    Nabi, Moin
    Moschitti, Alessandro
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 873 - 876
  • [4] VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision
    Sheng, Xihua
    Li, Li
    Liu, Dong
    Li, Houqiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (07) : 4579 - 4596
  • [5] Human-machine collaborative additive manufacturing
    Xiong, Yi
    Tang, Yunlong
    Kim, Samyeon
    Rosen, David W.
    JOURNAL OF MANUFACTURING SYSTEMS, 2023, 66 : 82 - 91
  • [6] Collaborative innovation and human-machine networks
    Kattel, Rainer
    Lember, Veiko
    Tonurist, Piret
    PUBLIC MANAGEMENT REVIEW, 2020, 22 (11) : 1652 - 1673
  • [7] A Coarse Representation of Frames Oriented Video Coding By Leveraging Cuboidal Partitioning of Image Data
    Ahmmed, Ashek
    Paul, Manoranjan
    Murshed, Manzur
    Taubman, David
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
  • [8] Human Performance in Competitive and Collaborative Human-Machine Teams
    Bennett, Murray S. S.
    Hedley, Laiton
    Love, Jonathon
    Houpt, Joseph W. W.
    Brown, Scott D. D.
    Eidels, Ami
    TOPICS IN COGNITIVE SCIENCE, 2023,
  • [9] A Commonality Modeling Framework for Enhanced Video Coding Leveraging on the Cuboidal Partitioning Based Representation of Frames
    Ahmmed, Ashek
    Murshed, Manzur
    Paul, Manoranjan
    Taubman, David
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4446 - 4457
  • [10] CASAM: collaborative human-machine annotation of multimedia
    Hendley, Robert J.
    Beale, Russell
    Bowers, Chris P.
    Georgousopoulos, Christos
    Vassiliou, Charalampos
    Sergios, Petridis
    Moeller, Ralf
    Karstens, Eric
    Spiliotopoulos, Dimitris
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 70 (02) : 1277 - 1308