HUMAN-MACHINE COLLABORATIVE VIDEO CODING THROUGH CUBOIDAL PARTITIONING

被引:5
|
作者
Ahmmed, Ashek [1 ,4 ]
Paul, Manoranjan [1 ]
Murshed, Manzur [2 ]
Taubman, David [3 ]
机构
[1] Charles Sturt Univ, Sch Comp & Math, Bathurst, NSW, Australia
[2] Federat Univ, Sch Sci Engn & Informat Technol, Ballarat, Vic, Australia
[3] Univ New South Wales, Sch Elect Engn & Telecommun, Kensington, NSW, Australia
[4] Univ New South Wales, Sch Engn & Informat Technol, Kensington, NSW, Australia
关键词
Cuboid; HEVC; VCM; Object detection;
D O I
10.1109/ICIP42928.2021.9506150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video coding algorithms encode and decode an entire video frame while feature coding techniques only preserve and communicate the most critical information needed for a given application. This is because video coding targets human perception, while feature coding aims for machine vision tasks. Recently, attempts are being made to bridge the gap between these two domains. In this work, we propose a video coding framework by leveraging on to the commonality that exists between human vision and machine vision applications using cuboids. This is because cuboids, estimated rectangular regions over a video frame, are computationally efficient, has a compact representation and object centric. Such properties are already shown to add value to traditional video coding systems. Herein cuboidal feature descriptors are extracted from the current frame and then employed for accomplishing a machine vision task in the form of object detection. Experimental results show that a trained classifier yields superior average precision when equipped with cuboidal features oriented representation of the current test frame. Additionally, this representation costs 7% less in bit rate if the captured frames are need be communicated to a receiver.
引用
收藏
页码:2074 / 2078
页数:5
相关论文
共 50 条
  • [41] Video Manipulations Beyond Faces: A Dataset with Human-Machine Analysis
    Mittal, Trisha
    Sinha, Ritwik
    Swaminathan, Viswanathan
    Collomosse, John
    Manocha, Dinesh
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 643 - 652
  • [42] THE HUMAN-MACHINE
    SMITH, B
    MECHANICAL ENGINEERING, 1993, 115 (08) : 4 - 4
  • [43] Extreme Learning Machine-Enabled Coding Unit Partitioning Algorithm for Versatile Video Coding
    Jiang, Xiantao
    Xiang, Mo
    Jin, Jiayuan
    Song, Tian
    INFORMATION, 2023, 14 (09)
  • [44] Sensecomputing for Human-Machine Collaboration through Human Emotion Understanding
    Hayashida, Naoko
    FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2018, 54 (05): : 62 - 69
  • [45] Deep Value of Information Estimators for Collaborative Human-Machine Information Gathering
    Lore, Kin Gwn
    Sweet, Nicholas
    Kumar, Kundan
    Ahmed, Nisar
    Sarkar, Soumik
    2016 ACM/IEEE 7TH INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS), 2016,
  • [46] RETRACTED: Human-machine collaborative systems for microsurgical applications (Retracted Article)
    Kragic, D
    Marayong, P
    Li, M
    Okamura, AM
    Hager, GD
    ROBOTICS RESEARCH, 2005, 15 : 162 - 171
  • [47] Human-machine Collaborative Decision-making for Transportation Scheduling Optimization
    Liu T.
    You H.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2024, 24 (02): : 136 - 148
  • [48] A GENERAL FRAMEWORK FOR CONTROL SYSTEMS APPROACH TO COLLABORATIVE HUMAN-MACHINE SYSTEMS
    Lin, Yingzi
    Cai, Hua
    PROCEEDINGS OF THE ASME DYNAMIC SYSTEMS AND CONTROL CONFERENCE 2009, PTS A AND B, 2010, : 977 - 984
  • [49] The Review of Human-Machine Collaborative Intelligent Interaction With Driver Cognition in the Loop
    Fu, Qianwen
    Zhang, Lijun
    Xu, Yiqian
    You, Fang
    SYSTEMS RESEARCH AND BEHAVIORAL SCIENCE, 2025,
  • [50] A COLLABORATIVE 20 QUESTIONS MODEL FOR TARGET SEARCH WITH HUMAN-MACHINE INTERACTION
    Tsiligkaridis, Theodoros
    Sadler, Brian M.
    Hero, Alfred O., III
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6516 - 6520