Anchor-based Detection for Natural Language Localization in Ego-centric Videos

被引:1
|
作者
Liu, Bei [1 ]
Zheng, Sipeng [2 ]
Fu, Jianlong [1 ]
Cheng, Wen-Huang [3 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
[2] Renmin Univ China, Beijing, Peoples R China
[3] Natl Yang Ming Chiao Tung Univ, Hsinchu, Taiwan
关键词
Embodied AI; ego-centric video; cross-modality; video understanding;
D O I
10.1109/ICCE56470.2023.10043460
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Natural Language Localization (NLL) task aims to localize a sentence in a video with starting and ending timestamps. It requires a comprehensive understanding of both language and videos. We have seen a lot of work conducted for third-person view videos, while the task on ego-centric videos is still under-explored, which is critical for the understanding of increasing ego-centric videos and further facilitating embodied AI tasks. Directly adapting existing methods of NLL to egocentric video datasets is challenging due to two reasons. Firstly, there is a temporal duration gap between different datasets. Secondly, queries in ego-centric videos usually require a better understanding of more complex and long-term temporal orders. For the above reason, we propose an anchor-based detection model for NLL in ego-centric videos.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] Fundamental limits of single anchor-based cooperative localization in millimeter wave systems
    Feng Zhao
    Tiancheng Huang
    Donglin Wang
    EURASIP Journal on Advances in Signal Processing, 2020
  • [32] Fundamental limits of single anchor-based cooperative localization in millimeter wave systems
    Zhao, Feng
    Huang, Tiancheng
    Wang, Donglin
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2020, 2020 (01)
  • [33] Indian pothole detection based on CNN and anchor-based deep learning method
    Anandhalli M.
    Tanuja A.
    Baligar V.P.
    Baligar P.
    International Journal of Information Technology, 2022, 14 (7) : 3343 - 3353
  • [34] Analysis and a Solution of Momentarily Missed Detection for Anchor-based Object Detectors
    Hosoya, Yusuke
    Suganuma, Masanori
    Okatani, Takayuki
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1399 - 1407
  • [35] Efficient Hardware Post Processing of Anchor-Based Object Detection on FPGA
    Zhang, Hui
    Wu, Wei
    Ma, Yufei
    Wang, Zhongfeng
    2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 580 - 585
  • [36] SQUARE GRID PATH PLANNING FOR MOBILE ANCHOR-BASED LOCALIZATION IN WIRELESS SENSOR NETWORKS
    Boukhari, Nawel
    Bouamama, Salim
    COMPUTER SCIENCE-AGH, 2023, 24 (04): : 513 - 535
  • [37] An Anchor-Based Localization in Underwater Wireless Sensor Networks for Industrial Oil Pipeline Monitoring
    Goyal, Nitin
    Nain, Mamta
    Singh, Aman
    Abualsaud, Khalid
    Alsubhi, Khalid
    Ortega-Mansilla, Arturo
    Zorba, Nizar
    IEEE CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2022, 45 (04): : 466 - 474
  • [38] Anchor-Based Transformer for Temporal LiDAR 3D Object Detection
    Gu, Rongqi
    Wu, Fei
    Liu, Peigen
    Yang, Chu
    Lu, Yaohan
    Chen, Guang
    2024 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS, ICARM 2024, 2024, : 45 - 50
  • [39] Single Anchor-Based Infrastructure-less Localization Performance using UWB Radios
    Bhushan, Shashank
    Ahluwalia, Ashish
    Deshwal, Ashutosh
    Pal, Amitangshu
    2024 20TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SMART SYSTEMS AND THE INTERNET OF THINGS, DCOSS-IOT 2024, 2024, : 639 - 646
  • [40] SWDet: Anchor-Based Object Detector for Solid Waste Detection in Aerial Images
    Zhou, Liming
    Rao, Xiaohan
    Li, Yahui
    Zuo, Xianyu
    Liu, Yang
    Lin, Yinghao
    Yang, Yong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 306 - 320