Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes

被引:0
|
作者
Kim, Byoungjun [1 ]
Shim, Ji Yea [1 ]
Park, Minho [1 ]
Ro, Yong Man [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon, South Korea
来源
关键词
Scene graph; Scene text; Audio classification;
D O I
10.1007/978-3-030-37734-2_73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a video retrieval tool for the 2020 Video Browser Showdown (VBS). The tool enhances the user's video browsing experience by ensuring full use of video analysis database constructed prior to the Showdown. Deep learning based object detection, scene text detection, scene color detection, audio classification and relation detection with scene graph generation methods have been used to construct the data. The data is composed of visual, textual, and auditory information, broadening the scope to which a user can search beyond visual information. In addition, the tool provides a simple and user-friendly interface for novice users to adapt to the tool in little time.
引用
收藏
页码:803 / 808
页数:6
相关论文
共 50 条
  • [1] Person Retrieval in Video Surveillance Using Deep Learning-Based Instance Segmentation
    Tseng, Chien-Hao
    Hsieh, Chia-Chien
    Jwo, Dah-Jing
    Wu, Jyh-Horng
    Sheu, Ruey-Kai
    Chen, Lun-Chi
    [J]. JOURNAL OF SENSORS, 2021, 2021
  • [2] Object of Interest and Unsupervised Learning-based Framework for an Effective Video Summarization Using Deep Learning
    Negi, Alok
    Kumar, Krishan
    Saini, Parul
    [J]. IETE JOURNAL OF RESEARCH, 2024, 70 (05) : 5019 - 5030
  • [3] A Deep Transfer Learning-Based Object Tracking Algorithm for Hyperspectral Video
    Tang Yiming
    Liu Yufei
    Huang Hong
    Zhang Chao
    Yuan Li
    [J]. IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 811 - 820
  • [4] Learning-based interactive video retrieval system
    Wu, Chi-Jiunn
    Zeng, Hui-Chi
    Huang, Szu-Hao
    Lai, Shang-Hong
    Wang, Wen-Hao
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1785 - 1788
  • [5] A Deep Learning-Based Coyote Detection System Using Audio Data
    Jung, Heesun
    Kwon, Bokyung
    Kim, Youngbin
    Lee, Yejin
    Park, Jihyeon
    Pegg, Griffin
    Wang, Yaqin
    Smith, Anthony H.
    [J]. 2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 170 - 175
  • [6] Object detection and recognition using deep learning-based techniques
    Sharma, Preksha
    Gupta, Surbhi
    Vyas, Sonali
    Shabaz, Mohammad
    [J]. IET COMMUNICATIONS, 2023, 17 (13) : 1589 - 1599
  • [7] Deep Learning-based Anomaly Detection for Compressors Using Audio Data
    Mobtahej, Pooyan
    Zhang, Xulong
    Hamidi, Maryam
    Zhang, Jing
    [J]. 67TH ANNUAL RELIABILITY & MAINTAINABILITY SYMPOSIUM (RAMS 2021), 2021,
  • [8] An Intelligent Retrieval Method for Audio and Video Content: Deep Learning Technology Based on Artificial Intelligence
    Sun, Maojin
    [J]. IEEE ACCESS, 2024, 12 : 123430 - 123446
  • [9] Deep Learning-Based Multi-class Multiple Object Tracking in UAV Video
    Micheal, A. Ancy
    Vani, K.
    [J]. JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2022, 50 (12) : 2543 - 2552
  • [10] A Deep Learning-Based Real-Time Video Object Contextualizing and Archiving System
    Pham, Dinh-Lam
    Yoon, Byeongnam
    Vu, Viet-Vu
    Kim, Joo-Chang
    Ahn, Sang-Eun
    Chang, Jeong-Hyun
    Yoo, Hyun
    Sun, Kyonghee
    Kim, Kyong-Sook
    Kim, Kwanghoon Pio
    [J]. 2023 25TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY, ICACT, 2023, : 137 - 144