News video retrieval by learning multimodal semantic information

被引:0
|
作者
Yu, Hui [1 ]
Su, Bolan [1 ]
Lu, Hong [1 ]
Xue, Xiangyang [1 ]
机构
[1] Fudan Univ, Dept Comp Sci & Engn, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
来源
关键词
video retrieval; rich semantic information; TRECVID; manual search task;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the explosion of multimedia data especially that of video data, requirement of efficient video retrieval has becoming more and more important. Years of TREC Video Retrieval Evaluation (TRECVID) research gives benchmark for video search task. The video data in TRECVID are mainly news video. In this paper a compound model consisting of several atom search modules, i.e., textual and visual, for news video retrieval is introduced. First, the analysis on query topics helps to improve the performance of video retrieval. Furthermore, the multimodal fusion of all atom search modules ensures to get good performance. Experimental results on TRECVID 2005 and TRECVID 2006 search tasks demonstrate the effectiveness of the proposed method.
引用
收藏
页码:403 / 414
页数:12
相关论文
共 50 条
  • [41] Combined Application of Video Semantic Understanding Technology for Music Video Information Learning
    Liu S.
    Yang Q.
    Gong T.
    Computer-Aided Design and Applications, 2023, 20 (S10): : 34 - 44
  • [42] A model for multimodal information retrieval
    Srihari, RK
    Rao, AB
    Han, B
    Munirathnam, S
    Wu, XY
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 701 - 704
  • [43] Semantic Description and Information Retrieval Research of Surveillance Video in Smart Transportation System
    Yang, Boxiong
    Huang, Jing
    Yang, Yuqi
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTROMECHANICAL CONTROL TECHNOLOGY AND TRANSPORTATION, 2015, 41 : 238 - 244
  • [44] Role of Semantic Links in Performance of Information Retrieval on Graph-based Multimodal Collections
    Sabetghadam, Serwah
    Lupu, Mihai
    Rauber, Andreas
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1574 - 1579
  • [45] Semantic indexing for instructional video via combination of handwriting recognition and information retrieval
    Tang, LJ
    Kender, JR
    2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 921 - 924
  • [46] LOOK, TELL AND MATCH: REFINING VIDEO-TEXT RETRIEVAL WITH SEMANTIC INFORMATION
    Zhu Jinkuan
    Hu Weiyi
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [47] VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles
    Li, Mingzhe
    Chen, Xiuying
    Gao, Shen
    Chan, Zhangming
    Zhao, Dongyan
    Yan, Rui
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 9360 - 9369
  • [48] Regim VID A Semantic and Personalized Framework for News Video Retrieval Based on Textual and Visual Transcripts
    Karray, Hichem
    Ben Ammar, Anis
    Alimi, Adel M.
    JOURNAL OF DECISION SYSTEMS, 2011, 20 (04) : 467 - 490
  • [49] Detection and retrieval of captions in news video
    Luo, M
    Bai, XS
    Xu, GG
    VISUALIZATION AND OPTIMIZATION TECHNIQUES, 2001, 4553 : 233 - 238
  • [50] VIDEO SALIENCY PREDICTION THROUGH MACHINE LEARNING WITH SEMANTIC INFORMATION
    Fu, Xiaohui
    Su, Li
    Qin, Lei
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 539 - 543