News video retrieval by learning multimodal semantic information

被引:0
|
作者
Yu, Hui [1 ]
Su, Bolan [1 ]
Lu, Hong [1 ]
Xue, Xiangyang [1 ]
机构
[1] Fudan Univ, Dept Comp Sci & Engn, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
来源
关键词
video retrieval; rich semantic information; TRECVID; manual search task;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the explosion of multimedia data especially that of video data, requirement of efficient video retrieval has becoming more and more important. Years of TREC Video Retrieval Evaluation (TRECVID) research gives benchmark for video search task. The video data in TRECVID are mainly news video. In this paper a compound model consisting of several atom search modules, i.e., textual and visual, for news video retrieval is introduced. First, the analysis on query topics helps to improve the performance of video retrieval. Furthermore, the multimodal fusion of all atom search modules ensures to get good performance. Experimental results on TRECVID 2005 and TRECVID 2006 search tasks demonstrate the effectiveness of the proposed method.
引用
收藏
页码:403 / 414
页数:12
相关论文
共 50 条
  • [31] SEMANTIC-PRESERVING METRIC LEARNING FOR VIDEO-TEXT RETRIEVAL
    Choo, Sungkwon
    Ha, Seong Jong
    Lee, Joonsoo
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2388 - 2392
  • [32] Personalized Video Browsing and Retrieval in a Semantic-Based Learning Environment
    Carbonaro, Antonella
    OPEN KNOWLEDGE SOCIETY: A COMPUTER SCIENCE AND INFORMATION SYSTEMS MANIFESTO, 2008, 19 : 163 - 171
  • [33] Multimodal Learning of Geometry-Preserving Binary Codes for Semantic Image Retrieval
    Irie, Go
    Arai, Hiroyuki
    Taniguchi, Yukinobu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (04) : 600 - 609
  • [34] Not all fake news is semantically similar: Contextual semantic representation learning for multimodal fake news detection
    Peng, Liwen
    Jian, Songlei
    Kan, Zhigang
    Qiao, Linbo
    Li, Dongsheng
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (01)
  • [35] Multimodal Indexing of Multilingual News Video
    Ghosh, Hiranmay
    Kopparapu, Sunil Kumar
    Chattopadhyay, Tanushyam
    Khare, Ashish
    Wattamwar, Sujal Subhash
    Gorai, Amarendra
    Pandharipande, Meghna
    INTERNATIONAL JOURNAL OF DIGITAL MULTIMEDIA BROADCASTING, 2010, 2010
  • [36] Semantic indexing of news video sequences: A multimodal hierarchical approach based on hidden Markov model
    Kolekar, M. H.
    Sengupta, S.
    TENCON 2005 - 2005 IEEE REGION 10 CONFERENCE, VOLS 1-5, 2006, : 2647 - 2652
  • [37] Large Scale News Video Database Browsing and Retrieval via Information Visualization
    Luo, Hangzai
    Fan, Jianping
    Satoh, Shin'ichi
    Ribarsky, William
    APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 1086 - +
  • [38] Multimodal search for effective video retrieval
    Natsev, Apostol
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 525 - 528
  • [39] Towards Semantic Multimodal Video Annotation
    Grassi, Marco
    Morbidoni, Christian
    Piazza, Francesco
    TOWARD AUTONOMOUS, ADAPTIVE, AND CONTEXT-AWARE MULTIMODAL INTERFACES: THEORETICAL AND PRACTICAL ISSUES, 2011, 6456 : 305 - 316
  • [40] News Video Clip Retrieval Based on Topic Caption Text and Audio Information
    Zhao Yaqin
    Zheng Jiaqiang
    Zhou Hongping
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL IV, 2009, : 477 - 481