Integrating multi-modal content analysis and hyperbolic visualization for large-scale news video retrieval and exploration

被引:5
|
作者
Luo, H. [2 ]
Fan, J. [1 ]
Satoh, S. [3 ]
Yang, J. [1 ]
Ribarsky, W. [1 ]
机构
[1] Univ N Carolina, Dept Comp Sci, Charlotte, NC 28223 USA
[2] E China Normal Univ, Inst Software Engn, Shanghai 200062, Peoples R China
[3] Natl Inst Informat, Tokyo 1018430, Japan
关键词
multi-modal content analysis; interestingness assignment; association determination; hyperbolic visualization;
D O I
10.1016/j.image.2008.04.014
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we have developed a novel scheme to achieve more effective analysis, retrieval and exploration of large-scale news video collections by performing multi-modal video content analysis and synchronization. First, automatic keyword extraction is performed on news closed captions and audio channels to detect the most interesting news topics (i.e., keywords for news topic interpretation), and the associations among these news topics (i.e., contextual relationships among the news topics) are further determined according to their co-occurrence probabilities. Second, visual semantic items, such as human faces, text captions, video concepts. are extracted automatically by using our semantic video analysis techniques. The news topics are automatically synchronized with the most relevant visual semantic items. In addition, an interestingness weight is assigned for each news topic to characterize its importance. Finally, a novel hyperbolic visualization scheme is incorporated to visualize large-scale news topics according to their associations and interestingness. With a better global overview of large-scale news video collections, users can specify their queries more precisely and explore large-scale news video collections interactively. Our experiments on large-scale news video collections have provided very positive results. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:538 / 553
页数:16
相关论文
共 50 条
  • [1] Flexible Online Multi-modal Hashing for Large-scale Multimedia Retrieval
    Lu, Xu
    Zhu, Lei
    Cheng, Zhiyong
    Li, Jingjing
    Nie, Xiushan
    Zhang, Huaxiang
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1129 - 1137
  • [2] Fast Discrete Collaborative Multi-Modal Hashing for Large-Scale Multimedia Retrieval
    Zheng, Chaoqun
    Zhu, Lei
    Lu, Xu
    Li, Jingjing
    Cheng, Zhiyong
    Zhang, Hanwang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (11) : 2171 - 2184
  • [3] Retrieval From and Understanding of Large-Scale Multi-modal Medical Datasets: A Review
    Mueller, Henning
    Unay, Devrim
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (09) : 2093 - 2104
  • [4] Towards Good Practices for Multi-modal Fusion in Large-Scale Video Classification
    Liu, Jinlai
    Yuan, Zehuan
    Wang, Changhu
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 287 - 296
  • [5] Efficient Large-Scale Multi-Modal Classification
    Kiela, Douwe
    Grave, Edouard
    Joulin, Armand
    Mikolov, Tomas
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5198 - 5204
  • [6] A Hierarchical Framwork with Improved Loss for Large-scale Multi-modal Video Identification
    Zhang, Shichuan
    Tang, Zengming
    Pan, Hao
    Wei, Xinyu
    Huang, Jun
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2539 - 2542
  • [7] Analyzing large-scale news video databases to support knowledge visualization and intuitive retrieval
    Luo, Hangzai
    Fan, Jianping
    Yang, Jing
    Ribarsky, William
    Satoh, Shinichi
    [J]. VAST: IEEE SYMPOSIUM ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY 2007, PROCEEDINGS, 2007, : 107 - +
  • [8] Face Retrieval in Large-Scale News Video Datasets
    Thanh Duc Ngo
    Hung Thanh Vu
    Duy-Dinh Le
    Satoh, Shin'ichi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (08): : 1811 - 1825
  • [9] Large-Scale Multi-modal Distance Metric Learning with Application to Content-Based Information Retrieval and Image Classification
    Rasheed, Ali Salim
    Zabihzadeh, Davood
    Al-Obaidi, Sumia Abdulhussien Razooqi
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (13)
  • [10] Multi-Modal Learning: Study on A Large-Scale Micro-Video Data Collection
    Chen, Jingyuan
    [J]. MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 1454 - 1458