Integrating multi-modal content analysis and hyperbolic visualization for large-scale news video retrieval and exploration

被引：5

作者：

Luo, H. ^{[2
]}

Fan, J. ^{[1
]}

Satoh, S. ^{[3
]}

Yang, J. ^{[1
]}

Ribarsky, W. ^{[1
]}

机构：

[1] Univ N Carolina, Dept Comp Sci, Charlotte, NC 28223 USA

[2] E China Normal Univ, Inst Software Engn, Shanghai 200062, Peoples R China

[3] Natl Inst Informat, Tokyo 1018430, Japan

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2008年 / 23卷 / 07期

关键词：

multi-modal content analysis; interestingness assignment; association determination; hyperbolic visualization;

D O I：

10.1016/j.image.2008.04.014

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we have developed a novel scheme to achieve more effective analysis, retrieval and exploration of large-scale news video collections by performing multi-modal video content analysis and synchronization. First, automatic keyword extraction is performed on news closed captions and audio channels to detect the most interesting news topics (i.e., keywords for news topic interpretation), and the associations among these news topics (i.e., contextual relationships among the news topics) are further determined according to their co-occurrence probabilities. Second, visual semantic items, such as human faces, text captions, video concepts. are extracted automatically by using our semantic video analysis techniques. The news topics are automatically synchronized with the most relevant visual semantic items. In addition, an interestingness weight is assigned for each news topic to characterize its importance. Finally, a novel hyperbolic visualization scheme is incorporated to visualize large-scale news topics according to their associations and interestingness. With a better global overview of large-scale news video collections, users can specify their queries more precisely and explore large-scale news video collections interactively. Our experiments on large-scale news video collections have provided very positive results. (C) 2008 Elsevier B.V. All rights reserved.

引用

页码：538 / 553

页数：16

共 50 条

[1] Flexible Online Multi-modal Hashing for Large-scale Multimedia Retrieval
Lu, Xu
Zhu, Lei
Cheng, Zhiyong
Li, Jingjing
Nie, Xiushan
Zhang, Huaxiang
[J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1129 - 1137
[2] Fast Discrete Collaborative Multi-Modal Hashing for Large-Scale Multimedia Retrieval
Zheng, Chaoqun
Zhu, Lei
Lu, Xu
Li, Jingjing
Cheng, Zhiyong
Zhang, Hanwang
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (11) : 2171 - 2184
[3] Retrieval From and Understanding of Large-Scale Multi-modal Medical Datasets: A Review
Mueller, Henning
Unay, Devrim
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (09) : 2093 - 2104
[4] Towards Good Practices for Multi-modal Fusion in Large-Scale Video Classification
Liu, Jinlai
Yuan, Zehuan
Wang, Changhu
[J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 287 - 296
[5] Efficient Large-Scale Multi-Modal Classification
Kiela, Douwe
Grave, Edouard
Joulin, Armand
Mikolov, Tomas
[J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5198 - 5204
[6] A Hierarchical Framwork with Improved Loss for Large-scale Multi-modal Video Identification
Zhang, Shichuan
Tang, Zengming
Pan, Hao
Wei, Xinyu
Huang, Jun
[J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2539 - 2542
[7] Analyzing large-scale news video databases to support knowledge visualization and intuitive retrieval
Luo, Hangzai
Fan, Jianping
Yang, Jing
Ribarsky, William
Satoh, Shinichi
[J]. VAST: IEEE SYMPOSIUM ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY 2007, PROCEEDINGS, 2007, : 107 - +
[8] Face Retrieval in Large-Scale News Video Datasets
Thanh Duc Ngo
Hung Thanh Vu
Duy-Dinh Le
Satoh, Shin'ichi
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (08): : 1811 - 1825
[9] Large-Scale Multi-modal Distance Metric Learning with Application to Content-Based Information Retrieval and Image Classification
Rasheed, Ali Salim
Zabihzadeh, Davood
Al-Obaidi, Sumia Abdulhussien Razooqi
[J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (13)
[10] Multi-Modal Learning: Study on A Large-Scale Micro-Video Data Collection
Chen, Jingyuan
[J]. MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 1454 - 1458

← 1 2 3 4 5 →