Integrating audio-visual features and text information for story segmentation of news video

被引：0

作者：

Liu, Hua-Yong ^{[1
]}

Zhou, Dong-Ru ^{[1
]}

机构：

[1] Sch. of Comp., Wuhan Univ., Wuhan 430072, China

来源：

Wuhan University Journal of Natural Sciences | 2003年 / 8卷 / 04期

关键词：

School of Computer; Wuhan University; Wuhan; 430072; Hubei; China Abstract: Video data are composed of multimodal information streams including visual; auditory and textual streams; so an approach of story segmentation for news video using multimodal analysis is described in this paper. The proposed approach detects the topic-caption frames; and integrates them with silence clips detection results; as well as shot segmentation results to locate the news story boundaries. The integration of audio-visual features and text information overcomes the weakness of the approach using only image analysis techniques. On test data with 135 400 frames; when the boundaries between news stories are detected; the accuracy rate 85.8~ and the recall rate 97.5~ are obtained. The experimental results show the approach is valid and robust. Key words: news video; story segmentation; audio-visual features analysis; text detection CLC number: TP 311. 5 Received date: 2002-12-23 Foundation item: Supported by the Nanonal Natural Science Foundation of China (60173045) Biogi~phg: Liu Hua-yong (1978-); male; Ph.D; can&date; research direetton:vldeo retneval and speech ~ignal processing. E-mad: hyhut9 _en@ sina. corn 1 To whom correspondence should be addressed;

D O I：

10.1007/bf02903674

中图分类号：

学科分类号：

摘要：

引用

页码：1070 / 1074

共 50 条

[1] Integrating Audio-Visual Features and Text Information for Story Segmentation of News Video
Liu Hua-yong
WuhanUniversityJournalofNaturalSciences, 2003, (04) : 1070 - 1074
[2] Automatic story segmentation of news video based on audio-visual features and text information
Wang, C
Wang, Y
Liu, HY
He, YX
2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 3008 - 3011
[3] News Video Story Segmentation Based on Topic Caption Text and Audio Information
Zhao Yaqin
Zhou Xianzhong
Chen Huiming
PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL IV, 2009, : 482 - +
[4] news video story segmentation silence clip shot detection audio-visual fusion
Song, Yu
Wang, Wenhong
Guo, Fengjuan
ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 1065 - +
[5] Integrating visual, audio and text analysis for news video
Qi, W
Gu, L
Jiang, H
Chen, XR
Zhang, HJ
2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 520 - 523
[6] Combining text and audio-visual features in video indexing
Chang, SF
Manmatha, R
Chua, TS
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1005 - 1008
[7] News video story segmentation method using fusion of audio-visual features - art. no. 67904G
Wen, Jun
Wu, Ling-da
Zeng, Pu
Luan, Xi-dao
Xie, Yu-xiang
REMOTE SENSING AND GIS DATA PROCESSING AND APPLICATIONS; AND INNOVATIVE MULTISPECTRAL TECHNOLOGY AND APPLICATIONS, PTS 1 AND 2, 2007, 6790 : G7904 - G7904
[8] Integrating Audio-Visual Contexts with Refinement for Segmentation
Geng, Qingwei
Gu, Xiaodong
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT III, 2024, 15018 : 31 - 44
[9] Content-based TV sports video retrieval based on audio-visual features and text information
Liu, HY
IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 481 - 484
[10] Bootstrapping Audio-Visual Video Segmentation by Strengthening Audio Cues
Chen, Tianxiang
Tan, Zhentao
Gong, Tao
Chu, Qi
Wu, Yue
Liu, Bin
Yu, Nenghai
Lu, Le
Ye, Jieping
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2398 - 2409

← 1 2 3 4 5 →