Integrating audio-visual features and text information for story segmentation of news video

被引:0
|
作者
Liu, Hua-Yong [1 ]
Zhou, Dong-Ru [1 ]
机构
[1] Sch. of Comp., Wuhan Univ., Wuhan 430072, China
关键词
School of Computer; Wuhan University; Wuhan; 430072; Hubei; China Abstract: Video data are composed of multimodal information streams including visual; auditory and textual streams; so an approach of story segmentation for news video using multimodal analysis is described in this paper. The proposed approach detects the topic-caption frames; and integrates them with silence clips detection results; as well as shot segmentation results to locate the news story boundaries. The integration of audio-visual features and text information overcomes the weakness of the approach using only image analysis techniques. On test data with 135 400 frames; when the boundaries between news stories are detected; the accuracy rate 85.8~ and the recall rate 97.5~ are obtained. The experimental results show the approach is valid and robust. Key words: news video; story segmentation; audio-visual features analysis; text detection CLC number: TP 311. 5 Received date: 2002-12-23 Foundation item: Supported by the Nanonal Natural Science Foundation of China (60173045) Biogi~phg: Liu Hua-yong (1978-); male; Ph.D; can&date; research direetton:vldeo retneval and speech ~ignal processing. E-mad: hyhut9 _en@ sina. corn 1 To whom correspondence should be addressed;
D O I
10.1007/bf02903674
中图分类号
学科分类号
摘要
8
引用
收藏
页码:1070 / 1074
相关论文
共 50 条
  • [2] Automatic story segmentation of news video based on audio-visual features and text information
    Wang, C
    Wang, Y
    Liu, HY
    He, YX
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 3008 - 3011
  • [3] News Video Story Segmentation Based on Topic Caption Text and Audio Information
    Zhao Yaqin
    Zhou Xianzhong
    Chen Huiming
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL IV, 2009, : 482 - +
  • [4] news video story segmentation silence clip shot detection audio-visual fusion
    Song, Yu
    Wang, Wenhong
    Guo, Fengjuan
    ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 1065 - +
  • [5] Integrating visual, audio and text analysis for news video
    Qi, W
    Gu, L
    Jiang, H
    Chen, XR
    Zhang, HJ
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 520 - 523
  • [6] Combining text and audio-visual features in video indexing
    Chang, SF
    Manmatha, R
    Chua, TS
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1005 - 1008
  • [7] News video story segmentation method using fusion of audio-visual features - art. no. 67904G
    Wen, Jun
    Wu, Ling-da
    Zeng, Pu
    Luan, Xi-dao
    Xie, Yu-xiang
    REMOTE SENSING AND GIS DATA PROCESSING AND APPLICATIONS; AND INNOVATIVE MULTISPECTRAL TECHNOLOGY AND APPLICATIONS, PTS 1 AND 2, 2007, 6790 : G7904 - G7904
  • [8] Integrating Audio-Visual Contexts with Refinement for Segmentation
    Geng, Qingwei
    Gu, Xiaodong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT III, 2024, 15018 : 31 - 44
  • [9] Content-based TV sports video retrieval based on audio-visual features and text information
    Liu, HY
    IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 481 - 484
  • [10] Bootstrapping Audio-Visual Video Segmentation by Strengthening Audio Cues
    Chen, Tianxiang
    Tan, Zhentao
    Gong, Tao
    Chu, Qi
    Wu, Yue
    Liu, Bin
    Yu, Nenghai
    Lu, Le
    Ye, Jieping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2398 - 2409