news video story segmentation silence clip shot detection audio-visual fusion

被引:3
|
作者
Song, Yu [1 ]
Wang, Wenhong [1 ]
Guo, Fengjuan [2 ]
机构
[1] North China Elect Power Univ, Dept Comp, Baoding 071003, Peoples R China
[2] North China Elect Power Univ, Sci & Technol Coll, Baoding 071003, Peoples R China
关键词
news video; story segmentation; silence clip; shot detection; audio-visual fusion; VIDEO;
D O I
10.1109/ICCSE.2009.5228544
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
this paper presents a method for news video story segmentation, which fuses multi-feature including audio and visual. At first, this paper detects the anchorperson shot for news video and determines the beginning of news story, and then detects topic caption between anchorperson shots. In the next step, silence clips in news video are detected using short-time energy and short-time average zero-crossing rate parameters, and then voice features of anchorperson is analyzed. At last, this method fuses multi-feature such as anchorperson shot, topic caption, silence and voice feature to segment news stories. Experimental results show that the approach is valid and avoid the deficiency of detecting news story by a single feature.
引用
收藏
页码:1065 / +
页数:2
相关论文
共 50 条
  • [31] Unsupervised Audio-Visual Lecture Segmentation
    Singh, S. Darshan
    Gupta, Anchit
    Jawahar, C. V.
    Tapaswi, Makarand
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5221 - 5230
  • [32] An unsupervised shot classification system for news video story detection
    De Santo, M
    Percannella, G
    Sansone, C
    Vento, M
    MULTIMEDIA DATABASES AND IMAGE COMMUNICATION, 2004, 17 : 93 - 104
  • [33] CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation
    Li, Kexin
    Yang, Zongxin
    Chen, Lei
    Yang, Yi
    Xiao, Jun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1485 - 1494
  • [34] Story segmentation in news video
    Feng, HM
    Zhai, XF
    Fan, JW
    Fang, Y
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 831 - 835
  • [35] News video story segmentation
    Fang, Yong
    Zhai, Xiaofei
    Fan, Jingwang
    12TH INTERNATIONAL MULTI-MEDIA MODELLING CONFERENCE PROCEEDINGS, 2006, : 397 - 400
  • [36] SEGMENTATION OF MUSIC VIDEO STREAMS IN MUSIC PIECES THROUGH AUDIO-VISUAL ANALYSIS
    Sargent, Gabriel
    Hanna, Pierre
    Nicolas, Henri
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [37] See hear now: is audio-visual QoE now just a fusion of audio and video metrics?
    Martinez, Helard B.
    Hines, Andrew
    Farias, Mylene C. Q.
    2022 14TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE, QOMEX, 2022,
  • [38] A unified scheme of shot boundary detection and anchor shot detection in news video story parsing
    Hansung Lee
    Jaehak Yu
    Younghee Im
    Joon-Min Gil
    Daihee Park
    Multimedia Tools and Applications, 2011, 51 : 1127 - 1145
  • [39] Fusion and combination in audio-visual integration
    Omata, Kei
    Mogi, Ken
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2008, 464 (2090): : 319 - 340
  • [40] A unified scheme of shot boundary detection and anchor shot detection in news video story parsing
    Lee, Hansung
    Yu, Jaehak
    Im, Younghee
    Gil, Joon-Min
    Park, Daihee
    MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 51 (03) : 1127 - 1145