news video story segmentation silence clip shot detection audio-visual fusion

被引:3
|
作者
Song, Yu [1 ]
Wang, Wenhong [1 ]
Guo, Fengjuan [2 ]
机构
[1] North China Elect Power Univ, Dept Comp, Baoding 071003, Peoples R China
[2] North China Elect Power Univ, Sci & Technol Coll, Baoding 071003, Peoples R China
关键词
news video; story segmentation; silence clip; shot detection; audio-visual fusion; VIDEO;
D O I
10.1109/ICCSE.2009.5228544
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
this paper presents a method for news video story segmentation, which fuses multi-feature including audio and visual. At first, this paper detects the anchorperson shot for news video and determines the beginning of news story, and then detects topic caption between anchorperson shots. In the next step, silence clips in news video are detected using short-time energy and short-time average zero-crossing rate parameters, and then voice features of anchorperson is analyzed. At last, this method fuses multi-feature such as anchorperson shot, topic caption, silence and voice feature to segment news stories. Experimental results show that the approach is valid and avoid the deficiency of detecting news story by a single feature.
引用
收藏
页码:1065 / +
页数:2
相关论文
共 50 条
  • [1] Integrating audio-visual features and text information for story segmentation of news video
    Liu, Hua-Yong
    Zhou, Dong-Ru
    Wuhan University Journal of Natural Sciences, 2003, 8 (04) : 1070 - 1074
  • [3] Automatic story segmentation of news video based on audio-visual features and text information
    Wang, C
    Wang, Y
    Liu, HY
    He, YX
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 3008 - 3011
  • [4] News video story segmentation method using fusion of audio-visual features - art. no. 67904G
    Wen, Jun
    Wu, Ling-da
    Zeng, Pu
    Luan, Xi-dao
    Xie, Yu-xiang
    REMOTE SENSING AND GIS DATA PROCESSING AND APPLICATIONS; AND INNOVATIVE MULTISPECTRAL TECHNOLOGY AND APPLICATIONS, PTS 1 AND 2, 2007, 6790 : G7904 - G7904
  • [5] Audio, video and audio-visual signatures for short video clip detection:: Experiments on Trecvid2003
    Senechal, B
    Pellerin, D
    Besacier, L
    Simand, I
    Brès, S
    2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 221 - 224
  • [6] AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
    Oorloff, Trevine
    Koppisetti, Surya
    Bonettini, Nicole
    Solanki, Divyaraj
    Ben Colman
    Yacoob, Yaser
    Shahriyari, Ali
    Bharaj, Gaurav
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 27092 - 27102
  • [7] Bootstrapping Audio-Visual Video Segmentation by Strengthening Audio Cues
    Chen, Tianxiang
    Tan, Zhentao
    Gong, Tao
    Chu, Qi
    Wu, Yue
    Liu, Bin
    Yu, Nenghai
    Lu, Le
    Ye, Jieping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2398 - 2409
  • [8] Audio-visual speaker recognition for video broadcast news
    Maison, B
    Neti, C
    Senior, A
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2001, 29 (1-2): : 71 - 79
  • [9] Audio-Visual Speaker Recognition for Video Broadcast News
    Benoît Maison
    Chalapathy Neti
    Andrew Senior
    Journal of VLSI signal processing systems for signal, image and video technology, 2001, 29 : 71 - 79
  • [10] Audio-Visual Segmentation
    Zhou, Jinxing
    Wang, Jianyuan
    Zhang, Jiayi
    Sun, Weixuan
    Zhang, Jing
    Birchfield, Stan
    Guo, Dan
    Kong, Lingpeng
    Wang, Meng
    Zhong, Yiran
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 386 - 403