news video story segmentation silence clip shot detection audio-visual fusion

被引:3
|
作者
Song, Yu [1 ]
Wang, Wenhong [1 ]
Guo, Fengjuan [2 ]
机构
[1] North China Elect Power Univ, Dept Comp, Baoding 071003, Peoples R China
[2] North China Elect Power Univ, Sci & Technol Coll, Baoding 071003, Peoples R China
关键词
news video; story segmentation; silence clip; shot detection; audio-visual fusion; VIDEO;
D O I
10.1109/ICCSE.2009.5228544
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
this paper presents a method for news video story segmentation, which fuses multi-feature including audio and visual. At first, this paper detects the anchorperson shot for news video and determines the beginning of news story, and then detects topic caption between anchorperson shots. In the next step, silence clips in news video are detected using short-time energy and short-time average zero-crossing rate parameters, and then voice features of anchorperson is analyzed. At last, this method fuses multi-feature such as anchorperson shot, topic caption, silence and voice feature to segment news stories. Experimental results show that the approach is valid and avoid the deficiency of detecting news story by a single feature.
引用
收藏
页码:1065 / +
页数:2
相关论文
共 50 条
  • [41] News Video Story Segmentation Based on Topic Caption Text and Audio Information
    Zhao Yaqin
    Zhou Xianzhong
    Chen Huiming
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL IV, 2009, : 482 - +
  • [42] End-to-End Bloody Video Recognition by Audio-Visual Feature Fusion
    Hou, Congcong
    Wu, Xiaoyu
    Wang, Ge
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 501 - 510
  • [43] Audio-visual event detection based on mining of semantic audio-visual labels
    Goh, KS
    Miyahara, K
    Radhakrishan, R
    Xiong, ZY
    Divakaran, A
    STORAGE AND RETRIEVAL METHODS AND APPLICATIONS FOR MULTIMEDIA 2004, 2004, 5307 : 292 - 299
  • [44] Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
    Feng, Chao
    Chen, Ziyang
    Owens, Andrew
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10491 - 10503
  • [45] Biometric person authentication with liveness detection based on audio-visual fusion
    Chetty, Girija
    Wagner, Michael
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2009, 1 (04) : 463 - 478
  • [46] A Novel Audio-Visual Information Fusion System for Mental Disorders Detection
    Li, Yichun
    Li, Shuanglin
    Naqvi, Syed Mohsen
    2024 27TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, FUSION 2024, 2024,
  • [47] Enhance audio-visual segmentation with hierarchical encoder and audio guidance
    Guo, Cunhan
    Huang, Heyan
    Zhou, Yanghao
    NEUROCOMPUTING, 2024, 594
  • [48] Integrating Audio-Visual Contexts with Refinement for Segmentation
    Geng, Qingwei
    Gu, Xiaodong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT III, 2024, 15018 : 31 - 44
  • [49] Audio-visual segmentation and "the cocktail party effect"
    Darrell, T
    Fisher, JW
    Viola, P
    Freeman, W
    ADVANCES IN MULTIMODAL INTERFACES - ICMI 2000, PROCEEDINGS, 2000, 1948 : 32 - 40
  • [50] Weakly-Supervised Audio-Visual Segmentation
    Mo, Shentong
    Raj, Bhiksha
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,