news video story segmentation silence clip shot detection audio-visual fusion

被引：3

作者：

Song, Yu ^{[1
]}

Wang, Wenhong ^{[1
]}

Guo, Fengjuan ^{[2
]}

机构：

[1] North China Elect Power Univ, Dept Comp, Baoding 071003, Peoples R China

[2] North China Elect Power Univ, Sci & Technol Coll, Baoding 071003, Peoples R China

来源：

ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION | 2009年

关键词：

news video; story segmentation; silence clip; shot detection; audio-visual fusion; VIDEO;

D O I：

10.1109/ICCSE.2009.5228544

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

this paper presents a method for news video story segmentation, which fuses multi-feature including audio and visual. At first, this paper detects the anchorperson shot for news video and determines the beginning of news story, and then detects topic caption between anchorperson shots. In the next step, silence clips in news video are detected using short-time energy and short-time average zero-crossing rate parameters, and then voice features of anchorperson is analyzed. At last, this method fuses multi-feature such as anchorperson shot, topic caption, silence and voice feature to segment news stories. Experimental results show that the approach is valid and avoid the deficiency of detecting news story by a single feature.

引用

页码：1065 / +

页数：2

共 50 条

[41] News Video Story Segmentation Based on Topic Caption Text and Audio Information
Zhao Yaqin
Zhou Xianzhong
Chen Huiming
PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL IV, 2009, : 482 - +
[42] End-to-End Bloody Video Recognition by Audio-Visual Feature Fusion
Hou, Congcong
Wu, Xiaoyu
Wang, Ge
PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 501 - 510
[43] Audio-visual event detection based on mining of semantic audio-visual labels
Goh, KS
Miyahara, K
Radhakrishan, R
Xiong, ZY
Divakaran, A
STORAGE AND RETRIEVAL METHODS AND APPLICATIONS FOR MULTIMEDIA 2004, 2004, 5307 : 292 - 299
[44] Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
Feng, Chao
Chen, Ziyang
Owens, Andrew
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10491 - 10503
[45] Biometric person authentication with liveness detection based on audio-visual fusion
Chetty, Girija
Wagner, Michael
INTERNATIONAL JOURNAL OF BIOMETRICS, 2009, 1 (04) : 463 - 478
[46] A Novel Audio-Visual Information Fusion System for Mental Disorders Detection
Li, Yichun
Li, Shuanglin
Naqvi, Syed Mohsen
2024 27TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, FUSION 2024, 2024,
[47] Enhance audio-visual segmentation with hierarchical encoder and audio guidance
Guo, Cunhan
Huang, Heyan
Zhou, Yanghao
NEUROCOMPUTING, 2024, 594
[48] Integrating Audio-Visual Contexts with Refinement for Segmentation
Geng, Qingwei
Gu, Xiaodong
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT III, 2024, 15018 : 31 - 44
[49] Audio-visual segmentation and "the cocktail party effect"
Darrell, T
Fisher, JW
Viola, P
Freeman, W
ADVANCES IN MULTIMODAL INTERFACES - ICMI 2000, PROCEEDINGS, 2000, 1948 : 32 - 40
[50] Weakly-Supervised Audio-Visual Segmentation
Mo, Shentong
Raj, Bhiksha
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

← 1 2 3 4 5 →