news video story segmentation silence clip shot detection audio-visual fusion

被引：3

作者：

Song, Yu ^{[1
]}

Wang, Wenhong ^{[1
]}

Guo, Fengjuan ^{[2
]}

机构：

[1] North China Elect Power Univ, Dept Comp, Baoding 071003, Peoples R China

[2] North China Elect Power Univ, Sci & Technol Coll, Baoding 071003, Peoples R China

来源：

ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION | 2009年

关键词：

news video; story segmentation; silence clip; shot detection; audio-visual fusion; VIDEO;

D O I：

10.1109/ICCSE.2009.5228544

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

this paper presents a method for news video story segmentation, which fuses multi-feature including audio and visual. At first, this paper detects the anchorperson shot for news video and determines the beginning of news story, and then detects topic caption between anchorperson shots. In the next step, silence clips in news video are detected using short-time energy and short-time average zero-crossing rate parameters, and then voice features of anchorperson is analyzed. At last, this method fuses multi-feature such as anchorperson shot, topic caption, silence and voice feature to segment news stories. Experimental results show that the approach is valid and avoid the deficiency of detecting news story by a single feature.

引用

页码：1065 / +

页数：2

共 50 条

[1] Integrating audio-visual features and text information for story segmentation of news video
Liu, Hua-Yong
Zhou, Dong-Ru
Wuhan University Journal of Natural Sciences, 2003, 8 (04) : 1070 - 1074
[2] Integrating Audio-Visual Features and Text Information for Story Segmentation of News Video
Liu Hua-yong
WuhanUniversityJournalofNaturalSciences, 2003, (04) : 1070 - 1074
[3] Automatic story segmentation of news video based on audio-visual features and text information
Wang, C
Wang, Y
Liu, HY
He, YX
2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 3008 - 3011
[4] News video story segmentation method using fusion of audio-visual features - art. no. 67904G
Wen, Jun
Wu, Ling-da
Zeng, Pu
Luan, Xi-dao
Xie, Yu-xiang
REMOTE SENSING AND GIS DATA PROCESSING AND APPLICATIONS; AND INNOVATIVE MULTISPECTRAL TECHNOLOGY AND APPLICATIONS, PTS 1 AND 2, 2007, 6790 : G7904 - G7904
[5] Audio, video and audio-visual signatures for short video clip detection:: Experiments on Trecvid2003
Senechal, B
Pellerin, D
Besacier, L
Simand, I
Brès, S
2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 221 - 224
[6] AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Oorloff, Trevine
Koppisetti, Surya
Bonettini, Nicole
Solanki, Divyaraj
Ben Colman
Yacoob, Yaser
Shahriyari, Ali
Bharaj, Gaurav
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 27092 - 27102
[7] Bootstrapping Audio-Visual Video Segmentation by Strengthening Audio Cues
Chen, Tianxiang
Tan, Zhentao
Gong, Tao
Chu, Qi
Wu, Yue
Liu, Bin
Yu, Nenghai
Lu, Le
Ye, Jieping
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2398 - 2409
[8] Audio-visual speaker recognition for video broadcast news
Maison, B
Neti, C
Senior, A
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2001, 29 (1-2): : 71 - 79
[9] Audio-Visual Speaker Recognition for Video Broadcast News
Benoît Maison
Chalapathy Neti
Andrew Senior
Journal of VLSI signal processing systems for signal, image and video technology, 2001, 29 : 71 - 79
[10] Audio-Visual Segmentation
Zhou, Jinxing
Wang, Jianyuan
Zhang, Jiayi
Sun, Weixuan
Zhang, Jing
Birchfield, Stan
Guo, Dan
Kong, Lingpeng
Wang, Meng
Zhong, Yiran
COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 386 - 403

← 1 2 3 4 5 →