A multi-modal approach to story segmentation for news video

被引：30

作者：

Chaisorn, L ^{[1
]}

Chua, TS ^{[1
]}

Lee, CH ^{[1
]}

机构：

[1] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore

来源：

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2003年 / 6卷 / 02期

关键词：

news story segmentation; shot classification; multi-modal approach; learning-based approach;

D O I：

10.1023/A:1023622605600

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This research proposes a two-level, multi-modal framework to perform the segmentation and classification of news video into single-story semantic units. The video is analyzed at the shot and story unit (or scene) levels using a variety of features and techniques. At the shot level, we employ Decision Trees technique to classify the shots into one of 13 predefined categories or mid-level features. At the scene/story level, we perform the HMM (Hidden Markov Models) analysis to locate story boundaries. Our initial results indicate that we could achieve a high accuracy of over 95% for shot classification, and over 89% in F-1 measure on scene/story boundary detection. Detailed analysis reveals that HMM is effective in identifying dominant features, which helps in locating story boundaries. Our eventual goal is to support the retrieval of news video at story unit level, together with associated texts retrieved from related news sites on the web.

引用

页码：187 / 208

页数：22

共 50 条

[1] A Multi-Modal Approach to Story Segmentation for News Video
Lekha Chaisorn
Tat-Seng Chua
Chin-Hui Lee
World Wide Web, 2003, 6 : 187 - 208
[2] MULTI-MODAL INFORMATION FUSION FOR NEWS STORY SEGMENTATION IN BROADCAST VIDEO
Feng, Bailan
Ding, Peng
Chen, Jiansong
Bai, Jinfeng
Xu, Su
Xu, Bo
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1417 - 1420
[3] News video story segmentation using fusion of multi-level multi-modal features in TRECVID 2003
Hsu, W
Kennedy, L
Huang, CW
Chang, SF
Lin, CY
Iyengar, G
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 645 - 648
[4] Generative, discriminative, and ensemble learning on multi-modal perceptual fusion toward news video story segmentation
Hsu, WHM
Chang, SF
2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1091 - 1094
[5] A hybrid approach to news video classification with multi-modal features
Wang, P
Cai, R
Yang, SQ
ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 787 - 791
[6] Discovery and fusion of salient multi-modal features towards news story segmentation
Hsu, W
Chang, SF
Huang, CW
Kennedy, L
Lin, CY
Iyengar, G
STORAGE AND RETRIEVAL METHODS AND APPLICATIONS FOR MULTIMEDIA 2004, 2004, 5307 : 244 - 258
[7] Multi-modal fusion for associated news story retrieval
Younessian, Ehsan
Rajan, Deepu
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (08) : 2563 - 2585
[8] Multi-modal fusion for associated news story retrieval
Ehsan Younessian
Deepu Rajan
Multimedia Tools and Applications, 2015, 74 : 2563 - 2585
[9] Multi-modal Solution for Unconstrained News Story Retrieval
Younessian, Ehsan
Rajan, Deepu
ADVANCES IN MULTIMEDIA MODELING, 2012, 7131 : 186 - 195
[10] Combining Multi-Modal Features for News Story Correlation Analysis
Chen Dan-wen
Deng Li-qiong
Yuan Zhi-min
Wu Ling-da
COMPUTATIONAL MATERIALS SCIENCE, PTS 1-3, 2011, 268-270 : 1040 - 1045

← 1 2 3 4 5 →