A multi-modal approach to story segmentation for news video

被引：30

作者：

Chaisorn, L ^{[1
]}

Chua, TS ^{[1
]}

Lee, CH ^{[1
]}

机构：

[1] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore

来源：

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2003年 / 6卷 / 02期

关键词：

news story segmentation; shot classification; multi-modal approach; learning-based approach;

D O I：

10.1023/A:1023622605600

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This research proposes a two-level, multi-modal framework to perform the segmentation and classification of news video into single-story semantic units. The video is analyzed at the shot and story unit (or scene) levels using a variety of features and techniques. At the shot level, we employ Decision Trees technique to classify the shots into one of 13 predefined categories or mid-level features. At the scene/story level, we perform the HMM (Hidden Markov Models) analysis to locate story boundaries. Our initial results indicate that we could achieve a high accuracy of over 95% for shot classification, and over 89% in F-1 measure on scene/story boundary detection. Detailed analysis reveals that HMM is effective in identifying dominant features, which helps in locating story boundaries. Our eventual goal is to support the retrieval of news video at story unit level, together with associated texts retrieved from related news sites on the web.

引用

页码：187 / 208

页数：22

共 50 条

[21] Multi-modal person-profiles from broadcast news video
Dagli, Charlie K.
Rao, Sharad V.
Huang, Thomas S.
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1559 - 1562
[22] Hardware accelerator design for video segmentation with multi-modal background modelling
Jiang, H
Ardö, H
Öwall, V
2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 1142 - 1145
[23] Multi-modal Video Summarization
Huang, Jia-Hong
ICMR 2024 - Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024, : 1214 - 1218
[24] Multi-modal Video Summarization
Huang, Jia-Hong
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1214 - 1218
[25] New Approach to Multi-Modal Multi-View Video Coding
Zhang Yun
Yu Mei
Jiang Gangyi
CHINESE JOURNAL OF ELECTRONICS, 2009, 18 (02): : 338 - 342
[26] The segmentation of news video into story units
Chaisorn, L
Chua, TS
Lee, CH
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 73 - 76
[27] The segmentation of news video into story units
Liu, HY
Zhang, H
ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 870 - 875
[28] Multi-modal Action Segmentation in the Kitchen with a Feature Fusion Approach
Kogure, Shunsuke
Aoki, Yoshimitsu
FIFTEENTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2021, 11794
[29] A DETECTION-BASED APPROACH TO BROADCAST NEWS VIDEO STORY SEGMENTATION
Ma, Chengyuan
Byun, Byungki
Kim, Ilseo
Lee, Chin-Hui
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1957 - 1960
[30] A hierarchical approach to story segmentation of large broadcast news video corpus
Chaisorn, L
Chua, TS
Lee, CH
Tian, Q
2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1095 - 1098

← 1 2 3 4 5 →