A multi-modal approach to story segmentation for news video

被引:30
|
作者
Chaisorn, L [1 ]
Chua, TS [1 ]
Lee, CH [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore
关键词
news story segmentation; shot classification; multi-modal approach; learning-based approach;
D O I
10.1023/A:1023622605600
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This research proposes a two-level, multi-modal framework to perform the segmentation and classification of news video into single-story semantic units. The video is analyzed at the shot and story unit (or scene) levels using a variety of features and techniques. At the shot level, we employ Decision Trees technique to classify the shots into one of 13 predefined categories or mid-level features. At the scene/story level, we perform the HMM (Hidden Markov Models) analysis to locate story boundaries. Our initial results indicate that we could achieve a high accuracy of over 95% for shot classification, and over 89% in F-1 measure on scene/story boundary detection. Detailed analysis reveals that HMM is effective in identifying dominant features, which helps in locating story boundaries. Our eventual goal is to support the retrieval of news video at story unit level, together with associated texts retrieved from related news sites on the web.
引用
收藏
页码:187 / 208
页数:22
相关论文
共 50 条
  • [21] Multi-modal person-profiles from broadcast news video
    Dagli, Charlie K.
    Rao, Sharad V.
    Huang, Thomas S.
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1559 - 1562
  • [22] Hardware accelerator design for video segmentation with multi-modal background modelling
    Jiang, H
    Ardö, H
    Öwall, V
    2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 1142 - 1145
  • [23] Multi-modal Video Summarization
    Huang, Jia-Hong
    ICMR 2024 - Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024, : 1214 - 1218
  • [24] Multi-modal Video Summarization
    Huang, Jia-Hong
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1214 - 1218
  • [25] New Approach to Multi-Modal Multi-View Video Coding
    Zhang Yun
    Yu Mei
    Jiang Gangyi
    CHINESE JOURNAL OF ELECTRONICS, 2009, 18 (02): : 338 - 342
  • [26] The segmentation of news video into story units
    Chaisorn, L
    Chua, TS
    Lee, CH
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 73 - 76
  • [27] The segmentation of news video into story units
    Liu, HY
    Zhang, H
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 870 - 875
  • [28] Multi-modal Action Segmentation in the Kitchen with a Feature Fusion Approach
    Kogure, Shunsuke
    Aoki, Yoshimitsu
    FIFTEENTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2021, 11794
  • [29] A DETECTION-BASED APPROACH TO BROADCAST NEWS VIDEO STORY SEGMENTATION
    Ma, Chengyuan
    Byun, Byungki
    Kim, Ilseo
    Lee, Chin-Hui
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1957 - 1960
  • [30] A hierarchical approach to story segmentation of large broadcast news video corpus
    Chaisorn, L
    Chua, TS
    Lee, CH
    Tian, Q
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1095 - 1098