A multi-modal approach to story segmentation for news video

被引:30
|
作者
Chaisorn, L [1 ]
Chua, TS [1 ]
Lee, CH [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore
关键词
news story segmentation; shot classification; multi-modal approach; learning-based approach;
D O I
10.1023/A:1023622605600
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This research proposes a two-level, multi-modal framework to perform the segmentation and classification of news video into single-story semantic units. The video is analyzed at the shot and story unit (or scene) levels using a variety of features and techniques. At the shot level, we employ Decision Trees technique to classify the shots into one of 13 predefined categories or mid-level features. At the scene/story level, we perform the HMM (Hidden Markov Models) analysis to locate story boundaries. Our initial results indicate that we could achieve a high accuracy of over 95% for shot classification, and over 89% in F-1 measure on scene/story boundary detection. Detailed analysis reveals that HMM is effective in identifying dominant features, which helps in locating story boundaries. Our eventual goal is to support the retrieval of news video at story unit level, together with associated texts retrieved from related news sites on the web.
引用
收藏
页码:187 / 208
页数:22
相关论文
共 50 条
  • [1] A Multi-Modal Approach to Story Segmentation for News Video
    Lekha Chaisorn
    Tat-Seng Chua
    Chin-Hui Lee
    World Wide Web, 2003, 6 : 187 - 208
  • [2] MULTI-MODAL INFORMATION FUSION FOR NEWS STORY SEGMENTATION IN BROADCAST VIDEO
    Feng, Bailan
    Ding, Peng
    Chen, Jiansong
    Bai, Jinfeng
    Xu, Su
    Xu, Bo
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1417 - 1420
  • [3] News video story segmentation using fusion of multi-level multi-modal features in TRECVID 2003
    Hsu, W
    Kennedy, L
    Huang, CW
    Chang, SF
    Lin, CY
    Iyengar, G
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 645 - 648
  • [4] Generative, discriminative, and ensemble learning on multi-modal perceptual fusion toward news video story segmentation
    Hsu, WHM
    Chang, SF
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1091 - 1094
  • [5] A hybrid approach to news video classification with multi-modal features
    Wang, P
    Cai, R
    Yang, SQ
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 787 - 791
  • [6] Discovery and fusion of salient multi-modal features towards news story segmentation
    Hsu, W
    Chang, SF
    Huang, CW
    Kennedy, L
    Lin, CY
    Iyengar, G
    STORAGE AND RETRIEVAL METHODS AND APPLICATIONS FOR MULTIMEDIA 2004, 2004, 5307 : 244 - 258
  • [7] Multi-modal fusion for associated news story retrieval
    Younessian, Ehsan
    Rajan, Deepu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (08) : 2563 - 2585
  • [8] Multi-modal fusion for associated news story retrieval
    Ehsan Younessian
    Deepu Rajan
    Multimedia Tools and Applications, 2015, 74 : 2563 - 2585
  • [9] Multi-modal Solution for Unconstrained News Story Retrieval
    Younessian, Ehsan
    Rajan, Deepu
    ADVANCES IN MULTIMEDIA MODELING, 2012, 7131 : 186 - 195
  • [10] Combining Multi-Modal Features for News Story Correlation Analysis
    Chen Dan-wen
    Deng Li-qiong
    Yuan Zhi-min
    Wu Ling-da
    COMPUTATIONAL MATERIALS SCIENCE, PTS 1-3, 2011, 268-270 : 1040 - 1045