n-gram Models for Video Semantic Indexing

被引:1
|
作者
Inoue, Nakamasa [1 ]
Shinoda, Koichi [1 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
关键词
Semantic Indexing; Video Search; Gaussian Mixture Models; n-gram Models;
D O I
10.1145/2647868.2654961
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We propose n-gram modeling of shot sequences for video semantic indexing, in which semantic concepts are extracted from a video shot. Most previous studies for this task have assumed that video shots in a video clip are independent from each other. We model the time-dependency between them assuming that n-consecutive video shots are dependent. Our models improve the robustness against occlusion and camera-angle changes by effectively using information from the previous video shots. In our experiments on the TRECVID 2012 Semantic Indexing Benchmark, we applied the proposed models to a system using Gaussian mixture models and support vector machines. Mean average precision was improved from 30.62% to 32.14%, which is the best performance on the TRECVID 2012 Semantic Indexing to the best of our knowledge.
引用
收藏
页码:777 / 780
页数:4
相关论文
共 50 条
  • [1] A study on N-gram indexing of musical features
    Yip, CL
    Kao, B
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 869 - 872
  • [2] Semantic N-Gram Topic Modeling
    Kherwa, Pooja
    Bansal, Poonam
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2020, 7 (26): : 1 - 12
  • [3] N-Gram FST Indexing for Spoken Term Detection
    Liu, Chao
    Wang, Dong
    Tejedor, Javier
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2091 - 2094
  • [4] Tokenization and N-gram for Indexing Indonesian Translation of the Quran
    Putra, Syopiansyah Jaya
    Gunawan, Muhamad Nur
    Suryatno, Agung
    [J]. 2018 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2018, : 158 - 161
  • [5] On compressing n-gram language models
    Hirsimaki, Teemu
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 949 - 952
  • [6] STATISTICAL N-GRAM INDEXING OF NATURAL-LANGUAGE DOCUMENTS
    TEUFEL, B
    [J]. INTERNATIONAL FORUM ON INFORMATION AND DOCUMENTATION, 1988, 13 (04): : 3 - 10
  • [7] MIXTURE OF MIXTURE N-GRAM LANGUAGE MODELS
    Sak, Hasim
    Allauzen, Cyril
    Nakajima, Kaisuke
    Beaufays, Francoise
    [J]. 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 31 - 36
  • [8] Perplexity of n-Gram and Dependency Language Models
    Popel, Martin
    Marecek, David
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 173 - 180
  • [9] Searching Polyphonic Indonesian Folksongs Based on N-gram Indexing Technique
    Marsye, Aurora
    Adriani, Mirna
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 387 - 396
  • [10] A Methodology to Identify Topic of Video via N-Gram Approach
    Pervaiz, Ramsha
    Aloufi, Khalid
    Zaidi, Syed Shabbar Raza
    Malik, Kaleem Razzaq
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (01): : 79 - 94