FACETED TOPIC RETRIEVAL OF NEWS VIDEO USING JOINT TOPIC MODELING OF VISUAL FEATURES AND SPEECH TRANSCRIPTS

被引:0
|
作者
Wan, Kong-Wah [1 ]
Tan, Ah-Hwee [2 ]
Lim, Joo-Hwee [1 ]
Chia, Liang-Tien [2 ]
机构
[1] Inst Infocomm Res, 1 Fusionopolis Way, Singapore 138632, Singapore
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
Faceted Topic Retrieval; Multimedia Topic Modeling; Latent Dirichlet Allocation;
D O I
10.1109/ICME.2010.5583061
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Because of the inherent ambiguity in user queries, an important task of modern retrieval systems is faceted topic retrieval (FTR), which relates to the goal of returning diverse or novel information elucidating the wide range of topics or facets of the query need. We introduce a generative model for hypothesizing facets in the (news) video domain by combining the complementary information in the visual keyframes and the speech transcripts. We evaluate the efficacy of our multimodal model on the standard TRECVID-2005 video corpus annotated with facets. We find that: (1) the joint modeling of the visual and text (speech transcripts) information can achieve significant F-score improvement over a text-alone system; (2) our model compares favorably with standard diverse ranking algorithms such as the MMR [1]. Our FTR model has been implemented on a news search prototype that is undergoing commercial trial.
引用
收藏
页码:843 / 848
页数:6
相关论文
共 46 条
  • [1] English-Filipino Speech Topic Tagger Using Automatic Speech Recognition Modeling and Topic Modeling
    Tumpalan, John Karl B.
    Recario, Reginald Neil C.
    [J]. ADVANCES IN INFORMATION AND COMMUNICATION, FICC, VOL 2, 2023, 652 : 427 - 445
  • [2] Using Topic Modeling and Adversarial Neural Networks for Fake News Video Detection
    Choi, Hyewon
    Ko, Youngjoong
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2950 - 2954
  • [3] Wikipedia Based News Video Topic Modeling for Information Extraction
    Roy, Sujoy
    Mak, Mun-Thye
    Wan, Kong Wah
    [J]. ADVANCES IN MULTIMEDIA MODELING, PT II, 2011, 6524 : 411 - 420
  • [4] News Video Clip Retrieval Based on Topic Caption Text and Audio Information
    Zhao Yaqin
    Zheng Jiaqiang
    Zhou Hongping
    [J]. PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL IV, 2009, : 477 - 481
  • [5] GraphTMT: Unsupervised Graph-based Topic Modeling from Video Transcripts
    Thies, Jason
    Stappen, Lukas
    Hagerer, Gerhard
    Schuller, Bjorn W.
    Groh, Georg
    [J]. 2021 IEEE SEVENTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2021), 2021, : 1 - 8
  • [6] New approaches to audio-visual segmentation of TV news for automatic topic retrieval
    Iurgel, U
    Meermeier, R
    Eickeler, S
    Rigoll, G
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 1397 - 1400
  • [7] Measuring Alliance and Symptom Severity in Psychotherapy Transcripts Using Bert Topic Modeling
    Lalk, Christopher
    Steinbrenner, Tobias
    Kania, Weronika
    Popko, Alexander
    Wester, Robin
    Schaffrath, Jana
    Eberhardt, Steffen
    Schwartz, Brian
    Lutz, Wolfgang
    Rubel, Julian
    [J]. ADMINISTRATION AND POLICY IN MENTAL HEALTH AND MENTAL HEALTH SERVICES RESEARCH, 2024, 51 (04) : 509 - 524
  • [8] Bangla News Trend Observation using LDA Based Topic Modeling
    Alam, Kazi Masudul
    Hemel, Md Tanvir Hussain
    Islam, S. M. Muhaiminul
    Akther, Avsha
    [J]. 2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
  • [9] Visual Duplicate based Topic Linking using a Robust Video Signature
    Iwamoto, Kota
    Sato, Takami
    Oami, Ryoma
    Nomura, Toshiyuki
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 280 - 283
  • [10] Modeling Latent Topic Interactions using Quantum Interference for Information Retrieval
    Sordoni, Alessandro
    He, Jing
    Nie, Jian-Yun
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1197 - 1200