A Robust Passage Retrieval Algorithm for Video Question Answering

被引:13
|
作者
Wu, Yu-Chieh [1 ]
Yang, Jie-Chi [2 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Jhongli 32001, Taiwan
[2] Natl Cent Univ, Grad Inst Network Learning Technol, Jhongli 32001, Taiwan
关键词
Multimedia retrieval; question answering (Q/A); video question answering (videoQ/A);
D O I
10.1109/TCSVT.2008.2002831
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a robust passage retrieval algorithm to extend the conventional text question answering (Q/A) to videos. Users interact with our videoQ/A system through natural language queries, while the top-ranked passage fragments with associated video clips are returned as answers. We compare our method with five of the high-performance ranking algorithms that are portable to different languages and domains. The experiments were evaluated with 75.3 h of Chinese videos and 253 questions. The experimental results showed that our method outperformed the second best retrieval model (language models) in relatively, 1.43% in mean reciprocal rank (MRR) score and 11.36% when employing a Chinese word segmentation tool. By adopting the initial retrieval results from the retrieval models, our method yields an improvement of at least 5.94% improvement in MRR score. This makes it very attractive for the Asia-like languages since the use of a well-developed word tokenizer is unnecessary.
引用
收藏
页码:1411 / 1421
页数:11
相关论文
共 50 条
  • [31] A Multi-lingual Approach to Improve Passage Retrieval for Automatic Question Answering
    Othman, Nouha
    Faiz, Rim
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2016, 2016, 9612 : 127 - 139
  • [32] A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering
    Sarrouti, Mourad
    Ouatik El Alaoui, Said
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 68 : 96 - 103
  • [33] Structured retrieval for question answering
    Bilotti, Matthew W.
    Ogilvie, Paul
    Callan, Jamie
    Nyberg, Eric
    [J]. Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07, 2007, : 351 - 358
  • [34] Maintaining Passage Retrieval Information Need Using Analogical Reasoning in a Question Answering Task
    Toba, Hapnes
    Adriani, Mirna
    Manurung, Ruli
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, 2011, 7097 : 489 - 498
  • [35] Cross-Modal Dense Passage Retrieval for Outside Knowledge Visual Question Answering
    Reichman, Benjamin
    Heck, Larry
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2829 - 2834
  • [36] Affective question answering on video
    Ruwa, Nelson
    Mao, Qirong
    Wang, Liangjun
    Gou, Jianping
    [J]. NEUROCOMPUTING, 2019, 363 : 125 - 139
  • [37] Question Answering Passage Retrieval and Re-ranking Using N-grams and SVM
    Othman, Nouha
    Faiz, Rim
    [J]. COMPUTACION Y SISTEMAS, 2016, 20 (03): : 483 - 494
  • [38] RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
    Qu, Yingqi
    Ding, Yuchen
    Liu, Jing
    Liu, Kai
    Ren, Ruiyang
    Zhao, Wayne Xin
    Dong, Daxiang
    Wu, Hua
    Wang, Haifeng
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5835 - 5847
  • [39] Video Graph Transformer for Video Question Answering
    Xiao, Junbin
    Zhou, Pan
    Chua, Tat-Seng
    Yan, Shuicheng
    [J]. COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 39 - 58
  • [40] A Markov Network Based Passage Retrieval Method for Multimodal Question Answering in the Cultural Heritage Domain
    Sheng, Shurong
    Venkitasubramanian, Aparna Nurani
    Moens, Marie-Francine
    [J]. MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 3 - 15