Harnessing the Power of Metadata for Enhanced Question Retrieval in Community Question Answering

被引:0
|
作者
Ghasemi, Shima [1 ]
Shakery, Azadeh [1 ,2 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran 1439957131, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran 193955746, Iran
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Community question answering; metadata; question retrieval;
D O I
10.1109/ACCESS.2024.3395449
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Community Question Answering (CQA) forums such as Yahoo! Answers and Stack Overflow have become popular. The main goal of a CQA is to provide the most suitable answer in the shortest possible time. Since there is a reach archive of answered questions, similar question retrieval has received much attention intending to answer questions immediately after asking. One of the main challenges in this task is the lexical gap between questions, which refers to the discrepancies between the terminologies used by users asking questions. In this paper, we use metadata and two transformer-based techniques to improve the translation-based language model as a traditional technique addressing the lexical gap in retrieval systems. To overcome the lexical gap problem, additional context and information about the questions can help. Metadata is a rich source of information that refers to supplementary data associated with each question. Subject, category, and answer are metadata used in this article. To leverage these metadata, two transformer-based methods are employed. First, to utilize category information, we build category-specific dictionaries to obtain more accurate translation probabilities. A BERT model predicts the categories of the questions. Second, to utilize answer information, we propose a question expansion technique. Expansion is done by a transformer-based model using a retrieval-augmented generation (RAG) model to generate answers and expand new questions with corresponding answers. Finally, candidate questions are ranked according to their similarity to the expanded new question. Our proposed method achieves 51.47 in terms of MAP, outperforming all state-of-the-art approaches in question retrieval.
引用
收藏
页码:65768 / 65779
页数:12
相关论文
共 50 条
  • [1] Learning Continuous Word Embedding with Metadata for Question Retrieval in Community Question Answering
    Zhou, Guangyou
    He, Tingting
    Zhao, Jun
    Hu, Po
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 250 - 259
  • [2] Research on question retrieval method for community question answering
    Sun, Yong
    Song, Junfang
    Song, Xiangyu
    Hou, Jiazheng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (16) : 24309 - 24325
  • [3] A RETRIEVAL MODEL FOR QUESTION IN COMMUNITY QUESTION ANSWERING SYSTEM
    Sun, Yueping
    Wang, Xiaojie
    Liu, Song
    Yuan, Caixia
    Wang, Xuwen
    [J]. 2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 1534 - 1539
  • [4] Research on question retrieval method for community question answering
    Yong Sun
    Junfang Song
    Xiangyu Song
    Jiazheng Hou
    [J]. Multimedia Tools and Applications, 2023, 82 : 24309 - 24325
  • [5] Question retrieval using combined queries in community question answering
    Saquib Khushhal
    Abdul Majid
    Syed Ali Abbas
    Malik Sajjad Ahmed Nadeem
    Saeed Arif Shah
    [J]. Journal of Intelligent Information Systems, 2020, 55 : 307 - 327
  • [6] Question retrieval using combined queries in community question answering
    Khushhal, Saquib
    Majid, Abdul
    Abbas, Syed Ali
    Nadeem, Malik Sajjad Ahmed
    Shah, Saeed Arif
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2020, 55 (02) : 307 - 327
  • [7] Improving Question Retrieval in Community Question Answering with Label Ranking
    Wang, Wei
    Li, Baichuan
    King, Irwin
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 349 - 356
  • [8] Manhattan Siamese LSTM for Question Retrieval in Community Question Answering
    Othman, Nouha
    Faiz, Rim
    Smaili, Kamel
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2019 CONFERENCES, 2019, 11877 : 661 - 677
  • [9] Answer Retrieval in Legal Community Question Answering
    Askari, Arian
    Yang, Zihui
    Ren, Zhaochun
    Verberne, Suzan
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT III, 2024, 14610 : 477 - 485
  • [10] Learning Distributed Representations of Data in Community Question Answering for Question Retrieval
    Zhang, Kai
    Wu, Wei
    Wang, Fang
    Zhou, Ming
    Li, Zhoujun
    [J]. PROCEEDINGS OF THE NINTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'16), 2016, : 533 - 542