LSTM-based Deep Learning Models for Answer Ranking

被引:4
|
作者
Li, Zhenzhen [1 ]
Huang, Jiuming [1 ]
Zhou, Zhongcheng [1 ]
Zhang, Haoyu [1 ]
Chang, Shoufeng [2 ]
Huang, Zhijie [3 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha, Hunan, Peoples R China
[2] Beijing Satellite Nav Ctr, Beijing, Peoples R China
[3] Beijing Gaodi Informat Technol Co Ltd, Beijing, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
long short-term memory; learning to rank; Question Answering; hypernyms;
D O I
10.1109/DSC.2016.37
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The learning problem of ranking arises in many tasks, including the question answering, information retrieval, and movie recommendation. In these tasks, the ordering of the answers, documents or movies returned is a critical aspect of the system. Recently, deep learning approaches have gained a lot of attention from the research community and industry for their ability to automatically learn optimal feature representation for a given task. We aim to solve the answer ranking problem in practical question answering system with deep learning approaches. In this paper, we define a composite representation for questions and answers by combining convolutional neural network (CNN) with bidirectional long short-term memory (biLSTM) models, and learn a similarity function to relate them in a supervised way from the available training data. Considering the limited training data, we propose a hypernym strategy to get more general text pairs and test the effectiveness of different strategies. Experimental results on a public benchmark dataset from TREC demonstrate that our system outperforms previous work which requires syntactic features and some deep learning models.
引用
收藏
页码:90 / 97
页数:8
相关论文
共 50 条
  • [1] OneHotEncoding and LSTM-based deep learning models for protein secondary structure prediction
    Enireddy, Vamsidhar
    Karthikeyan, C.
    Babu, D. Vijendra
    SOFT COMPUTING, 2022, 26 (08) : 3825 - 3836
  • [2] OneHotEncoding and LSTM-based deep learning models for protein secondary structure prediction
    Vamsidhar Enireddy
    C. Karthikeyan
    D. Vijendra Babu
    Soft Computing, 2022, 26 : 3825 - 3836
  • [3] A novel approach to fake news classification using LSTM-based deep learning models
    Padalko, Halyna
    Chomko, Vasyl
    Chumachenko, Dmytro
    FRONTIERS IN BIG DATA, 2024, 6
  • [4] LSTM-Based Deep Learning Models for Long-Term Tourism Demand Forecasting
    Salamanis, Athanasios
    Xanthopoulou, Georgia
    Kehagias, Dionysios
    Tzovaras, Dimitrios
    ELECTRONICS, 2022, 11 (22)
  • [5] Short-Term Traffic Forecasting using LSTM-based Deep Learning Models
    Haputhanthri, Dilantha
    Wijayasiri, Adeesha
    MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON 2021) / 7TH INTERNATIONAL MULTIDISCIPLINARY ENGINEERING RESEARCH CONFERENCE, 2021, : 602 - 607
  • [6] LSTM-based deep learning for spatial–temporal software testing
    Lei Xiao
    Huaikou Miao
    Tingting Shi
    Yu Hong
    Distributed and Parallel Databases, 2020, 38 : 687 - 712
  • [7] Analysis and forecasting of financial time series using CNN and LSTM-based deep learning models
    Mehtab, Sidra
    Sen, Jaydip
    Dasgupta, Subhasis
    arXiv, 2020,
  • [8] Deep learning methods for LSTM-based personalized search: a comparative analysis
    Sara Abri
    Rayan Abri
    International Journal of Machine Learning and Cybernetics, 2025, 16 (4) : 2747 - 2759
  • [9] LSTM-based deep learning for spatial-temporal software testing
    Xiao, Lei
    Miao, Huaikou
    Shi, Tingting
    Hong, Yu
    DISTRIBUTED AND PARALLEL DATABASES, 2020, 38 (03) : 687 - 712
  • [10] LSTM-based Models for Earthquake Prediction
    Berhich, Asmae
    Belouadha, Fatima-Zahra
    Kabbaj, Mohammed Issam
    3RD INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEM & SECURITY (NISS'20), 2020,