A Sequential Latent Topic-Based Readability Model for Domain-Specific Information Retrieval

被引:3
|
作者
Zhang, Wenya [1 ]
Song, Dawei [1 ,2 ]
Zhang, Peng [1 ]
Zhao, Xiaozhao [1 ]
Hou, Yuexian [1 ]
机构
[1] Tianjin Univ, Tianjin Key Lab Cognit Comp & Applicat, Tianjin 300072, Peoples R China
[2] Open Univ, Dept Comp, Milton Keynes, Bucks, England
关键词
Domain-specific retrieval; Readability; Documents reranking;
D O I
10.1007/978-3-319-28940-3_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In domain-specific information retrieval (IR), an emerging problem is how to provide different users with documents that are both relevant and readable, especially for the lay users. In this paper, we propose a novel document readability model to enhance the domain-specific IR. Our model incorporates the coverage and sequential dependency of latent topics in a document. Accordingly, two topical readability indicators, namely Topic Scope and Topic Trace are developed. These indicators, combined with the classical Surface-level indicator, can be used to rerank the initial list of documents returned by a conventional search engine. In order to extract the structured latent topics without supervision, the hierarchical Latent Dirichlet Allocation (hLDA) is used. We have evaluated our model from the user-oriented and system-oriented perspectives, in the medical domain. The user-oriented evaluation shows a good correlation between the readability scores given by our model and human judgments. Furthermore, our model also gains significant improvement in the system-oriented evaluation in comparison with one of the state-of-the-art readability methods.
引用
收藏
页码:241 / 252
页数:12
相关论文
共 50 条
  • [1] Domain-specific readability measures to improve information retrieval in the Persian language
    Arastoopoor, Sholeh
    [J]. ELECTRONIC LIBRARY, 2018, 36 (03): : 430 - 444
  • [2] Domain-specific information retrieval based on improved language model
    Kang, Kai
    Lin, Kunhui
    Zhou, Changle
    Guo, Feng
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 374 - +
  • [3] Analogy-based Matching Model for Domain-specific Information Retrieval
    Bounhas, Myriam
    Elayeb, Bilel
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 496 - 505
  • [4] Sentence retrieval with a topic-based language model
    National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100080, China
    [J]. Jisuanji Yanjiu yu Fazhan, 2007, 2 (288-295):
  • [6] Toward a Semantic Granularity Model for Domain-Specific Information Retrieval
    Yan, Xin
    Lau, Raymond Y. K.
    Song, Dawei
    Li, Xue
    Ma, Jian
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2011, 29 (03)
  • [7] A Probabilistic Topic-Based Ranking Framework for Location-Sensitive Domain Information Retrieval
    Li, Huajing
    Li, Zhisheng
    Lee, Wang-Chien
    Lee, Dik Lun
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 331 - 338
  • [8] A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval
    Arun, K. S.
    Govindan, V. K.
    [J]. DATA SCIENCE AND ENGINEERING, 2018, 3 (02) : 166 - 195
  • [9] Domain-Specific Information Retrieval Using Recommenders
    Li, Wei
    [J]. PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1327 - 1327
  • [10] Patent Information Retrieval An Instance of Domain-specific Search
    Lupu, Mihai
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1189 - 1190