Semantically-enhanced information retrieval using multiple knowledge sources

被引:10
|
作者
Jiang, Yuncheng [1 ]
机构
[1] South China Normal Univ, Sch Comp Sci, Guangzhou 510631, Peoples R China
基金
中国国家自然科学基金;
关键词
Information retrieval; Keyword search; Semantic relatedness; Multiple knowledge sources; WORD SENSE DISAMBIGUATION; LINKED DATA; SEARCH; ONTOLOGY; WEB; SIMILARITY; WIKIPEDIA; CONSTRUCTION; POINT;
D O I
10.1007/s10586-020-03057-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classical or traditional Information Retrieval (IR) approaches rely on the word-based representations of query and documents in the collection. The specification of the user information need is completely based on words figuring in the original query in order to retrieve documents containing those words. Such approaches have been limited due to the absence of relevant keywords as well as the term variation in documents and user's query. The purpose of this paper is to present a new method to Semantic Information Retrieval (SIR) to solve the limitations of existing approaches. Concretely, we propose a novel method SIRWWO (Semantic Information Retrieval using Wikipedia, WordNet, and domain Ontologies) for SIR by combining multiple knowledge sources Wikipedia, WordNet, and Description Logic (DL) ontologies. In order to illustrate the approach SIRWWO, we first present the notion of Labeled Dynamic Semantic Network (LDSN) by extending the notions of dynamic semantic network and extended semantic net based on WordNet (and DAML ontology library). According to the notion of LDSN, we obtain the notion of Weighted Dynamic Semantic Network (WDSN, intuitively, each edge in WDSN is assigned to a number in the [0, 1] interval) and give the WDSN construction method using Wikipedia, WordNet, and DL ontology. We then propose a novel metric to measure the semantic relatedness between concepts based on WDSN. Lastly, we investigate the approach SIRWWO by using semantic relatedness between users' query keywords and digital documents. The experimental results show that our proposals obtain comparable and better performance results than other traditional IR system Lucene.
引用
收藏
页码:2925 / 2944
页数:20
相关论文
共 50 条
  • [41] Predicting Users' Domain Knowledge in Information Retrieval Using Multiple Regression Analysis of Search Behaviors
    Zhang, Xiangmin
    Liu, Jingjing
    Cole, Michael
    Belkin, Nicholas
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2015, 66 (05) : 980 - 1000
  • [42] Intelligent mobile agents for information retrieval and knowledge discovery from distributed data and knowledge sources
    Yang, J
    Honavar, V
    Miller, L
    Wong, J
    1998 IEEE INFORMATION TECHNOLOGY CONFERENCE, PROCEEDINGS, 1998, : 99 - 102
  • [43] Exploiting semantic linkages among multiple sources for semantic information retrieval
    Li, JianQiang
    Yang, Ji-Jiang
    Liu, Chunchen
    Zhao, Yu
    Liu, Bo
    Shi, Yuliang
    ENTERPRISE INFORMATION SYSTEMS, 2014, 8 (04) : 464 - 489
  • [44] Improving the Applicability of Knowledge-Enhanced Dialogue Generation Systems by Using Heterogeneous Knowledge from Multiple Sources
    Wu, Sixing
    Wang, Minghui
    Li, Ying
    Zhang, Dawei
    Wu, Zhonghai
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1149 - 1157
  • [45] Using knowledge-based relatedness for information retrieval
    Arantxa Otegi
    Xabier Arregi
    Olatz Ansa
    Eneko Agirre
    Knowledge and Information Systems, 2015, 44 : 689 - 718
  • [46] Using knowledge-based relatedness for information retrieval
    Otegi, Arantxa
    Arregi, Xabier
    Ansa, Olatz
    Agirre, Eneko
    KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 44 (03) : 689 - 718
  • [47] Enhancing information retrieval using problem specific knowledge
    Morioka, Nobuyuki
    Mahidadia, Ashesh
    ADVANCES IN KNOWLEDGE ACQUISITION AND MANAGEMENT, 2006, 4303 : 244 - +
  • [48] Information retrieval using ontology for sharing knowledge on safety
    Ogure, T
    Furuta, K
    PROBABILISTIC SAFETY ASSESSMENT AND MANAGEMENT, VOL 1- 6, 2004, : 531 - 536
  • [49] Enhanced information retrieval by using HTML']HTML tags
    Werner, L
    Böttcher, S
    Beckmann, R
    DMIN '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON DATA MINING, 2005, : 24 - 29
  • [50] Disaster linguistics, climate change semantics and public discourse studies: a semantically-enhanced discourse study of 2011 Queensland Floods
    Bromhead, Helen
    LANGUAGE SCIENCES, 2021, 85