Use of N-grams Model and Semantic Similarity to Improve the Results of Search Engine

被引:0
|
作者
El Hadi, Amine [1 ]
Madani, Youness [1 ]
El Ayachi, Rachid [1 ]
Erritali, Mohamed [1 ]
机构
[1] Sultan Moulay Slimane Univ, Fac Sci & Tech, Beni Mellal, Morocco
关键词
Semantic similarity; N-grams model; Search engine; Query reformulation;
D O I
10.1007/978-3-030-90639-9_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic indexing and document similarity is a significant problem of the information retrieval systems (IRS). Recently many works use semantics in information retrieval systems to find the information easily. A search engine is the best example of the application of information retrieval to get the most relevant results. In this work, to facilitate the use of search engines for users for getting the most relevant documents, we propose a new approach for calculating the semantic similarity between user's query and a list of documents based on the N-grams model (i.e., a sub-sequence of n constructed elements from a given sequence). Results show that our method give us better results than some methods from the literature.
引用
收藏
页码:437 / 444
页数:8
相关论文
共 50 条
  • [1] Visualizing Document Similarity Using N-Grams and Latent Semantic Analysis
    Hussein, Ashraf S.
    [J]. PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 269 - 279
  • [2] Finding Relevant Documents in a Search Engine Using N-Grams Model and Reinforcement Learning
    El Hadi, Amine
    Madani, Youness
    El Ayachi, Rachid
    Erritali, Mohamed
    [J]. JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
  • [3] Malware Detection and Classification Based on n-grams Attribute Similarity
    Zhang Fuyong
    Zhao Tiezhou
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 1, 2017, : 793 - 796
  • [4] Interpolated N-Grams for Model Based Testing
    Tonella, Paolo
    Tiella, Roberto
    Cu Duy Nguyen
    [J]. 36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014), 2014, : 562 - 572
  • [5] Algorithms for correcting recognition results using N-grams
    Manzhikov T.V.
    Slavin O.A.
    Faradjev I.A.
    Janiszewski I.M.
    [J]. Pattern Recognition and Image Analysis, 2017, 27 (4) : 832 - 837
  • [6] Efficient and Effective KNN Sequence Search with Approximate n-grams
    Wang, Xiaoli
    Ding, Xiaofeng
    Tung, Anthony K. H.
    Zhang, Zhenjie
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 7 (01): : 1 - 12
  • [7] GAUGING SIMILARITY WITH N-GRAMS - LANGUAGE-INDEPENDENT CATEGORIZATION OF TEXT
    DAMASHEK, M
    [J]. SCIENCE, 1995, 267 (5199) : 843 - 848
  • [8] A new semantic similarity approach for improving the results of an Arabic search engine
    El Hadi, Amine
    Madani, Youness
    El Ayachi, Rachid
    Erritali, Mohamed
    [J]. 10TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2019) / THE 2ND INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40 2019) / AFFILIATED WORKSHOPS, 2019, 151 : 1170 - 1175
  • [9] Classifying disease outbreak reports using n-grams and semantic features
    Conway, Mike
    Doan, Son
    Kawazoe, Ai
    Collier, Nigel
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2009, 78 (12) : E47 - E58
  • [10] An improved N-grams based Model for Authorship Attribution
    Boughaci, Dalila
    Benmesbah, Mounir
    Zebiri, Aniss
    [J]. 2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, : 70 - 75