Approximate keyword search inn web search engines

被引:0
|
作者
Wu, Sun
Chang, Hsien-Tsung
Hsu, Ting-Chao
Liu, Pei-Shin
机构
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a new index method to provide approximate keyword search in search engines. Our approximate keyword matching adopts a new similarity measurement called Listance model, which is a variation of the LCS (Longest Common Subsequence) model. Two keywords are considered approximately matched, if their Listance is no more than a predefined parameter k. Suppose the length of keywords A and B are m and n respectively, the Listance between A and B is defined to be max(m, n) - LCS(A, B). The index method uses a new data structure called LBS index (Listance Bounded Subsequence index), which was designed to allow for very fast approximate keyword matching. In the index phase, a collection of keywords is used as a reference dictionary. We transform keywords in the web pages into a special form to be indexed if they match one of the keywords approximately in the reference dictionary. During the query processing, a similar keyword transformation is conducted to search the approximate index. The experimental result shows that our approach is efficient and can provide approximate keyword search capability that could be practically interesting.
引用
收藏
页码:404 / 411
页数:8
相关论文
共 50 条
  • [1] A Performance Evaluation of Semantic based Search Engines and Keyword based Search Engines
    Khan, Javed Ahinad
    Sangroha, Deepak
    Ahmad, Masroor
    Rahman, Md Tanzillur
    [J]. 2014 INTERNATIONAL CONFERENCE ON MEDICAL IMAGING, M-HEALTH & EMERGING COMMUNICATION SYSTEMS (MEDCOM), 2015, : 168 - 173
  • [2] A keyword searching algorithm for Search Engines
    Gupta, Vishal
    [J]. 2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 517 - 521
  • [3] Web search engines
    Schwartz, C
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1998, 49 (11): : 973 - 982
  • [4] Approximate Nearest Neighbor Search on Standard Search Engines
    Carrara, Fabio
    Vadicamo, Lucia
    Gennaro, Claudio
    Amato, Giuseppe
    [J]. SIMILARITY SEARCH AND APPLICATIONS (SISAP 2022), 2022, 13590 : 214 - 221
  • [5] Web search engines: Search syntax and features
    Ojala, M
    [J]. ONLINE, 2002, 26 (05): : 28 - 31
  • [6] Mimicking Web search engines for expert search
    Santos, Rodrygo L. T.
    Macdonald, Craig
    Ounis, Iadh
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (04) : 467 - 481
  • [7] Web search engines: Search syntax and features
    Ojala, Marydee
    [J]. Online (Wilton, Connecticut), 2002, 26 (05):
  • [8] Keyword stuffing and the big three search engines
    Zuze, Herbert
    Weideman, Melius
    [J]. ONLINE INFORMATION REVIEW, 2013, 37 (02) : 268 - 286
  • [9] Evaluation of Web Search Engines
    Luo XiaoLing
    Xue He Ru
    [J]. EMERGING RESEARCH IN WEB INFORMATION SYSTEMS AND MINING, 2011, 238 : 448 - 454
  • [10] Search engines and Web dynamics
    Risvik, KM
    Michelsen, R
    [J]. COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2002, 39 (03): : 289 - 302