Genetic programming-based discovery of ranking functions for effective Web search

被引:46
|
作者
Fan, WG [1 ]
Gordon, MD
Pathak, P
机构
[1] Virginia Polytech Inst & State Univ, Blacksburg, VA 24061 USA
[2] Univ Michigan, Ross Sch Business, Ann Arbor, MI 48109 USA
[3] Univ Florida, Warrington Coll Business, Gainesville, FL 32611 USA
关键词
business intelligence; genetic programming; information retrieval; machine learning; ranking function; search engine; text mining; Web mining;
D O I
10.1080/07421222.2005.11045828
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web search engines have become an integral part of the daily life of a knowledge worker, who depends on these search engines to retrieve relevant information from the Web or from the company's vast document databases. Current search engines are very fast in terms of their response time to a user query. But their usefulness to the user in terms of retrieval performance leaves a lot to be desired. Typically, the user has to sift through a lot of nonrelevant documents to get only a few relevant ones for the user's information needs. Ranking functions play a very important role in the search engine retrieval performance. In this paper, we describe a methodology using genetic programming to discover new ranking functions for the Web-based information-seeking task. We exploit the content as well as structural information in the Web documents in the discovery process. The discovery process is carried out for both the ad hoc task and the routing task in retrieval. For either of the retrieval tasks, the retrieval performance of these newly discovered ranking functions has been found to be superior to the performance obtained by well-known ranking strategies in the information retrieval literature.
引用
收藏
页码:37 / 56
页数:20
相关论文
共 50 条
  • [21] Quality of Service Timeseries Forecasting for Web Services: A Machine Learning, Genetic Programming-Based Approach
    Yang Syu
    Yong-Yi Fanjiang
    Jong-Yih Kuo
    Jui-Lung Su
    [J]. 2016 ANNUAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (CISS), 2016,
  • [22] A novel approach of cluster based optimal ranking of clicked URLs using genetic algorithm for effective personalized web search
    Chawla, Suruchi
    [J]. APPLIED SOFT COMPUTING, 2016, 46 : 90 - 103
  • [23] Genetic Programming-Based Inverse Kinematics for Robotic Manipulators
    Reuter, Julia
    Steup, Christoph
    Mostaghim, Sanaz
    [J]. GENETIC PROGRAMMING (EUROGP 2022), 2022, : 130 - 145
  • [24] Genetic Programming-Based Prediction Model for Microseismic Data
    Wang, Man
    Zhou, Hongwei
    Zhang, Dongming
    Wang, Yingwei
    Du, Weihang
    Yu, Beichen
    [J]. GEOFLUIDS, 2022, 2022
  • [25] Genetic programming-based modeling on chaotic time series
    Zhang, W
    Yang, GK
    Wu, ZM
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2347 - 2352
  • [26] MAHATMA: a Genetic Programming-Based Tool for Protein Classification
    Tsunoda, Denise F.
    Freitas, Alex A.
    Lopes, Heitor S.
    [J]. 2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009, : 1136 - +
  • [27] A Genetic Programming-Based Model for Colloid Retention in Fractures
    Yosri, Ahmed
    Siam, Ahmad
    El-Dakhakhni, Wael
    Dickson-Anderson, Sarah
    [J]. GROUNDWATER, 2019, 57 (05) : 693 - 703
  • [28] Genetic programming-based chaotic time series modeling
    张伟
    吴智铭
    杨根科
    [J]. Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2004, (11) : 121 - 128
  • [29] Genetic Programming-Based Machine Degradation Modeling Methodology
    Yan, Tongtong
    Wang, Dong
    [J]. IEEE Open Journal of Instrumentation and Measurement, 2022, 1
  • [30] Genetic programming-based chaotic time series modeling
    Zhang W.
    Wu Z.-M.
    Yang G.-K.
    [J]. Journal of Zhejiang University-SCIENCE A, 2004, 5 (11): : 1432 - 1439