Toward creating a fairer ranking in search engine results

被引:66
|
作者
Gao, Ruoyuan [1 ]
Shah, Chirag [2 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08854 USA
[2] Univ Washington, Informat Sch, Seattle, WA 98195 USA
关键词
Information retrieval; Search engine bias; Fairness ranking; Relevance; Diversity; Novelty; BIAS; IMPACT;
D O I
10.1016/j.ipm.2019.102138
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increasing popularity and social influence of search engines in IR, various studies have raised concerns on the presence of bias in search engines and the social responsibilities of IR systems. As an essential component of search engine, ranking is a crucial mechanism in presenting the search results or recommending items in a fair fashion. In this article, we focus on the top-k diversity fairness ranking in terms of statistical parity fairness and disparate impact fairness. The former fairness definition provides a balanced overview of search results where the number of documents from different groups are equal; The latter enables a realistic overview where the proportion of documents from different groups reflect the overall proportion. Using 100 queries and top 100 results per query from Google as the data, we first demonstrate how topical diversity bias is present in the top web search results. Then, with our proposed entropy-based metrics for measuring the degree of bias, we reveal that the top search results are unbalanced and disproportionate to their overall diversity distribution. We explore several fairness ranking strategies to investigate the relationship between fairness, diversity, novelty and relevance. Our experimental results show that using a variant of fair epsilon-greedy strategy, we could bring more fairness and enhance diversity in search results without a cost of relevance. In fact, we can improve the relevance and diversity by introducing the diversity fairness. Additional experiments with TREC datasets containing 50 queries demonstrate the robustness of our proposed strategies and our findings on the impact of fairness. We present a series of correlation analysis on the amount of fairness and diversity, showing that statistical parity fairness highly correlates with diversity while disparate impact fairness does not. This provides clear and tangible implications for future works where one would want to balance fairness, diversity and relevance in search results.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Search results diversification for effective fair ranking in academic search
    McDonald, Graham
    Macdonald, Craig
    Ounis, Iadh
    INFORMATION RETRIEVAL JOURNAL, 2022, 25 (01): : 1 - 26
  • [42] Search results diversification for effective fair ranking in academic search
    Graham McDonald
    Craig Macdonald
    Iadh Ounis
    Information Retrieval Journal, 2022, 25 : 1 - 26
  • [43] Achieving Higher Ranking to Webpages Through Search Engine Optimization
    Swapna, B.
    Anuradha, T.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING, 2018, 9 : 105 - 112
  • [44] Examining the Impact of Ranking on Consumer Behavior and Search Engine Revenue
    Ghose, Anindya
    Ipeirotis, Panagiotis G.
    Li, Beibei
    MANAGEMENT SCIENCE, 2014, 60 (07) : 1632 - 1654
  • [45] EMPIRICAL STUDY OF LOAD TIME FACTOR IN SEARCH ENGINE RANKING
    Marszalkowski, Jakub
    Marszalkowski, Jedrzej M.
    Drozdowski, Maciej
    JOURNAL OF WEB ENGINEERING, 2014, 13 (1-2): : 114 - 128
  • [46] Method to Recommend Personalized Ranking Factors in the Google Search Engine
    Injante, Richard
    Mauricio, David
    REVISTA ESPANOLA DE DOCUMENTACION CIENTIFICA, 2020, 43 (01):
  • [47] EagleRank: A novel ranking model for web image search engine
    Liu, Kangmiao
    Chen, Wei
    Chen, Chun
    Bu, Jiajun
    Wang, Can
    Huang, Peng
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2006, PROCEEDINGS, 2006, 4261 : 749 - +
  • [48] The LAILAPS Search Engine: Relevance Ranking in Life Science Databases
    Lange, Matthias
    Spies, Karl
    Bargsten, Jachim
    Haberhauer, Gregor
    Klapperstueck, Matthias
    Leps, Michael
    Weinel, Christian
    Wuenschiers, Roebbe
    Weissbach, Mandy
    Stein, Jens
    Scholz, Uwe
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2010, 7 (02):
  • [49] Search Engine Optimization to Enhance Web Site Visibility and Ranking
    Al-Badi, Ali H.
    Al Majeeni, Ali O.
    Mayhew, Pam J.
    Al-Rashdi, Abdullah S.
    KNOWLEDGE MANAGEMENT AND INNOVATION IN ADVANCING ECONOMIES-ANALYSES & SOLUTIONS, VOLS 1-3, 2009, : 1379 - +
  • [50] Ranking Keyword Search Results with Query Logs
    Zhou, Jing
    Yu, Xiaohui
    Liu, Yang
    Yu, Ziqiang
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 770 - 771