Towards understanding and mitigating unintended biases in language model-driven conversational recommendation

被引:14
|
作者
Shen, Tianshu [1 ]
Li, Jiaru [1 ]
Bouadjenek, Mohamed Reda [2 ]
Mai, Zheda [1 ]
Sanner, Scott [1 ]
机构
[1] Univ Toronto, Dept Mech & Ind Engn, Toronto, ON, Canada
[2] Deakin Univ, Sch Informat Technol, Waurn Ponds Campus, Geelong, Vic 3216, Australia
关键词
Conversational recommendation systems; BERT; Contextual language models; Bias and discrimination; FOOD CRAVINGS; SOCIOECONOMIC-STATUS; ALCOHOL-CONSUMPTION; UNITED-STATES; GENDER; HEALTH; DISCRIMINATION; RACE/ETHNICITY; CONSEQUENCES; PATTERNS;
D O I
10.1016/j.ipm.2022.103139
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conversational Recommendation Systems (CRSs) have recently started to leverage pretrained language models (LM) such as BERT for their ability to semantically interpret a wide range of preference statement variations. However, pretrained LMs are prone to intrinsic biases in their training data, which may be exacerbated by biases embedded in domain-specific language data (e.g., user reviews) used to fine-tune LMs for CRSs. We study a simple LM-driven recom-mendation backbone (termed LMRec) of a CRS to investigate how unintended bias - i.e., bias due to language variations such as name references or indirect indicators of sexual orientation or location that should not affect recommendations - manifests in substantially shifted price and category distributions of restaurant recommendations. For example, offhand mention of names associated with the black community substantially lowers the price distribution of recommended restaurants, while offhand mentions of common male-associated names lead to an increase in recommended alcohol-serving establishments. While these results raise red flags regarding a range of previously undocumented unintended biases that can occur in LM -driven CRSs, there is fortunately a silver lining: we show that train side masking and test side neutralization of non-preferential entities nullifies the observed biases without significantly impacting recommendation performance.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Towards Understanding and Mitigating Social Biases in Language Models
    Liang, Paul Pu
    Wu, Chiyu
    Morency, Louis-Philippe
    Salakhutdinov, Ruslan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [2] Towards a Model-Driven Datacube Analytics Language
    Baumann, Peter
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3740 - 3746
  • [3] Towards model-driven communications
    Natali, Antonio
    Molesini, Ambra
    World Academy of Science, Engineering and Technology, 2010, 40 : 73 - 85
  • [4] Towards model-driven communications
    Natali, Antonio
    Molesini, Ambra
    World Academy of Science, Engineering and Technology, 2010, 64 : 73 - 84
  • [5] Measuring and mitigating language model biases in abusive language detection
    Song, Rui
    Giunchiglia, Fausto
    Li, Yingji
    Shi, Lida
    Xu, Hao
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [6] Towards a model-driven approach to reuse
    France, RB
    Ghosh, S
    Turk, DE
    OOIS 2001: 7TH INTERNATIONAL CONFERENCE ON OBJECT-ORIENTED INFORMATION SYSTEMS, PROCEEDINGS, 2001, : 181 - 190
  • [7] Towards model-driven unit testing
    Engels, Gregor
    Gueldali, Baris
    Lohmann, Marc
    MODELS IN SOFTWARE ENGINEERING, 2007, 4364 : 182 - +
  • [8] A Model-Driven Approach for Context-Aware Recommendation
    Haddad, Mohamed Ramzi
    Baazaoui, Hajer
    Ziou, Djemel
    Ben Ghezala, Henda
    2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 755 - 760
  • [9] Language Architecture: An Architecture Language for Model-Driven Engineering
    Brouwers, Niels
    Hamilton, Marc
    Kurtev, Ivan
    Luo, Yaping
    MODELSWARD: PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON MODEL-DRIVEN ENGINEERING AND SOFTWARE DEVELOPMENT, 2017, : 147 - 156
  • [10] Understanding the Anatomy of Virtual Coaches - A Step Towards the Model-Driven Development of Digital Therapeutics
    Gisske, Carola
    Weimann, Thure Georg
    Schlieter, Hannes
    2024 26TH INTERNATIONAL CONFERENCE ON BUSINESS INFORMATICS, CBI 2024, 2024, : 238 - 246