"They only care to show us the wheelchair": Disability Representation in Text-to-Image AI Models

被引:0
|
作者
Mack, Kelly Avery [1 ,2 ]
Qadri, Rida [3 ]
Denton, Remi [4 ]
Kane, Shaun K. [5 ]
Bennett, Cynthia L. [4 ]
机构
[1] Google Res, Seattle, WA 98101 USA
[2] Univ Washington, Paul G Allen Sch Comp Sci, Seattle, WA 98195 USA
[3] Google Res, San Francisco, CA USA
[4] Google Res, New York, NY USA
[5] Google Res, Boulder, CO USA
关键词
disability representation; generative AI; algorithmic harms; human-centered AI; AI harms; text-to-image models;
D O I
10.1145/3613904.3642166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports on disability representation in images output from text-to-image (T2I) generative AI systems. Through eight focus groups with 25 people with disabilities, we found that models repeatedly presented reductive archetypes for different disabilities. Often these representations reflected broader societal stereotypes and biases, which our participants were concerned to see reproduced through T2I. Our participants discussed further challenges with using these models including the current reliance on prompt engineering to reach satisfactorily diverse results. Finally, they offered suggestions for how to improve disability representation with solutions like showing multiple, heterogeneous images for a single prompt and including the prompt with images generated. Our discussion reflects on tensions and tradeoffs we found among the diverse perspectives shared to inform future research on representation-oriented generative AI system evaluation metrics and development processes.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Negative Capabilities: Investigating Apophasis in AI Text-to-Image Models
    Lucas, Hannah
    [J]. RELIGIONS, 2023, 14 (06)
  • [2] AI's Regimes of Representation: A Community-centered Study of Text-to-Image Models in South Asia
    Qadri, Rida
    Shelby, Renee
    Bennett, Cynthia L.
    Denton, Emily
    [J]. PROCEEDINGS OF THE 6TH ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2023, 2023, : 506 - 517
  • [3] Fantasy on Demand: The Temptation Of Text-to-Image AI
    Nagele, Julia
    [J]. CTBUH Journal, 2023, 2023 (03) : 46 - 51
  • [4] Text-to-image AI tools and tourism experiences
    Miao, Li
    Yang, Fiona X.
    [J]. ANNALS OF TOURISM RESEARCH, 2023, 102
  • [5] Adversarial Representation Learning for Text-to-Image Matching
    Sarafianos, Nikolaos
    Xu, Xiang
    Kakadiaris, Ioannis A.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5813 - 5823
  • [6] Holistic Evaluation of Text-to-Image Models
    Lee, Tony
    Yasunaga, Michihiro
    Meng, Chenlin
    Mai, Yifan
    Park, Joon Sung
    Gupta, Agrim
    Zhang, Yunzhi
    Narayanan, Deepak
    Teufel, Hannah Benita
    Bellagente, Marco
    Kang, Minguk
    Park, Taesung
    Leskovec, Jure
    Zhu, Jun-Yan
    Li Fei-Fei
    Wu, Jiajun
    Ermon, Stefano
    Liang, Percy
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] Unleashing the AI revolution: exploring the capabilities and challenges of large language models and text-to-image AI programs
    Youssef, A.
    [J]. ULTRASOUND IN OBSTETRICS & GYNECOLOGY, 2023, 62 (02) : 308 - 312
  • [8] Personalizing Text-to-Image Diffusion Models by Fine-Tuning Classification for AI Applications
    Hidalgo, Rafael
    Salah, Nesreen
    Jetty, Rajiv Chandra
    Jetty, Anupama
    Varde, Aparna S.
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, INTELLISYS 2023, 2024, 822 : 642 - 658
  • [9] AI for conceptual architecture: Reflections on designing with text-to-text, text-to-image, and image-to-image generators
    AncaSimona Horvath
    Panagiota Pouliou
    [J]. Frontiers of Architectural Research., 2024, 13 (03) - 612
  • [10] AI for conceptual architecture: Reflections on designing with text-to-text, text-to-image, and image-to-image generators
    Horvath, Anca-Simona
    Pouliou, Panagiota
    [J]. FRONTIERS OF ARCHITECTURAL RESEARCH, 2024, 13 (03) : 593 - 612