Partiality and Misconception: Investigating Cultural Representativeness in Text-To-Image Models

被引：0

作者：

Zhang, Lili ^{[1
]}

Liao, Xi ^{[1
]}

Yang, Zaijia ^{[1
]}

Gao, Baihang ^{[1
]}

Wang, Chunjie ^{[1
]}

Yang, Qiuling ^{[2
]}

Li, Deshun ^{[2
]}

机构：

[1] Hainan Univ, Haikou, Hainan, Peoples R China

[2] Hainan Univ, China Innovat Platform Acad Hainan Prov, Haikou, Hainan, Peoples R China

来源：

PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024) | 2024年

基金：

中国国家自然科学基金;

关键词：

text-to-image generation; cultural representativeness; cultural cluster; bias; stereotype;

D O I：

10.1145/3613904.3642877

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text-to-image (T2I) models enable users worldwide to create high-defnition and realistic images through text prompts, where the underrepresentation and potential misinformation of images have raised growing concerns. However, few existing works examine cultural representativeness, especially involving whether the generated content can fairly and accurately refect global cultures. Combining automated and human methods, we investigate this issue in multiple dimensions quantifcationally and conduct a set of evaluations on three prevailing T2I models (DALL-E v2, Stable Difusion v1.5 and v2.1). Introducing attributes of cultural cluster and subject, we provide a fresh interdisciplinary perspective to bias analysis. The benchmark dataset UCOGC is presented, which encompasses authentic images of unique cultural objects from global clusters. Our results reveal that the culture of a disadvantaged country is prone to be neglected, some specifed subjects often present a stereotype or a simple patchwork of elements, and over half of cultural objects are mispresented.

引用

页数：25

共 50 条

[1] Inspecting the Geographical Representativeness of Images from Text-to-Image Models
Basu, Abhipsa
Babu, R. Venkatesh
Pruthi, Danish
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5113 - 5124
[2] Negative Capabilities: Investigating Apophasis in AI Text-to-Image Models
Lucas, Hannah
[J]. RELIGIONS, 2023, 14 (06)
[3] Holistic Evaluation of Text-to-Image Models
Lee, Tony
Yasunaga, Michihiro
Meng, Chenlin
Mai, Yifan
Park, Joon Sung
Gupta, Agrim
Zhang, Yunzhi
Narayanan, Deepak
Teufel, Hannah Benita
Bellagente, Marco
Kang, Minguk
Park, Taesung
Leskovec, Jure
Zhu, Jun-Yan
Li Fei-Fei
Wu, Jiajun
Ermon, Stefano
Liang, Percy
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[4] Evaluating Data Attribution for Text-to-Image Models
Wang, Sheng-Yu
Efros, Alexei A.
Zhu, Jun-Yan
Zhang, Richard
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7158 - 7169
[5] Multilingual Conceptual Coverage in Text-to-Image Models
Saxon, Michael
Wang, William Yang
[J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4831 - 4848
[6] Ablating Concepts in Text-to-Image Diffusion Models
Kumari, Nupur
Zhang, Bingliang
Wang, Sheng-Yu
Shechtman, Eli
Zhang, Richard
Zhu, Jun-Yan
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22634 - 22645
[7] Resolving Ambiguities in Text-to-Image Generative Models
Mehrabi, Ninareh
Goyal, Palash
Verma, Apurv
Dhamala, Jwala
Kumar, Varun
Hu, Qian
Chang, Kai-Wei
Zemel, Richard
Galstyan, Aram
Gupta, Rahul
[J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14367 - 14388
[8] Typology of Risks of Generative Text-to-Image Models
Bird, Charlotte
Ungless, Eddie L.
Kasirzadeh, Atoosa
[J]. PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023, 2023, : 396 - 410
[9] SINE: SINgle Image Editing with Text-to-Image Diffusion Models
Zhang, Zhixing
Han, Ligong
Ghosh, Arnab
Metaxas, Dimitris
Ren, Jian
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6027 - 6037
[10] Unleashing Text-to-Image Diffusion Models for Visual Perception
Zhao, Wenliang
Rao, Yongming
Liu, Zuyan
Liu, Benlin
Zhou, Jie
Lu, Jiwen
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5706 - 5716

← 1 2 3 4 5 →