Learning and Evaluating Human Preferences for Conversational Head Generation

被引:0
|
作者
Zhou, Mohan [1 ]
Bai, Yalong [2 ]
Zhang, Wei [3 ]
Yao, Ting [4 ]
Zhao, Tiejun [1 ]
Mei, Tao [4 ]
机构
[1] Harbin Inst Technol, Harbin, Peoples R China
[2] JD Explore Acad, Beijing, Peoples R China
[3] Gaoding AI, Beijing, Peoples R China
[4] HiDream Ai, Beijing, Peoples R China
关键词
evaluation metric; digital human; conversational head generation;
D O I
10.1145/3581783.3612831
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A reliable and comprehensive evaluation metric that aligns with manual preference assessments is crucial for conversational head video synthesis methods development. Existing quantitative evaluations often fail to capture the full complexity of human preference, as they only consider limited evaluation dimensions. Qualitative evaluations and user studies offer a solution but are time-consuming and labor-intensive. This limitation hinders the advancement of conversational head generation algorithms and systems. In this paper, we propose a novel learning-based evaluation metric named Preference Score (PS) for fitting human preference according to the quantitative evaluations across different dimensions. PS can serve as a quantitative evaluation without the need for human annotation. Experimental results validate the superiority of Preference Score in aligning with human perception, and also demonstrate robustness and generalizability to unseen data, making it a valuable tool for advancing conversation head generation. We expect this metric could facilitate new advances in conversational head generation. Project page: https://github.com/dc3ea9f/PreferenceScore.
引用
收藏
页码:9615 / 9619
页数:5
相关论文
共 50 条
  • [1] ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
    Xu, Jiazheng
    Liu, Xiao
    Wu, Yuchen
    Tong, Yuxuan
    Li, Qinkai
    Ding, Ming
    Tang, Jie
    Dong, Yuxiao
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] A Baseline for ViCo Conversational Head Generation Challenge
    Liu, Meng
    Zhai, Shuyan
    Li, Yongqiang
    Guan, Weili
    Nie, Liqiang
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 7013 - 7015
  • [3] An Evaluation of Human Conversational Preferences in Social Human-Robot Interaction
    Sirithunge, Chapa
    Jayasekara, A. G. Buddhika P.
    Chandima, D. P.
    [J]. APPLIED BIONICS AND BIOMECHANICS, 2021, 2021
  • [4] Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer
    Huang, Ailin
    Huang, Zhewei
    Zhou, Shuchang
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 7050 - 7054
  • [5] Evaluating Housing Needs and Preferences of Generation Y in Malaysia
    Kam, Kenn Jhun
    Lim, Anthony Sheng Hui
    Al-Obaidi, Karam M.
    Lim, Tze Shwan
    [J]. PLANNING PRACTICE AND RESEARCH, 2018, 33 (02): : 172 - 185
  • [6] Learning to Abstract for Memory-augmented Conversational Response Generation
    Tian, Zhiliang
    Bi, Wei
    Li, Xiaopeng
    Zhang, Nevin L.
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3816 - 3825
  • [7] Dual Learning for Conversational Emotion Recognition and Emotional Response Generation
    Zhang, Shuhe
    Hu, Haifeng
    Xing, Songlong
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 1241 - 1252
  • [8] Corpus-based generation of head and eyebrow motion for an embodied conversational agent
    Foster, Mary Ellen
    Oberlander, Jon
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2007, 41 (3-4) : 305 - 323
  • [9] Improvements on SadTalker-based Approach for ViCo Conversational Head Generation Challenge
    Dai, Wei
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9566 - 9570
  • [10] Towards Realistic Conversational Head Generation: A Comprehensive Framework for Lifelike Video Synthesis
    Liu, Meng
    Li, Yongqiang
    Zhai, Shuyan
    Guan, Weili
    Nie, Liqiang
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9441 - 9445