Aligning Human and Computational Coherence Evaluations

被引:0
|
作者
Lim, Jia Peng [1 ]
Lauw, Hady W. [1 ]
机构
[1] Singapore Management Univ, Sch Comp & Informat Syst, PreferredAI Res Grp, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
VOCABULARY;
D O I
10.1162/coli_a_00518
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated coherence metrics constitute an efficient and popular way to evaluate topic models. Previous work presents a mixed picture of their presumed correlation with human judgment. This work proposes a novel sampling approach to mining topic representations at a large scale while seeking to mitigate bias from sampling, enabling the investigation of widely used automated coherence metrics via large corpora. Additionally, this article proposes a novel user study design, an amalgamation of different proxy tasks, to derive a finer insight into the human decision-making processes. This design subsumes the purpose of simple rating and outlier-detection user studies. Similar to the sampling approach, the user study conducted is extensive, comprising 40 study participants split into eight different study groups tasked with evaluating their respective set of 100 topic representations. Usually, when substantiating the use of these metrics, human responses are treated as the gold standard. This article further investigates the reliability of human judgment by flipping the comparison and conducting a novel extended analysis of human response at the group and individual level against a generic corpus. The investigation results show a moderate to good correlation between these metrics and human judgment, especially for generic corpora, and derive further insights into the human perception of coherence. Analyzing inter-metric correlations across corpora shows moderate to good correlation among these metrics. As these metrics depend on corpus statistics, this article further investigates the topical differences between corpora, revealing nuances in applications of these metrics.
引用
收藏
页码:893 / 952
页数:60
相关论文
共 50 条
  • [1] ALIGNING HUMAN AND COMPUTATIONAL EVALUATIONS OF FUNCTIONAL DESIGN SIMILARITY
    Nandy, Ananya
    Goucher-Lambert, Kosa
    PROCEEDINGS OF ASME 2021 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2021, VOL 6, 2021,
  • [2] Computational and Human Evaluations of Orthogonal Graph Drawings
    Mirza, Irfan Baig
    Huang, Weidong
    Georgakopoulos, Dimitrios
    Liu, Hengyang
    2019 23RD INTERNATIONAL CONFERENCE IN INFORMATION VISUALIZATION - PT II (IV-2 2019), 2019, : 74 - 77
  • [3] Coherence Shifts in Attribute Evaluations
    Lee, Douglas G.
    Holyoak, Keith J.
    DECISION-WASHINGTON, 2021, 8 (04): : 257 - 276
  • [4] Do Human and Computational Evaluations of Similarity Align? An Empirical Study of Product Function
    Nandy, Ananya
    Goucher-Lambert, Kosa
    JOURNAL OF MECHANICAL DESIGN, 2022, 144 (04)
  • [5] Combining Computational and Human Analysis to Study Low Coherence in Design Conversations
    Menning, Axel
    Grasnick, Bastien Marvin
    Ewald, Benedikt
    Dobrigkeit, Franziska
    Nicolai, Claudia
    ANALYSING DESIGN THINKING: STUDIES OF CROSS-CULTURAL CO-CREATION, 2017, : 291 - 309
  • [6] Beyond networks: Aligning qualitative and computational science studies
    Cambrosio, Alberto
    Cointet, Jean-Philippe
    Abdo, Alexandre Hannud
    QUANTITATIVE SCIENCE STUDIES, 2020, 1 (03): : 1017 - 1024
  • [7] Aligning the talent pathway: exploring the role and mechanisms of coherence in development
    Webb, Vincent
    Collins, Dave
    Cruickshank, Andrew
    JOURNAL OF SPORTS SCIENCES, 2016, 34 (19) : 1799 - 1807
  • [8] Aligning for Impact: Human Rights
    Duggan, Colleen
    JOURNAL OF HUMAN RIGHTS PRACTICE, 2011, 3 (02) : 214 - 219
  • [9] Aligning Human and Robot Representations
    Bobu, Andreea
    Peng, Andi
    Agrawal, Pulkit
    Shah, Julie A.
    Dragan, Anca D.
    PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024, 2024, : 42 - 54
  • [10] The role of coherence for handling probabilistic evaluations and independence
    Barbara Vantaggi
    Soft Computing, 2005, 9 : 617 - 628