A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks

被引:0
|
作者
Braylan, Alexander [1 ]
Marabella, Madalyn [2 ]
Alonso, Omar [3 ]
Lease, Matthew [4 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[2] Univ Texas Austin, McCombs Sch Business, Austin, TX USA
[3] Amazon, Seattle, WA USA
[4] Univ Texas Austin, Sch Informat, Austin, TX USA
基金
美国国家科学基金会;
关键词
AGREEMENT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human annotations are vital to supervised learning, yet annotators often disagree on the correct label, especially as annotation tasks increase in complexity. A common strategy to improve label quality is to ask multiple annotators to label the same item and then aggregate their labels. To date, many aggregation models have been proposed for simple categorical or numerical annotation tasks, but far less work has considered more complex annotation tasks, such as those involving open-ended, multivariate, or structured responses. Similarly, while a variety of bespoke models have been proposed for specific tasks, our work is the first we are aware of to introduce aggregation methods that generalize across many, diverse complex tasks, including sequence labeling, translation, syntactic parsing, ranking, bounding boxes, and keypoints. This generality is achieved by applying readily available task-specific distance functions, then devising a task-agnostic method to model these distances between labels, rather than the labels themselves.This article presents a unified treatment of our prior work on complex annotation modeling and extends that work with investigation of three new research questions. First, how do complex annotation task and dataset properties impact aggregation accuracy? Second, how should a task owner navigate the many modeling choices in order to maximize aggregation accuracy? Finally, what tests and diagnoses can verify that aggregation models are specified correctly for the given data? To understand how various factors impact accuracy and to inform model selection, we conduct large-scale simulation studies and broad experiments on real, complex datasets. Regarding testing, we introduce the concept of unit tests for aggregation models and present a suite of such tests to ensure that a given model is not mis-specified and exhibits expected behavior.Beyond investigating these research questions above, we discuss the foundational con-cept and nature of annotation complexity, present a new aggregation model as a concep-tual bridge between traditional models and our own, and contribute a new general semi -supervised learning method for complex label aggregation that outperforms prior work.
引用
收藏
页码:901 / 973
页数:73
相关论文
共 50 条
  • [1] A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks
    Braylan A.
    Marabella M.
    Alonso O.
    Lease M.
    Journal of Artificial Intelligence Research, 2023, 78 : 901 - 973
  • [2] A General Model for Aggregating Annotations AcrossSimple, Complex, and Multi-object Annotation Tasks (Abstract Reprint)
    Braylan, Alexander
    Marabella, Madalyn
    Alonso, Omar
    Lease, Matthew
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22693 - 22693
  • [3] Measuring Annotator Agreement Generally across Complex Structured, Multi-object, and Free-text Annotation Tasks
    Braylan, Alexander
    Alonso, Omar
    Lease, Matthew
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1720 - 1730
  • [4] Crowdsourcing System for Multi-object Annotation in Surveillance Videos
    Zhang, Zheng
    Zhao, Zixin
    Zhang, Lan
    Li, Xiangyang
    2022 8TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS, BIGCOM, 2022, : 389 - 397
  • [5] Learning Multi-Object Tracking and Segmentation from Automatic Annotations
    Porzi, Lorenzo
    Hofinger, Markus
    Ruiz, Idoia
    Serrat, Joan
    Bulo, Samuel Rota
    Kontschieder, Peter
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6845 - 6854
  • [6] General multi-object filtering and association measure
    Houssineau, Jeremie
    Del Moral, Pierre
    Clark, Daniel E.
    2013 IEEE 5TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2013), 2013, : 33 - +
  • [7] Simple Cues Lead to a Strong Multi-Object Tracker
    Seidenschwarz, Jenny
    Braso, Guillem
    Serrano, Victor Castro
    Elezi, Ismail
    Leal-Taixe, Laura
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13813 - 13823
  • [8] Increasing the Availability of Multi-object Tasks on Multi-region Distributed System
    Liu, Lihui
    Song, Junping
    Wang, Haibo
    Lv, Pin
    2017 IEEE 21ST INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2017, : 532 - 539
  • [9] Basketball-SORT: an association method for complex multi-object occlusion problems in basketball multi-object tracking
    Qingrui Hu
    Atom Scott
    Calvin Yeung
    Keisuke Fujii
    Multimedia Tools and Applications, 2024, 83 (38) : 86281 - 86297
  • [10] Learning Relational Affordance Models for Robots in Multi-Object Manipulation Tasks
    Moldovan, Bogdan
    Moreno, Plinio
    van Otterlo, Martijn
    Santos-Victor, Jose
    De Raedt, Luc
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 4373 - 4378