A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks

被引:0
|
作者
Braylan, Alexander [1 ]
Marabella, Madalyn [2 ]
Alonso, Omar [3 ]
Lease, Matthew [4 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[2] Univ Texas Austin, McCombs Sch Business, Austin, TX USA
[3] Amazon, Seattle, WA USA
[4] Univ Texas Austin, Sch Informat, Austin, TX USA
基金
美国国家科学基金会;
关键词
AGREEMENT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human annotations are vital to supervised learning, yet annotators often disagree on the correct label, especially as annotation tasks increase in complexity. A common strategy to improve label quality is to ask multiple annotators to label the same item and then aggregate their labels. To date, many aggregation models have been proposed for simple categorical or numerical annotation tasks, but far less work has considered more complex annotation tasks, such as those involving open-ended, multivariate, or structured responses. Similarly, while a variety of bespoke models have been proposed for specific tasks, our work is the first we are aware of to introduce aggregation methods that generalize across many, diverse complex tasks, including sequence labeling, translation, syntactic parsing, ranking, bounding boxes, and keypoints. This generality is achieved by applying readily available task-specific distance functions, then devising a task-agnostic method to model these distances between labels, rather than the labels themselves.This article presents a unified treatment of our prior work on complex annotation modeling and extends that work with investigation of three new research questions. First, how do complex annotation task and dataset properties impact aggregation accuracy? Second, how should a task owner navigate the many modeling choices in order to maximize aggregation accuracy? Finally, what tests and diagnoses can verify that aggregation models are specified correctly for the given data? To understand how various factors impact accuracy and to inform model selection, we conduct large-scale simulation studies and broad experiments on real, complex datasets. Regarding testing, we introduce the concept of unit tests for aggregation models and present a suite of such tests to ensure that a given model is not mis-specified and exhibits expected behavior.Beyond investigating these research questions above, we discuss the foundational con-cept and nature of annotation complexity, present a new aggregation model as a concep-tual bridge between traditional models and our own, and contribute a new general semi -supervised learning method for complex label aggregation that outperforms prior work.
引用
收藏
页码:901 / 973
页数:73
相关论文
共 50 条
  • [31] Algorithms to model the multi-object spectrograph JWST/NIRSpec instrument
    Gnata, X.
    Ferruit, P.
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVII, 2008, 394 : 673 - 676
  • [32] Algorithm and evaluation model for programming multi-object of some material
    Lu, F.
    Sun, D.
    Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 2001, 29 (09): : 43 - 45
  • [33] Rehearsing the complex data flow of Multi-Object Spectrograph Survey projects
    Worley, C. C.
    Walton, N. A.
    Murphy, D. N. A.
    Paz-Chinchon, F.
    Irwin, M. J.
    Molaeinezhad, A.
    Gonneau, A.
    MODELING, SYSTEMS ENGINEERING, AND PROJECT MANAGEMENT FOR ASTRONOMY X, 2022, 12187
  • [34] Labeled Multi-object Tracking Algorithms for Generic Observation Model
    Li, Suqi
    Yi, Wei
    Wang, Bailu
    Kong, Lingjiang
    2016 19TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2016, : 1125 - 1131
  • [35] Nonlinear Multi-Object Differential Game Simulation Model in LabVIEW
    Lisowski, Jozef
    ELECTRONICS, 2023, 12 (18)
  • [36] An improved Faster R-CNN model for multi-object tomato maturity detection in complex scenarios
    Wang, Zan
    Ling, Yiming
    Wang, Xuanli
    Meng, Dezhang
    Nie, Lixiu
    An, Guiqin
    Wang, Xuanhui
    ECOLOGICAL INFORMATICS, 2022, 72
  • [37] A multi-agent, multi-object and multi-attribute intelligent negotiation model
    Fei, Yulian
    Chen, Wenjuan
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 440 - +
  • [38] MFACNet: A Multi-Frame Feature Aggregating and Inter-Feature Correlation Framework for Multi-Object Tracking in Satellite Videos
    Zhao, Hu
    Shen, Yanyun
    Wang, Zhipan
    Zhang, Qingling
    REMOTE SENSING, 2024, 16 (09)
  • [39] A Simple Multi-Frame Fusion Baseline For Long-Term Multi-Object Tracking
    Ke, Junmin
    Guo, Shengting
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 39 - 45
  • [40] Exploring Simple 3D Multi-Object Tracking for Autonomous Driving
    Luo, Chenxu
    Yang, Xiaodong
    Yuille, Alan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10468 - 10477