A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks

被引：0

作者：

Braylan, Alexander ^{[1
]}

Marabella, Madalyn ^{[2
]}

Alonso, Omar ^{[3
]}

Lease, Matthew ^{[4
]}

机构：

[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA

[2] Univ Texas Austin, McCombs Sch Business, Austin, TX USA

[3] Amazon, Seattle, WA USA

[4] Univ Texas Austin, Sch Informat, Austin, TX USA

来源：

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH | 2023年 / 78卷

基金：

美国国家科学基金会;

关键词：

AGREEMENT;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human annotations are vital to supervised learning, yet annotators often disagree on the correct label, especially as annotation tasks increase in complexity. A common strategy to improve label quality is to ask multiple annotators to label the same item and then aggregate their labels. To date, many aggregation models have been proposed for simple categorical or numerical annotation tasks, but far less work has considered more complex annotation tasks, such as those involving open-ended, multivariate, or structured responses. Similarly, while a variety of bespoke models have been proposed for specific tasks, our work is the first we are aware of to introduce aggregation methods that generalize across many, diverse complex tasks, including sequence labeling, translation, syntactic parsing, ranking, bounding boxes, and keypoints. This generality is achieved by applying readily available task-specific distance functions, then devising a task-agnostic method to model these distances between labels, rather than the labels themselves.This article presents a unified treatment of our prior work on complex annotation modeling and extends that work with investigation of three new research questions. First, how do complex annotation task and dataset properties impact aggregation accuracy? Second, how should a task owner navigate the many modeling choices in order to maximize aggregation accuracy? Finally, what tests and diagnoses can verify that aggregation models are specified correctly for the given data? To understand how various factors impact accuracy and to inform model selection, we conduct large-scale simulation studies and broad experiments on real, complex datasets. Regarding testing, we introduce the concept of unit tests for aggregation models and present a suite of such tests to ensure that a given model is not mis-specified and exhibits expected behavior.Beyond investigating these research questions above, we discuss the foundational con-cept and nature of annotation complexity, present a new aggregation model as a concep-tual bridge between traditional models and our own, and contribute a new general semi -supervised learning method for complex label aggregation that outperforms prior work.

引用

页码：901 / 973

页数：73

共 50 条

[31] Algorithms to model the multi-object spectrograph JWST/NIRSpec instrument
Gnata, X.
Ferruit, P.
ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVII, 2008, 394 : 673 - 676
[32] Algorithm and evaluation model for programming multi-object of some material
Lu, F.
Sun, D.
Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 2001, 29 (09): : 43 - 45
[33] Rehearsing the complex data flow of Multi-Object Spectrograph Survey projects
Worley, C. C.
Walton, N. A.
Murphy, D. N. A.
Paz-Chinchon, F.
Irwin, M. J.
Molaeinezhad, A.
Gonneau, A.
MODELING, SYSTEMS ENGINEERING, AND PROJECT MANAGEMENT FOR ASTRONOMY X, 2022, 12187
[34] Labeled Multi-object Tracking Algorithms for Generic Observation Model
Li, Suqi
Yi, Wei
Wang, Bailu
Kong, Lingjiang
2016 19TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2016, : 1125 - 1131
[35] Nonlinear Multi-Object Differential Game Simulation Model in LabVIEW
Lisowski, Jozef
ELECTRONICS, 2023, 12 (18)
[36] An improved Faster R-CNN model for multi-object tomato maturity detection in complex scenarios
Wang, Zan
Ling, Yiming
Wang, Xuanli
Meng, Dezhang
Nie, Lixiu
An, Guiqin
Wang, Xuanhui
ECOLOGICAL INFORMATICS, 2022, 72
[37] A multi-agent, multi-object and multi-attribute intelligent negotiation model
Fei, Yulian
Chen, Wenjuan
FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 440 - +
[38] MFACNet: A Multi-Frame Feature Aggregating and Inter-Feature Correlation Framework for Multi-Object Tracking in Satellite Videos
Zhao, Hu
Shen, Yanyun
Wang, Zhipan
Zhang, Qingling
REMOTE SENSING, 2024, 16 (09)
[39] A Simple Multi-Frame Fusion Baseline For Long-Term Multi-Object Tracking
Ke, Junmin
Guo, Shengting
2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 39 - 45
[40] Exploring Simple 3D Multi-Object Tracking for Autonomous Driving
Luo, Chenxu
Yang, Xiaodong
Yuille, Alan
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10468 - 10477

← 1 2 3 4 5 →