Deep Active Learning for Address Parsing Tasks with BERT

被引：1

作者：

Guler, Berkay ^{[1
]}

Aygun, Betul ^{[2
]}

Gerek, Aydin ^{[2
]}

Gurel, Alaeddin Selcuk ^{[2
]}

机构：

[1] Univ Calif Irvine, Donald Bren Sch Informat & Comp Sci, Irvine, CA 92697 USA

[2] Huawei Turkey Res & Dev Ctr, Istanbul, Turkiye

来源：

2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2023年

关键词：

active learning; token classification; address data; BERT;

D O I：

10.1109/SIU59756.2023.10223996

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning models tend to perform better with larger datasets. With decreasing data handling costs, researchers have the means to gather and store vast amounts of unlabeled data. Supervised learning, on the other hand, requires training data to be labeled by annotators. However, high annotation costs pose challenges to labeling an optimum portion of the available data. One proposed method to mitigate this problem is to employ active learning (AL). AL strategies use a machine learning model to select the most informative and representative samples among unlabeled data points. Here, we demonstrate the effectiveness of uncertainty-based active learning strategies, including a new strategy, for address parsing with a BERT model on an in-house Arabic address dataset manually annotated for two different tasks. We compare AL methods with random sampling and longest-sentence baselines. We show that AL strategies' usefulness greatly depends on dataset characteristics, being less effective on datasets with fewer classes. We conclude that AL for address parsing with BERT decreases annotation costs, if measured in the number of queries. Yet, due to AL methods' tendency to select longer queries, some strategies may increase labeling costs, measured in the total number of words.

引用

页数：4

共 50 条

[1] Parsing Address Texts with Deep Learning Method
Delil, Selman
Kuyumcu, Birol
Aksakalli, Cuneyt
Akcira, Isa Semih
[J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
[2] Active learning for deep semantic parsing
Duong, Long
Afshar, Hadi
Estival, Dominique
Pink, Glen
Cohen, Philip
Johnson, Mark
[J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 43 - 48
[3] Scaling Address Parsing Sequence Models through Active Learning
Craig, Helen
Yankov, Dragomir
Wang, Renzhong
Berkhin, Pavel
Wu, Wei
[J]. 27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 424 - 427
[4] MII: A Novel Text Classification Model Combining Deep Active Learning with BERT
Zhang, Anman
Li, Bohan
Wang, Wenhuan
Wan, Shuo
Chen, Weitong
[J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (03): : 1499 - 1514
[5] MII: A novel text classification model combining deep active learning with BERT
Zhang, Anman
Li, Bohan
Wang, Wenhuan
Wan, Shuo
Chen, Weitong
[J]. Computers, Materials and Continua, 2020, 63 (03): : 1499 - 1514
[6] Active Learning for BERT: An Empirical Study
Ein-Dor, Liat
Halfon, Alon
Gera, Ariel
Shnarch, Eyal
Dankin, Lena
Choshen, Leshem
Danilevsky, Marina
Aharonov, Ranit
Katz, Yoav
Slonim, Noam
[J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7949 - 7962
[7] Deep Learning for Natural Language Parsing
Jaf, Sardar
Calder, Calum
[J]. IEEE ACCESS, 2019, 7 : 131363 - 131373
[8] Deep Active Learning for Computer Vision Tasks: Methodologies, Applications, and Challenges
Wu, Mingfei
Li, Chen
Yao, Zehuan
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (16):
[9] Deep Human Parsing with Active Template Regression
Liang, Xiaodan
Liu, Si
Shen, Xiaohui
Yang, Jianchao
Liu, Luoqi
Dong, Jian
Lin, Liang
Yan, Shuicheng
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (12) : 2402 - 2414
[10] On the differences between BERT and MT encoder spaces and how to address them in translation tasks
Vazquez, Raul
Celikkanat, Hande
Creutz, Mathias
Tiedemann, Jorg
[J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 337 - 347

← 1 2 3 4 5 →