Deep Active Learning for Address Parsing Tasks with BERT

被引:1
|
作者
Guler, Berkay [1 ]
Aygun, Betul [2 ]
Gerek, Aydin [2 ]
Gurel, Alaeddin Selcuk [2 ]
机构
[1] Univ Calif Irvine, Donald Bren Sch Informat & Comp Sci, Irvine, CA 92697 USA
[2] Huawei Turkey Res & Dev Ctr, Istanbul, Turkiye
关键词
active learning; token classification; address data; BERT;
D O I
10.1109/SIU59756.2023.10223996
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models tend to perform better with larger datasets. With decreasing data handling costs, researchers have the means to gather and store vast amounts of unlabeled data. Supervised learning, on the other hand, requires training data to be labeled by annotators. However, high annotation costs pose challenges to labeling an optimum portion of the available data. One proposed method to mitigate this problem is to employ active learning (AL). AL strategies use a machine learning model to select the most informative and representative samples among unlabeled data points. Here, we demonstrate the effectiveness of uncertainty-based active learning strategies, including a new strategy, for address parsing with a BERT model on an in-house Arabic address dataset manually annotated for two different tasks. We compare AL methods with random sampling and longest-sentence baselines. We show that AL strategies' usefulness greatly depends on dataset characteristics, being less effective on datasets with fewer classes. We conclude that AL for address parsing with BERT decreases annotation costs, if measured in the number of queries. Yet, due to AL methods' tendency to select longer queries, some strategies may increase labeling costs, measured in the total number of words.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Parsing Address Texts with Deep Learning Method
    Delil, Selman
    Kuyumcu, Birol
    Aksakalli, Cuneyt
    Akcira, Isa Semih
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [2] Active learning for deep semantic parsing
    Duong, Long
    Afshar, Hadi
    Estival, Dominique
    Pink, Glen
    Cohen, Philip
    Johnson, Mark
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 43 - 48
  • [3] Scaling Address Parsing Sequence Models through Active Learning
    Craig, Helen
    Yankov, Dragomir
    Wang, Renzhong
    Berkhin, Pavel
    Wu, Wei
    [J]. 27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 424 - 427
  • [4] MII: A Novel Text Classification Model Combining Deep Active Learning with BERT
    Zhang, Anman
    Li, Bohan
    Wang, Wenhuan
    Wan, Shuo
    Chen, Weitong
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (03): : 1499 - 1514
  • [5] MII: A novel text classification model combining deep active learning with BERT
    Zhang, Anman
    Li, Bohan
    Wang, Wenhuan
    Wan, Shuo
    Chen, Weitong
    [J]. Computers, Materials and Continua, 2020, 63 (03): : 1499 - 1514
  • [6] Active Learning for BERT: An Empirical Study
    Ein-Dor, Liat
    Halfon, Alon
    Gera, Ariel
    Shnarch, Eyal
    Dankin, Lena
    Choshen, Leshem
    Danilevsky, Marina
    Aharonov, Ranit
    Katz, Yoav
    Slonim, Noam
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7949 - 7962
  • [7] Deep Learning for Natural Language Parsing
    Jaf, Sardar
    Calder, Calum
    [J]. IEEE ACCESS, 2019, 7 : 131363 - 131373
  • [8] Deep Active Learning for Computer Vision Tasks: Methodologies, Applications, and Challenges
    Wu, Mingfei
    Li, Chen
    Yao, Zehuan
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (16):
  • [9] Deep Human Parsing with Active Template Regression
    Liang, Xiaodan
    Liu, Si
    Shen, Xiaohui
    Yang, Jianchao
    Liu, Luoqi
    Dong, Jian
    Lin, Liang
    Yan, Shuicheng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (12) : 2402 - 2414
  • [10] On the differences between BERT and MT encoder spaces and how to address them in translation tasks
    Vazquez, Raul
    Celikkanat, Hande
    Creutz, Mathias
    Tiedemann, Jorg
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 337 - 347