Combining Self-supervised Learning and Active Learning for Disfluency Detection

被引：4

作者：

Wang, Shaolei ^{[1
]}

Wang, Zhongyuan ^{[1
]}

Che, Wanxiang ^{[1
]}

Zhao, Sendong ^{[1
]}

Liu, Ting ^{[1
]}

机构：

[1] Harbin Inst Technol, 2 YiKuang St,Tech & Innovat Bldg,HIT Sci Pk, Harbin 150001, Heilongjiang, Peoples R China

来源：

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING | 2022年 / 21卷 / 03期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Disfluency detection; self-supervised learning; active learning; pre-training technology;

D O I：

10.1145/3487290

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Spoken language is fundamentally different from the written language in that it contains frequent disfluencies or parts of an utterance that are corrected by the speaker. Disfluency detection (removing these disfluencies) is desirable to clean the input for use in downstream NLP tasks. Most existing approaches to disfluency detection heavily rely on human-annotated data, which is scarce and expensive to obtain in practice. To tackle the training data bottleneck, in this work, we investigate methods for combining self-supervised learning and active learning for disfluency detection. First, we construct large-scale pseudo training data by randomly adding or deleting words fromunlabeled data and propose two self-supervised pre-training tasks: (i) a tagging task to detect the added noisy words and (ii) sentence classification to distinguish original sentences from grammatically incorrect sentences. We then combine these two tasks to jointly pre-train a neural network. The pre-trained neural network is then fine-tuned using human-annotated disfluency detection training data. The self-supervised learning method can capture task-special knowledge for disfluency detection and achieve better performance when fine-tuning on a small annotated dataset compared to other supervised methods. However, limited in that the pseudo training data are generated based on simple heuristics and cannot fully cover all the disfluency patterns, there is still a performance gap compared to the supervised models trained on the full training dataset. We further explore how to bridge the performance gap by integrating active learning during the fine-tuning process. Active learning strives to reduce annotation costs by choosing the most critical examples to label and can address the weakness of self-supervised learning with a small annotated dataset. We show that by combining self-supervised learning with active learning, our model is able to match state-of-the-art performance with just about 10% of the original training data on both the commonly used English Switchboard test set and a set of in-house annotated Chinese data.

引用

页数：25

共 50 条

[31] Classification-Based Self-Supervised Learning for Anomaly Detection
Li, Honghu
Zhu, Yuesheng
He, Ying
THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878
[32] Repeatable adaptive keypoint detection via self-supervised learning
Pei Yan
Yihua Tan
Yuan Tai
Science China Information Sciences, 2022, 65
[33] A NOVEL CONTRASTIVE LEARNING FRAMEWORK FOR SELF-SUPERVISED ANOMALY DETECTION
Li, Jingze
Lian, Zhichao
Li, Min
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3366 - 3370
[34] Contrastive self-supervised learning for diabetic retinopathy early detection
Jihong Ouyang
Dong Mao
Zeqi Guo
Siguang Liu
Dong Xu
Wenting Wang
Medical & Biological Engineering & Computing, 2023, 61 : 2441 - 2452
[35] Contrastive self-supervised learning for diabetic retinopathy early detection
Ouyang, Jihong
Mao, Dong
Guo, Zeqi
Liu, Siguang
Xu, Dong
Wang, Wenting
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (09) : 2441 - 2452
[36] Self-Supervised Video Representation Learning by Video Incoherence Detection
Cao, Haozhi
Xu, Yuecong
Mao, Kezhi
Xie, Lihua
Yin, Jianxiong
See, Simon
Xu, Qianwen
Yang, Jianfei
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (06) : 3810 - 3822
[37] Hyperspectral target detection using self-supervised background learning
Ali, Muhammad Khizer
Amin, Benish
Maud, Abdur Rahman
Bhatti, Farrukh Aziz
Sukhia, Komal Nain
Khurshid, Khurram
ADVANCES IN SPACE RESEARCH, 2024, 74 (02) : 628 - 646
[38] Online Self-Supervised Deep Learning for Intrusion Detection Systems
Nakip, Mert
Gelenbe, Erol
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 5668 - 5683
[39] Contrastive Self-Supervised Learning for Globally Distributed Landslide Detection
Ghorbanzadeh, Omid
Shahabi, Hejar
Piralilou, Sepideh Tavakkoli
Crivellari, Alessandro
La Rosa, Laura Elena Cue
Atzberger, Clement
Li, Jonathan
Ghamisi, Pedram
IEEE ACCESS, 2024, 12 : 118453 - 118466
[40] Domain adaptation and self-supervised learning for surgical margin detection
Santilli, Alice M. L.
Jamzad, Amoon
Sedghi, Alireza
Kaufmann, Martin
Logan, Kathryn
Wallis, Julie
Ren, Kevin Y. M.
Janssen, Natasja
Merchant, Shaila
Engel, Jay
McKay, Doug
Varma, Sonal
Wang, Ami
Fichtinger, Gabor
Rudan, John F.
Mousavi, Parvin
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (05) : 861 - 869

← 1 2 3 4 5 →