MAPGN: MASKED POINTER-GENERATOR NETWORK FOR SEQUENCE-TO-SEQUENCE PRE-TRAINING

被引：2

作者：

Ihori, Mana ^{[1
]}

Makishima, Naoki ^{[1
]}

Tanaka, Tomohiro ^{[1
]}

Takashima, Akihiko ^{[1
]}

Orihashi, Shota ^{[1
]}

Masumura, Ryo ^{[1
]}

机构：

[1] NTT Corp, NTT Media Intelligence Labs, Tokyo, Japan

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年

关键词：

sequence-to-sequence pre-training; pointer-generator networks; self-supervised learning; spoken-text normalization;

D O I：

10.1109/ICASSP39728.2021.9414738

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a self-supervised learning method for pointer-generator networks to improve spoken-text normalization. Spoken-text normalization that converts spoken-style text into style normalized text is becoming an important technology for improving subsequent processing such as machine translation and summarization. The most successful spoken-text normalization method to date is sequence-to-sequence (seq2seq) mapping using pointer-generator networks that possess a copy mechanism from an input sequence. However, these models require a large amount of paired data of spoken-style text and style normalized text, and it is difficult to prepare such a volume of data. In order to construct spoken-text normalization model from the limited paired data, we focus on self-supervised learning which can utilize unpaired text data to improve seq2seq models. Unfortunately, conventional self-supervised learning methods do not assume that pointer-generator networks are utilized. Therefore, we propose a novel self-supervised learning method, MAsked Pointer-Generator Network (MAPGN). The proposed method can effectively pre-train the pointer-generator network by learning to fill masked tokens using the copy mechanism. Our experiments demonstrate that MAPGN is more effective for pointer-generator networks than the conventional self-supervised learning methods in two spoken-text normalization tasks.

引用

页码：7563 / 7567

页数：5

共 50 条

[1] Improving AMR Parsing with Sequence-to-Sequence Pre-training
Xu, Dongqin
Li, Junhui
Zhu, Muhua
Min Zhang
Zhou, Guodong
[J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2501 - 2511
[2] Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
Zhou, Wangchunshu
Ge, Tao
Xu, Canwen
Xu, Ke
Wei, Furu
[J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 571 - 582
[3] Denoising based Sequence-to-Sequence Pre-training for Text Generation
Wang, Liang
Zhao, Wei
Jia, Ruoyu
Li, Sujian
Liu, Jingming
[J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 4003 - 4015
[4] MASS: Masked Sequence to Sequence Pre-training for Language Generation
Song, Kaitao
Tan, Xu
Qin, Tao
Lu, Jianfeng
Liu, Tie-Yan
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[5] ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training
Qi, Weizhen
Yan, Yu
Gong, Yeyun
Liu, Dayiheng
Duan, Nan
Chen, Jiusheng
Zhang, Ruofei
Zhou, Ming
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2401 - 2410
[6] Code Question Answering via Task-Adaptive Sequence-to-Sequence Pre-training
Yu, Tingrui
Gu, Xiaodong
Shen, Beijun
[J]. 2022 29TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, APSEC, 2022, : 229 - 238
[7] SPT-Code: Sequence-to-Sequence Pre-Training for Learning Source Code Representations
Niu, Changan
Li, Chuanyi
Ng, Vincent
Ge, Jidong
Huang, Liguo
Luo, Bin
[J]. 2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022), 2022, : 2006 - 2018
[8] Masked Hard Coverage Mechanism on Pointer-generator Network for Natural Language Generation
Hu, Ting
Meinel, Christoph
[J]. ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 1177 - 1183
[9] Coloring the Blank Slate: Pre-training Imparts a Hierarchical Inductive Bias to Sequence-to-sequence Models
Mueller, Aaron
Frank, Robert
Linzen, Tal
Wang, Luheng
Schuster, Sebastian
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1352 - 1368
[10] SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training
Yan, Hong
Liu, Yang
Wei, Yushen
Li, Zhen
Li, Guanbin
Lin, Liang
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5583 - 5595

← 1 2 3 4 5 →