NCLS: Neural Cross-Lingual Summarization

被引:0
|
作者
Zhu, Junnan [1 ,2 ]
Wang, Qian [1 ,2 ]
Wang, Yining [1 ,2 ]
Zhou, Yu [1 ,2 ]
Zhang, Jiajun [1 ,2 ]
Wang, Shaonan [1 ,2 ]
Zong, Chengqing [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-lingual summarization (CLS) is the task to produce a summary in one particular language for a source document in a different language. Existing methods simply divide this task into two steps: summarization and translation, leading to the problem of error propagation. To handle that, we present an end-to-end CLS framework, which we refer to as Neural Cross-Lingual Summarization (NCLS), for the first time. Moreover, we propose to further improve NCLS by incorporating two related tasks, monolingual summarization and machine translation, into the training process of CLS under multi-task learning. Due to the lack of supervised CLS data, we propose a round-trip translation strategy to acquire two high-quality large-scale CLS datasets based on existing monolingual summarization datasets. Experimental results have shown that our NCLS achieves remarkable improvement over traditional pipeline methods on both English-to-Chinese and Chinese-toEnglish CLS human-corrected test sets. In addition, NCLS with multi-task learning can further significantly improve the quality of generated summaries. We make our dataset and code publicly available here: http://www. nlpr.ia.ac.cn/cip/dataset.htm.
引用
收藏
页码:3054 / 3064
页数:11
相关论文
共 50 条
  • [31] Cross-Lingual Sentiment Relation Capturing for Cross-Lingual Sentiment Analysis
    Chen, Qiang
    Li, Wenjie
    Lei, Yu
    Liu, Xule
    Luo, Chuwei
    He, Yanxiang
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 54 - 67
  • [32] Cross-Lingual Training of Neural Models for Document Ranking
    Shi, Peng
    Bai, He
    Lin, Jimmy
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2768 - 2773
  • [33] Cross-lingual training of summarization systems using annotated corpora in a foreign language
    Litvak, Marina
    Last, Mark
    [J]. INFORMATION RETRIEVAL, 2013, 16 (05): : 629 - 656
  • [34] A Study of Neural Matching Models for Cross-lingual IR
    Yu, Puxuan
    Allan, James
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1637 - 1640
  • [35] Multi-path Based Self-adaptive Cross-lingual Summarization
    Bao, Zhongtian
    Wang, Jun
    Yang, Zhenglu
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2023, 2023, 14119 : 282 - 294
  • [36] Reinforced Transformer with Cross-Lingual Distillation for Cross-Lingual Aspect Sentiment Classification
    Wu, Hanqian
    Wang, Zhike
    Qing, Feng
    Li, Shoushan
    [J]. ELECTRONICS, 2021, 10 (03) : 1 - 14
  • [37] Neural Cross-Lingual Named Entity Recognition with Minimal Resources
    Xie, Jiateng
    Yang, Zhilin
    Neubig, Graham
    Smith, Noah A.
    Carbonell, Jaime
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 369 - 379
  • [38] Unified Training for Cross-Lingual Abstractive Summarization by Aligning Parallel Machine Translation Pairs
    Cheng, Shaohuan
    Chen, Wenyu
    Tang, Yujia
    Fu, Mingsheng
    Qu, Hong
    [J]. MATHEMATICS, 2024, 12 (13)
  • [39] Neural Cross-Lingual Event Detection with Minimal Parallel Resources
    Liu, Jian
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 738 - 748
  • [40] Cross-lingual Supervision Improves Unsupervised Neural Machine Translation
    Wang, Mingxuan
    Bai, Hongxiao
    Zhao, Hai
    Li, Lei
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, : 89 - 96