Domain-oriented Language Modeling with Adaptive Hybrid Masking and Optimal Transport Alignment

被引:1
|
作者
Zhang, Denghui [1 ]
Yuan, Zixuan [1 ]
Liu, Yanchi [2 ]
Liu, Hao [4 ]
Zhuang, Fuzhen [3 ]
Xiong, Hui [1 ]
Chen, Haifeng [2 ]
机构
[1] Rutgers State Univ, New Brunswick, NJ 08854 USA
[2] NEC Labs Amer, Princeton, NJ 08540 USA
[3] Beihang Univ, Inst Artificial Intelligence, Sch Comp Sci, Beijing, Peoples R China
[4] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Domain language modeling; pre-training; masked language model; optimal transport;
D O I
10.1145/3447548.3467215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Motivated by the success of pre-trained language models such as BERT in a broad range of natural language processing (NLP) tasks, recent research efforts have been made for adapting these models for different application domains. Along this line, existing domain-oriented models have primarily followed the vanilla BERT architecture, and have a straightforward use of the domain corpus. However, domain-oriented tasks usually require accurate understanding of domain phrases, and such fine-grained phrase-level knowledge is hard to be captured by existing pre-training scheme. Also, the word co-occurrences guided semantic learning of pre-training models can be largely augmented by entity-level association knowledge. But meanwhile, there is a risk of introducing noise due to the lack of groundtruth word-level alignment. To address the above issues, we provide a generalized domain-oriented approach, which leverages auxiliary domain knowledge to improve the existing pre-training framework from two aspects. First, to preserve phrase knowledge effectively, we build a domain phrase pool as auxiliary knowledge, meanwhile we introduce Adaptive I Hybrid Masked Model to incorporate such knowledge. It integrates two learning modes, word learning and phrase learning, and allows them to switch between each other. Second, we introduce Cross Entity Alignment to leverage entity association as weak supervision to augment the semantic learning of pre-trained models. To alleviate the potential noise in this process, we introduce an interpretable, Optimal Transport based approach to guide alignment learning. Experiments on four domain oriented tasks demonstrate the superiority of our framework.
引用
收藏
页码:2145 / 2153
页数:9
相关论文
共 42 条
  • [1] A Domain-Oriented, Java']Java Specification Language
    Duc Minh Le
    [J]. 2015 Seventh International Conference on Knowledge and Systems Engineering (KSE), 2015, : 25 - 30
  • [2] CPA on COLM Authenticated Cipher and the Protection Using Domain-Oriented Masking
    Jahanbani, Mohsen
    Bagheri, Nasour
    Norouzi, Zeinolabedin
    [J]. ISECURE-ISC INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2020, 12 (02): : 67 - 80
  • [3] Domain-Oriented Multilevel Ontology for Adaptive Data Processing
    Man Tianxing
    Stankova, Elena
    Vodyaho, Alexander
    Zhukova, Nataly
    Shichkina, Yulia
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2020, PT I, 2020, 12249 : 634 - 649
  • [4] Domain-oriented variability modeling for reuse of simulation models
    Lee, Hyesun
    Yang, Jin-Seok
    Kang, Kyo Chul
    Pyun, Jai-Jeong
    [J]. SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2014, 90 (04): : 438 - 459
  • [5] Domain-Oriented Masking Compact Masked Hardware Implementations with Arbitrary Protection Order
    Gross, Hannes
    Mangard, Stefan
    Korak, Thomas
    [J]. PROCEEDINGS OF THE 2016 ACM WORKSHOP ON THE THEORY OF IMPLEMENTATION SECURITY (TIS'16), 2016, : 3 - 3
  • [6] Domain-oriented edge-based alignment of protein interaction networks
    Guo, Xin
    Hartemink, Alexander J.
    [J]. BIOINFORMATICS, 2009, 25 (12) : I240 - I246
  • [7] Protecting Triple-DES Against DPA A Practical Application of Domain-Oriented Masking
    Sasdrich, Pascal
    Hutter, Michael
    [J]. CONSTRUCTIVE SIDE-CHANNEL ANALYSIS AND SECURE DESIGN, COSADE 2018, 2018, 10815 : 207 - 226
  • [8] QUERYING DATABASES WITH A DOMAIN-ORIENTED NATURAL-LANGUAGE UNDERSTANDING SYSTEM
    BERNORIO, M
    BERTONI, M
    DABBENE, A
    SOMALVICO, M
    [J]. INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1980, 9 (02): : 141 - 159
  • [9] DomBERT: Domain-oriented Language Model for Aspect-based Sentiment Analysis
    Xu, Hu
    Liu, Bing
    Shu, Lei
    Yu, Philip S.
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1725 - 1731
  • [10] KUSANAGI - A DOMAIN-ORIENTED SPECIFICATION LANGUAGE FOR BUSINESS APPLICATIONS AND ITS DEVELOPMENT ENVIRONMENT
    HIKITA, T
    MATSUMOTO, MJ
    [J]. NEC RESEARCH & DEVELOPMENT, 1995, 36 (03): : 438 - 444