Ensembling Transformers for Cross-domain Automatic Term Extraction

被引:3
|
作者
Hanh Thi Hong Tran [1 ,2 ,3 ]
Martinc, Matej [1 ]
Pelicon, Andraz [1 ]
Doucet, Antoine [3 ]
Pollak, Senja [2 ]
机构
[1] Jozef Stefan Int Postgrad Sch, Jamova Cesta 39, Ljubljana 1000, Slovenia
[2] Jozef Stefan Inst, Jamova Cesta 39, Ljubljana 1000, Slovenia
[3] Univ La Rochelle, 23 Av Albert Einstein, La Rochelle, France
关键词
Automatic term extraction; ATE; Low resource; ACTER; RSDO5; Monolingual; Cross-domain;
D O I
10.1007/978-3-031-21756-2_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic term extraction plays an essential role in domain language understanding and several natural language processing downstream tasks. In this paper, we propose a comparative study on the predictive power of Transformers-based pretrained language models toward term extraction in a multi-language cross-domain setting. Besides evaluating the ability of monolingual models to extract single- and multiword terms, we also experiment with ensembles of mono- and multilingual models by conducting the intersection or union on the term output sets of different language models. Our experiments have been conducted on the ACTER corpus covering four specialized domains (Corruption, Wind energy, Equitation, and Heart failure) and three languages (English, French, and Dutch), and on the RSDO5 Slovenian corpus covering four additional domains (Biomechanics, Chemistry, Veterinary, and Linguistics). The results show that the strategy of employing monolingual models outperforms the state-of-the-art approaches from the related work leveraging multilingual models, regarding all the languages except Dutch and French if the term extraction task excludes the extraction of named entity terms. Furthermore, by combining the outputs of the two best performing models, we achieve significant improvements.
引用
收藏
页码:90 / 100
页数:11
相关论文
共 50 条
  • [41] Domain-invariant feature extraction and fusion for cross-domain person re-identification
    Zhaoqian Jia
    Ye Li
    Zhuofu Tan
    Wenchao Wang
    Zhiguo Wang
    Guangqiang Yin
    The Visual Computer, 2023, 39 : 1205 - 1216
  • [42] Cross-Domain Crawling for Innovation
    Assogna, Pierluigi
    Taglino, Francesco
    ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS (CAISE), 2013, 148 : 286 - 297
  • [43] Cross-Domain Data Fusion
    Yang, Qiang
    COMPUTER, 2016, 49 (04) : 18 - 18
  • [44] Cross-Domain Relation Adaptation
    Kessler, Ido
    Lifshitz, Omri
    Benaim, Sagie
    Wolf, Lior
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [45] Cross-Domain Secure Computation
    Cho, Chongwon
    Garg, Sanjam
    Ostrovsky, Rafail
    PUBLIC-KEY CRYPTOGRAPHY - PKC 2014, 2014, 8383 : 650 - 668
  • [46] Cross-domain network representations
    Xue, Shan
    Lu, Jie
    Zhang, Guangquan
    PATTERN RECOGNITION, 2019, 94 : 135 - 148
  • [47] Cross-Domain Activity Recognition
    Zheng, Vincent Wenchen
    Hu, Derek Hao
    Yang, Qiang
    UBICOMP'09: PROCEEDINGS OF THE 11TH ACM INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING, 2009, : 61 - 70
  • [48] Cross-Domain Sparse Coding
    Wang, Jim Jing-Yan
    Bensmail, Halima
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1461 - 1464
  • [49] Cross-domain fault diagnosis of rotating machinery based on graph feature extraction
    Wang, Pei
    Liu, Jie
    Zhou, Jianzhong
    Duan, Ran
    Jiang, Wei
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (02)
  • [50] Unsupervised Cross-Domain Feature Extraction for Single Blood Cell Image Classification
    Salehi, Raheleh
    Sadafi, Ario
    Gruber, Armin
    Lienemann, Peter
    Navab, Nassir
    Albarqouni, Shadi
    Marr, Carsten
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT III, 2022, 13433 : 739 - 748