Transfer Joint Embedding for Cross-Domain Named Entity Recognition

被引:26
|
作者
Pan, Sinno Jialin [1 ]
Toh, Zhiqiang [1 ]
Su, Jian [1 ]
机构
[1] Inst Infocomm Res, Data Analyt Dept, Singapore 138632, Singapore
关键词
Algorithms; Experimentation; Named entity recognition; transfer learning; multiclass classification;
D O I
10.1145/2457465.2457467
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) is a fundamental task in information extraction from unstructured text. Most previous machine-learning-based NER systems are domain-specific, which implies that they may only perform well on some specific domains (e.g., Newswire) but tend to adapt poorly to other related but different domains (e.g., Weblog). Recently, transfer learning techniques have been proposed to NER. However, most transfer learning approaches to NER are developed for binary classification, while NER is a multiclass classification problem in nature. Therefore, one has to first reduce the NER task to multiple binary classification tasks and solve them independently. In this article, we propose a new transfer learning method, named Transfer Joint Embedding (TJE), for cross-domain multiclass classification, which can fully exploit the relationships between classes (labels), and reduce domain difference in data distributions for transfer learning. More specifically, we aim to embed both labels (outputs) and high-dimensional features (inputs) from different domains (e.g., a source domain and a target domain) into a unified low-dimensional latent space, where 1) each label is represented by a prototype and the intrinsic relationships between labels can be measured by Euclidean distance; 2) the distance in data distributions between the source and target domains can be reduced; 3) the source domain labeled data are closer to their corresponding label-prototypes than others. After the latent space is learned, classification on the target domain data can be done with the simple nearest neighbor rule in the latent space. Furthermore, in order to scale up TJE, we propose an efficient algorithm based on stochastic gradient descent (SGD). Finally, we apply the proposed TJE method for NER across different domains on the ACE 2005 dataset, which is a benchmark in Natural Language Processing (NLP). Experimental results demonstrate the effectiveness of TJE and show that TJE can outperform state-of-the-art transfer learning approaches to NER.
引用
收藏
页数:27
相关论文
共 50 条
  • [41] Stratified Transfer Learning for Cross-domain Activity Recognition
    Wang, Jindong
    Chen, Yiqiang
    Hu, Lisha
    Peng, Xiaohui
    Yu, Philip S.
    2018 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS (PERCOM), 2018, : 115 - 124
  • [42] Deep Transfer Learning for Cross-domain Activity Recognition
    Wang, Jindong
    Zheng, Vincent W.
    Chen, Yiqiang
    Huang, Meiyu
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON CROWD SCIENCE AND ENGINEERING (ICCSE 2018), 2018,
  • [43] Cross-domain activity recognition via transfer learning
    Hu, Derek Hao
    Zheng, Vincent Wenchen
    Yang, Qiang
    PERVASIVE AND MOBILE COMPUTING, 2011, 7 (03) : 344 - 358
  • [44] Adversarial Named Entity Recognition with POS label embedding
    Bai, Yuxuan
    Wang, Yu
    Xia, Bin
    Li, Yun
    Zhu, Ziye
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [45] Common Latent Embedding Space for Cross-Domain Facial Expression Recognition
    Wang, Run
    Song, Peng
    Li, Shaokai
    Ji, Liang
    Zheng, Wenming
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 2046 - 2056
  • [46] Joint Speech Translation and Named Entity Recognition
    Gaido, Marco
    Papi, Sara
    Negri, Matteo
    Turchi, Marco
    INTERSPEECH 2023, 2023, : 47 - 51
  • [47] Named Entity Recognition in the Domain of Geographical Subject
    Xu, Feifei
    Li, Huiying
    Li, Xuelian
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 2229 - 2234
  • [48] A framework for Named Entity Recognition in the Open domain
    Evans, RJ
    RECENT ADVANCES IN NATURAL LANGUAGE PROCESSING III, 2004, 260 : 267 - 276
  • [49] Named Entity Recognition System for the Biomedical Domain
    Sharma, Raghav
    Chauhan, Deependra
    Sharma, Raksha
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 837 - 840
  • [50] Named Entity Recognition in a Very Homogeneous Domain
    Agarwal, Oshin
    Nenkova, Ani
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1850 - 1855