Transfer Joint Embedding for Cross-Domain Named Entity Recognition

被引：26

作者：

Pan, Sinno Jialin ^{[1
]}

Toh, Zhiqiang ^{[1
]}

Su, Jian ^{[1
]}

机构：

[1] Inst Infocomm Res, Data Analyt Dept, Singapore 138632, Singapore

来源：

ACM TRANSACTIONS ON INFORMATION SYSTEMS | 2013年 / 31卷 / 02期

关键词：

Algorithms; Experimentation; Named entity recognition; transfer learning; multiclass classification;

D O I：

10.1145/2457465.2457467

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Named Entity Recognition (NER) is a fundamental task in information extraction from unstructured text. Most previous machine-learning-based NER systems are domain-specific, which implies that they may only perform well on some specific domains (e.g., Newswire) but tend to adapt poorly to other related but different domains (e.g., Weblog). Recently, transfer learning techniques have been proposed to NER. However, most transfer learning approaches to NER are developed for binary classification, while NER is a multiclass classification problem in nature. Therefore, one has to first reduce the NER task to multiple binary classification tasks and solve them independently. In this article, we propose a new transfer learning method, named Transfer Joint Embedding (TJE), for cross-domain multiclass classification, which can fully exploit the relationships between classes (labels), and reduce domain difference in data distributions for transfer learning. More specifically, we aim to embed both labels (outputs) and high-dimensional features (inputs) from different domains (e.g., a source domain and a target domain) into a unified low-dimensional latent space, where 1) each label is represented by a prototype and the intrinsic relationships between labels can be measured by Euclidean distance; 2) the distance in data distributions between the source and target domains can be reduced; 3) the source domain labeled data are closer to their corresponding label-prototypes than others. After the latent space is learned, classification on the target domain data can be done with the simple nearest neighbor rule in the latent space. Furthermore, in order to scale up TJE, we propose an efficient algorithm based on stochastic gradient descent (SGD). Finally, we apply the proposed TJE method for NER across different domains on the ACE 2005 dataset, which is a benchmark in Natural Language Processing (NLP). Experimental results demonstrate the effectiveness of TJE and show that TJE can outperform state-of-the-art transfer learning approaches to NER.

引用

页数：27

共 50 条

[1] Dynamically Transfer Entity Span Information for Cross-domain Chinese Named Entity Recognition
Wu B.-C.
Deng C.-L.
Guan B.
Chen X.-L.
Zan D.-G.
Chang Z.-J.
Xiao Z.-Y.
Qu D.-C.
Wang Y.-J.
Ruan Jian Xue Bao/Journal of Software, 2022, 33 (10): : 3776 - 3792
[2] Data Augmentation for Cross-Domain Named Entity Recognition
Chen, Shuguang
Aguilar, Gustavo
Neves, Leonardo
Solorio, Thamar
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5346 - 5356
[3] CrossNER: Evaluating Cross-Domain Named Entity Recognition
Liu, Zihan
Xu, Yan
Yu, Tiezheng
Dai, Wenliang
Ji, Ziwei
Cahyawijaya, Samuel
Madotto, Andrea
Fung, Pascale
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13452 - 13460
[4] Zero-Resource Cross-Domain Named Entity Recognition
Liu, Zihan
Winata, Genta Indra
Fung, Pascale
5TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2020), 2020, : 1 - 6
[5] Dual Contrastive Learning for Cross-Domain Named Entity Recognition
Xu, Jingyun
Yu, Junnan
Cai, Yi
Chua, Tat-Seng
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (06)
[6] Cross-domain Named Entity Recognition via Graph Matching
Zheng, Junhao
Chen, Haibin
Ma, Qianli
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2670 - 2680
[7] Neural Adaptation Layers for Cross-domain Named Entity Recognition
Lin, Bill Yuchen
Lu, Wei
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2012 - 2022
[8] Domain-Adapted Dependency Parsing for Cross-Domain Named Entity Recognition
Dou, Chenxiao
Sun, Xianghui
Wang, Yaoshu
Ji, Yunjie
Ma, Baochang
Li, Xiangang
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 12737 - 12744
[9] Exploring Modular Task Decomposition in Cross-domain Named Entity Recognition
Zhang, Xinghua
Yu, Bowen
Wang, Yubin
Liu, Tingwen
Su, Taoyu
Xu, Hongbo
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 301 - 311
[10] Causal Relationship Representation Enhanced Cross-Domain Named Entity Recognition
Liu, Xiaoming
Cao, Mengyuan
Yang, Guan
Liu, Jie
Wang, Hang
Computer Engineering and Applications, 60 (18): : 176 - 188

← 1 2 3 4 5 →