Cross-Domain and Semisupervised Named Entity Recognition in Chinese Social Media: A Unified Model

被引:21
|
作者
Xu, Jingjing [1 ]
He, Hangfeng [2 ]
Sun, Xu [1 ]
Ren, Xuancheng [1 ]
Li, Sujian [1 ]
机构
[1] Peking Univ, MOE Key Lab Computat Linguist, Beijing 100871, Peoples R China
[2] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
基金
中国国家自然科学基金;
关键词
Named entity recognition; Chinese social media; cross-domain learning; semi-supervised learning;
D O I
10.1109/TASLP.2018.2856625
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Named entity recognition (NER) in Chinese social media is an important, but challenging task because Chinese social media language is informal and noisy. Most previous methods on NER focus on in-domain supervised learning, which is limited by scarce annotated data in social media. In this paper, we present that sufficient corpora in formal domains and massive unannotated text can be combined to improve the NER performance in social media. We propose a unified model which can learn from out-of-domain corpora and in-domain unannotated text. The unified model is composed of two parts. One is for cross-domain learning and the other is for semisupervised learning. Cross-domain learning can learn out-of-domain information based on domain similarity. Semisupervised learning can learn in-domain unannotated information by self-training. Experimental results show that our unified model yields a 9.57% improvement over strong baselines and achieves the state-of-the-art performance.
引用
收藏
页码:2142 / 2152
页数:11
相关论文
共 50 条
  • [1] A Unified Model for Cross-Domain and Semi-Supervised Named Entity Recognition in Chinese Social Media
    He, Hangfeng
    Sun, Xu
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3216 - 3222
  • [2] Dynamically Transfer Entity Span Information for Cross-domain Chinese Named Entity Recognition
    Wu B.-C.
    Deng C.-L.
    Guan B.
    Chen X.-L.
    Zan D.-G.
    Chang Z.-J.
    Xiao Z.-Y.
    Qu D.-C.
    Wang Y.-J.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (10): : 3776 - 3792
  • [3] Data Augmentation for Cross-Domain Named Entity Recognition
    Chen, Shuguang
    Aguilar, Gustavo
    Neves, Leonardo
    Solorio, Thamar
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5346 - 5356
  • [4] CrossNER: Evaluating Cross-Domain Named Entity Recognition
    Liu, Zihan
    Xu, Yan
    Yu, Tiezheng
    Dai, Wenliang
    Ji, Ziwei
    Cahyawijaya, Samuel
    Madotto, Andrea
    Fung, Pascale
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13452 - 13460
  • [5] Zero-Resource Cross-Domain Named Entity Recognition
    Liu, Zihan
    Winata, Genta Indra
    Fung, Pascale
    5TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2020), 2020, : 1 - 6
  • [6] Dual Contrastive Learning for Cross-Domain Named Entity Recognition
    Xu, Jingyun
    Yu, Junnan
    Cai, Yi
    Chua, Tat-Seng
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (06)
  • [7] Transfer Joint Embedding for Cross-Domain Named Entity Recognition
    Pan, Sinno Jialin
    Toh, Zhiqiang
    Su, Jian
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2013, 31 (02)
  • [8] Cross-domain Named Entity Recognition via Graph Matching
    Zheng, Junhao
    Chen, Haibin
    Ma, Qianli
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2670 - 2680
  • [9] Neural Adaptation Layers for Cross-domain Named Entity Recognition
    Lin, Bill Yuchen
    Lu, Wei
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2012 - 2022
  • [10] Domain-Adapted Dependency Parsing for Cross-Domain Named Entity Recognition
    Dou, Chenxiao
    Sun, Xianghui
    Wang, Yaoshu
    Ji, Yunjie
    Ma, Baochang
    Li, Xiangang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 12737 - 12744