Cross-Domain Labeled LDA for Cross-Domain Text Classification

被引:10
|
作者
Jing, Baoyu [1 ]
Lu, Chenwei [2 ]
Wang, Deqing [2 ]
Zhuang, Fuzhen [3 ,4 ]
Niu, Cheng [5 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[3] Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing, Peoples R China
[4] Univ Chinese Acad Sci, Beijing, Peoples R China
[5] Pattern Recognit Ctr, WeChat Search Applicat Dept, Tencent, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross Domain Text Classification; Topic Modeling; Group Alignment; Partial Supervision;
D O I
10.1109/ICDM.2018.00034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-domain text classification aims at building a classifier for a target domain which leverages data from both source and target domain. One promising idea is to minimize the feature distribution differences of the two domains. Most existing studies explicitly minimize such differences by an exact alignment mechanism (aligning features by one-to-one feature alignment, projection matrix etc.). Such exact alignment, however, will restrict models' learning ability and will further impair models' performance on classification tasks when the semantic distributions of different domains are very different. To address this problem, we propose a novel group alignment which aligns the semantics at group level. In addition, to help the model learn better semantic groups and semantics within these groups, we also propose a partial supervision for model's learning in source domain. To this end, we embed the group alignment and a partial supervision into a cross-domain topic model, and propose a Cross-Domain Labeled LDA (CDL-LDA). On the standard 20Newsgroup and Reuters dataset, extensive quantitative (classification, perplexity etc.) and qualitative (topic detection) experiments are conducted to show the effectiveness of the proposed group alignment and partial supervision.
引用
收藏
页码:187 / 196
页数:10
相关论文
共 50 条
  • [1] Iterative Reinforcement Cross-Domain Text Classification
    Zhang, Di
    Xue, Gui-Rong
    Yu, Yong
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 282 - 293
  • [2] Cross-domain knowledge distillation for text classification
    Zhang, Shaokang
    Jiang, Lei
    Tan, Jianlong
    [J]. NEUROCOMPUTING, 2022, 509 : 11 - 20
  • [3] Cross-Domain Text Classification Based on BERT Model
    Zhang, Kuan
    Hei, Xinhong
    Fei, Rong
    Guo, Yufan
    Jiao, Rui
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS: DASFAA 2021 INTERNATIONAL WORKSHOPS, 2021, 12680 : 197 - 208
  • [4] Research Progress on Cross-domain Text Sentiment Classification
    Zhao, Chuan-Jun
    Wang, Su-Ge
    Li, De-Yu
    [J]. Ruan Jian Xue Bao/Journal of Software, 2020, 31 (06): : 1723 - 1746
  • [5] Cross-Domain NER using Cross-Domain Language Modeling
    Jia, Chen
    Liang, Xiaobo
    Zhang, Yue
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2464 - 2474
  • [6] Cross-Domain Text Sentiment Classification Based on Wasserstein Distance
    Cai, Guoyong
    Lin, Qiang
    Chen, Nannan
    [J]. SECURITY WITH INTELLIGENT COMPUTING AND BIG-DATA SERVICES, 2020, 895 : 280 - 291
  • [7] A Structure-Aware Method for Cross-domain Text Classification
    Zhang, Yuhong
    Qian, Lin
    Zhang, Qi
    Li, Peipei
    Liu, Guocheng
    [J]. PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2022, 13630 : 283 - 296
  • [8] Automatic Classification of Cross-Domain Opinions
    Guzman Cabrera, Rafael
    [J]. COMPUTACION Y SISTEMAS, 2019, 23 (04): : 1541 - 1548
  • [9] Cross-Domain Collaborative Filtering with Review Text
    Xin, Xin
    Liu, Zhirun
    Lin, Chin-Yew
    Huang, Heyan
    Wei, Xiaochi
    Guo, Ping
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1827 - 1833
  • [10] Unsupervised Energy-based Adversarial Domain Adaptation for Cross-domain Text Classification
    Zou, Han
    Yang, Jianfei
    Wu, Xiaojian
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1208 - 1218