Text Length Adaptation in Sentiment Classification

被引:0
|
作者
Amplayo, Reinald Kim [1 ,3 ]
Lim, Seonjae [2 ]
Hwang, Seung-won [3 ]
机构
[1] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[2] Samsung Elect, Seoul, South Korea
[3] Yonsei Univ, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Can a text classifier generalize well for datasets where the text length is different? For example, when short reviews are sentiment-labeled, can these transfer to predict the sentiment of long reviews (i.e., short to long transfer), or vice versa? While unsupervised transfer learning has been well-studied for cross domain/lingual transfer tasks, Cross Length Transfer (CLT) has not yet been explored. One reason is the assumption that length difference is trivially transferable in classification. We show that it is not, because short/long texts differ in context richness and word intensity. We devise new benchmark datasets from diverse domains and languages, and show that existing models from similar tasks cannot deal with the unique challenge of transferring across text lengths. We introduce a strong baseline model called BaggedCNN that treats long texts as bags containing short texts. We propose a state-of-the-art CLT model called Length Transfer Networks (LeTraNets) that introduces a two-way encoding scheme for short and long texts using multiple training mechanisms. We test our models and find that existing models perform worse than the BaggedCNN baseline, while LeTraNets outperforms all models.
引用
收藏
页码:646 / 661
页数:16
相关论文
共 50 条
  • [1] A Text Classifier with Domain Adaptation for Sentiment Classification
    Chen, Wei
    Zhou, Jingyu
    INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 61 - 72
  • [2] Sentiment classification of Hinglish text
    Ravi, Kumar
    Ravi, Vadlamani
    2016 3RD INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN INFORMATION TECHNOLOGY (RAIT), 2016, : 641 - 645
  • [3] Automated Classification of Text Sentiment
    Dufourq, Emmanuel
    Bassett, Bruce A.
    SOUTH AFRICAN INSTITUTE OF COMPUTER SCIENTISTS AND INFORMATION TECHNOLOGISTS (SACSIT 2017), 2017, : 96 - +
  • [4] Connecting Text Classification with Image Classification: A New Preprocessing Method for Implicit Sentiment Text Classification
    Chen, Meikang
    Ubul, Kurban
    Xu, Xuebin
    Aysa, Alimjan
    Muhammat, Mahpirat
    SENSORS, 2022, 22 (05)
  • [5] Timeline Adaptation for Text Classification
    Fukumoto, Fumiyo
    Suzuki, Yoshimi
    Takasu, Atsuhiro
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1517 - 1520
  • [6] An Adaptive Text Representation Method for Sentiment Classification
    Zhao, Huan
    Zhang, Xi-xiang
    Chen, Zuo
    2015 INTERNATIONAL CONFERENCE ON SOFTWARE, MULTIMEDIA AND COMMUNICATION ENGINEERING (SMCE 2015), 2015, : 148 - 157
  • [7] Text sentiment classification based on feature fusion
    Zhang C.
    Li Q.
    Cheng X.
    Revue d'Intelligence Artificielle, 2020, 34 (04) : 515 - 520
  • [8] Text Mining: Sentiment Analysis on news classification
    Gomes, Helder
    Neto, Miguel de Castro
    Henriques, Roberto
    PROCEEDINGS OF THE 2013 8TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI 2013), 2013,
  • [9] Tibetan text Sentiment Classification Based on Rules
    Huang, Tao
    Yan, Xiaodong
    PROCEEDINGS 2015 18TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2015), 2015, : 566 - 569
  • [10] Hierarchical Classification in Text Mining for Sentiment Analysis
    Li, Jinyan
    Fong, Simon
    Zhuang, Yan
    Khoury, Richard
    2014 INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE ISCMI 2014, 2014, : 46 - 51