Data Quality Controlling for Cross-Lingual Sentiment Classification

被引:2
|
作者
Li, Shoushan [1 ,2 ]
Xue, Yunxia [1 ]
Wang, Zhongqing [1 ]
Lee, Sophia Yat Mei [2 ]
Huang, Chu-Ren [2 ]
机构
[1] Soochow Univ, Nat Language Proc Lab, Suzhou, Peoples R China
[2] Hong Kong Polytech Univ, Dept Chinese & Bilingual Studies, Hong Kong, Peoples R China
关键词
D O I
10.1109/IALP.2013.43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-lingual sentiment classification aims to perform sentiment classification in a language (named as the target language) with the help of the resources from another language (named as the source language). Previous studies are prone to using all available data in the source language while using all data is observed to perform no better or even worse than using a partion of good data. In this paper, we propose a novel task called data quality controlling in the source language to select high quality samples from the source language. To tackle this task, we propose two kinds of data quality measurements: intra- and extra-quality measurements which are implemented with the certainty and similarity measurements respectively. The empirical studies demonstrate the effectiveness of the proposed approach to data quality controlling in the source language.
引用
收藏
页码:125 / 128
页数:4
相关论文
共 50 条
  • [1] Reinforced Transformer with Cross-Lingual Distillation for Cross-Lingual Aspect Sentiment Classification
    Wu, Hanqian
    Wang, Zhike
    Qing, Feng
    Li, Shoushan
    [J]. ELECTRONICS, 2021, 10 (03) : 1 - 14
  • [2] A Comparative Study of Cross-Lingual Sentiment Classification
    Wan, Xiaojun
    [J]. 2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 24 - 31
  • [3] Cross-lingual sentiment classification with stacked autoencoders
    Guangyou Zhou
    Zhiyuan Zhu
    Tingting He
    Xiaohua Tony Hu
    [J]. Knowledge and Information Systems, 2016, 47 : 27 - 44
  • [4] Cross-lingual sentiment classification with stacked autoencoders
    Zhou, Guangyou
    Zhu, Zhiyuan
    He, Tingting
    Hu, Xiaohua Tony
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 47 (01) : 27 - 44
  • [5] Active Learning for Cross-Lingual Sentiment Classification
    Li, Shoushan
    Wang, Rong
    Liu, Huanhuan
    Huang, Chu-Ren
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 236 - 246
  • [6] Cross-Lingual Sentiment Relation Capturing for Cross-Lingual Sentiment Analysis
    Chen, Qiang
    Li, Wenjie
    Lei, Yu
    Liu, Xule
    Luo, Chuwei
    He, Yanxiang
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 54 - 67
  • [7] Cross-lingual sentiment classification: Similarity discovery plus training data adjustment
    Zhang, Peng
    Wang, Suge
    Li, Deyu
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 107 : 129 - 141
  • [8] A Subspace Learning Framework For Cross-Lingual Sentiment Classification With Partial Parallel Data
    Zhou, Guangyou
    He, Tingting
    Zhao, Jun
    Wu, Wensheng
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1426 - 1432
  • [9] Cross-Lingual Classification of Crisis Data
    Khare, Prashant
    Burel, Gregoire
    Maynard, Diana
    Alani, Harith
    [J]. SEMANTIC WEB - ISWC 2018, PT I, 2018, 11136 : 617 - 633
  • [10] Cross-Lingual Sentiment Quantification
    Esuli, Andrea
    Moreo, Alejandro
    Sebastiani, Fabrizio
    [J]. IEEE INTELLIGENT SYSTEMS, 2020, 35 (03) : 106 - 113