Sentimental feature selection for sentiment analysis of Chinese online reviews

被引:74
|
作者
Zheng, Lijuan [1 ,2 ]
Wang, Hongwei [2 ]
Gao, Song [2 ]
机构
[1] Liaocheng Univ, Sch Business, Liaocheng 252000, Peoples R China
[2] Tongji Univ, Sch Econ & Management, Shanghai 200092, Peoples R China
关键词
Online reviews; Sentiment; Feature selection; Statistical machine learning; CLASSIFICATION;
D O I
10.1007/s13042-015-0347-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the growing availability and popularity of online reviews, the sentiment analysis arises in response to the requirement of organizing useful information in speed. Feature selection directly affects the representation of online reviews and brings a lot of challenges to the domain of sentiment analysis. However, little attention has been paid to feature selection of Chinese online reviews so far. Therefore, we are motivated to explore the effects of feature selection on sentiment analysis of Chinese online reviews. Firstly, N-char-grams and N-POS-grams are selected as the potential sentimental features. Then, the improved Document Frequency method is used to select feature subsets, and the Boolean Weighting method is adopted to calculate feature weight. At last, experiments based on online reviews of mobile phone are conducted, and Chi-square test is carried out to test the significance of experimental results. The results suggest that sentiment analysis of Chinese online reviews obtains higher accuracy when taking 4-POS-grams as features. Besides that, low order N-char-grams can achieve a better performance than high order N-char-grams when taking N-char-grams as features. Furthermore, the improved document frequency achieves significant improvement in sentiment analysis of Chinese online reviews.
引用
收藏
页码:75 / 84
页数:10
相关论文
共 50 条
  • [21] A machine learning-based sentiment analysis of online product reviews with a novel term weighting and feature selection approach
    Zhao, Huiliang
    Liu, Zhenghong
    Yao, Xuemei
    Yang, Qin
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (05)
  • [22] Feature Selection using Particle Swarm Optimization for Sentiment Analysis of Drug Reviews
    Asri, Afifah Mohd
    Ahmad, Siti Rohaidah
    Yusop, Nurhafizah Moziyana Mohd
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 286 - 295
  • [23] A Novel Feature-based Method for Sentiment Analysis of Chinese Product Reviews
    Liu Lizhen
    Song Wei
    Wang Hanshi
    Li Chuchu
    Lu Jingli
    CHINA COMMUNICATIONS, 2014, 11 (03) : 154 - 164
  • [24] An Unsupervised Fine-grained Sentiment Analysis Model for Chinese Online Reviews
    Shi, Hanxiao
    Zhou, Guodong
    Qian, Peide
    Li, Xiaojun
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (10): : 4277 - 4294
  • [25] Feature Based Sentiment Analysis for Service Reviews
    Abirami, Ariyur Mahadevan
    Askarunisa, Abdulkhader
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2016, 22 (05) : 650 - 670
  • [26] Feature level Sentiment Analysis on Movie Reviews
    Sharma, Pallavi
    Mishra, Nidhi
    PROCEEDINGS ON 2016 2ND INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2016, : 306 - 311
  • [27] FEATURE SELECTION USING IMPROVED SHUFFLED FROG ALGORITHM FOR SENTIMENT ANALYSIS OF BOOK REVIEWS
    Madhusudhanan
    Srivatsa
    IIOAB JOURNAL, 2016, 7 (09) : 526 - 534
  • [28] Interactions Between Term Weighting and Feature Selection Methods on the Sentiment Analysis of Turkish Reviews
    Parlar, Tuba
    Ozel, Selma Ayse
    Song, Fei
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT II, 2018, 9624 : 335 - 346
  • [29] Sentiment Analysis of Japanese Tourism Online Reviews
    Chuanming Yu
    Xingyu Zhu
    Bolin Feng
    Lin Cai
    Lu An
    Journal of Data and Information Science, 2019, (01) : 89 - 113
  • [30] Sentiment Analysis of Japanese Tourism Online Reviews
    Yu, Chuanming
    Zhu, Xingyu
    Feng, Bolin
    Cai, Lin
    An, Lu
    JOURNAL OF DATA AND INFORMATION SCIENCE, 2019, 4 (01) : 89 - 113