Opinion mining using principal component analysis based ensemble model for e-commerce application

被引:2
|
作者
G. Vinodhini
R M Chandrasekaran
机构
[1] Annamalai University,Department of Computer Science and Engineering
关键词
Opinion; Classification; Unigram; N-grams; Feature; Mining; Reviews;
D O I
10.1007/s40012-014-0055-3
中图分类号
学科分类号
摘要
With the rapid expansion of e-commerce over the decades, more and more product reviews emerge on e-commerce sites. In order to effectively utilize the information available in the form of reviews, an automatic opinion mining system is needed to organize the reviews and to help the users and organizations in making an informed decision about the products. Opinion mining systems based on machine learning approaches are used to categorize the reviews containing the customer opinion into positive or negative reviews. In this paper we explore this new research area of applying a hybrid combination of machine learning approaches tied with principal component analysis as a feature reduction technique. We introduce two hybrid ensemble based models (i.e. bagging and bayesian boosting based) for opinion classification. The results are compared with two individual classifier models based on statistical learning (i.e. logistic regression and support vector machine) using a dataset of product reviews. The other objective is to compare the influence of using different n-gram schemes (unigrams, bigrams and trigrams). We found that ensemble based hybrid methods perform better in terms of various quality measures in classifying the opinion into positive and negative reviews. We also applied a pairwise statistical test to compare the significance of the classifiers.
引用
收藏
页码:169 / 179
页数:10
相关论文
共 50 条
  • [31] Research of Data Mining Based on E-Commerce
    Li Yong-hong
    Liu Xiao-liang
    ICCSIT 2010 - 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 4, 2010, : 719 - 722
  • [32] Application Research of Web Log Mining in the E-commerce
    Wang, Yuanyuan
    Liu, Hailin
    Liu, Qianqian
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 349 - 352
  • [33] The Application of Visualization Technology on E-commerce Data Mining
    Zhang, Fangfang
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 563 - 566
  • [34] An Application of Data Mining System in Tourism E-commerce
    Jiang, Hua
    Cui, Zhenxing
    PROCEEDINGS OF 2009 CONFERENCE ON SYSTEMS SCIENCE, MANAGEMENT SCIENCE & SYSTEM DYNAMICS, VOL 3, 2009, : 83 - 86
  • [35] Application of parallel and distributed data mining in e-commerce
    Beg, MMS
    Ravikumar, CP
    IETE TECHNICAL REVIEW, 2000, 17 (04): : 189 - 195
  • [36] The Application of Web Usage Mining In E-commerce Security
    Tamimi, Reyhaneh
    Ebrahim, Mohammad
    Mohammadpourzarandi
    2013 7TH INTERNATIONAL CONFERENCE ON E-COMMERCE IN DEVELOPING COUNTRIES: WITH FOCUS ON E-SECURITY (ECDC), 2013,
  • [37] The Application of Clustering Mining Technology in E-commerce Website
    Li, Bin
    Chen, Yeh-Cheng
    2018 9TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP 2018), 2018, : 104 - 107
  • [38] The Research of Dynamic Mining Technology in the Application of E-Commerce
    Gao, Rencai
    ADVANCES IN COMPUTER SCIENCE, ENVIRONMENT, ECOINFORMATICS, AND EDUCATION, PT II, 2011, 215 : 477 - 481
  • [39] Application of parallel and distributed data mining in e-Commerce
    Sufyan Beg, M.M.
    Ravikumar, C.P.
    IETE Technical Review (Institution of Electronics and Telecommunication Engineers, India), 2000, 17 (04): : 189 - 195
  • [40] Application of sensor-based speech data mining in E-commerce operations data analysis
    Yao, Kang
    Measurement: Sensors, 2024, 33