Improving Sentiment Classification Accuracy of Financial News using N-gram Approach and Feature Weighting Methods

被引:0
|
作者
Foroozan, S. [1 ]
Murad, M. A. Azmi [1 ]
Sharef, N. M. [1 ]
Latiff, A. R. Abdul [2 ]
机构
[1] Univ Putra Malaysia, Fac Comp Sci & IT, Serdang 43400, Malaysia
[2] Univ Putra Malaysia, Putra Business Sch, Serdang 43400, Malaysia
关键词
sentiment classification; financial news; support vector machine; Radial Basis Function; linear kernel; document frequency; TF-IDF;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment classification of financial news deals with the identification of positive and negative news so that they can be applied in decision support system to perform stock trend predictions. This paper explores several types of feature space as different datasets for sentiment classification of the news article. Experiments are conducted based on n-gram approach (unigram, bigram and the combination of unigram and bigram) used as feature extraction with different feature weighting methods, while, document frequency (DF) is used as feature selection method. We performed experiments to measure the classification accuracy of support vector machine (SVM) with two kernel methods of linear and Radial Basis Function (RBF). Results showed that an efficient feature extraction increased classification accuracy when it is used as a combination of unigram and bigram. Moreover, we also found that DF can be applied as a dimension reduction method to reduce the feature space without loss of accuracy.
引用
收藏
页码:211 / 214
页数:4
相关论文
共 50 条
  • [1] Classification of sentiment reviews using n-gram machine learning approach
    Tripathy, Abinash
    Agrawal, Ankit
    Rath, Santanu Kumar
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 57 : 117 - 126
  • [2] Sentiment Analysis Using N-gram Technique
    Chidananda, Himadri Tanaya
    Das, Debashis
    Sagnika, Santwana
    [J]. PROGRESS IN COMPUTING, ANALYTICS AND NETWORKING, ICCAN 2017, 2018, 710 : 359 - 367
  • [3] Classification of facemarks using N-gram
    Yamada, Thichi
    Tsuchiya, Seiji
    Kuroiwa, Shiongo
    Ren, Fuji
    [J]. PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 322 - +
  • [4] Alphabet Flatting as a variant of n-gram feature extraction method in ensemble classification of fake news
    Ksieniewicz, Pawel
    Zyblewski, Pawel
    Borek-Marciniec, Weronika
    Kozik, Rafal
    Choras, Michal
    Wozniak, Michal
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [5] Sentiment Classification Using N-Gram Inverse Document Frequency and Automated Machine Learning
    Maipradit, Rungroj
    Hata, Hideaki
    Matsumoto, Kenichi
    [J]. IEEE SOFTWARE, 2019, 36 (05) : 65 - 70
  • [6] Lipreading Using n-Gram Feature Vector
    Singh, Preety
    Laxmi, Vijay
    Gupta, Deepika
    Gaur, M. S.
    [J]. COMPUTATIONAL INTELLIGENCE IN SECURITY FOR INFORMATION SYSTEMS 2010, 2010, 85 : 81 - 88
  • [7] N-gram Based WSD for Improving Accuracy of Machine Translation using TM
    Rawat, Sunita
    Chandak, Manoj
    Khan, Tabassum
    [J]. HELIX, 2018, 8 (05): : 3916 - 3918
  • [8] Sentiment Analysis Approach Based on N-gram and KNN Classifier
    Kaur, Sumandeep
    Sikka, Geeta
    Awasthi, Lalit Kumar
    [J]. 2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 13 - 16
  • [9] Short Text Classification Based on Feature Extension Using The N-Gram Model
    Zhang, Xinwei
    Wu, Bin
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 710 - 716
  • [10] Partitioning Based N-Gram Feature Selection for Malware Classification
    Hu, Weiwei
    Tan, Ying
    [J]. DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 187 - 195