Machine Learning-based USD/PKR Exchange Rate Forecasting Using Sentiment Analysis of Twitter Data

被引:12
|
作者
Naeem, Samreen [1 ]
Mashwani, Wali Khan [2 ]
Ali, Aqib [1 ,3 ]
Uddin, M. Irfan [4 ]
Mahmoud, Marwan [5 ]
Jamal, Farrukh [6 ]
Chesneau, Christophe [7 ]
机构
[1] Glim Inst Modern Studies, Dept Comp Sci & IT, Bahawalpur 63100, Pakistan
[2] Kohat Univ Sci & Technol, Inst Numer Sci, Kohat 26000, Pakistan
[3] Concordia Coll Bahawalpur, Dept Comp Sci, Bahawalpur 63100, Pakistan
[4] Kohat Univ Sci & Technol, Inst Comp, Kohat 26000, Pakistan
[5] King Abdulaziz Univ, Fac Appl Studies, Jeddah 21577, Saudi Arabia
[6] Islamia Univ Bahawalpur, Dept Stat, Bahawalpur 63100, Pakistan
[7] Univ Caen, LMNO, Dept Math, Campus 2,Sci 3, F-14032 Caen, France
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 67卷 / 03期
关键词
Machine learning; exchange rate; sentiment analysis; linear discriminant analysis; principal component analysis; simple logistic;
D O I
10.32604/cmc.2021.015872
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study proposes an approach based on machine learning to forecast currency exchange rates by applying sentiment analysis to messages on Twitter (called tweets). A dataset of the exchange rates between the United States Dollar (USD) and the Pakistani Rupee (PKR) was formed by collecting information from a forex website as well as a collection of tweets from the business community in Pakistan containing finance-related words. The dataset was collected in raw form, and was subjected to natural language processing by way of data preprocessing. Response variable labeling was then applied to the standardized dataset, where the response variables were divided into two classes: "1" indicated an increase in the exchange rate and "-1" indicated a decrease in it. To better represent the dataset, we used linear discriminant analysis and principal component analysis to visualize the data in three-dimensional vector space. Clusters that were obtained using a sampling approach were then used for data optimization. Five machine learning classifiers-the simple logistic classifier, the random forest, bagging, naive Bayes, and the support vector machine-were applied to the optimized dataset. The results show that the simple logistic classifier yielded the highest accuracy of 82.14% for the USD and the PKR exchange rates forecasting.
引用
收藏
页码:3451 / 3461
页数:11
相关论文
共 50 条
  • [1] Machine Learning-Based Sentiment Analysis of Twitter Data
    Karthiga, M.
    Kumar, Sathish G.
    Aravindhraj, N.
    Priyanka, S.
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATION ENGINEERING (ICACCE-2019), 2019,
  • [2] Machine Learning-Based Sentiment Analysis for Twitter Accounts
    Hasan, Ali
    Moin, Sana
    Karim, Ahmad
    Shamshirband, Shahaboddin
    [J]. MATHEMATICAL AND COMPUTATIONAL APPLICATIONS, 2018, 23 (01)
  • [3] Solving the twitter sentiment analysis problem based on a machine learning-based approach
    Fatemeh Zarisfi Kermani
    Faramarz Sadeghi
    Esfandiar Eslami
    [J]. Evolutionary Intelligence, 2020, 13 : 381 - 398
  • [4] Solving the twitter sentiment analysis problem based on a machine learning-based approach
    Zarisfi Kermani, Fatemeh
    Sadeghi, Faramarz
    Eslami, Esfandiar
    [J]. EVOLUTIONARY INTELLIGENCE, 2020, 13 (03) : 381 - 398
  • [5] A Machine Learning-Based Technique with Intelligent WordNet Lemmatize for Twitter Sentiment Analysis
    Saranya, S.
    Usha, G.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (01): : 339 - 352
  • [6] Sentiment Analysis of Twitter Data Using Machine Learning Approaches and Semantic Analysis
    Gautam, Geetika
    Yadav, Divakar
    [J]. 2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 437 - 442
  • [7] Twitter Sentiment Analysis Using Machine Learning Techniques
    Le, Bac
    Huy Nguyen
    [J]. ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2015, 358 : 279 - 289
  • [8] Sentiment Analysis in Twitter using Machine Learning Techniques
    Neethu, M. S.
    Rajasree, R.
    [J]. 2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [9] Forecasting and Analysis of Exchange Rate of RMB to USD
    Wang Dan
    Xiao Chunlai
    [J]. CONTEMPORARY INNOVATION AND DEVELOPMENT IN STATISTICAL SCIENCE, 2012, : 441 - 446
  • [10] Machine learning tool for exploring sentiment analysis on twitter data
    Biradar, Shanta H.
    Gorabal, J. V.
    Gupta, Gaurav
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 56 : 1927 - 1934