Using Classification with K-means Clustering to Investigate Transaction Anomaly

被引:0
|
作者
Tan, Xing Scott [1 ]
Yang, Zijiang [1 ]
Benlimane, Younes [1 ]
Liu, Eric [2 ]
机构
[1] York Univ, Fac Liberal Arts & Profess Studies, Sch Informat Technol, N York, ON, Canada
[2] Bayview Secondary Sch, Richmond Hill, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
E-Commerce; machine learning; decision analysis;
D O I
10.1109/ieem45057.2020.9309909
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Applications of machine learning and related algorithms in Electronic Commerce (hereafter E-Commerce) have the potential to build robust analytical models that help examine transaction data and successfully detect and predict anomalies. Nonetheless, the robustness of such models can be undermined in the case of highly unbalanced data set. This paper presents a classification method built on K-means Clustering that addresses the issue of highly unbalanced data. In this method, we first pre-process our E-Commerce data and then apply clustering and classifying procedures to create a number of clusters where each resulting cluster includes similar transaction records. Next, four classifiers including Logistic Regression, Naive Bayes, RBFNetwork and NBtree classifiers are used to assess the resulting solution. Findings based on real-word data show that this method provides a better solution for transaction anomaly detection and prediction than traditional approaches. They also show that it straightforwardly resolves classification problems with data imbalance.
引用
收藏
页码:171 / 174
页数:4
相关论文
共 50 条
  • [1] Classification of Moving Vehicles using K-Means Clustering
    Changalasetty, Suresh Babu
    Thota, Lalitha Saroja
    Badawy, Ahmed Said
    Ghribi, Wade
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES, 2015,
  • [2] Acute Leukemia Classification by Using SVM and K-Means Clustering
    Laosai, Jakkrich
    Chamnongthai, Kosin
    [J]. 2014 INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2014,
  • [3] Classification of patients with bipolar disorder using k-means clustering
    de la Fuente-Tomas, Lorena
    Arranz, Belen
    Safont, Gemma
    Sierra, Pilar
    Sanchez-Autet, Monica
    Garcia-Blanco, Ana
    Garcia-Portilla, Maria P.
    [J]. PLOS ONE, 2019, 14 (01):
  • [4] Android Malware Classification Using K-Means Clustering Algorithm
    Hamid, Isredza Rahmi A.
    Khalid, Nur Syafiqah
    Abdullah, Nurul Azma
    Ab Rahman, Nurul Hidayah
    Wen, Chuah Chai
    [J]. INTERNATIONAL RESEARCH AND INNOVATION SUMMIT (IRIS2017), 2017, 226
  • [5] Opinion Classification Using Maximum Entropy and K-Means Clustering
    Hamzah, Amir
    Widyastuti, Naniek
    [J]. PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEMS (ICTS), 2016, : 162 - 166
  • [6] Comparing document classification schemes using K-means clustering
    Silic, Artur
    Moens, Marie-Francine
    Zmak, Lovro
    Basic, Bojana Dalbelo
    [J]. KNOWLEDGE - BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2008, 5177 : 615 - +
  • [7] Anomaly Detection by Using Streaming K-Means and Batch K-Means
    Wang, Zhuo
    Zhou, Yanghui
    Li, Gangmin
    [J]. 2020 5TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (IEEE ICBDA 2020), 2020, : 11 - 17
  • [8] Clustering of Image Data Using K-Means and Fuzzy K-Means
    Rahmani, Md. Khalid Imam
    Pal, Naina
    Arora, Kamiya
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (07) : 160 - 163
  • [9] Spectral Classification of Retinal Features Using K-Means Clustering Algorithm
    Cho, Julie
    Kashani, Amir H.
    Humayun, Mark S.
    [J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2015, 56 (07)
  • [10] Poverty Classification Using Analytic Hierarchy Process and K-Means Clustering
    Sarwosri
    Sunaryono, Dwi
    Akbar, Rizky Januar
    Setiyawan, Risky Dwi
    [J]. PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEMS (ICTS), 2016, : 266 - 269