A Hybrid Support Vector Machine Algorithm for Big Data Heterogeneity Using Machine Learning

被引:1
|
作者
Ul Ahsaan, Shafqat [1 ]
Kaur, Harleen [1 ]
Mourya, Ashish Kumar [1 ]
Naaz, Sameena [1 ]
机构
[1] Jamia Hamdard, Dept Comp Sci, New Delhi 110062, India
来源
SYMMETRY-BASEL | 2022年 / 14卷 / 11期
关键词
big data; Euclidean distance; heterogeneity; heterogeneous Euclidean overlap metric (HEOM); hybrid support vector machine (H-SVM); k-nearest neighbor (kNN); SENTIMENT ANALYSIS; SYSTEMS;
D O I
10.3390/sym14112344
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Big data technology has gained attention in all fields, particularly with regard to research and financial institutions. This technology has changed the world tremendously. Researchers and data scientists are currently working on its applicability in different domains such as health care, medicine, and the stock market, among others. The data being generated at an unexpected pace from multiple sources like social media, health care contexts, and Internet of things have given rise to big data. Management and processing of big data represent a challenge for researchers and data scientists, as there is heterogeneity and ambiguity. Heterogeneity is considered to be an important characteristic of big data. The analysis of heterogeneous data is a very complex task as it involves the compilation, storage, and processing of varied data based on diverse patterns and rules. The proposed research has focused on the heterogeneity problem in big data. This research introduces the hybrid support vector machine (H-SVM) classifier, which uses the support vector machine as a base. In the proposed algorithm, the heterogeneous Euclidean overlap metric (HEOM) and Euclidean distance are introduced to form clusters and classify the data on the basis of ordinal and nominal values. The performance of the proposed learning classifier is compared with linear SVM, random forest, and k-nearest neighbor. The proposed algorithm attained the highest accuracy as compared to other classifiers.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Improvement of Support Vector Machine Algorithm in Big Data Background
    Gaye, Babacar
    Zhang, Dezheng
    Wulamu, Aziguli
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [2] Big data Analytics Using Support Vector Machine
    Amudha, P.
    Sivakumari, S.
    [J]. IEEE INTERNATIONAL CONFERENCE ON SOFT-COMPUTING AND NETWORK SECURITY (ICSNS 2018), 2018, : 63 - +
  • [3] PV Forecasting Using Support Vector Machine Learning in a Big Data Analytics Context
    Preda, Stefan
    Oprea, Simona-Vasilica
    Bara, Adela
    Belciu , Anda
    [J]. SYMMETRY-BASEL, 2018, 10 (12):
  • [4] Predictive big data analytic on demonetization data using support vector machine
    Kannan, Nattar
    Sivasubramanian, S.
    Kaliappan, M.
    Vimal, S.
    Suresh, A.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 14709 - 14720
  • [5] Predictive big data analytic on demonetization data using support vector machine
    Nattar Kannan
    S. Sivasubramanian
    M. Kaliappan
    S. Vimal
    A. Suresh
    [J]. Cluster Computing, 2019, 22 : 14709 - 14720
  • [6] Misalignment Detection of a Rotating Machine Shaft Using a Support Vector Machine Learning Algorithm
    Lee, Yong Eun
    Kim, Bok-Kyung
    Bae, Jun-Hee
    Kim, Kyung Chun
    [J]. INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2021, 22 (03) : 409 - 416
  • [7] Misalignment Detection of a Rotating Machine Shaft Using a Support Vector Machine Learning Algorithm
    Yong Eun Lee
    Bok-Kyung Kim
    Jun-Hee Bae
    Kyung Chun Kim
    [J]. International Journal of Precision Engineering and Manufacturing, 2021, 22 : 409 - 416
  • [8] An incremental learning algorithm for support vector machine
    An, YL
    Wang, ZO
    Ma, ZP
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1153 - 1156
  • [9] Support vector machine learning algorithm and transduction
    A. Gammermann
    [J]. Computational Statistics, 2000, 15 : 31 - 39
  • [10] Support vector machine learning algorithm and transduction
    Gammermann, A
    [J]. COMPUTATIONAL STATISTICS, 2000, 15 (01) : 31 - 39