Performance of Machine Learning Algorithms and Diversity in Data

被引:6
|
作者
Sug, Hyontai [1 ]
机构
[1] Dongseo Univ, Div Comp Engn, 47 Jurye Ro, Busan 47011, South Korea
关键词
NEURAL-NETWORKS;
D O I
10.1051/matecconf/201821004019
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent world events in go games between human and artificial intelligence called AlphaGo showed the big advancement in machine learning technologies. While AlphaGo was trained using real world data, AlphaGo Zero was trained using massive random data, and the fact that AlphaGo Zero won AlphaGo completely revealed that diversity and size in training data is important for better performance for the machine learning algorithms, especially in deep learning algorithms of neural networks. On the other hand, artificial neural networks and decision trees are widely accepted machine learning algorithms because of their robustness in errors and comprehensibility respectively. In this paper in order to prove that diversity and size in data are important factors for better performance of machine learning algorithms empirically, the two representative algorithms are used for experiment. A real world data set called breast tissue was chosen, because the data set consists of real numbers that is very good property for artificial random data generation. The result of the experiment proved the fact that the diversity and size of data are very important factors for better performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Predictive Performance of Machine Learning Algorithms Trained with Sparse Data
    Dewey, H. Heath
    DeVries, Derek R.
    [J]. 2021 IEEE AEROSPACE CONFERENCE (AEROCONF 2021), 2021,
  • [2] Comparative Performance of Deep Learning and Machine Learning Algorithms on Imbalanced Handwritten Data
    Amri, A'Inur A'Fifah
    Ismail, Amelia Ritahani
    Zarir, Abdullah Ahmad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (02) : 258 - 264
  • [3] Performance and efficiency of machine learning algorithms for analyzing rectangular biomedical data
    Deng, Fei
    Huang, Jibing
    Yuan, Xiaoling
    Cheng, Chao
    Zhang, Lanjing
    [J]. LABORATORY INVESTIGATION, 2021, 101 (04) : 430 - 441
  • [4] Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance
    Ahsan, Md Manjurul
    Mahmud, M. A. Parvez
    Saha, Pritom Kumar
    Gupta, Kishor Datta
    Siddique, Zahed
    [J]. TECHNOLOGIES, 2021, 9 (03)
  • [5] Algorithms for Data Mining and Machine Learning
    Schulz, Volker H.
    [J]. SIAM REVIEW, 2020, 62 (03) : 739 - 739
  • [6] Classification and prediction of student performance data using various machine learning algorithms
    Pallathadka, Harikumar
    Wenda, Alex
    Ramirez-Asís, Edwin
    Asís-López, Maximiliano
    Flores-Albornoz, Judith
    Phasinam, Khongdet
    [J]. Materials Today: Proceedings, 2023, 80 : 3782 - 3785
  • [7] Predicting the performance of anaerobic digestion using machine learning algorithms and genomic data
    Long, Fei
    Wang, Luguang
    Cai, Wenfang
    Lesnik, Keaton
    Liu, Hong
    [J]. WATER RESEARCH, 2021, 199
  • [8] The Application of Machine Learning Algorithms in Data Mining
    Zhang, Wei
    [J]. 2016 INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING AND COMMUNICATIONS TECHNOLOGY (IECT 2016), 2016, : 521 - 527
  • [9] Comparison of Machine Learning Algorithms in Data classification
    ul Hassan, Ch Anwar
    Khan, Muhammad Sufyan
    Shah, Munam Ali
    [J]. 2018 24TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC' 18), 2018, : 270 - 275
  • [10] Comparison of Machine Learning Algorithms on Noisy Data
    Oreski, Dijana
    Visnjic, Dunja
    Kadoic, Nikola
    [J]. CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS, CECIIS, 2023, : 383 - 389