Performance of Machine Learning Algorithms and Diversity in Data

被引:6
|
作者
Sug, Hyontai [1 ]
机构
[1] Dongseo Univ, Div Comp Engn, 47 Jurye Ro, Busan 47011, South Korea
关键词
NEURAL-NETWORKS;
D O I
10.1051/matecconf/201821004019
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent world events in go games between human and artificial intelligence called AlphaGo showed the big advancement in machine learning technologies. While AlphaGo was trained using real world data, AlphaGo Zero was trained using massive random data, and the fact that AlphaGo Zero won AlphaGo completely revealed that diversity and size in training data is important for better performance for the machine learning algorithms, especially in deep learning algorithms of neural networks. On the other hand, artificial neural networks and decision trees are widely accepted machine learning algorithms because of their robustness in errors and comprehensibility respectively. In this paper in order to prove that diversity and size in data are important factors for better performance of machine learning algorithms empirically, the two representative algorithms are used for experiment. A real world data set called breast tissue was chosen, because the data set consists of real numbers that is very good property for artificial random data generation. The result of the experiment proved the fact that the diversity and size of data are very important factors for better performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Integrating relational chemical data with machine learning algorithms
    Henry, DR
    Chen, LL
    Grier, DL
    Durant, JL
    Leland, BA
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2003, 225 : U783 - U783
  • [32] Advanced machine learning algorithms for biomedical data and imaging
    [J]. Multimedia Tools and Applications, 2021, 80 : 30005 - 30005
  • [33] A SURVEY OF MACHINE LEARNING ALGORITHMS FOR BIG DATA ANALYTICS
    Athmaja, S.
    Hanumanthappa, M.
    Kavitha, Vasantha
    [J]. 2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [34] Machine learning algorithms for analysis and modeling of GeoSpatial data
    Pozdnoukhov, Alexei
    Kanevski, Mikhail
    [J]. PROCEEDINGS OF THE IAMG '07: GEOMATHEMATICS AND GIS ANALYSIS OF RESOURCES, ENVIRONMENT AND HAZARDS, 2007, : 216 - +
  • [35] Data Analytics for Cybersecurity Based on Machine Learning Algorithms
    Wang, Lidong
    Mosher, Reed L.
    Duett, Patti
    Falls, Terril C.
    [J]. SOUTHEASTCON 2023, 2023, : 810 - 814
  • [36] Data poisoning attacks against machine learning algorithms
    Yerlikaya, Fahri Anil
    Bahtiyar, Serif
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 208
  • [37] Classification of Logging Data Using Machine Learning Algorithms
    Mukhamediev, Ravil
    Kuchin, Yan
    Yunicheva, Nadiya
    Kalpeyeva, Zhuldyz
    Muhamedijeva, Elena
    Gopejenko, Viktors
    Rystygulov, Panabek
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [38] Advanced machine learning algorithms for biomedical data and imaging
    Tanveer, Mohammad
    Rastogi, Reshma
    Lin, Chin-Teng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (20) : 30005 - 30005
  • [39] Streaming Machine Learning Algorithms with Big Data Systems
    Abeykoon, Vibhatha
    Kamburugamuve, Supun
    Govindrarajan, Kannan
    Wickramasinghe, Pulasthi
    Widanage, Chathura
    Perera, Niranda
    Uyar, Ahmet
    Gunduz, Gurhan
    Akkas, Selahattin
    Von Laszewski, Gregor
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 5661 - 5666
  • [40] Analysis of Clustering Algorithms in Machine Learning for Healthcare Data
    Zhang, Jingyou
    Zhong, Haiping
    [J]. Journal of Commercial Biotechnology, 2022, 27 (05) : 82 - 91