Machine Learning Models for Automatic Labeling: A Systematic Literature Review

被引:6
|
作者
Fredriksson, Teodor [1 ]
Bosch, Jan [1 ]
Olsson, Helena [2 ]
机构
[1] Chalmers Univ Technol, Dept Comp Sci & Engn, Div Software Engn, Gothenburg, Sweden
[2] Malm Univ, Dept Comp Sci & Media Technol, Malm, Sweden
关键词
Semi-supervised Learning; Active Machine Learning; Automatic Labeling;
D O I
10.5220/0009972705520561
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Automatic labeling is a type of classification problem. Classification has been studied with the help of statistical methods for a long time. With the explosion of new better computer processing units (CPUs) and graphical processing units (GPUs) the interest in machine learning has grown exponentially and we can use both statistical learning algorithms as well as deep neural networks (DNNs) to solve the classification tasks. Classification is a supervised machine learning problem and there exists a large amount of methodology for performing such task. However, it is very rare in industrial applications that data is fully labeled which is why we need good methodology to obtain error-free labels. The purpose of this paper is to examine the current literature on how to perform labeling using ML, we will compare these models in terms of popularity and on what datatypes they are used on. We performed a systematic literature review of empirical studies for machine learning for labeling. We identified 43 primary studies relevant to our search. From this we were able to determine the most common machine learning models for labeling. Lack of unlabeled instances is a major problem for industry as supervised learning is the most widely used. Obtaining labels is costly in terms of labor and financial costs. Based on our findings in this review we present alternate ways for labeling data for use in supervised learning tasks.
引用
收藏
页码:552 / 561
页数:10
相关论文
共 50 条
  • [41] Conceptual Modeling Interacts with Machine Learning - A Systematic Literature Review
    Zaidi, Moayid Ali
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT IX, 2021, 12957 : 522 - 532
  • [42] Application of machine learning in dementia diagnosis: A systematic literature review
    Kantayeva, Gauhar
    Lima, Jose
    Pereira, Ana I.
    [J]. HELIYON, 2023, 9 (11)
  • [43] Use of machine learning in osteoarthritis research: a systematic literature review
    Binvignat, Marie
    Pedoia, Valentina
    Butte, Atul J.
    Louati, Karine
    Klatzmann, David
    Berenbaum, Francis
    Mariotti-Ferrandiz, Encarnita
    Sellam, Jeremie
    [J]. RMD OPEN, 2022, 8 (01):
  • [44] Machine learning in business process management: A systematic literature review
    Weinzierl, Sven
    Zilker, Sandra
    Dunzer, Sebastian
    Matzner, Martin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 253
  • [45] Machine/Deep Learning for Software Engineering: A Systematic Literature Review
    Wang, Simin
    Huang, Liguo
    Gao, Amiao
    Ge, Jidong
    Zhang, Tengfei
    Feng, Haitao
    Satyarth, Ishna
    Li, Ming
    Zhang, He
    Ng, Vincent
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (03) : 1188 - 1231
  • [46] A Systematic Literature Review on Machine Learning for Automated Requirements Classification
    Manuel Perez-Verdejo, J.
    Sanchez-Garcia, Angel J.
    Octavio Ocharan-Hernandez, Jorge
    [J]. 2020 8TH EDITION OF THE INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION (CONISOFT 2020), 2020, : 21 - 28
  • [47] Machine Learning Techniques for Knowledge Tracing: A Systematic Literature Review
    Ramirez Luelmo, Sergio Ivan
    El Mawas, Nour
    Heutte, Jean
    [J]. CSEDU: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED EDUCATION - VOL 1, 2021, : 60 - 70
  • [48] Systematic literature review: machine learning for software fault prediction
    Navarro Cedeno, Gabriel Omar
    Cortes Moya, Katherine
    Somarribas Dormond, Ahmed
    Gonzalez-Torres, Antonio
    Rojas-Hernandez, Yenory
    [J]. 2023 IEEE 41ST CENTRAL AMERICA AND PANAMA CONVENTION, CONCAPAN XLI, 2023, : 134 - 139
  • [49] Systematic literature review of machine learning for manufacturing supply chain
    Ganjare, Smita Abhijit
    Satao, Sunil M.
    Narwane, Vaibhav
    [J]. TQM JOURNAL, 2023, 36 (08): : 2236 - 2259
  • [50] Machine Learning for Credit Risk Prediction: A Systematic Literature Review
    Noriega, Jomark Pablo
    Rivera, Luis Antonio
    Herrera, Jose Alfredo
    [J]. DATA, 2023, 8 (11)