Comparison of Visual Datasets for Machine Learning

被引:23
|
作者
Gauen, Kent [1 ]
Dailey, Ryan [1 ]
Laiman, John [1 ]
Zi, Yuxiang [1 ]
Asokan, Nirmal [1 ]
Lu, Yung-Hsiang [1 ]
Thiruvathukal, George K. [2 ]
Shyu, Mei-Ling [3 ]
Chen, Shu-Ching [4 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[2] Loyola Univ, Dept Comp Sci, Chicago, IL 60611 USA
[3] Univ Miami, Dept Elect & Comp Engn, Coral Gables, FL 33124 USA
[4] Florida Int Univ, Sch Comp & Informat Sci, Miami, FL 33199 USA
基金
美国国家科学基金会;
关键词
OBJECT;
D O I
10.1109/IRI.2017.59
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the greatest technological improvements in recent years is the rapid progress using machine learning for processing visual data. Among all factors that contribute to this development, datasets with labels play crucial roles. Several datasets are widely reused for investigating and analyzing different solutions in machine learning. Many systems, such as autonomous vehicles, rely on components using machine learning for recognizing objects. This paper compares different visual datasets and frameworks for machine learning. The comparison is both qualitative and quantitative and investigates object detection labels with respect to size, location, and contextual information. This paper also presents a new approach creating datasets using real-time, geo-tagged visual data, greatly improving the contextual information of the data. The data could be automatically labeled by cross-referencing information from other sources (such as weather).
引用
收藏
页码:346 / 355
页数:10
相关论文
共 50 条
  • [1] Comparison of Machine Learning Algorithms on Different Datasets
    Uysal, Elif
    Ozturk, Ali
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [2] A Comparison of Machine Learning Classifiers Applied to Financial Datasets
    Robles-Granda, Pablo D.
    Belik, Ivan V.
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS 1 AND 2, 2010, : 454 - 459
  • [3] Utilizing QR codes to verify the visual fidelity of image datasets for machine learning
    Chow, Yang-Wai
    Susilo, Willy
    Wang, Jianfeng
    Buckland, Richard
    Baek, Joonsang
    Kim, Jongkil
    Li, Nan
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2021, 173
  • [4] DendroMap: Visual Exploration of Large-Scale Image Datasets for Machine Learning with Treemaps
    Bertucci D.
    Hamid M.M.
    Anand Y.
    Ruangrotsakun A.
    Tabatabai D.
    Perez M.
    Kahng M.
    IEEE Transactions on Visualization and Computer Graphics, 2023, 29 (01) : 320 - 330
  • [5] Comparison of machine learning methods for ground settlement prediction with different tunneling datasets
    Tang, Libin
    Na, SeonHong
    JOURNAL OF ROCK MECHANICS AND GEOTECHNICAL ENGINEERING, 2021, 13 (06) : 1274 - 1289
  • [6] Classification Comparison of Machine Learning Algorithms Using Two Independent CAD Datasets
    Yuvali, Meliz
    Yaman, Belma
    Tosun, Oezguer
    MATHEMATICS, 2022, 10 (03)
  • [7] Comparison of machine learning methods for ground settlement prediction with different tunneling datasets
    Libin Tang
    SeonHong Na
    Journal of Rock Mechanics and Geotechnical Engineering, 2021, 13 (06) : 1274 - 1289
  • [8] QDataSet, quantum datasets for machine learning
    Elija Perrier
    Akram Youssry
    Chris Ferrie
    Scientific Data, 9
  • [9] Datasets with rich labels for machine learning
    Hoarau, Arthur
    Thierry, Constance
    Martin, Arnaud
    Dubois, Jean-Christophe
    Le Gall, Yolande
    2023 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, FUZZ, 2023,
  • [10] Image Watermarking for Machine Learning Datasets
    Maesen, Palle
    Isler, Devris
    Laoutaris, Nikolaos
    Erkin, Zekeriya
    PROCEEDINGS OF THE 2ND ACM DATA ECONOMY WORKSHOP, DEC 2023, 2023, : 7 - 13