Specifics of Data Collection and Data Processing during Formation of RailVista Dataset for Machine Learning- and Deep Learning-Based Applications

被引:0
|
作者
Abisheva, Gulsipat [1 ]
Goranin, Nikolaj [2 ]
Razakhova, Bibigul [1 ]
Aidynov, Tolegen [3 ]
Satybaldina, Dina [3 ]
机构
[1] LN Gumilyov Eurasian Natl Univ, Fac Informat Technol, Dept Artificial Intelligence Technol, KZ-010000 Astana, Kazakhstan
[2] Vilnius Gediminas Tech Univ, Fac Fundamental Sci, Dept Informat Syst, LT-08412 Vilnius, Lithuania
[3] LN Gumilyov Eurasian Natl Univ, Fac Informat Technol, Dept Informat Secur, KZ-010000 Astana, Kazakhstan
关键词
dataset; data collection; machine learning; railway; railway track defects; DEFECT DETECTION; RAILWAY;
D O I
10.3390/s24165239
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents the methodology and outcomes of creating the Rail Vista dataset, designed for detecting defects on railway tracks using machine and deep learning techniques. The dataset comprises 200,000 high-resolution images categorized into 19 distinct classes covering various railway infrastructure defects. The data collection involved a meticulous process including complex image capture methods, distortion techniques for data enrichment, and secure storage in a data warehouse using efficient binary file formats. This structured dataset facilitates effective training of machine/deep learning models, enhancing automated defect detection systems in railway safety and maintenance applications. The study underscores the critical role of high-quality datasets in advancing machine learning applications within the railway domain, highlighting future prospects for improving safety and reliability through automated recognition technologies.
引用
收藏
页数:18
相关论文
共 50 条
  • [42] Deep Reinforcement Learning Based Data Collection in IoT Networks
    Khodaparast, Seyed Saeed
    Lu, Xiao
    Wang, Ping
    Uyen Trang Nguyen
    [J]. 2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 818 - 823
  • [43] Deep Reinforcement Learning Based Data Collection with Charging Stations
    Hao, Fuxin
    Hu, Yifan
    Fu, Junjie
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3344 - 3349
  • [44] Big Data, Data Mining, Machine Learning, and Deep Learning Concepts in Crime Data
    Ates, Emre Cihan
    Bostanci, Erkan
    Guzel, Mehmet Serdar
    [J]. JOURNAL OF PENAL LAW AND CRIMINOLOGY-CEZA HUKUKU VE KRIMINOLOJI DERGISI, 2020, 8 (02): : 293 - 319
  • [45] Supervised Machine Learning-based Routing for Named Data Networking
    Mekinda, Leonce
    Muscariello, Luca
    [J]. 2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
  • [46] Importance of data selection for machine learning-based atomistic potentials
    Smith, Justin
    Nebgen, Benjamin
    Lubbers, NIcholas
    Tretiak, Sergei
    Barros, Kipton
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [47] Editorial: Machine Learning-Based Methods for RNA Data Analysis
    Peng, Lihong
    Yang, Jialiang
    Wang, Minxian
    Zhou, Liqian
    [J]. FRONTIERS IN GENETICS, 2022, 13
  • [48] Machine Learning-Based Intrusion Detection System For Healthcare Data
    Balyan, Amit Kumar
    Ahuja, Sachin
    Sharma, Sanjeev Kumar
    Lilhore, Umesh Kumar
    [J]. PROCEEDINGS OF 3RD IEEE CONFERENCE ON VLSI DEVICE, CIRCUIT AND SYSTEM (IEEE VLSI DCS 2022), 2022, : 290 - 294
  • [49] Machine Learning-based Energy Consumption Model for Data Center
    Qiao, Lin
    Yu, Yuanqi
    Wang, Qun
    Zhang, Yu
    Song, Yueming
    Yu, Xiaosheng
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3051 - 3055
  • [50] Deep Learning- and Transfer Learning-Based Super Resolution Reconstruction from Single Medical Image
    Zhang, YiNan
    An, MingQiang
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2017, 2017