Classification of lidar measurements using supervised and unsupervised machine learning methods

被引:6
|
作者
Farhani, Ghazal [1 ]
Sica, Robert J. [1 ]
Daley, Mark Joseph [2 ]
机构
[1] Univ Western Ontario, Dept Phys & Astron, 1151 Richmond St, London, ON N6A 3K7, Canada
[2] Univ Western Ontario, Vector Inst Artificial Intelligence, Dept Comp Sci, 1151 Richmond St, London, ON N6A 3K7, Canada
关键词
D O I
10.5194/amt-14-391-2021
中图分类号
P4 [大气科学(气象学)];
学科分类号
0706 ; 070601 ;
摘要
While it is relatively straightforward to automate the processing of lidar signals, it is more difficult to choose periods of "good" measurements to process. Groups use various ad hoc procedures involving either very simple (e.g. signal-to-noise ratio) or more complex procedures (e.g. Wing et al., 2018) to perform a task that is easy to train humans to perform but is time-consuming Here, we use machine learning techniques to train the machine to sort the measurements before processing. The presented method is generic and can be applied to most lidars. We test the techniques using measurements from the Purple Crow Lidar (PCL) system located in London, Canada. The PCL has over 200 000 raw profiles in Rayleigh and Raman channels available for classification. We classify raw (level-0) lidar measurements as "clear" sky profiles with strong lidar returns, "bad" profiles, and profiles which are significantly influenced by clouds or aerosol loads. We examined different supervised machine learning algorithms including the random forest, the support vector machine, and the gradient boosting trees, all of which can successfully classify profiles. The algorithms were trained using about 1500 profiles for each PCL channel, selected randomly from different nights of measurements in different years. The success rate of identification for all the channels is above 95 %. We also used the t-distributed stochastic embedding (t-SNE) method, which is an unsupervised algorithm, to cluster our lidar profiles. Because the t-SNE is a data-driven method in which no labelling of the training set is needed, it is an attractive algorithm to find anomalies in lidar profiles. The method has been tested on several nights of measurements from the PCL measurements. The t-SNE can successfully cluster the PCL data profiles into meaningful categories. To demonstrate the use of the technique, we have used the algorithm to identify stratospheric aerosol layers due to wildfires.
引用
收藏
页码:391 / 402
页数:12
相关论文
共 50 条
  • [1] Classifying Force Spectroscopy of DNA Pulling Measurements Using Supervised and Unsupervised Machine Learning Methods
    Karatay, Durmus U.
    Zhang, Jie
    Harrison, Jeffrey S.
    Ginger, David S.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (04) : 621 - 629
  • [2] Ball Bearing Fault Diagnosis Using Supervised and Unsupervised Machine Learning Methods
    Vakharia, V.
    Gupta, V. K.
    Kankar, P. K.
    [J]. INTERNATIONAL JOURNAL OF ACOUSTICS AND VIBRATION, 2015, 20 (04): : 244 - 250
  • [3] Signal Parameter Estimation and Classification Using Mixed Supervised and Unsupervised Machine Learning Approaches
    Katyara, Sunny
    Staszewski, Lukasz
    Leonowicz, Zbigniew
    [J]. IEEE ACCESS, 2020, 8 : 92754 - 92764
  • [4] Comparison of supervised and unsupervised machine learning techniques for UXO classification using EMI data
    Bijamov, Alex
    Shubitidze, Fridon
    Fernandez, Juan Pablo
    Shamatava, Irma
    Barrowes, Benjamin E.
    O'Neill, Kevin
    [J]. DETECTION AND SENSING OF MINES, EXPLOSIVE OBJECTS, AND OBSCURED TARGETS XVI, 2011, 8017
  • [5] Wind Speed Extrapolation Using Machine Learning Methods and LiDAR Measurements
    Mohandes, M. A.
    Rehman, S.
    [J]. IEEE ACCESS, 2018, 6 : 77634 - 77642
  • [6] Classification of Users of a Health Service Provider Using Unsupervised Machine Learning Methods
    Marlon David Arango-Abella
    Juan Carlos Figueroa-García
    [J]. SN Computer Science, 5 (5)
  • [7] Evaluating Statistical and Machine Learning Supervised Classification Methods
    Hand, David J.
    [J]. STATISTICAL DATA SCIENCE, 2018, : 37 - 53
  • [8] Classification of Human and Machine-Generated Texts Using Lexical Features and Supervised/Unsupervised Machine Learning Algorithms
    Rojas-Simon, Jonathan
    Ledeneva, Yulia
    Arnulfo Garcia-Hernandez, Rene
    [J]. PATTERN RECOGNITION, MCPR 2024, 2024, 14755 : 331 - 341
  • [9] Language and Gender Classification of Speech Files Using Supervised Machine Learning Methods
    HaCohen-Kerner, Yaakov
    Hagege, Ruben
    [J]. CYBERNETICS AND SYSTEMS, 2017, 48 (6-7) : 510 - 535
  • [10] Analysis of classification by supervised and unsupervised learning
    Sapkal, Shubhangi D.
    Kakarwal, Sangeeta N.
    Revankar, P. S.
    [J]. ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL I, PROCEEDINGS, 2007, : 280 - 284