A Topological Machine Learning Pipeline for Classification

被引:6
|
作者
Conti, Francesco [1 ,2 ]
Moroni, Davide [2 ]
Pascali, Maria Antonietta [2 ]
机构
[1] Univ Pisa, Dept Math, I-56126 Pisa, Italy
[2] Natl Res Council Italy CNR, Inst Informat Sci & Technol A Faedo, I-56124 Pisa, Italy
关键词
topological machine learning; persistent homology; classification; vectorization; SIZE FUNCTIONS; REGRESSION; SELECTION;
D O I
10.3390/math10173086
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this work, we develop a pipeline that associates Persistence Diagrams to digital data via the most appropriate filtration for the type of data considered. Using a grid search approach, this pipeline determines optimal representation methods and parameters. The development of such a topological pipeline for Machine Learning involves two crucial steps that strongly affect its performance: firstly, digital data must be represented as an algebraic object with a proper associated filtration in order to compute its topological summary, the Persistence Diagram. Secondly, the persistence diagram must be transformed with suitable representation methods in order to be introduced in a Machine Learning algorithm. We assess the performance of our pipeline, and in parallel, we compare the different representation methods on popular benchmark datasets. This work is a first step toward both an easy and ready-to-use pipeline for data classification using persistent homology and Machine Learning, and to understand the theoretical reasons why, given a dataset and a task to be performed, a pair (filtration, topological representation) is better than another.
引用
收藏
页数:33
相关论文
共 50 条
  • [21] Video Image Enhancement and Machine Learning Pipeline for Underwater Animal Detection and Classification at Cabled Observatories
    Lopez-Vazquez, Vanesa
    Lopez-Guede, Jose Manuel
    Marini, Simone
    Fanelli, Emanuela
    Johnsen, Espen
    Aguzzi, Jacopo
    SENSORS, 2020, 20 (03)
  • [22] A Machine Learning Processing Pipeline for Reliable Hand Gesture Classification of FMG Signals with Stochastic Variance
    Asfour, Mohammed
    Menon, Carlo
    Jiang, Xianta
    SENSORS, 2021, 21 (04) : 1 - 16
  • [23] Topological data analysis and machine learning
    Leykam, Daniel
    Angelakis, Dimitris G.
    ADVANCES IN PHYSICS-X, 2023, 8 (01):
  • [24] A Survey of Topological Machine Learning Methods
    Hensel, Felix
    Moor, Michael
    Rieck, Bastian
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2021, 4
  • [25] Detection of topological materials with machine learning
    Claussen, Nikolas
    Bernevig, B. Andrei
    Regnault, Nicolas
    PHYSICAL REVIEW B, 2020, 101 (24)
  • [26] Machine-Learning-Based Hotspot Detection Using Topological Classification and Critical Feature Extraction
    Yu, Yen-Ting
    Lin, Geng-He
    Jiang, Iris Hui-Ru
    Chiang, Charles
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2015, 34 (03) : 460 - 470
  • [27] Machine-Learning-Based Hotspot Detection Using Topological Classification and Critical Feature Extraction
    Yu, Yen-Ting
    Lin, Geng-He
    Jiang, Iris Hui-Ru
    Chiang, Charles
    2013 50TH ACM / EDAC / IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2013,
  • [28] Teaching Machine Learning in Argentina: the ClusterAI pipeline
    Palazzo, Martin
    Velazquez, Agustin
    Breda, Melisa
    Callara, Matias
    Aguirre, Nicolas
    SECOND TEACHING MACHINE LEARNING AND ARTIFICIAL INTELLIGENCE WORKSHOP, VOL 170, 2021, 170 : 83 - 87
  • [29] PiPar: : Pipeline parallelism for collaborative machine learning
    Zhang, Zihan
    Kilpatrick, Peter
    Spence, Ivor
    Varghese, Blesson
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 193
  • [30] Leveraging Machine Learning for Pipeline Condition Assessment
    Lu, Hongfang
    Xu, Zhao-Dong
    Zang, Xulei
    Xi, Dongmin
    Iseley, Tom
    Matthews, John C.
    Wang, Niannian
    JOURNAL OF PIPELINE SYSTEMS ENGINEERING AND PRACTICE, 2023, 14 (03)