Interpretable multi-morphology and multi-scale microalgae classification based on machine learning

被引:0
|
作者
Yan, Huchao [1 ,2 ]
Peng, Xinggan [3 ,4 ]
Wang, Chao [1 ,2 ]
Xia, Ao [1 ,2 ]
Huang, Yun [1 ,2 ]
Zhu, Xianqing [1 ,2 ]
Zhang, Jingmiao [1 ,2 ]
Zhu, Xun [1 ,2 ]
Liao, Qiang [1 ,2 ]
机构
[1] Chongqing Univ, Key Lab Low grade Energy Utilizat Technol & Syst, Minist Educ, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Inst Engn Thermophys, Sch Energy & Power Engn, Chongqing 400044, Peoples R China
[3] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[4] CMCU Engn Co Ltd, Chongqing 400039, Peoples R China
基金
中国国家自然科学基金;
关键词
Microalgae classification; Bioprocess; Machine learning; Feature dimensionality reduction; Explanatory analysis; SUPPORT VECTOR MACHINE; RANDOM FOREST; IDENTIFICATION;
D O I
10.1016/j.algal.2024.103812
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The multi-morphology and multi-scale mixed microalgae are widely distributed in natural and artificial systems. There is an urgent need to develop an efficient approach to classify the mixed microalgae for natural water system monitoring and microalgae bioprocesses, such as wastewater treatment, carbon dioxide capture and prevention of harmful algal blooms. The numerical feature datasets of pure and mixed cultures of multi-morphic microalgae with a size range between 5 and 500 mu m are established in the study. A large number of input features increases model complexity and computational costs, and the feature space dimension was reduced from 24 dimensions to 11 dimensions using the Pearson coefficient matrix and principal component analysis to reduce the impact of unimportant factors. Research indicates that the classification performance of the ensemble model is significantly better than that of the linear and nonlinear models. The average F1_score of the random forest optimized by grid search classified pure and mixed microalgae are 0.952 and 0.943, respectively, which are 2.2 % and 1.0 % higher than those without optimization. The Shapley Additive exPlanations theory and the ensemble model are combined to analyze the critical factors for microalgae classification, and the texture features play a crucial role in all the numerical features of microalgae images.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Statistical Characteristics of Multi-Scale Auroral Arc Width Based on Machine Learning
    Yang, Qiuju
    Xie, Minghao
    Su, Hang
    Han, Desheng
    He, Qiqi
    JOURNAL OF GEOPHYSICAL RESEARCH-SPACE PHYSICS, 2024, 129 (01)
  • [32] A machine learning-based multi-scale computational framework for granular materials
    Guan, Shaoheng
    Qu, Tongming
    Feng, Y. T.
    Ma, Gang
    Zhou, Wei
    ACTA GEOTECHNICA, 2023, 18 (04) : 1699 - 1720
  • [33] Characterizing multi-scale shale pore structure based on multi-experimental imaging and machine learning
    Yao, Jun
    Liu, Lei
    Yang, Yongfei
    Sun, Hai
    Zhang, Lei
    NATURAL GAS INDUSTRY B, 2023, 10 (04) : 361 - 371
  • [34] An edge detection algorithm based on multi-scale morphology
    Wang, Kun
    Gao, Liqun
    Shi, Zhengang
    Guo, Li
    Pian, Zhaoyu
    ICIEA 2007: 2ND IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-4, PROCEEDINGS, 2007, : 2210 - 2213
  • [35] Remote sensing image classification based on support vector machine with the multi-scale segmentation
    Bao, Wenxing
    Feng, Wei
    Ma, Ruishi
    SEVENTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2015), 2015, 9817
  • [36] Hyperspectral Image Denoising and Classification Using Multi-Scale Weighted EMAPs and Extreme Learning Machine
    Liu, Meizhuang
    Cao, Faxian
    Yang, Zhijing
    Hong, Xiaobin
    Huang, Yuezhen
    ELECTRONICS, 2020, 9 (12) : 1 - 17
  • [37] Multi-Scale Feature Based Medical Image Classification
    Li, Bo
    Li, Wei
    Zhao, Dazhe
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1182 - 1186
  • [38] Multi-Scale and Attention based ResNet for Heartbeat Classification
    Zhang, Haojie
    Yang, Gongping
    Huang, Yuwen
    Yuan, Feng
    Yin, Yilong
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1529 - 1535
  • [39] Audio segmentation based on multi-scale audio classification
    Zhang, YB
    Zhou, J
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: AUDIO AND ELECTROACOUSTICS SIGNAL PROCESSING FOR COMMUNICATIONS, 2004, : 349 - 352
  • [40] A Multi-scale CNN and Curriculum Learning Strategy for Mammogram Classification
    Lotter, William
    Sorensen, Greg
    Cox, David
    DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, 2017, 10553 : 169 - 177