A freshwater algae classification system based on machine learning with StyleGAN2-ADA augmentation for limited and imbalanced datasets

被引:11
|
作者
Chan, Wang Hin [1 ]
Fung, Benjamin S. B. [2 ]
Tsang, Danny H. K. [2 ,3 ]
Lo, Irene M. C. [1 ,2 ,4 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Civil & Environm Engn, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol Guangzhou, Internet Things Thrust, Guangzhou 510230, Guangdong, Peoples R China
[3] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Clear Water Bay, Hong Kong, Peoples R China
[4] Hong Kong Univ Sci & Technol, Inst Adv Study, Hong Kong, Peoples R China
关键词
Algae; Deep learning; Eutrophication; GAN; Machine learning; Microscopy; CONVOLUTIONAL NEURAL-NETWORKS; EUTROPHICATION; BLOOMS;
D O I
10.1016/j.watres.2023.120409
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Automated algae classification using machine learning is a more efficient and effective solution compared to manual classification, which can be tedious and time-consuming. However, the practical application of such a classification approach is restricted by the scarcity of labeled freshwater algae datasets, especially for rarer algae. To overcome these challenges, this study proposes to generate artificial algal images with StyleGAN2-ADA and use both the generated and real images to train machine-learning-driven algae classification models. This approach significantly enhances the performance of classification models, particularly in their ability to identify rare algae. Overall, the proposed approach improves the F1-score of lightweight MobileNetV3 classification models covering all 20 freshwater algae covered in this research from 88.4% to 96.2%, while for the models that cover only the rarer algae, the experiments show an improvement from 80% to 96.5% in terms of F1-score. The results show that the approach enables the trained algae classification systems to effectively cover algae with limited image data.
引用
收藏
页数:11
相关论文
共 7 条
  • [1] Enhancing automated strabismus classification with limited data: Data augmentation using StyleGAN2-ADA
    Joo, Jaehan
    Kim, Sang Yoon
    Kim, Donghwan
    Lee, Ji-Eun
    Lee, Seung Min
    Suh, Su Youn
    Kim, Su-Jin
    Kim, Suk Chan
    PLOS ONE, 2024, 19 (05):
  • [2] A machine learning-based data augmentation strategy for structural damage classification in civil infrastructure system
    Lechen Li
    Raimondo Betti
    Journal of Civil Structural Health Monitoring, 2023, 13 : 1265 - 1285
  • [3] A machine learning-based data augmentation strategy for structural damage classification in civil infrastructure system
    Li, Lechen
    Betti, Raimondo
    JOURNAL OF CIVIL STRUCTURAL HEALTH MONITORING, 2023, 13 (6-7) : 1265 - 1285
  • [4] Machine learning based urban land cover classification using PolInSAR data: a study with ALOS-2 and RADARSAT-2 datasets
    Luvkesh Attri
    Shashi Kumar
    Sandeep Maithani
    Discover Geoscience, 2 (1):
  • [5] Machine and deep learning-based trophic state classification of national freshwater reservoirs in Taiwan using Sentinel-2 data
    Mukonza, Sabastian Simbarashe
    Chiang, Jie-Lun
    PHYSICS AND CHEMISTRY OF THE EARTH, 2024, 134
  • [6] Exergoeconomic analysis and machine learning-based optimization for a near-zero CO2 emission multi-generation system with freshwater production
    Liang, Wenxing
    Sun, Yanxia
    Han, Cong
    Yang, Jinwen
    Rong, Fanhua
    Han, Jitian
    DESALINATION, 2025, 601
  • [7] Machine learning-based D2D communication for a cloud-secure e-health system and data analysis by feature selection with classification
    Awasthi, Aishwary
    Suchithra, R.
    Chakravarty, Ajay
    Shah, Jaymeel
    Ghosh, Debanjan
    Kumar, Avneesh
    SOFT COMPUTING, 2023,