Analysis of handmade paper by Raman spectroscopy combined with machine learning

被引:5
|
作者
Yan, Chunsheng [1 ,2 ]
Cheng, Zhongyi [3 ,4 ]
Luo, Si [5 ]
Huang, Chen [1 ]
Han, Songtao [1 ]
Han, Xiuli [1 ]
Du, Yuandong [1 ]
Ying, Chaonan [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310058, Peoples R China
[2] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou, Peoples R China
[3] Zhejiang Univ, Coll Environm & Resource Sci, Inst Soil & Water Resources & Environm Sci, Hangzhou, Peoples R China
[4] Zhejiang Univ, Zhejiang Prov Key Lab Agr Resources & Environm, Hangzhou, Peoples R China
[5] Zhejiang Normal Univ, Hangzhou Inst Adv Studies, Hangzhou, Peoples R China
关键词
confocal Raman microspectroscopy; handmade paper; machine learning; principal component analysis-linear regression; random forest; ANCIENT; DISCRIMINATION; IDENTIFICATION; DEGRADATION; MICROSCOPY; WOOD; TOOL;
D O I
10.1002/jrs.6280
中图分类号
O433 [光谱学];
学科分类号
0703 ; 070302 ;
摘要
Handmade paper is a major carrier and restoration material of traditional Chinese ancient books, calligraphies, and paintings. In this study, we carried out a Raman spectroscopy analysis of 18 types of handmade paper samples. The main components of the handmade paper were cellulose and lignin, according to the wavenumber and Raman vibration assignment. We divided its Raman spectrum into eight subbands. Five machine learning models were employed: principal component analysis (PCA), partial least squares (PLS), support vector machine (SVM), k-nearest neighbors (KNN), and random forest (RF). The Raman spectral data were normalized, and the fluorescence envelope was subtracted using the airPLS algorithm to obtain four types of data, raw, normalized, defluorescence, and fluorescence data. An RF variable importance analysis of data processing showed that data normalization eliminated the intensity differences of fluorescence signals caused by lignin, which contained important information of raw materials and papermaking technology, let alone the data defluorescence. The data processing also reduced the importance of the average variables in almost all spectral bands. Nevertheless, the data processing is worthwhile because it significantly improves the accuracy of machine learning, and the information loss does not affect the prediction. Using the machine learning models of PCA, PLS, and SVM combined with linear regression (LR), KNN, and RF, the classification and prediction of handmade paper samples were realized. For almost all processed data, including the fluorescence data, PCA-LR had the highest classification and prediction accuracy (R-2 = 1) in almost all spectral bands. PLS-LR and SVM-LR had the second-highest accuracies (R-2 = 0.4-0.9), whereas KNN and RF had the lowest accuracies (R-2 = 0.1-0.4) for full band spectral data. Our results suggest that the abundant information contained in Raman spectroscopy combined with powerful machine learning models could inspire further studies on handmade paper and related cultural relics.
引用
收藏
页码:260 / 271
页数:12
相关论文
共 50 条
  • [41] Machine Learning Protocol for Surface-Enhanced Raman Spectroscopy
    Hu, Wei
    Ye, Sheng
    Zhang, Yujin
    Li, Tianduo
    Zhang, Guozhen
    Luo, Yi
    Mukamel, Shaul
    Jiang, Jun
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2019, 10 (20): : 6026 - 6031
  • [42] Raman spectroscopy and machine-learning for edible oils evaluation
    Berghian-Grosan, Camelia
    Magdas, Dana Alina
    TALANTA, 2020, 218
  • [43] Decoding Phases of Matter by Machine-Learning Raman Spectroscopy
    Cui, Anyang
    Jiang, Kai
    Jiang, Minhong
    Shang, Liyan
    Zhu, Liangqing
    Hu, Zhigao
    Xu, Guisheng
    Chu, Junhao
    PHYSICAL REVIEW APPLIED, 2019, 12 (05)
  • [44] Classification and Recognition of Lilies Based on Raman Spectroscopy and Machine Learning
    Wang Zhi-xin
    Wang Hui-hui
    Zhang Wen-bo
    Wang Zhong
    Li Yue-e
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43 (01) : 183 - 189
  • [45] Employing Raman Spectroscopy and Machine Learning for the Identification of Breast Cancer
    Zhang, Ya
    Li, Zheng
    Li, Zhongqiang
    Wang, Huaizhi
    Regmi, Dinkar
    Zhang, Jian
    Feng, Jiming
    Yao, Shaomian
    Xu, Jian
    BIOLOGICAL PROCEDURES ONLINE, 2024, 26 (01)
  • [46] Optimized preprocessing and machine learning for quantitative Raman spectroscopy in biology
    Storey, Emily E.
    Helmy, Amr S.
    JOURNAL OF RAMAN SPECTROSCOPY, 2019, 50 (07) : 958 - 968
  • [47] Non-destructive distinction between geogenic and anthropogenic calcite by Raman spectroscopy combined with machine learning workflow
    Calandra, Sara
    Conti, Claudia
    Centauro, Irene
    Cantisani, Emma
    ANALYST, 2023, 148 (12) : 2861 - 2870
  • [48] Identification of pathogens and detection of antibiotic susceptibility at single-cell resolution by Raman spectroscopy combined with machine learning
    Lu, Weilai
    Li, Haifei
    Qiu, Haoning
    Wang, Lu
    Feng, Jie
    Fu, Yu Vincent
    FRONTIERS IN MICROBIOLOGY, 2023, 13
  • [49] Raman spectroscopy combined with machine learning and chemometrics analyses as a tool for identification atherosclerotic carotid stenosis from serum
    Kesik, Jan Jakub
    Paja, Wieslaw
    Terlecki, Piotr
    Ilzecki, Marek
    Klebowski, Bartosz
    Depciuch, Joanna
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2025, 326
  • [50] Analysis of Machine Learning Methods for COVID-19 Detection Using Serum Raman Spectroscopy
    Chen, David
    APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (14) : 1147 - 1168