Analysis of handmade paper by Raman spectroscopy combined with machine learning

被引:5
|
作者
Yan, Chunsheng [1 ,2 ]
Cheng, Zhongyi [3 ,4 ]
Luo, Si [5 ]
Huang, Chen [1 ]
Han, Songtao [1 ]
Han, Xiuli [1 ]
Du, Yuandong [1 ]
Ying, Chaonan [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310058, Peoples R China
[2] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou, Peoples R China
[3] Zhejiang Univ, Coll Environm & Resource Sci, Inst Soil & Water Resources & Environm Sci, Hangzhou, Peoples R China
[4] Zhejiang Univ, Zhejiang Prov Key Lab Agr Resources & Environm, Hangzhou, Peoples R China
[5] Zhejiang Normal Univ, Hangzhou Inst Adv Studies, Hangzhou, Peoples R China
关键词
confocal Raman microspectroscopy; handmade paper; machine learning; principal component analysis-linear regression; random forest; ANCIENT; DISCRIMINATION; IDENTIFICATION; DEGRADATION; MICROSCOPY; WOOD; TOOL;
D O I
10.1002/jrs.6280
中图分类号
O433 [光谱学];
学科分类号
0703 ; 070302 ;
摘要
Handmade paper is a major carrier and restoration material of traditional Chinese ancient books, calligraphies, and paintings. In this study, we carried out a Raman spectroscopy analysis of 18 types of handmade paper samples. The main components of the handmade paper were cellulose and lignin, according to the wavenumber and Raman vibration assignment. We divided its Raman spectrum into eight subbands. Five machine learning models were employed: principal component analysis (PCA), partial least squares (PLS), support vector machine (SVM), k-nearest neighbors (KNN), and random forest (RF). The Raman spectral data were normalized, and the fluorescence envelope was subtracted using the airPLS algorithm to obtain four types of data, raw, normalized, defluorescence, and fluorescence data. An RF variable importance analysis of data processing showed that data normalization eliminated the intensity differences of fluorescence signals caused by lignin, which contained important information of raw materials and papermaking technology, let alone the data defluorescence. The data processing also reduced the importance of the average variables in almost all spectral bands. Nevertheless, the data processing is worthwhile because it significantly improves the accuracy of machine learning, and the information loss does not affect the prediction. Using the machine learning models of PCA, PLS, and SVM combined with linear regression (LR), KNN, and RF, the classification and prediction of handmade paper samples were realized. For almost all processed data, including the fluorescence data, PCA-LR had the highest classification and prediction accuracy (R-2 = 1) in almost all spectral bands. PLS-LR and SVM-LR had the second-highest accuracies (R-2 = 0.4-0.9), whereas KNN and RF had the lowest accuracies (R-2 = 0.1-0.4) for full band spectral data. Our results suggest that the abundant information contained in Raman spectroscopy combined with powerful machine learning models could inspire further studies on handmade paper and related cultural relics.
引用
收藏
页码:260 / 271
页数:12
相关论文
共 50 条
  • [21] Raman spectroscopy and machine learning for forensic document examination
    Lee, Yong Ju
    Jeong, Chang Woo
    Kim, Hong Taek
    Lee, Tai-Ju
    Kim, Hyoung Jin
    ANALYST, 2025,
  • [22] Differentiation of Plastics by Combining Raman Spectroscopy and Machine Learning
    Y. Yang
    W. Zhang
    Zh. Wang
    Y. Li
    Journal of Applied Spectroscopy, 2022, 89 : 790 - 798
  • [23] Recent Progresses in Machine Learning Assisted Raman Spectroscopy
    Qi, Yaping
    Hu, Dan
    Jiang, Yucheng
    Wu, Zhenping
    Zheng, Ming
    Chen, Esther Xinyi
    Liang, Yong
    Sadi, Mohammad A. A.
    Zhang, Kang
    Chen, Yong P. P.
    ADVANCED OPTICAL MATERIALS, 2023, 11 (14)
  • [24] Raman spectroscopy and topological machine learning for cancer grading
    Conti, Francesco
    D'Acunto, Mario
    Caudai, Claudia
    Colantonio, Sara
    Gaeta, Raffaele
    Moroni, Davide
    Pascali, Maria Antonietta
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [25] Differentiation of Plastics by Combining Raman Spectroscopy and Machine Learning
    Yang, Y.
    Zhang, W.
    Wang, Zh
    Li, Y.
    JOURNAL OF APPLIED SPECTROSCOPY, 2022, 89 (04) : 790 - 798
  • [26] Advancing Hemoglobinopathy Screening with Raman Spectroscopy and Machine Learning
    Abbasi, Sara
    Feizpour, Mehdi
    Weets, Ilse
    Liu, Qing
    Thienpont, Hugo
    Ferranti, Francesco
    Ottevaere, Heidi
    BIOMEDICAL SPECTROSCOPY, MICROSCOPY, AND IMAGING III, 2024, 13006
  • [27] Raman spectroscopy and topological machine learning for cancer grading
    Francesco Conti
    Mario D’Acunto
    Claudia Caudai
    Sara Colantonio
    Raffaele Gaeta
    Davide Moroni
    Maria Antonietta Pascali
    Scientific Reports, 13
  • [28] Rapid Analysis of Ruminant Fat Adulteration by Spectroscopy Combined with Machine Learning.
    Tong, Pei-Jin
    Zhang, Hong-Chao
    Wei, Ting-Ting
    Cao, Wen-Ming
    JOURNAL OF THE AMERICAN OIL CHEMISTS SOCIETY, 2021, 98 : 21 - 21
  • [29] Rapid Raman spectroscopy analysis assisted with machine learning: a case study on Radix Bupleuri
    Guo, Fangjie
    Yang, Xudong
    Zhang, Zhengyong
    Liu, Shuren
    Zhang, Yinsheng
    Wang, Haiyan
    JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE, 2025, 105 (04) : 2412 - 2419
  • [30] Discriminative feature analysis of dairy products based on machine learning algorithms and Raman spectroscopy
    Li, Jia-Xin
    Qing, Chun-Chun
    Wang, Xiu-Qian
    Zhu, Mei-Jia
    Zhang, Bo-Ya
    Zhang, Zheng-Yong
    CURRENT RESEARCH IN FOOD SCIENCE, 2024, 8