Analysis of handmade paper by Raman spectroscopy combined with machine learning

被引:5
|
作者
Yan, Chunsheng [1 ,2 ]
Cheng, Zhongyi [3 ,4 ]
Luo, Si [5 ]
Huang, Chen [1 ]
Han, Songtao [1 ]
Han, Xiuli [1 ]
Du, Yuandong [1 ]
Ying, Chaonan [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310058, Peoples R China
[2] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou, Peoples R China
[3] Zhejiang Univ, Coll Environm & Resource Sci, Inst Soil & Water Resources & Environm Sci, Hangzhou, Peoples R China
[4] Zhejiang Univ, Zhejiang Prov Key Lab Agr Resources & Environm, Hangzhou, Peoples R China
[5] Zhejiang Normal Univ, Hangzhou Inst Adv Studies, Hangzhou, Peoples R China
关键词
confocal Raman microspectroscopy; handmade paper; machine learning; principal component analysis-linear regression; random forest; ANCIENT; DISCRIMINATION; IDENTIFICATION; DEGRADATION; MICROSCOPY; WOOD; TOOL;
D O I
10.1002/jrs.6280
中图分类号
O433 [光谱学];
学科分类号
0703 ; 070302 ;
摘要
Handmade paper is a major carrier and restoration material of traditional Chinese ancient books, calligraphies, and paintings. In this study, we carried out a Raman spectroscopy analysis of 18 types of handmade paper samples. The main components of the handmade paper were cellulose and lignin, according to the wavenumber and Raman vibration assignment. We divided its Raman spectrum into eight subbands. Five machine learning models were employed: principal component analysis (PCA), partial least squares (PLS), support vector machine (SVM), k-nearest neighbors (KNN), and random forest (RF). The Raman spectral data were normalized, and the fluorescence envelope was subtracted using the airPLS algorithm to obtain four types of data, raw, normalized, defluorescence, and fluorescence data. An RF variable importance analysis of data processing showed that data normalization eliminated the intensity differences of fluorescence signals caused by lignin, which contained important information of raw materials and papermaking technology, let alone the data defluorescence. The data processing also reduced the importance of the average variables in almost all spectral bands. Nevertheless, the data processing is worthwhile because it significantly improves the accuracy of machine learning, and the information loss does not affect the prediction. Using the machine learning models of PCA, PLS, and SVM combined with linear regression (LR), KNN, and RF, the classification and prediction of handmade paper samples were realized. For almost all processed data, including the fluorescence data, PCA-LR had the highest classification and prediction accuracy (R-2 = 1) in almost all spectral bands. PLS-LR and SVM-LR had the second-highest accuracies (R-2 = 0.4-0.9), whereas KNN and RF had the lowest accuracies (R-2 = 0.1-0.4) for full band spectral data. Our results suggest that the abundant information contained in Raman spectroscopy combined with powerful machine learning models could inspire further studies on handmade paper and related cultural relics.
引用
收藏
页码:260 / 271
页数:12
相关论文
共 50 条
  • [31] Chemometric analysis in Raman spectroscopy from experimental design to machine learning–based modeling
    Shuxia Guo
    Jürgen Popp
    Thomas Bocklitz
    Nature Protocols, 2021, 16 : 5426 - 5459
  • [32] Qualitative and quantitative analysis of chlorinated solvents using Raman spectroscopy and machine learning.
    Conroy, J
    Ryder, AG
    Leger, MN
    Hennessey, K
    Madden, MG
    Opto-Ireland 2005: Optical Sensing and Spectroscopy, 2005, 5826 : 131 - 142
  • [33] Tensor product based 2-D correlation data preprocessing methods for Raman spectroscopy of Chinese handmade paper
    Yan, Chunsheng
    Luo, Si
    Cheng, Zhongyi
    Zhang, Hui
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2023, 302
  • [34] Quantitative analysis of excipient dominated drug formulations by Raman spectroscopy combined with deep learning
    Fu, Xiang
    Zhong, Li-min
    Cao, Yong-bing
    Chen, Hui
    Lu, Feng
    ANALYTICAL METHODS, 2021, 13 (01) : 64 - 68
  • [35] Garlic bulb classification by combining Raman spectroscopy and machine learning
    Wang, Zhixin
    Li, Chenming
    Wang, Zhong
    Li, Yuee
    Hu, Bin
    VIBRATIONAL SPECTROSCOPY, 2023, 125
  • [36] Visible Particle Identification Using Raman Spectroscopy and Machine Learning
    Han Sheng
    Yinping Zhao
    Xiangan Long
    Liwen Chen
    Bei Li
    Yiyan Fei
    Lan Mi
    Jiong Ma
    AAPS PharmSciTech, 23
  • [37] Raman spectroscopy and machine learning for the classification of esophageal squamous carcinoma
    Huang, Wenhua
    Shang, Qixin
    Xiao, Xin
    Zhang, Hanlu
    Gu, Yimin
    Yang, Lin
    Shi, Guidong
    Yang, Yushang
    Hu, Yang
    Yuan, Yong
    Ji, Aifang
    Chen, Longqi
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2022, 281
  • [38] Early diagnosis of citrus Huanglongbing by Raman spectroscopy and machine learning
    Kong, Lili
    Liu, Tianyuan
    Qiu, Honglin
    Yu, Xinna
    Wang, Xianda
    Huang, Zhiwei
    Huang, Meizhen
    LASER PHYSICS LETTERS, 2024, 21 (01)
  • [39] Machine learning prediction of lignin content in poplar with Raman spectroscopy
    Gao, Wenli
    Zhou, Liang
    Liu, Shengquan
    Guan, Ying
    Gao, Hui
    Hui, Bin
    BIORESOURCE TECHNOLOGY, 2022, 348
  • [40] Visible Particle Identification Using Raman Spectroscopy and Machine Learning
    Sheng, Han
    Zhao, Yinping
    Long, Xiangan
    Chen, Liwen
    Li, Bei
    Fei, Yiyan
    Mi, Lan
    Ma, Jiong
    AAPS PHARMSCITECH, 2022, 23 (06)