Establishment of Quantitative Analysis Model of Gentiopicroside in Gentiana Macrophylla by Wavelet Transform with Orthogonal Matching Pursuit Based on Near Infrared Spectroscopy
摘 要
针对近红外光谱高维小样本数据以及信号具有稀疏先验的特点,提出了小波变换(WT)结合正交匹配追踪(OMP)建立秦艽中龙胆苦苷定量分析模型的方法。将204个秦艽样品按照Kennard-Stone法以3∶1的比例进行划分,得到153个校正集样品和51个测试集样品,利用傅里叶变换近红外光谱仪采集样品的近红外光谱,采用WT对原始近红外光谱数据进行预处理,选择小波基函数为Daubechies(db5),小波分解层数为5,小波阈值为0.1,采用OMP建立龙胆苦苷的定量分析模型。结果表明:该模型预测性能较好,校正集对应的决定系数(RC2)为0.994 0,校正均方根误差(RMSEC)为0.081 9,测试集对应的决定系数(RP2)为0.985 4,预测均方根误差(RMSEP)为0.112 4;利用所建模型分析204个秦艽样品,龙胆苦苷预测值与参考值基本一致。
Abstract
Aiming at the characteristics of sparse prior of high-dimensional small samples data and signals in near infrared spectroscopy, a method for the establishment of quantitative analysis model of gentiopicroside in Gentiana macrophylla by wavelet transform (WT) with orthogonal matching pursuit (OMP) was proposed. According to Kennard-Stone method, 204 Gentiana macrophylla samples were divided into 153 calibration set samples and 51 testing set samples at ratio of 3∶1. Fourier transform near infrared spectrometer was used to collect the near infrared spectra of samples, and WT was used to pretreat the original near infrared spectra data. The wavelet basis function was Daubechies (db5), the wavelet decomposition layer was 5, and the wavelet threshold value was 0.1. The OMP was used to establish the quantitative analysis model of gentiopicroside. As shown by the results, the prediction performance of this model was good. The determination coefficient of calibration set (RC2) was 0.994 0, and root mean square error of calibration (RMSEC) was 0.081 9. The determination coefficient of testing set (RP2) was 0.985 4, and root mean square error of prediction (RMSEP) was 0.112 4. The proposed model was used to analyze 204 Gentiana macrophylla samples, and the predicted values of gentiopicroside were basically consistent with the reference values.
中图分类号 O657.33 DOI 10.11973/lhjy-hx202307011
所属栏目 专题报道(化学计量学方法在食品药品分析中的应用)
基金项目 甘肃省科技计划项目(21JR1RA272);兰州市科技计划项目(2018-3-41)
收稿日期 2022/8/7
修改稿日期
网络出版日期
作者单位点击查看
联系人作者李四海(lshroom@163.com)
备注陈方方,硕士研究生,主要从事机器学习、光谱分析研究工作
引用该论文: CHEN Fangfang,LI Sihai,DING Yuewu,YANG You. Establishment of Quantitative Analysis Model of Gentiopicroside in Gentiana Macrophylla by Wavelet Transform with Orthogonal Matching Pursuit Based on Near Infrared Spectroscopy[J]. Physical Testing and Chemical Analysis part B:Chemical Analysis, 2023, 59(7): 812~817
陈方方,李四海,丁跃武,杨友. 基于近红外光谱的小波变换结合正交匹配追踪建立秦艽中龙胆苦苷的定量分析模型[J]. 理化检验-化学分册, 2023, 59(7): 812~817
共有人对该论文发表了看法,其中:
人认为该论文很差
人认为该论文较差
人认为该论文一般
人认为该论文较好
人认为该论文很好
参考文献
【1】国家中医药管理局《中华本草》编委会.中华本草[M].上海:上海科学技术出版社, 1999.
【2】钟镥,罗世英,张静,等.藏药秦艽花的质量标准提升研究[J].中国药房, 2022,33(1):26-31.
【3】季文静,张玉萱,赵志礼,等.云南丽江产粗茎秦艽溯源及道地药材初加工方法评价[J].药学学报, 2022,57(2):507-513.
【4】李海伦,李恒,孙飞,等.经典名方大秦艽汤HPLC指纹图谱及含量测定方法研究[J].中草药, 2021,52(1):99-107.
【5】梁向平,戢爽,杜少波,等.基于UPLC-Q-TOF-MS/MS技术的麻花秦艽不同部位化学成分分析[J].中国实验方剂学杂志, 2022,28(8):139-148.
【6】郑姜彬,陈宝宝,陈千良,等.HPLC-ELSD法测定麻花秦艽中4种脂溶性成分的含量[J].药物分析杂志, 2010,30(5):787-790.
【7】林岩,郭培源,王昕琨.基于近红外光谱的猪肉蛋白质及脂肪含量检测[J].食品科技, 2014,39(2):262-266.
【8】白晓丽,郭卫华,孔俊豪,等.速溶普洱茶中水分、咖啡碱和茶多酚含量近红外光谱快速测定方法的建立[J].食品工业科技, 2019,40(1):234-238.
【9】冯镇,刘馨,张震,等.基于近红外光谱技术对小麦中毒死蜱农药残留测定方法的研究[J].食品工业科技, 2022,43(4):271-277.
【10】陈梓云,彭梦侠.复合维生素B中VB1的近红外快速检测与定量分析研究[J].食品科技, 2020,45(10):273-278.
【11】安思宇,张磊,尚献召,等.红参提取物总皂苷近红外定量分析建模中的变量筛选[J].光谱学与光谱分析, 2021,41(1):206-209.
【12】LU Y, LI X L, LI W J, et al. Detection of chlorpyrifos and carbendazim residues in the cabbage using visible/near-infrared spectroscopy combined with chemometrics[J]. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2021,257:119759.
【13】QIAN L L, LI D W, SONG X J, et al. Identification of Baha'sib mung beans based on Fourier transform near infrared spectroscopy and partial least squares[J]. Journal of Food Composition and Analysis, 2022,105:104203.
【14】NÆS T, MARTENS H. Principal component regression in NIR analysis: Viewpoints, background details and selection of components[J]. Journal of Chemometrics, 1988,2(2):155-167.
【15】WU L, GAO Y, REN W C, et al. Rapid determination and origin identification of total polysaccharides contents in Schisandra chinensis by near-infrared spectroscopy[J]. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2022,264:120327.
【16】LIU J, SUN S Q, TAN Z L, et al. Nondestructive detection of sunset yellow in cream based on near-infrared spectroscopy and interval random forest[J]. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2020,242:118718.
【17】ŞERMENT M, KAHRıMAN F. Ability of near infrared spectroscopy and chemometrics to measure the phytic acid content in maize flour[J]. Spectroscopy Letters, 2021,54(7):520-527.
【18】KAMBOJ U, GUHA P, MISHRA S. Comparison of PLSR, MLR, SVM regression methods for determination of crude protein and carbohydrate content in stored wheat using near infrared spectroscopy[J]. Materials Today: Proceedings, 2022,48:576-582.
【19】TROPP J A, GILBERT A C. Signal recovery from random measurements via orthogonal matching pursuit[J]. IEEE Transactions on Information Theory, 2007,53(12):4655-4666.
【20】李四海,刘东玲.正交匹配追踪算法的近红外光谱定量分析[J].光谱学与光谱分析, 2021,41(4):1097-1101.
【21】张金鹏,钱慧,王仁平.正交匹配追踪算法的优化及FPGA实现[J].小型微型计算机系统, 2022,43(4):707-711.
【22】AS'G MIGIEL S. ECG classification using orthogonal matching pursuit and machine learning[J]. Sensors, 2022,22(13):4960.
【23】CAI T T, WANG L. Orthogonal matching pursuit for sparse signal recovery with noise[J]. IEEE Transactions on Information Theory, 2011,57(7):4680-4688.
【24】刘璐,刘洋,刘财,等.地震随机噪声压缩感知迭代压制方法[J].地球物理学报, 2021,64(12):4629-4643.
【2】钟镥,罗世英,张静,等.藏药秦艽花的质量标准提升研究[J].中国药房, 2022,33(1):26-31.
【3】季文静,张玉萱,赵志礼,等.云南丽江产粗茎秦艽溯源及道地药材初加工方法评价[J].药学学报, 2022,57(2):507-513.
【4】李海伦,李恒,孙飞,等.经典名方大秦艽汤HPLC指纹图谱及含量测定方法研究[J].中草药, 2021,52(1):99-107.
【5】梁向平,戢爽,杜少波,等.基于UPLC-Q-TOF-MS/MS技术的麻花秦艽不同部位化学成分分析[J].中国实验方剂学杂志, 2022,28(8):139-148.
【6】郑姜彬,陈宝宝,陈千良,等.HPLC-ELSD法测定麻花秦艽中4种脂溶性成分的含量[J].药物分析杂志, 2010,30(5):787-790.
【7】林岩,郭培源,王昕琨.基于近红外光谱的猪肉蛋白质及脂肪含量检测[J].食品科技, 2014,39(2):262-266.
【8】白晓丽,郭卫华,孔俊豪,等.速溶普洱茶中水分、咖啡碱和茶多酚含量近红外光谱快速测定方法的建立[J].食品工业科技, 2019,40(1):234-238.
【9】冯镇,刘馨,张震,等.基于近红外光谱技术对小麦中毒死蜱农药残留测定方法的研究[J].食品工业科技, 2022,43(4):271-277.
【10】陈梓云,彭梦侠.复合维生素B中VB1的近红外快速检测与定量分析研究[J].食品科技, 2020,45(10):273-278.
【11】安思宇,张磊,尚献召,等.红参提取物总皂苷近红外定量分析建模中的变量筛选[J].光谱学与光谱分析, 2021,41(1):206-209.
【12】LU Y, LI X L, LI W J, et al. Detection of chlorpyrifos and carbendazim residues in the cabbage using visible/near-infrared spectroscopy combined with chemometrics[J]. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2021,257:119759.
【13】QIAN L L, LI D W, SONG X J, et al. Identification of Baha'sib mung beans based on Fourier transform near infrared spectroscopy and partial least squares[J]. Journal of Food Composition and Analysis, 2022,105:104203.
【14】NÆS T, MARTENS H. Principal component regression in NIR analysis: Viewpoints, background details and selection of components[J]. Journal of Chemometrics, 1988,2(2):155-167.
【15】WU L, GAO Y, REN W C, et al. Rapid determination and origin identification of total polysaccharides contents in Schisandra chinensis by near-infrared spectroscopy[J]. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2022,264:120327.
【16】LIU J, SUN S Q, TAN Z L, et al. Nondestructive detection of sunset yellow in cream based on near-infrared spectroscopy and interval random forest[J]. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2020,242:118718.
【17】ŞERMENT M, KAHRıMAN F. Ability of near infrared spectroscopy and chemometrics to measure the phytic acid content in maize flour[J]. Spectroscopy Letters, 2021,54(7):520-527.
【18】KAMBOJ U, GUHA P, MISHRA S. Comparison of PLSR, MLR, SVM regression methods for determination of crude protein and carbohydrate content in stored wheat using near infrared spectroscopy[J]. Materials Today: Proceedings, 2022,48:576-582.
【19】TROPP J A, GILBERT A C. Signal recovery from random measurements via orthogonal matching pursuit[J]. IEEE Transactions on Information Theory, 2007,53(12):4655-4666.
【20】李四海,刘东玲.正交匹配追踪算法的近红外光谱定量分析[J].光谱学与光谱分析, 2021,41(4):1097-1101.
【21】张金鹏,钱慧,王仁平.正交匹配追踪算法的优化及FPGA实现[J].小型微型计算机系统, 2022,43(4):707-711.
【22】AS'G MIGIEL S. ECG classification using orthogonal matching pursuit and machine learning[J]. Sensors, 2022,22(13):4960.
【23】CAI T T, WANG L. Orthogonal matching pursuit for sparse signal recovery with noise[J]. IEEE Transactions on Information Theory, 2011,57(7):4680-4688.
【24】刘璐,刘洋,刘财,等.地震随机噪声压缩感知迭代压制方法[J].地球物理学报, 2021,64(12):4629-4643.
相关信息