[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN109632761B - Processing method and system of Raman spectrum data - Google Patents

Processing method and system of Raman spectrum data Download PDF

Info

Publication number
CN109632761B
CN109632761B CN201811533229.6A CN201811533229A CN109632761B CN 109632761 B CN109632761 B CN 109632761B CN 201811533229 A CN201811533229 A CN 201811533229A CN 109632761 B CN109632761 B CN 109632761B
Authority
CN
China
Prior art keywords
raman
spectrum data
data
raman spectrum
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811533229.6A
Other languages
Chinese (zh)
Other versions
CN109632761A (en
Inventor
张健伟
段贵娇
陈鲁
张志彬
陈迟
霍剑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Huankai Microbial Sci and Tech Co Ltd
Original Assignee
Guangdong Huankai Microbial Sci and Tech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Huankai Microbial Sci and Tech Co Ltd filed Critical Guangdong Huankai Microbial Sci and Tech Co Ltd
Priority to CN201811533229.6A priority Critical patent/CN109632761B/en
Publication of CN109632761A publication Critical patent/CN109632761A/en
Application granted granted Critical
Publication of CN109632761B publication Critical patent/CN109632761B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/62Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light
    • G01N21/63Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light optically excited
    • G01N21/65Raman scattering

Landscapes

  • Health & Medical Sciences (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Immunology (AREA)
  • Pathology (AREA)
  • Investigating, Analyzing Materials By Fluorescence Or Luminescence (AREA)

Abstract

The invention discloses a processing method and a system of Raman spectrum data, wherein the method comprises the following steps: collecting first Raman spectrum data of a test sample through a Raman spectrometer; background removing processing is carried out on the first Raman spectrum data to obtain second Raman spectrum data; and identifying the components in the test sample according to the Raman spectrum database of the standard sample and the second Raman spectrum data to obtain an identification result. According to the method, background removal processing is carried out on the first Raman spectrum data of the test sample to obtain second Raman spectrum data, then components in the test sample are identified according to the Raman spectrum database of the standard sample and the second Raman spectrum data to obtain an identification result, and through the background removal processing, the influence of a background spectrum on the test result in the identification process is avoided, and the accuracy of the identification result is ensured. The invention can be widely applied to the field of spectral data processing.

Description

Processing method and system of Raman spectrum data
Technical Field
The invention relates to the field of spectral data processing, in particular to a Raman spectral data processing method and system.
Background
The Raman spectrum analysis technology is characterized in that a tested sample is stimulated by laser, the tested sample is stimulated to radiate Raman signals, and due to different internal molecular structures of different substances, the sample can be stimulated to generate a specific Raman scattering spectrum, so that the molecular structure and the composition can be researched by analyzing the Raman signals. With the progress of laser technology and photoelectric detection technology, the application of raman technology in the field of component detection is more and more extensive. In recent years, the domestic strength of food and drug safety supervision is getting bigger and bigger, and compared with the traditional gas-phase liquid-phase analysis equipment, the Raman spectrum detection equipment has the advantages of short detection period, low cost and convenient carrying when being used for component detection. However, because the raman signal is very weak, especially the laser energy of the small and light-weight device is weak, the collection capability of the raman signal to the scattered light is also limited, so that the raman signal obtained by the detection device is easily interfered by the background spectrum, noise and stray light, and if the data collected by the detector is directly processed and analyzed, the accuracy of component judgment on the test sample can be affected.
Disclosure of Invention
To solve the above technical problems, the present invention aims to: provided are a processing method and system capable of effectively detecting whether a test sample contains a Raman spectrum data of a standard sample component.
The first technical scheme adopted by the invention is as follows: a method of processing raman spectral data, comprising the steps of:
collecting first Raman spectrum data of a test sample through a Raman spectrometer;
background removing processing is carried out on the first Raman spectrum data to obtain second Raman spectrum data;
and identifying the components in the test sample according to the Raman spectrum database of the standard sample and the second Raman spectrum data to obtain an identification result.
Further, the background removing processing is performed on the first raman spectrum data to obtain second raman spectrum data, which specifically includes:
background removing processing is carried out on the first Raman spectrum data through a high-pass filter matrix and a penalty function to obtain second Raman spectrum data, and the calculation formula of the second Raman spectrum data is as follows:
Figure BDA0001906229380000011
wherein,
Figure BDA0001906229380000012
is the second Raman spectrum data, y is the first Raman spectrum data, x is the iterative spectrum data, F (x) is the evaluation function, H is the finite field high pass filter matrix, phi is the penalty function, lambda is the weight coefficient, DiIs the i-th order differential, M is the total order of the differential, NiIs the i-th order differential data length.
Further, the finite field high-pass filter matrix is obtained by the following steps:
acquiring a first differential operator and a truncation operator;
obtaining a first augmentation matrix and a second augmentation matrix according to the first differential operator and the truncation operator;
acquiring an extension mode transformation matrix;
obtaining a truncation matrix according to the first augmentation matrix and the extension mode transformation matrix;
obtaining a differential matrix according to the second augmentation matrix and the extension mode transformation matrix;
and obtaining a finite field high-pass filter matrix according to the truncation matrix and the differential matrix.
Further, before background removing processing is carried out on the first Raman spectrum data, the method further comprises the following steps:
calculating the minimum resolution of the wave number according to the wavelength resolution of the Raman spectrometer to obtain a wave number division value, wherein the calculation formula of the wave number division value is as follows:
Figure BDA0001906229380000021
wherein k isΔIs the division value of wave number, lambdamaxIs the maximum of the measurable wavelength, λ, in the first Raman spectral dataΔIs the wavelength resolution of the raman spectrometer.
Further, the Raman spectrum database of the standard sample is obtained by the following steps:
acquiring third Raman spectrum data of the standard sample;
background removing processing is carried out on the third Raman spectrum data to obtain fourth Raman spectrum data;
performing interpolation processing on the fourth Raman spectrum data according to the wave number scale values to obtain first Raman shift spectrum data;
taking the maximum value in the first Raman shift spectrum data as a normalization parameter, and carrying out normalization processing on the first Raman shift spectrum data to obtain second Raman shift spectrum data;
obtaining a mask array according to the magnitude relation between the first Raman shift spectrum data and a first threshold;
and performing background interference removal processing on the second Raman shift spectrum data according to the mask module group to obtain third Raman shift spectrum data, wherein a calculation formula of the third Raman shift spectrum data is as follows:
yk=yk0οαk
wherein, ykAs third Raman shift spectral data, αkAs a mask array, yk0Performing interpolation processing on Raman spectrum data with a sequence number of k in a Raman spectrum database by using a wave number dividing value to obtain first Raman shift spectrum data with a dimension of n;
and marking and storing the third Raman shift spectrum data to obtain a Raman spectrum database of the standard sample.
Further, the identifying components in the test sample specifically includes:
performing interpolation processing on the second Raman spectrum data according to the wave number scale values to obtain fourth Raman shift spectrum data;
and identifying the components in the test sample according to the Raman spectrum database of the standard sample and the fourth Raman shift spectrum data.
Further, the identifying the components in the test sample according to the raman spectrum database and the fourth raman shift spectrum data of the standard sample specifically comprises:
removing background interference from the fourth Raman shift spectrum data according to the mask module group to obtain fifth Raman shift spectrum data;
calculating the energy ratio of the fifth Raman shift spectrum data in the test sample, wherein the calculation formula of the energy ratio is as follows:
Figure BDA0001906229380000031
wherein eta iskN is the data dimension of the fourth Raman shift spectrum data, zkProjection of fourth Raman shift spectrum data on a vector subspace spanned by a k mask of a standard sample is carried out, and z is the fourth Raman shift spectrum data;
and judging the relation between the energy ratio and the second threshold value to obtain a first judgment result.
Further, the identifying the components in the test sample according to the Raman spectrum database of the standard sample and the fourth Raman shift spectrum data further comprises the following steps:
calculating a correlation coefficient of the third Raman shift spectrum data and the fifth Raman shift spectrum data, wherein the calculation formula of the correlation coefficient is as follows:
Figure BDA0001906229380000032
where ρ iskIs the correlation coefficient, ykAs third Raman shift spectral data, zkFifth raman shift spectral data;
and judging the magnitude relation between the correlation coefficient and the third threshold value to obtain a second judgment result.
Further, the identifying the components in the test sample according to the Raman spectrum database of the standard sample and the fourth Raman shift spectrum data further comprises the following steps:
if the first judgment result is that the energy ratio is greater than a second threshold value and the second judgment result is that the correlation coefficient is greater than a third threshold value, determining that the test sample contains the components of the standard sample; otherwise, the test sample is judged to contain no components of the standard sample.
The second technical scheme adopted by the invention is as follows:
a system for processing raman spectral data, comprising:
at least one memory for storing a program;
at least one processor for loading the program to implement the method for processing raman spectral data.
The invention has the beneficial effects that: the method comprises the steps of removing a background spectrum from first Raman spectrum data of a test sample to obtain second Raman spectrum data, and identifying components in the test sample according to a Raman spectrum database of a standard sample and the second Raman spectrum data to obtain an identification result.
Drawings
FIG. 1 is a flow chart of a method of processing Raman spectral data according to the present invention;
fig. 2 is a system block diagram of a raman spectrum data processing system according to the present invention.
Detailed Description
The invention is described in further detail below with reference to the figures and the specific embodiments. The step numbers in the following embodiments are provided only for convenience of illustration, the order between the steps is not limited at all, and the execution order of each step in the embodiments can be adapted according to the understanding of those skilled in the art.
Referring to fig. 1, a method for processing raman spectrum data includes the following steps:
s101, collecting first Raman spectrum data of a test sample through a Raman spectrometer;
specifically, the first raman spectrum data is initial spectrum data of the test sample collected by the raman spectrometer, and the data dimension is N and the unit is wavelength.
S102, background removing processing is carried out on the first Raman spectrum data to obtain second Raman spectrum data;
specifically, the second raman spectrum data is target spectrum data of the test sample obtained after background removal processing is performed on the initial spectrum data of the test sample.
S103, identifying components in the test sample according to the Raman spectrum database of the standard sample and the second Raman spectrum data to obtain an identification result.
Specifically, identifying the components of the test sample refers to identifying the same components in the test sample as the standard sample. And performing interpolation and mask processing on the second Raman spectrum data to obtain Raman shift spectrum data of the test sample, comparing and identifying the Raman shift spectrum data of the test sample with Raman shift spectrum data in a Raman spectrum database of the standard sample, and judging whether the test sample contains components of the standard sample according to an identification result.
Specifically, background spectrum in the first Raman spectrum data is removed through background removing processing of the first Raman spectrum data of the test sample, second Raman spectrum data is obtained, and then components in the test sample are identified according to a Raman spectrum database of the standard sample and the second Raman spectrum data, so that an identification result is obtained.
Further, as a preferred embodiment, the background removing processing is performed on the first raman spectrum data to obtain second raman spectrum data, specifically:
background removing processing is carried out on the first Raman spectrum data through a high-pass filter matrix and a penalty function to obtain second Raman spectrum data, and the calculation formula of the second Raman spectrum data is as follows:
Figure BDA0001906229380000051
wherein,
Figure BDA0001906229380000052
is the second Raman spectrum data, y is the first Raman spectrum data, x is the iterative spectrum data, F (x) is the evaluation function, H is the finite field high pass filter matrix, phi is the penalty function, lambda is the weight coefficient, DiIs the i-th order differential, M is the total order of the differential, NiIs the i-th order differential data length.
Specifically, the expression of the finite field high-pass filter matrix is as follows:
H=BA-1
wherein H is a high-pass filter matrix, and B is a differential matrix; a is a truncated matrix, and H, B and A are both N × N dimensional matrices.
The expression of the penalty function is:
φ=cx2
where φ is a penalty function, c is a coefficient, and x is iterative spectral data.
The iterative spectrum data is obtained by processing the first Raman spectrum data by adopting a BEADS algorithm, and the processing steps are as follows:
step one, acquiring first Raman spectrum data y, a truncation matrix A, a differential matrix B and an ith order differential DiThe first derivative of the penalty function phi' (x), and a weighting factor lambdaiWherein i is 0,1, … M, and M is the total order of differentiation;
step two, obtaining first intermediate data according to the truncation matrix A and the differential matrix B, wherein a calculation formula of the first intermediate data is as follows:
b=BTBA-1x;
step three, setting iterative spectrum data x as y;
step four, according to the ith order differential DiAnd the first derivative phi' (x) of the penalty function to obtain second intermediate dataThe calculation formula of the second intermediate data is as follows:
Figure BDA0001906229380000053
step five, according to the weight coefficient lambdaiThe ith order differential DiAnd obtaining third intermediate data from the second intermediate data, wherein the calculation formula of the third intermediate data is as follows:
Figure BDA0001906229380000061
sixthly, obtaining fourth intermediate data according to the truncation matrix A, the differential matrix B and the third intermediate data D, wherein a calculation formula of the fourth intermediate data is as follows:
Q=BTB+ATDA;
seventhly, obtaining first iteration spectrum data g according to the truncation matrix A, the first intermediate data b and the fourth intermediate data Q, wherein a calculation formula of the first iteration spectrum data g is as follows:
g=AQ-1b;
step eight, judging whether the first iteration spectrum data g reaches a preset iteration frequency, if not, making x equal to g, and returning to the step four; otherwise, let x be g, end iteration, and output x.
Specifically, the background spectrum and the interference noise in the first raman spectrum data are separated by processing the first raman spectrum data with the BEADS algorithm. The penalty function is used to sparsify the initial spectral data. And after background removal processing is carried out on the first Raman spectrum data through a high-pass filter and a penalty function, second Raman spectrum data with the background spectrum removed are obtained.
Further, as a preferred embodiment, the finite field high-pass filter matrix is obtained by:
acquiring a first differential operator and a truncation operator;
specifically, the first differential operator is a 2d order differential operator, which can be obtained by:
with P [ -1,2, -1] as the second order differential operator, the 2d order differential operator can be obtained by d convolutions of P, so the formula for the d convolutions of P is:
Pd=[ad......a0......ad]
wherein, a0、adAnd d are both constants.
Specifically, the truncation operator is obtained by:
setting the cut-off frequency to fcThen the truncation operator is QdSo as to truncate operator QdThe calculation formula of (2) is as follows:
Qd=convd([-1 2 -1])+αconvd([1 2 1])
wherein d is a constant, and d is a constant,
Figure BDA0001906229380000062
obtaining a first augmentation matrix and a second augmentation matrix according to the first differential operator and the truncation operator;
specifically, the first augmentation matrix is AEAnd a second amplification matrix BEThe amplification matrix AEAnd BEIs obtained by the following steps:
convolution of P d timesdAnd truncation operator QdConstructing an N x (N + d) -dimensional augmented matrix A of a truncated matrix A and a differential matrix BEAnd BEFirst augmentation matrix AEThe expression of (a) is:
Figure BDA0001906229380000071
second augmentation matrix BEThe expression of (a) is:
Figure BDA0001906229380000072
wherein, a0、ad、b0、bdAnd d are both constants, AEIs a first augmented matrix, BEIs a second augmented matrix.
Acquiring an extension mode transformation matrix;
specifically, the expression of the boundary extension matrix is:
Figure BDA0001906229380000073
wherein G is a boundary extension matrix, E is an NxN dimensional identity matrix, and L and R are a dxN dimensional left boundary and a right boundary, respectively.
Obtaining a truncation matrix according to the first augmentation matrix and the extension mode transformation matrix;
specifically, the expression of the truncation matrix is:
A=AEG
where A is a truncation matrix, AEIs the first augmented matrix and G is the boundary extended matrix.
Obtaining a differential matrix according to the second augmentation matrix and the extension mode transformation matrix;
specifically, the expression of the differential matrix is:
B=BEG
wherein B is a differential matrix, BEIs the second augmented matrix and G is the boundary extended matrix.
And obtaining a finite field high-pass filter matrix according to the truncation matrix and the differential matrix.
Specifically, the calculation formula of the finite field high-pass filter matrix is as follows:
H=BA-1
wherein H is a high-pass filter matrix, B is a differential matrix, A is a truncation matrix, and H, B and A are both NxN dimensional matrices.
Specifically, by introducing a boundary extension matrix into a finite field high-pass filter matrix, the baseline shift phenomenon caused by the fact that the differential value of the boundary position is a non-zero value in the prior art is eliminated.
As a further preferred embodiment, before background removing processing is performed on the first raman spectrum data, the method further comprises the following steps:
calculating the minimum resolution of the wave number according to the wavelength resolution of the Raman spectrometer to obtain a wave number division value, wherein the calculation formula of the wave number division value is as follows:
Figure BDA0001906229380000081
wherein k isΔIs the division value of wave number, lambdamaxIs the maximum of the measurable wavelength, λ, in the first Raman spectral dataΔIs the wavelength resolution of the raman spectrometer.
Specifically, the wavenumber division value is wavenumber resolution accurate to a finite fraction of a digit, which is calculated from parameters of the raman spectrometer. And a uniform wave number division value is provided for interpolation processing of Raman spectrum data in subsequent steps, and the uniqueness of variables is ensured.
Further, as a preferred embodiment, the raman spectrum database of the standard sample is obtained by:
acquiring third Raman spectrum data of the standard sample;
specifically, the third raman spectrum data is initial spectrum data of the standard sample obtained by acquiring raman spectrum data of the standard sample by the raman spectrometer.
Background removing processing is carried out on the third Raman spectrum data to obtain fourth Raman spectrum data;
specifically, the fourth raman spectrum data is standard spectrum data of the standard sample obtained after background removal processing is performed on the standard sample, and each standard spectrum data of the standard sample is provided with a serial number, so that processing in subsequent steps is facilitated. The step of background removal of the standard sample and the step of background removal of the test sample are the same, i.e. the step of background removal of the third raman spectral data and the step of background removal of the first raman spectral data are the same.
Performing interpolation processing on the fourth Raman spectrum data according to the wave number scale values to obtain first Raman shift spectrum data;
specifically, each raman spectrum data in the fourth raman spectrum data is interpolated by using the wave number scale values to obtain global raman shift spectrum data, that is, the first raman shift spectrum data. For example, the raman spectrum data of the standard sample with number k in the fourth raman spectrum data is selected, and interpolation processing is performed using the wave number division value, so as to obtain the raman shift spectrum data with dimension n and unit of wave number.
Taking the maximum value in the first Raman shift spectrum data as a normalization parameter, and carrying out normalization processing on the first Raman shift spectrum data to obtain second Raman shift spectrum data;
obtaining a mask array according to the magnitude relation between the first Raman shift spectrum data and a first threshold;
specifically, the first threshold is a raman shift spectrum data threshold, which is set according to actual needs. The dimensions of the mask array and the dimensions of the first raman shift spectral data are the same. The calculation formula of the mask array is as follows:
Figure BDA0001906229380000091
wherein alpha iskIs a mask array, ε is a first threshold value, yk0The method is characterized in that after the Raman spectrum data with the serial number of k in a Raman spectrum database is subjected to interpolation processing by using a wave number dividing value, first Raman shift spectrum data with the dimensionality of n is obtained.
And performing background interference removal processing on the second Raman shift spectrum data according to the mask module group to obtain third Raman shift spectrum data, wherein a calculation formula of the third Raman shift spectrum data is as follows:
yk=yk0οαk"omicron" is the Hadamard product;
wherein, ykAs third Raman shift spectral data, αkAs a mask array, yk0To use the wave number scale value to perform Raman spectrum data with the serial number of k in the Raman spectrum databaseAfter interpolation processing, obtaining first Raman shift spectrum data with the dimensionality of n;
specifically, the calculation formula of the third raman shift spectral data represents the third raman shift spectral data ykEqual to the first Raman shift spectrum data yk0And mask array alphakAnd carrying out composite operation to obtain an operation result. The background interference removing processing is zero setting processing of tiny values in the second Raman shift spectrum data, and the scope of the tiny values is set according to actual needs.
And marking and storing the third Raman shift spectrum data to obtain a Raman spectrum database of the standard sample.
Specifically, a Raman spectrum database of the standard sample is constructed, standard reference data are provided for component detection of the test sample, and accuracy of a test result is guaranteed.
Further, as a preferred embodiment, the identifying the components in the test sample specifically includes:
performing interpolation processing on the second Raman spectrum data according to the wave number scale values to obtain fourth Raman shift spectrum data;
and identifying the components in the test sample according to the Raman spectrum database of the standard sample and the fourth Raman shift spectrum data.
Specifically, the fourth raman shift spectrum data is raman shift spectrum data of the test sample having a dimension n and a unit of wavenumber, which is obtained by interpolating the second raman spectrum data using the wavenumber scale. Interpolation processing is carried out on the second Raman spectrum data by using the wave number scale values, so that the singularity of variables in the detection process can be ensured, and the influence of other factors on the detection result is avoided.
Further as a preferred embodiment, the identifying the components in the test sample according to the raman spectrum database and the fourth raman shift spectrum data of the standard sample specifically includes:
removing background interference from the fourth Raman shift spectrum data according to the mask module group to obtain fifth Raman shift spectrum data;
specifically, the background interference removal processing on the fourth raman shift spectrum data is specifically to perform zeroing processing on a tiny value in the fourth raman shift spectrum data, where the tiny value is set according to actual needs. The calculation formula of the fifth raman shift spectrum data is as follows:
zk=zοαk"omicron" is the Hadamard product;
wherein z iskFor the projection of the fourth Raman shift spectrum data z on the vector subspace spanned by the k mask of the standard sample, alphakTo mask array, z is the fourth raman shift spectral data.
The calculation formula of the fifth Raman shift spectrum data represents the fifth Raman shift spectrum data zkEquals fourth raman shift spectral data z and mask array alphakAnd carrying out composite operation to obtain an operation result.
Calculating the energy ratio of the fifth Raman shift spectrum data in the test sample, wherein the calculation formula of the energy ratio is as follows:
Figure BDA0001906229380000101
wherein eta iskN is the data dimension of the fourth Raman shift spectrum data, zkProjection of fourth Raman shift spectrum data on a vector subspace spanned by a k mask of a standard sample is carried out, and z is the fourth Raman shift spectrum data;
and judging the relation between the energy ratio and the second threshold value to obtain a first judgment result.
Specifically, the second threshold is an energy-to-ratio threshold, which is obtained by performing a test on a standard sample. The mask number sequence is used for avoiding the influence of a non-correlation peak value on a calculation result in the correlation coefficient calculation process while peak searching operation is carried out, and the calculation complexity is reduced.
Further as a preferred embodiment, the identifying the components in the test sample according to the raman spectrum database and the fourth raman shift spectrum data of the standard sample further comprises the following steps:
calculating a correlation coefficient of the third Raman shift spectrum data and the fifth Raman shift spectrum data, wherein the calculation formula of the correlation coefficient is as follows:
Figure BDA0001906229380000111
where ρ iskIs the correlation coefficient, ykAs third Raman shift spectral data, zkFifth raman shift spectral data;
and judging the magnitude relation between the correlation coefficient and the third threshold value to obtain a second judgment result.
In particular, the third threshold is a correlation coefficient threshold, which is derived from a large amount of experimental data. By calculating the correlation coefficient of the displacement spectrum data of the test sample and the displacement spectrum of the standard sample, whether the test sample contains the standard sample can be further judged, and the accuracy of the judgment result is improved.
Further as a preferred embodiment, the identifying the components in the test sample according to the raman spectrum database and the fourth raman shift spectrum data of the standard sample further comprises the following steps:
if the first judgment result is that the energy ratio is greater than a second threshold value and the second judgment result is that the correlation coefficient is greater than a third threshold value, determining that the test sample contains the components of the standard sample; otherwise, the test sample is judged to contain no components of the standard sample.
Specifically, whether the components of the standard sample are contained in the test sample or not is judged through the first judgment result and the second judgment result together, and the accuracy of the judgment result is ensured.
Referring to fig. 2, an embodiment of the present invention further provides a system for processing raman spectrum data corresponding to the method in fig. 1, including:
at least one memory for storing a program;
at least one processor for loading the program to implement the method for processing raman spectral data.
The contents in the above method embodiments are all applicable to the embodiment of the present system, the functions specifically implemented by the embodiment of the present system are the same as those in the above method embodiments, and the beneficial effects achieved by the embodiment of the present system are also the same as those achieved by the above method.
In summary, according to the processing method and system for raman spectrum data of the present invention, the background spectrum in the first raman spectrum data of the test sample is removed by performing background removal processing on the first raman spectrum data to obtain the second raman spectrum data, and then the components in the test sample are identified according to the raman spectrum database of the standard sample and the second raman spectrum data to obtain the identification result, and the present invention avoids the influence of the background spectrum on the test result in the identification process by performing the background removal processing, and ensures the accuracy of the identification result; further, a high-pass filter boundary extension matrix is used for eliminating a baseline shift phenomenon caused by the fact that a boundary position differential value is a nonzero value in the prior art; further, the influence of baseline background spectrum and interference noise is removed by performing background removal processing on the initial spectrum data; furthermore, by introducing the mask sequence, the influence of the uncorrelated peak value on the calculation result during peak searching operation is avoided, and the calculation complexity is reduced.
While the preferred embodiments of the present invention have been illustrated and described, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (5)

1. A processing method of Raman spectrum data is characterized in that: the method comprises the following steps:
collecting first Raman spectrum data of a test sample through a Raman spectrometer;
background removing processing is carried out on the first Raman spectrum data to obtain second Raman spectrum data;
identifying components in the test sample according to the Raman spectrum database of the standard sample and the second Raman spectrum data to obtain an identification result;
wherein before background removal processing is performed on the first Raman spectrum data, the method further comprises the following steps:
calculating the minimum resolution of the wave number according to the wavelength resolution of the Raman spectrometer to obtain a wave number division value, wherein the calculation formula of the wave number division value is as follows:
Figure FDA0003247319020000011
wherein k isΔIs the division value of wave number, lambdamaxIs the maximum of the measurable wavelength, λ, in the first Raman spectral dataΔThe wavelength resolution of the Raman spectrometer;
the Raman spectrum database of the standard sample is obtained by the following steps:
acquiring third Raman spectrum data of the standard sample;
background removing processing is carried out on the third Raman spectrum data to obtain fourth Raman spectrum data;
performing interpolation processing on the fourth Raman spectrum data according to the wave number scale values to obtain first Raman shift spectrum data;
taking the maximum value in the first Raman shift spectrum data as a normalization parameter, and carrying out normalization processing on the first Raman shift spectrum data to obtain second Raman shift spectrum data;
obtaining a mask array according to the magnitude relation between the first Raman shift spectrum data and a first threshold;
and performing background interference removal processing on the second Raman shift spectrum data according to the mask module group to obtain third Raman shift spectrum data, wherein a calculation formula of the third Raman shift spectrum data is as follows:
Figure FDA0003247319020000012
wherein, ykAs third Raman shift spectral data, αkAs a mask array, yk0Interpolation of Raman spectrum data with number k in Raman spectrum database using wave number scale division valueAfter processing, obtaining first Raman shift spectrum data with the dimensionality of n;
marking and storing the third Raman shift spectrum data to obtain a Raman spectrum database of the standard sample;
the identifying of the components in the test sample comprises:
performing interpolation processing on the second Raman spectrum data according to the wave number scale values to obtain fourth Raman shift spectrum data;
removing background interference from the fourth Raman shift spectrum data according to the mask module group to obtain fifth Raman shift spectrum data;
calculating the energy ratio of the fifth Raman shift spectrum data in the test sample, wherein the calculation formula of the energy ratio is as follows:
Figure FDA0003247319020000021
wherein eta iskN is the data dimension of the fourth Raman shift spectrum data, zkProjection of fourth Raman shift spectrum data on a vector subspace spanned by a k mask of a standard sample is carried out, and z is the fourth Raman shift spectrum data;
judging the relation between the energy ratio and the second threshold value to obtain a first judgment result;
the method for identifying the components in the test sample further comprises the following steps:
calculating a correlation coefficient of the third Raman shift spectrum data and the fifth Raman shift spectrum data, wherein the calculation formula of the correlation coefficient is as follows:
Figure FDA0003247319020000022
where ρ iskIs the correlation coefficient, ykAs third Raman shift spectral data, zkFifth raman shift spectral data;
and judging the magnitude relation between the correlation coefficient and the third threshold value to obtain a second judgment result.
2. A method of processing raman spectral data according to claim 1, wherein: the background removing processing is carried out on the first Raman spectrum data to obtain second Raman spectrum data, and the method specifically comprises the following steps:
background removing processing is carried out on the first Raman spectrum data through a high-pass filter matrix and a penalty function to obtain second Raman spectrum data, and the calculation formula of the second Raman spectrum data is as follows:
Figure FDA0003247319020000023
wherein,
Figure FDA0003247319020000024
is the second Raman spectrum data, y is the first Raman spectrum data, x is the iterative spectrum data, F (x) is the evaluation function, H is the finite field high pass filter matrix, phi is the penalty function, lambda is the weight coefficient, DiIs the i-th order differential, M is the total order of the differential, NiIs the i-th order differential data length.
3. A method of processing raman spectral data according to claim 2, wherein: the finite field high-pass filter matrix is obtained by the following steps:
acquiring a first differential operator and a truncation operator;
obtaining a first augmentation matrix and a second augmentation matrix according to the first differential operator and the truncation operator;
acquiring an extension mode transformation matrix;
obtaining a truncation matrix according to the first augmentation matrix and the extension mode transformation matrix;
obtaining a differential matrix according to the second augmentation matrix and the extension mode transformation matrix;
and obtaining a finite field high-pass filter matrix according to the truncation matrix and the differential matrix.
4. A method of processing raman spectral data according to claim 1, wherein: the method for identifying the components in the test sample further comprises the following steps:
if the first judgment result is that the energy ratio is greater than a second threshold value and the second judgment result is that the correlation coefficient is greater than a third threshold value, determining that the test sample contains the components of the standard sample; otherwise, the test sample is judged to contain no components of the standard sample.
5. A system for processing raman spectral data, comprising: the method comprises the following steps:
at least one memory for storing a program;
at least one processor for loading the program to perform a method of processing raman spectral data according to any one of claims 1 to 4.
CN201811533229.6A 2018-12-14 2018-12-14 Processing method and system of Raman spectrum data Active CN109632761B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811533229.6A CN109632761B (en) 2018-12-14 2018-12-14 Processing method and system of Raman spectrum data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811533229.6A CN109632761B (en) 2018-12-14 2018-12-14 Processing method and system of Raman spectrum data

Publications (2)

Publication Number Publication Date
CN109632761A CN109632761A (en) 2019-04-16
CN109632761B true CN109632761B (en) 2021-11-09

Family

ID=66074199

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811533229.6A Active CN109632761B (en) 2018-12-14 2018-12-14 Processing method and system of Raman spectrum data

Country Status (1)

Country Link
CN (1) CN109632761B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110532285B (en) * 2019-07-16 2022-06-24 河北凌析科技有限公司 Associated case determining method and system, electronic device and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101852734A (en) * 2010-06-01 2010-10-06 中国人民解放军第二军医大学 Fake medicine discrimination and analysis device, system and method
CN102360502A (en) * 2011-09-07 2012-02-22 中国科学院武汉物理与数学研究所 Automatic baseline correction method
CN103217409A (en) * 2013-03-22 2013-07-24 重庆绿色智能技术研究院 Raman spectral preprocessing method
CN103913764A (en) * 2014-02-24 2014-07-09 东华理工大学 NaI (TI) scintillation detector gamma energy spectrum high-resolution inversion analysis process and method based on gauss response matrix
CN106645091A (en) * 2017-02-15 2017-05-10 西派特(北京)科技有限公司 Raman spectrum based substance qualitative detection method
CN108241845A (en) * 2016-12-26 2018-07-03 同方威视技术股份有限公司 Method for deducting spectrogram background and the method by Raman mass spectrum database substance

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101852734A (en) * 2010-06-01 2010-10-06 中国人民解放军第二军医大学 Fake medicine discrimination and analysis device, system and method
CN102360502A (en) * 2011-09-07 2012-02-22 中国科学院武汉物理与数学研究所 Automatic baseline correction method
CN103217409A (en) * 2013-03-22 2013-07-24 重庆绿色智能技术研究院 Raman spectral preprocessing method
CN103913764A (en) * 2014-02-24 2014-07-09 东华理工大学 NaI (TI) scintillation detector gamma energy spectrum high-resolution inversion analysis process and method based on gauss response matrix
CN108241845A (en) * 2016-12-26 2018-07-03 同方威视技术股份有限公司 Method for deducting spectrogram background and the method by Raman mass spectrum database substance
CN106645091A (en) * 2017-02-15 2017-05-10 西派特(北京)科技有限公司 Raman spectrum based substance qualitative detection method

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
"基于凸优化的激光诱导击穿光谱基线校正方法";柯轲 等;《光谱学与光谱分析》;20180731;第38卷(第7期);第2256-2261页 *
"基于稀疏表述及字典学习遥感图像处理关键技术研究";秦振涛;《中国博士学位论文全文数据库 基础科技辑》;20160415;全文 *
"拉曼光谱仪器标准化的算法研究及其在药品快检中的应用";陈辉;《中国优秀硕士学位论文全文数据库 医药卫生科技辑》;20160115;全文 *
"模型传递及背景扣除的方法研究及其软件实现";詹德坚;《中国优秀硕士学位论文全文数据库 工程科技Ⅰ辑》;20140515;全文 *
"稀疏凸优化理论及其在激光诱导击穿光谱定量分析中的应用";珂轲;《中国优秀硕士学位论文全文数据库 工程科技Ⅰ辑》;20181015;论文正文第34-37页 *

Also Published As

Publication number Publication date
CN109632761A (en) 2019-04-16

Similar Documents

Publication Publication Date Title
EP0156877B1 (en) Multicomponent quantitative analytical method and apparatus
JP6091493B2 (en) Spectroscopic apparatus and spectroscopy for determining the components present in a sample
CN113109317B (en) Raman spectrum quantitative analysis method and system based on background subtraction extraction peak area
CN109738413B (en) Mixture Raman spectrum qualitative analysis method based on sparse nonnegative least square
CN109409350B (en) PCA modeling feedback type load weighting-based wavelength selection method
CN110763913B (en) Derivative spectrum smoothing processing method based on signal segmentation classification
CN104656100A (en) Line-scanning hyperspectral real-time anomaly detection method and system
CN113008805A (en) Radix angelicae decoction piece quality prediction method based on hyperspectral imaging depth analysis
CN109632761B (en) Processing method and system of Raman spectrum data
CN111474117B (en) Method and device for monitoring crop diseases
CN114878544B (en) Method for identifying target component from mixture SERS spectrum
CN108195817B (en) Raman spectrum detection method for removing solvent interference
CN113109318B (en) Raman spectrum quantitative analysis method and system based on spectral peak height direct extraction
CN114002162A (en) Soil organic carbon content estimation method, apparatus, storage medium, and program product
JP2021148776A (en) Peak analysis method and waveform processing device
EP0298398B1 (en) Analyzer of partial molecular structures
JPS60143765A (en) Multichannel simultaneous detection data processor for chromatograph
CN113610817B (en) Characteristic peak identification method, computing device and storage medium
CN109030452A (en) A kind of Raman spectrum data noise-reduction method based on 5 points of smoothing algorithms three times
CN116106289A (en) Complex system substance multicomponent analysis method and system based on Raman signal
CN112683816B (en) Spectrum identification method for spectrum model transmission
CN116399836A (en) Cross-talk fluorescence spectrum decomposition method based on alternating gradient descent algorithm
CN117929347B (en) Quick pesticide residue detection method and system for traditional Chinese medicine safflower
CN112229816A (en) Wood elastic modulus prediction method based on OPLS-SPA-MIX-PLS
CN116660207B (en) Method for determining characteristic spectrum in oil product quick detection and octane content detection system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant