Disclosure of Invention
In order to solve the technical problems, the invention provides a structural damage identification method based on vibration data two-dimension and multi-category augmentation, which utilizes a lightweight MLP architecture only containing a shallow structure and a small amount of parameters to obtain high precision and high timeliness.
In order to solve the technical problems, the technical scheme provided by the invention is as follows:
The method comprises the steps of firstly constructing signals of a plurality of sensors into a two-dimensional multichannel time-frequency spectrogram according to time and space association information among the sensors, and taking the two-dimensional multichannel time-frequency spectrogram as input of a multi-scale feature extraction network; then, generating an countermeasure network by using an auxiliary discriminator to amplify multi-category sample data; then, training a multi-scale feature extraction network by adopting a mixed loss function; and finally, classifying the real-time input samples by utilizing a multi-scale feature extraction network to obtain an evaluation result of the structural state.
The technical scheme is further improved as follows:
Preferably, the signals of the plurality of sensors are constructed as a two-dimensional multichannel time-frequency spectrogram, comprising the following steps:
S1-1, for multiple vibration signals collected by c sensors First to the vibration signalPerforming continuous wavelet transformation to obtain×Time-frequency spectrum coefficient matrix of (a):
(1)
Wherein, Is thatIs used for the transformation of the wavelet coefficients of (a),For the number of degrees of frequency scale,Is thatIs used for the sequence length of (a),Performing modular operation; The calculation formula of (2) is as follows:
(2)
(3)
Wherein, Is Gabor mother waveletIs used for the conjugation of (a),Is thatIs used for the scale-up factor of (a),Is thatA translation factor of (2);
s1-2, will Mapping the numerical range of (C) to the gray value interval of (0, 255), and scaling the size to 224x224 pixels to obtain a time-spectrum gray scale map; Then, time-lapse spectrum gray scale mapGamma correction is performed to obtainThe calculation formula is as follows:
(4)
Wherein, AndRespectively taking maximum value operation and taking minimum value operation,Is thatGamma coefficients of (a); The calculation formula of (2) is as follows:
(5)
Wherein, In order to adjust the gamma coefficient of the gamma,Is thatAn average value of the image entropy and the gray gradient of (c),;
S1-3, splicing the gamma corrected time spectrum gray level diagramObtaining a multichannel time-frequency spectrogram; the spatial information of the plurality of sensor signals is correlated by increasing the spatial dimension.
Preferably, the generating of the countermeasure network by using the auxiliary discriminator performs multi-category sample data augmentation, specifically:
introducing condition generators in generating an countermeasure network Distinguishing classifierForming ADC-GAN, optimized objective function of ADC-GANThe method comprises the following steps:
(8)
Wherein, For joint distribution of the real data and the tag information,To generate a joint distribution of data and tag information; lambda is an adjustable hyper-parameter; to determine the classification probability of the classifier input sample z to the true sample y +, The classification probability of the sample y - is generated for z attribution, and the calculation formulas are respectively as follows:
(9)
(10)
Wherein, A feature extraction layer shared by the classifier and the discriminator; And For the full connection layer, the real sample label and the generated sample label are mapped into a learnable feature vector respectively.
Preferably, the mixing loss function is lost by Softmax 、 Center lossAnd exclusive regularization lossThe three parts of the composite material are formed,Guiding a Softmax layer of the backbone network to form a classification decision boundary; center lossThe function of the method is to measure the distance of the high-dimensional embedded features and reduce the distance of the intra-class sample features extracted by the backbone network; exclusive regularization lossThe function of the method is to measure the distance between high-dimensional embedded feature clusters and increase the distance between sample features extracted by a backbone network.
Preferably, the calculation formula of the mixing loss function L is:
(11)
Wherein, AndRepresenting the input feature vector and class of the last full connection layer of the backbone network of the structural state identification model of the ith sample, wherein d represents feature dimension; Weight matrix representing last full-connection layer of backbone network of structural state identification model Is selected from the group consisting of the (j) th column,The ith column of the weight matrix W of the last full-connection layer of the backbone network of the structural state identification model is represented, and n and m respectively represent the category number and the sample number of the small-batch training samples; Is the learnable first The class-center feature vector is used to determine,Representing taking L2 norm operation; And Is an adjustable super parameter for balancing 、 AndLoss between; k is training round, N is adjustable annealing super parameter,Representing taking the L1 norm operation.
Preferably, the multi-scale feature extraction network is trained by adopting a mixed loss function, firstly, an ADC-GAN is trained by using a damage real sample, and a high-quality pseudo sample is generated by utilizing the trained ADC-GAN and histogram specification; mixing a pseudo sample with a real sample, and establishing an augmented sample library; next, a structural state recognition model is trained using the augmented sample library and the mixed loss function.
Compared with the prior art, the structural damage identification method based on vibration data two-dimension and multi-category augmentation provided by the invention has the following advantages:
According to the structural damage identification method based on vibration data two-dimension and multi-category augmentation, firstly, signals of a plurality of sensors are constructed into a two-dimensional multi-channel time-frequency spectrogram to be used as input of a multi-scale feature extraction network GoogLeNet in order to fully utilize time and space association information among the sensors; then, aiming at the problem that a sufficient number of damaged samples for training the deep learning network are difficult to obtain in actual engineering, an auxiliary discriminator is utilized to generate an antagonistic network ADC-GAN to realize multi-class sample data augmentation; then, training GoogLeNet by adopting the proposed mixed loss function, and improving the training effect of the backbone network; and finally, classifying the real-time input samples by GoogLeNet to obtain an evaluation result of the structural state. The method comprises the following components: the data augmentation model ADC-GAN, the structural state recognition model GoogLeNet and the mixed loss function all have the advantage of improving the sample recognition precision. Compared with a representative signal processing method and a new deep learning method, the method disclosed by the invention has the advantages of higher precision, lower requirement on a damaged sample and good practical value.
Detailed Description
The following describes specific embodiments of the present invention in detail. It should be understood that the detailed description and specific examples, while indicating and illustrating the invention, are not intended to limit the invention.
According to the structural damage identification method based on vibration data two-dimension and multi-category augmentation, firstly, signals of a plurality of sensors are constructed into a two-dimensional multi-channel time-frequency spectrogram to be used as input of a multi-scale feature extraction network GoogLeNet in order to fully utilize time and space association information among the sensors; then, aiming at the problem that a sufficient number of damaged samples for training the deep learning network are difficult to obtain in actual engineering, an auxiliary discriminator is utilized to generate an antagonistic network ADC-GAN to realize multi-class sample data augmentation; then, training GoogLeNet by adopting the proposed mixed loss function, and improving the training effect of the backbone network; and finally, classifying the real-time input samples by GoogLeNet to obtain an evaluation result of the structural state.
The method for identifying the structural damage based on vibration data two-dimension and multi-category augmentation comprises two stages of online and offline, as shown in figure 1. In the off-line stage, firstly, a continuous wavelet transformation is adopted to convert the vibration signal of the sensor into a two-dimensional time-frequency spectrogram, gamma brightness correction is carried out on the time-frequency spectrogram to eliminate brightness difference caused by space position, and a multi-channel time-frequency spectrogram is constructed by splicing the time-frequency spectrograms and is used as the input of a structural state identification model. Then, training the data augmentation model and generating a structural damage pseudo-sample, and performing histogram specification on the pseudo-sample to improve the image quality of the pseudo-sample, thereby realizing multi-category sample data augmentation. And finally, adopting the proposed mixed loss function and the augmented sample library to train the structural state recognition model together, and improving the training effect of the structural state recognition model.
In the online stage, a multichannel time-frequency spectrogram is constructed through online monitoring data, and real-time input is classified by using a structural state recognition model trained in the offline stage, so that a structural state evaluation result is obtained.
The invention discloses a structural damage identification method based on vibration data two-dimension and multi-category augmentation, which comprises the following steps:
Step S1, constructing a multichannel time-frequency spectrogram
In order to fully utilize the time and space correlation information among the sensors, one-dimensional time sequence signals of a plurality of sensors are converted into a multichannel time-frequency spectrogram which is used as the input of a subsequent structural state recognition model. The continuous wavelet transformation can effectively capture the time-frequency characteristics of the vibration signals which are not stable and have noise, and the time-frequency spectrum gray level map of the single vibration signal is obtained by adopting the continuous wavelet transformation in the embodiment.
S1-1, for multiple vibration signals collected by c sensorsFirst to the vibration signalPerforming continuous wavelet transformation to obtain×Time-frequency spectrum coefficient matrix of (a):
(1)
Wherein, Is thatIs used for the transformation of the wavelet coefficients of (a),For the number of degrees of frequency scale,Is thatIs used for the sequence length of (a),Is a modulo operation.The calculation formula of (2) is as follows:
(2)
(3)
Wherein, Is Gabor mother waveletIs used for the conjugation of (a),Is thatIs used for the scale-up factor of (a),Is thatIs a translation factor of (a).
S1-2, for adapting to the input format of the structural state recognition model, time-lapse spectral coefficient matrixAnd (5) performing post-treatment. Will beMapping the numerical range of (C) to the gray value interval of (0, 255), and scaling the size to 224x224 pixels to obtain a time-spectrum gray scale map. Then, to eliminate the luminance difference of the time spectrum caused by the spatial position of the sensor, the gray scale of the time spectrum is displayedGamma correction is performed to obtainThe calculation formula is as follows:
(4)
Wherein, AndRespectively taking maximum value operation and taking minimum value operation,Is thatGamma coefficients of (a); The calculation formula of (2) is as follows:
(5)
Wherein, In order to adjust the gamma coefficient of the gamma,Is thatAn average value of the image entropy and the gray gradient of (c),。
In the present embodiment, there is provided0.5, ThenThe calculation formula of (2) is as follows:
(6)
Wherein, Represents the probability distribution of the gray level of a pixel in the image, H represents the height of the image, W represents the width of the image,Representing the horizontal gradient of the image at pixel coordinates (n, m),Representing the horizontal gradient of the image at pixel coordinates (n, m).
S1-3, splicing the gamma corrected time spectrum gray level diagramA multichannel time-frequency spectrogram is obtained. By increasing the spatial dimension, the spatial information of the plurality of sensor signals is effectively correlated.
Step S2, data augmentation
In practical engineering, the number of structural damage samples is usually limited, so that the structural state recognition model is not sufficiently trained. In response to this problem, the proposed method employs an improved Generation Antagonism Network (GAN) to achieve multi-class sample data augmentation prior to training the structural impairment model.
The standard GAN contains two neural networks, generator G and arbiter D, the overall structure of which is shown in fig. 2. The generator functions to generate realistic samples to fool the discriminant, while the discriminant functions to accurately distinguish between real and pseudo samples. Zero and game are carried out between the generator and the arbiter, and when the generator and the arbiter reach Nash equilibrium, the effect of the generator is optimal. The optimized objective function V (G, D) for GAN is:
(7)
where z is the input sample of the arbiter D, AndThe edge distributions for the real samples and the generated samples respectively,Expressed in real data distributionThe desire for z is lower than that for z,Representing the data distribution being generatedThe following expectation for z, D (z) represents the probabilistic predictive value that the arbiter D is true for z.
The standard GAN is an unsupervised deep learning model, and has the problems of unstable training, mode collapse and the like in the application process. Furthermore, standard GANs can only generate single class pseudo-samples due to lack of supervision of class label information. To solve these problems, the ADC-GAN model is selected in this embodiment. ADC-GAN through introducing condition generatorDistinguishing classifierStandard GAN is improved. Condition generatorThe condition information is embedded into random noise input, so that the condition generation of the specified class sample is realized, and the controllability of the generated sample is improved. Distinguishing classifierThe real classification loss of the generated samples is introduced, so that the training process of the generator is supervised by the class labels, and the diversity of the generated samples is improved.
The overall structure of the ADC-GAN is shown in FIG. 3. Optimized objective function of ADC-GAN compared to the optimized objective function of standard GANOptimizing and addingClassification loss of (c):
(8)
Wherein, For joint distribution of the real data and the tag information,To generate a joint distribution of data and tag information; lambda is an adjustable hyper-parameter.To determine the classification probability of the classifier input sample Z belonging to the real sample y + The classification probability of the sample y - is generated for Z attribution, and the calculation formulas are respectively as follows:
(9)
(10)
Wherein, A feature extraction layer shared by the classifier and the discriminator; And For the full connection layer, the real sample label and the generated sample label are mapped into a learnable feature vector respectively.
Step S3, structural state identification model
In order to fully excavate structural vibration information carried by the multichannel time-frequency spectrogram, the embodiment adopts a two-dimensional deep convolutional neural network to extract time-frequency and spatial characteristic information so as to realize structural state identification.
The structural state recognition model is composed of a backbone network and a Softmax classification layer. The backbone network extracts low-level features and high-level features of input data in a progressive manner, and the process directly determines the performance of structure state identification. Therefore, it is important to select an appropriate backbone network for the recognition model. In general, increasing the number of layers of the backbone network may improve the performance of the network; but the increase of the number of layers also brings new problems such as overfitting, gradient disappearance, gradient explosion, etc. GoogLeNet is a backbone network developed by Google corporation, and the parallel processing is introduced by adopting Inception architecture, so that the number of parameters in the network is reduced, and the training speed and accuracy of the network are improved. Compared with AlexNet, VGGNet and other early backbone networks, the method has the advantages of less network parameters and faster reasoning speed; compared with ResNet appearing later, the multi-scale characteristic information of the input sample can be better extracted. Thus, this embodiment employs GoogLeNet as the backbone network for the structural state recognition model.
Softmax is a widely used deep learning model training loss function. In the method provided by the embodiment, the structural state recognition model is a multi-classification task model; and the difference between the lesion type samples is not large. Under the above conditions, the model trained with Softmax is susceptible to noise interference.
In order to improve the performance of the structural state recognition model, the embodiment proposes a hybrid loss function L as a loss function of the training process on the basis of the Softmax function. Loss of L by Softmax 、 Center lossAnd exclusive regularization lossThree parts.Is a traditional Softmax penalty, and the Softmax layer responsible for guiding the backbone network forms the classification decision boundary. To be compatible withThe embodiment sets the bias of the last full-connection layer of the backbone network to 0, soThe bias of the last fully connected layer is not taken into account. Center lossThe function of the method is to measure the distance of the high-dimensional embedded features and reduce the distance of the intra-class sample features extracted by the backbone network; exclusive regularization lossThe function of the method is to measure the distance between high-dimensional embedded feature clusters and increase the distance between sample features extracted by a backbone network.
The calculation formula of L is:
(11)
Wherein, AndRepresenting the input feature vector and class of the last full connection layer of the backbone network of the structural state identification model of the ith sample, wherein d represents feature dimension; Weight matrix representing last full-connection layer of backbone network of structural state identification model Is selected from the group consisting of the (j) th column,The ith column of the weight matrix W of the last full-connection layer of the backbone network of the structural state identification model is represented, and n and m respectively represent the category number and the sample number of the small-batch training samples; Is the learnable first The class-center feature vector is used to determine,Representing taking L2 norm operation; And Is an adjustable super parameter for balancing 、 AndLoss between; k is training round, N is adjustable annealing super parameter,Representing taking the L1 norm operation.
Step S4, training and monitoring flow
The overall training flow of this embodiment is shown in fig. 4, and the multi-channel time-frequency spectrogram used is divided according to the ratio of 8:2, and is used for the offline stage and the online stage of the simulation respectively.
During the off-line simulation stage, the damage real sample is used for training a data augmentation model, and a trained model and a histogram are utilized for prescribing to generate a high-quality pseudo sample. The pseudo sample is mixed with the real sample, and an augmented sample library is established. Next, a structural state recognition model is trained using the augmented sample library and the mixed loss function. In the on-line stage, classifying the input samples by means of the trained structural state recognition model to finish structural state monitoring.
In training the data augmentation model, ADC-GAN is used as the data augmentation model and equation (8) is used as its loss function. The parameter λ of the formula (8) is set to 1. The optimization algorithms of the ADC-GAN generator, the discriminator and the discrimination optimizer all adopt an Adam optimizer, and the learning rates are respectively set to be 5 multiplied by 10 -5、2×10-4 and 2 multiplied by 10 -4. The maximum training round for ADC-GAN was set to 50 and the Mini-batch size for each round was 8.
In training the structural recognition model, googLeNet is used as the structural recognition model, and equation (11) is used as its loss function. Parameters in equation (11)、And N is set to 1, 2 and 1, respectively. GoogLeNet the optimization algorithm was used as Adam optimizer, and the learning rate was set to 2 x 10 -4. GoogLeNet maximum training runs were set to 50 and Mini-batch size for each run was 52.
Experiment verification
(1) Data source and generation of multi-channel time-frequency spectrogram
The experimental data is a structural vibration test data of a certain disclosure. The research object is a truss tower with the height of 9m, the accelerometer and the structural damage position of the truss tower are shown in the figure 5, the corresponding numbers of ML 1-ML 9 are from 01x to 09x, and the sampling frequency of the accelerometer is 1651Hz; DAM1-DAM6 correspond to the location of structural damage.
The published data records structural response vibration signals experienced by truss towers under natural excitation during the period of time from 8.1 in 2020 to 7.31 in 2021. These signals include health status vibration signals and vibration signals of 6 different damage states. The experiment was performed with a selection of portions of the data collected from accelerometers numbered 01x to 09x, see table 1 for details of these data.
Table 1 experimental data details:
The one-dimensional vibration signals acquired by the multiple sensors are processed in a combined mode and are converted into a multichannel time-frequency spectrogram, the multichannel time-frequency spectrogram is used as an input of a structural state identification model, and the specific process is as follows: (1) Non-overlapping slicing of selected experimental data (accelerometer numbers from 01x to 09 x) using a sliding window of length 2048 to obtain signal samples Wherein n=1, a 11592; (2) And converting the signal sample into a multichannel time-frequency spectrogram. The detailed information of the multichannel time-frequency spectrogram is shown in table 2, wherein the structural damage categories are 6 categories in total, and are respectively: DAM6 (S), DAM4 (S), DAM3 (S), DAM6 (M), DAM4 (M) and DAM3 (M).
Table 2 multichannel time-frequency spectrum details:
(2) Experimental platform and model training details
The overall training flow is shown in fig. 4. At each cross-validation, 11592 samples were used in total, of which 9274 samples were used for the offline phase of the simulation and 2318 samples were used for the online phase of the simulation. The ratio of each damaged sample to the healthy state sample in the samples used in the two stages is kept consistent and is 1:6.
In the off-line phase, the data augmentation model ADC-GAN is first trained using real samples (4626) of all lesion classes and a lesion class pseudo-sample is generated. Mixing the pseudo sample and the real sample to form an augmented sample library (13900), wherein the ratio of each damage sample to each health sample is 2:6. the backbone network of the data augmentation model ADC-GAN is realized by adopting a BigGAN backbone network, and the super parameter lambda of the optimization objective function is set to be 1. In ADC-GAN, a generatorDistinguishing device D and distinguishing classifierThe optimization algorithm of (a) is set as Adam optimizer, and the learning rates are set as 5×10 -5、2×10-4 and 2×10 -4 respectively. The maximum training iteration round for ADC-GAN is set to 50 and Mini-batch for each round is set to 8.
The structural state recognition model is then trained using the augmented sample library (13900). Training with mixed loss function, its super parameter、And N is set to 1, 2 and 1, respectively. The optimization algorithm of the structural state recognition model is set as an Adam optimizer, and the learning rate is set as 2×10 -4; the maximum training iteration round is set to 50 and the Mini-batch for each round is set to 52.
(3) Overall performance assessment
The proposed method is compared with representative signal processing methods and deep learning methods respectively by adopting Accuracy, precision, recall, F-score, false Positive Rate (FPR) and FALSE NEGATIVE RATE (FNR) evaluation indexes, so that the overall performance of the method is evaluated.
The accuracy of identifying whether the structure has defects or not by adopting 5-fold cross validation to obtain the evaluation index of the on-line sample identification result is shown in the table 3. As can be seen from Table 3, the Accuracy and F1-Score, which identify whether the structure is defective, are 98.10% and 97.91%, respectively.
Table 3 the method of the present invention was used to identify the accuracy (%) of the whole sample:
The disclosed data of this experiment adopts a signal processing method with covariance-driven random subspace recognition as a representative, and the natural frequency difference of the structure in different states is analyzed, and the result is shown in fig. 6. As can be seen from the results shown in fig. 6 (a), the difference between the natural frequency (gray) when severe damage occurs to DAM6 (S) and the natural frequency (cyan and blue) when the structure is healthy is obvious, and the two states can be effectively distinguished by using the natural frequency as a criterion; the natural frequencies of DAM3 (S) and DAM4 (S) when severe damage occurs are not obviously different from the natural frequencies (blue, red and green) when the structure is healthy, and the three states are difficult to distinguish by taking the natural frequencies as criteria. As can be seen from the calculation results of the data of the other period shown in fig. 6 (b), the natural frequencies (gray) of the three light lesions of DAM6 (M), DAM4 (M) and DAM3 (M) are very close to the natural frequencies (blue, red and green) of the healthy structure, and cannot be effectively distinguished. Therefore, the method can effectively identify mild and severe structural damage.
Experiments show that the 3DS-CNN network processes the fused multi-vibration signal characteristics to realize structural damage identification, and the method is a representative novel structural damage identification method based on deep learning. Wherein, the ratio of each damaged sample to healthy sample is 1:1. The whole identification precision of the verification set in the experimental method is 81.43%, and effective structural state identification can be realized. However, the method provided by the invention is remarkably improved by 16.67% in the overall recognition accuracy.
In summary, the method of the present invention exhibits superior performance compared to the signal processing method and the 3DS-CNN deep learning method using covariance to drive random subspace recognition. The method not only can effectively identify various structural states, but also has higher overall identification precision. In addition, the experimental data sources of the method and the comparison method are consistent, and the complex structure excited by natural excitation responds to vibration signals. Thus, the proposed method has practical value in engineering applications when dealing with noisy and disturbing vibration signals generated by natural excitation.
The above embodiments are merely preferred embodiments of the present invention, and are not intended to limit the present invention in any way. While the invention has been described with reference to preferred embodiments, it is not intended to be limiting. Therefore, any simple modification, equivalent variation and modification of the above embodiments according to the technical substance of the present invention shall fall within the scope of the technical solution of the present invention.