CN114972906A

CN114972906A - Soil quality type identification method for excavation surface of soil pressure balance shield

Info

Publication number: CN114972906A
Application number: CN202210479274.8A
Authority: CN
Inventors: 张东明; 傅蕾; 黄宏伟
Original assignee: Tongji University
Current assignee: Tongji University
Priority date: 2022-05-05
Filing date: 2022-05-05
Publication date: 2022-08-30

Abstract

The invention relates to the field of earth pressure balance shield tunnel construction. An earth pressure balance shield excavation surface soil type identification method is proposed, which is characterized by comprising the following steps: step 1, establishing a dregs classification system; step 2, establishing a dregs identification database; step 3, dregs identification model 's build. The method of the invention can not only reflect the nature of the soil itself scientifically, but also is suitable for the actual needs of shield construction, and the constructed muck visual identification criterion provides a basis for the production of data set labels; The identification can quickly and accurately obtain the soil type information in front of the excavation, and has the characteristics of real-time and low cost.

Description

A method for identifying soil types of earth pressure balance shield excavation surface

技术领域technical field

本发明涉及土压平衡盾构法隧道施工领域。The invention relates to the field of earth pressure balance shield tunnel construction.

背景技术Background technique

随着中国城市化进程的不断加快，地铁的里程和规模也随之迅速扩张，这对隧道施工的安全及对周围环境影响的控制也带来了极大的挑战。盾构法因其具有施工速度快，对周围环境影响小，机械化程度高的特点被广泛应用于地铁隧道的建设中。但盾构掘进时的施工决策依赖于对前方地层情况的了解，当实际土质与已知情况不符时，会对施工质量与施工安全带来极大的威胁。目前施工人员所掌握的隧道工程沿线的地质信息仅来源于施工前地质勘探提供的地质描述，钻孔间的地质情况仅通过插值法获得，具有很高的不确定性。当实际地层与地勘报告中提供的地层信息不符时，掘进过程可能产生隧道质量问题，严重时甚至导致安全事故的发生。因此，一种利用渣土图像实时进行开挖面土质识别的系统能够保证施工人员及时掌握地层变化，调整施工决策，保证隧道的安全稳定掘进。With the continuous acceleration of urbanization in China, the mileage and scale of subways have also expanded rapidly, which also brings great challenges to the safety of tunnel construction and the control of the impact on the surrounding environment. The shield method is widely used in the construction of subway tunnels because of its fast construction speed, little impact on the surrounding environment, and high degree of mechanization. However, the construction decision during shield tunneling depends on the understanding of the stratum ahead. When the actual soil quality is inconsistent with the known situation, it will bring a great threat to the construction quality and construction safety. At present, the geological information along the tunnel project mastered by the construction personnel only comes from the geological description provided by the geological exploration before construction. When the actual stratum does not match the stratum information provided in the geological survey report, tunnel quality problems may occur during the excavation process, and even lead to safety accidents in severe cases. Therefore, a system that uses muck images to identify the soil quality of the excavation surface in real time can ensure that the construction personnel can grasp the stratum changes in time, adjust the construction decisions, and ensure the safe and stable excavation of the tunnel.

目前针对盾构机开挖面土质实时识别问题主要有以下解决方案：At present, there are mainly the following solutions for the real-time identification of soil quality on the excavation surface of the shield machine:

申请号为202110183343.6的中国专利申请提出一种基于盾构实时掘进参数的地层特性确定方法，先根据隧道施工前的勘察报告对地层情况进行预分类，同时对盾构机实时收集的参数进行变换和处理，对处理后的指数绘制成二维平面图，判断是否产生新的地层类型并更新地层类型数量。最后将标准化后的参数输入到K-Means算法中，输出对应参数确定的地层类型。The Chinese patent application with application number 202110183343.6 proposes a method for determining stratum characteristics based on real-time excavation parameters of shield tunneling. First, the stratum conditions are pre-classified according to the survey report before tunnel construction, and at the same time, the parameters collected in real time by the shield machine are transformed and analyzed. Process, draw the processed index into a two-dimensional plan, judge whether a new formation type is generated and update the number of formation types. Finally, the standardized parameters are input into the K-Means algorithm, and the formation type determined by the corresponding parameters is output.

申请号为201911421797.1的中国专利申请提出一种基于土压平衡盾构机参数数据驱动反演地质的方法。对土压平衡盾构机的历史运行数据进行处理制备数据集，在训练集上学习随机森林模型，测试集上验证模型。提取土压平衡盾构机的实时掘进数据输入随机森林模型中即可得到地质条件信息。The Chinese patent application with the application number of 201911421797.1 proposes a method for driving geological inversion based on the parameter data of the earth pressure balance shield machine. The historical operation data of the earth pressure balance shield machine is processed to prepare the data set, the random forest model is learned on the training set, and the model is verified on the test set. The geological condition information can be obtained by extracting the real-time excavation data of the earth pressure balance shield machine and inputting it into the random forest model.

申请号为202110152053.5的中国专利申请提出一种基于振动信号的实时土体类别识别方法。首先在掘进机构上安装传感器采集开挖产生的振动信号，再根据振动信号分析接触土体的平均剪切模量，最后将分析结果与土体类别数据库中的数据进行对比判断土体类别。The Chinese patent application with the application number of 202110152053.5 proposes a real-time soil type identification method based on vibration signals. Firstly, a sensor is installed on the excavation mechanism to collect the vibration signal generated by the excavation, and then the average shear modulus of the contacting soil is analyzed according to the vibration signal. Finally, the analysis result is compared with the data in the soil type database to determine the soil type.

申请号为202110558201.3的中国专利申请提出一种盾构法隧道施工开挖面土质实时预测系统及方法，预测系统包括相似工程数据获取模块、土质信息处理模块、施工数据处理模块、土质预测器构建模块以及土质预测模块。土质预测器构建模块基于施工数据获取的土质特征灰度图和渣土图像，采用卷积神经网络构建第一土质预测器和第二土质预测器，融合两个预测器的预测结果得到土质实时检测结果。The Chinese patent application with the application number of 202110558201.3 proposes a real-time prediction system and method for soil quality of a shield tunnel construction excavation surface. The prediction system includes a similar engineering data acquisition module, a soil quality information processing module, a construction data processing module, and a soil quality predictor building module. And soil prediction module. The soil quality predictor building module uses the convolutional neural network to construct the first soil quality predictor and the second soil quality predictor based on the grayscale map of the soil quality characteristics and the muck image obtained from the construction data, and combines the prediction results of the two predictors to obtain the real-time soil quality detection. result.

现有技术的缺点Disadvantages of the prior art

盾构施工参数与土质类型、盾构机选型、隧道埋深、渣土改良状态等各方面因素有关，因此利用实时掘进参数反演地质类型的方法建立的地质识别模型只能在施工条件相似的隧道工程中使用，否则识别效果很难保证；利用传感器采集振动信号分析土体参数的方法需要额外布置传感器，且土体只能基于平均剪切模量这一个参数指标进行分类；利用渣土图像进行土质识别的方法中，图像对应的土质特征信息是基于隧道施工前地质勘查结果分析计算得到的，而地质勘查结果本身就存在信息不全面不准确的特点，钻孔间的土层分布情况存在很大的不确定性，标签的不准确直接影响模型的识别效果。Shield construction parameters are related to various factors such as soil type, shield machine selection, tunnel burial depth, and slag improvement state. Therefore, the geological identification model established by using real-time tunneling parameters to invert geological types can only be constructed under similar construction conditions. Otherwise, the identification effect is difficult to guarantee; the method of using sensors to collect vibration signals to analyze soil parameters requires additional sensors, and the soil can only be classified based on the average shear modulus, a parameter index; using slag soil In the method of soil quality identification by image, the soil quality information corresponding to the image is obtained by analysis and calculation based on the geological survey results before tunnel construction, and the geological survey results themselves have the characteristics of incomplete and inaccurate information, and the distribution of soil layers between boreholes. There is a lot of uncertainty, and the inaccuracy of the label directly affects the recognition effect of the model.

发明内容SUMMARY OF THE INVENTION

本发明要解决的技术问题Technical problem to be solved by the present invention

设计一种渣土分类方法及基于渣土监控视频实时进行土质类型识别的系统，旨在解决盾构司机难以获取前方地层信息的问题。通过本发明可以实现实时、准确地识别刀盘前方地层的土质类型，为盾构司机后续调整施工参数及渣土改良方案等施工决策提供信息。A method for classifying muck and a system for real-time identification of soil types based on muck monitoring video are designed to solve the problem that it is difficult for shield drivers to obtain information on the stratum ahead. The invention can realize real-time and accurate identification of the soil type of the stratum in front of the cutter head, and provide information for construction decisions such as subsequent adjustment of construction parameters and slag soil improvement plans by the shield driver.

本发明的目的object of the present invention

本发明的核心，就是研发一种能结合土体自身工程性质及工程经验的渣土分类系统，并根据盾构机监控摄像头拍摄的渣土图像实时识别前方土质类型的方法。使用该方法，能够得到基于盾构施工特性的渣土分类结果，并基于监控摄像头采集的出渣视频对渣土进行实时的定位以及对前方地质类型进行实时判断。特点：低成本、自动化、适用于各种土质地层中掘进的隧道。The core of the present invention is to develop a muck classification system that can combine the engineering properties and engineering experience of the soil itself, and real-time identify the type of soil ahead according to the muck image captured by the shield machine monitoring camera. Using this method, the classification results of the muck based on the construction characteristics of the shield can be obtained, and the muck can be located in real time based on the slag discharge video collected by the surveillance camera, and the geological type ahead can be judged in real time. Features: low cost, automatic, suitable for tunnels in various soil layers.

本发明技术方案Technical scheme of the present invention

一种土压平衡盾构开挖面土质类型识别方法，该方法包括对土压平衡盾构渣土进行基于盾构施工特性的分类，以及基于深度学习算法对出渣土进行图像识别。An earth pressure balance shield excavation surface soil type identification method includes classifying earth pressure balance shield dregs based on shield construction characteristics, and performing image recognition on the excavated dregs based on a deep learning algorithm.

本方法是根据对出渣土的识别来确定开挖面土质类型，因此首先需要基于土质类型的划分来确定渣土类别的划分。首先建立一个同时考虑土体自身工程特性以及盾构施工经验的渣土分类系统。接着收集渣土数据，并对数据进行挑选及处理得到土压平衡盾构出渣土识别数据集。然后构建并训练得到出渣土目标检测模型。最后输入土压平衡盾构施工时的出渣土监控视频即可实现渣土的实时识别，得到开挖面的土质类别信息。This method determines the soil type of the excavation surface based on the identification of the excavated soil. Therefore, it is necessary to determine the classification of the soil type based on the classification of the soil type. Firstly, a slag classification system is established that considers both the engineering characteristics of the soil and the experience of shield construction. Then collect the muck data, select and process the data to obtain the earth pressure balance shield muck identification data set. Then build and train to get the muck target detection model. Finally, input the monitoring video of the muck during the construction of the earth pressure balance shield to realize the real-time identification of the muck, and obtain the soil type information of the excavation surface.

本方法步骤如下：The steps of this method are as follows:

步骤1、建立渣土分类系统Step 1. Establish a muck classification system

为实现盾构掘进时土质类型的识别，对盾构工程所在地区浅部地层工程性质的了解是第一步。静力触探试验(CPT)是一种可以综合反映土体物理力学性质的原位试验方法。通过分析不同地层的贯入阻力统计特征值，即可掌握各地层土体的工程特性，为渣土分类奠定基础。In order to realize the identification of soil types during shield tunneling, it is the first step to understand the engineering properties of the shallow strata in the area where the shield tunnel is located. Static penetration test (CPT) is an in-situ test method that can comprehensively reflect the physical and mechanical properties of soil. By analyzing the statistical eigenvalues of penetration resistance of different strata, the engineering characteristics of soil mass in each stratum can be grasped, which lays a foundation for the classification of slag.

1.1首先收集盾构施工所在地区的地质勘察报告，整理其中的静力触探分层参数表，统计各钻孔经过地层的贯入阻力值，剔除异常值，即明显不合理的数据。1.1 First, collect the geological survey report of the area where the shield construction is located, sort out the static penetration layer parameter table, count the penetration resistance value of each borehole through the stratum, and eliminate the abnormal value, that is, the obviously unreasonable data.

1.2接着计算各地层贯入阻力指标的统计特征值，包括均值、标准差以及变异系数。1.2 Then calculate the statistical characteristic values of the penetration resistance index of each layer, including the mean, standard deviation and coefficient of variation.

首先根据各层的土层名称进行大类划分，如：粘土(CL)、粉土(SI)、砂土(SA)。First, the soil layers are classified into categories according to the name of each layer, such as: clay (CL), silt (SI), and sand (SA).

再根据能够反映土体工程性质的贯入阻力均值进行详细划分，如粘土(CL)划分为软粘土(SC)、普通粘土(NC)、硬粘土(HC)。According to the average penetration resistance, which can reflect the properties of soil engineering, it is divided into details, such as clay (CL) is divided into soft clay (SC), ordinary clay (NC), and hard clay (HC).

依据此方法，能够将某一地区的地层根据其工程性质进行合并分类，获得适用于土压平衡盾构参数调整的土质类型划分。According to this method, the strata in a certain area can be combined and classified according to their engineering properties, and the classification of soil types suitable for the adjustment of EPB shield parameters can be obtained.

1.3对某地区土质类型划分后，在此基础上确定渣土的类别划分。1.3 After classifying the soil types in a certain area, determine the classification of the muck on this basis.

首先考虑是否存在对土压平衡盾构施工影响较大的特殊土或承压水等情况，若存在此类有特殊性质的地层，而在根据贯入阻力值进行分类时又将此类地层与其他地层合并为一类，则应该将这类地层单独划分。除此之外，考虑盾构法施工时常出现开挖面经过多层地层的情况，此时排出的渣土可定为“混合土”这一特殊的类型。至此，在土质类别划分的基础上进行调整即可获得同时考虑土体自身工程性质以及施工经验的渣土类别划分结果。First, consider whether there are special soils or confined water that have a great impact on the earth pressure balance shield construction. If other strata are combined into one category, such strata should be classified separately. In addition, considering that the excavation surface often passes through multi-layer strata during the construction of the shield method, the slag discharged at this time can be designated as a special type of "mixed soil". So far, the classification results of slag soil can be obtained by adjusting on the basis of the classification of soil quality, taking into account the engineering properties of the soil itself and construction experience.

1.4由于在之后建立数据集的过程中，要对每一张渣土图片标记其渣土类型，因此最后再基于渣土的视觉特征来构建目力鉴别渣土类别的准则，便于后续标签的制作。本方法的渣土识别准则采用四种渣土的表观特征来确定渣土类别，分别为渣土形状(MS)、渣土断面形态(CS)、渣土表面平整度(SC)以及渣土颜色(MC)。1.4 In the process of establishing the data set later, each muck image needs to be marked with its muck type, so finally, based on the visual characteristics of the muck, a criterion for visually identifying the muck category is constructed, which is convenient for the production of subsequent labels. The muck identification criterion of this method uses four kinds of muck apparent characteristics to determine the muck category, namely muck shape (MS), muck section shape (CS), muck surface flatness (SC), and muck shape (MS), respectively. Color (MC).

步骤2、建立渣土识别数据库Step 2. Establish a muck identification database

2.1首先广泛收集该方法应用地区的渣土监控视频。2.1 First, the monitoring videos of muck in the area where the method is applied are widely collected.

一般监控视频分辨率为1920*1080，最好不低于1280*720。The general surveillance video resolution is 1920*1080, preferably no less than 1280*720.

渣土监控摄像头应安装于皮带传送机的正上方或斜上方位置，对准螺旋出土机出土口，保证拍摄到完整的出渣过程和渣土形态。在出土口上方安装照明设备，并保持镜头表面洁净，使得拍摄的图像明亮清晰、易于识别。The muck monitoring camera should be installed directly above or diagonally above the belt conveyor, aiming at the excavation port of the screw excavator to ensure that the complete slag process and muck form can be photographed. Install lighting equipment above the excavation opening and keep the lens surface clean, so that the captured images are bright, clear and easy to identify.

2.2收集到视频数据后对数据进行预处理。2.2 Preprocessing the data after collecting the video data.

首先将稳定出土阶段的有效视频截取出来，再以每30帧一张的频率抽取视频帧(具体取帧频率可根据盾构机推进速度进行调整)。依据渣土形态和图像背景尽量变化多样的原则选取图像，剔除镜头污损下拍摄的图像以及由于视频卡顿造成的马赛克不良图像和冗余图像。First, cut out the valid video of the stable unearthed stage, and then extract video frames at a frequency of one frame every 30 frames (the specific frame frequency can be adjusted according to the advancing speed of the shield machine). Images were selected according to the principle of changing the shape of the muck and the image background as much as possible, and the images taken under the contamination of the lens, as well as the poor mosaic images and redundant images caused by video freezes were excluded.

2.3采用LabelImg图像标注软件对渣土图像进行手工标注，用矩形框框出每张图像内的渣土体目标，并输入每个目标框的渣土类别。标签信息为PASCAL VOC格式，每张图片真实的目标框位置信息和相应的渣土类别信息都被保存为XML文件。在训练集和验证集中，每张图像都必须带有标签，里面记录的信息被称为真实值(ground truth，GT)。该真实值提供给3.3.4的训练过程。2.3 Use LabelImg image annotation software to manually mark the muck image, frame the muck target in each image with a rectangular frame, and input the muck category of each target frame. The label information is in PASCAL VOC format, and the real target box position information and corresponding muck category information of each image are saved as XML files. In both training and validation sets, each image must be labeled, and the information recorded in it is called the ground truth (GT). This ground truth is provided to the training process in 3.3.4.

2.4最后获得的渣土图片文件及对应的标签文件即为渣土数据集，将其按8：1：1的比例划分为训练集、验证集和测试集。2.4 The finally obtained muck image file and corresponding label file is the muck data set, which is divided into training set, validation set and test set according to the ratio of 8:1:1.

步骤3、渣土识别模型的构建Step 3. Construction of muck identification model

渣土识别模型的构建流程也是代码的组成部分以及编写顺序。本发明采用卷积神经网络建立渣土识别模型，其任务属于目标检测任务，因此需要搭建目标检测网络。模型在深度学习框架Pytorch上进行搭建，采用python语言实现。The construction process of the muck identification model is also an integral part of the code and the writing order. The present invention uses the convolutional neural network to establish the muck identification model, and its task belongs to the target detection task, so a target detection network needs to be built. The model is built on the deep learning framework Pytorch and implemented in python language.

所述渣土识别模型包括：自定义数据集模块、渣土检测网络搭建模块、构建训练过程模块、模型测试及评估指标模块和渣土识别结果可视化模块。The muck identification model includes: a custom data set module, a muck detection network building module, a construction training process module, a model testing and evaluation index module, and a muck identification result visualization module.

3.1在自定义数据集模块中3.1 In custom dataset module

3.1.1需要从数据集存储路径上载入渣土图像数据以及相应的标签信息，分别保存在列表中。3.1.1 It is necessary to load the muck image data and the corresponding label information from the data set storage path, and save them in the list respectively.

3.1.2接着对图像数据和标签数据进行一定的处理和转换。将图像数据从PIL格式转换成形状为(C,H,W)的Tensor格式，其中C代表图像通道数，H代表图片高度，W代表图片宽度，并将所有像素值除以255归一化到0-1之间。3.1.2 Then perform certain processing and conversion on the image data and label data. Convert image data from PIL format to Tensor format of shape (C,H,W), where C is the number of image channels, H is the picture height, and W is the picture width, and normalize all pixel values by dividing by 255 to between 0-1.

3.1.3接着在图像输入网络之前对其进行一定的随机图像增强操作，以0.5的概率对图像进行水平翻转、网格掩码、随机裁剪、色彩抖动以及模糊五种数据增强，提高数据库图片的多样性，增强模型的鲁棒性。3.1.3 Then, before the image is input into the network, a certain random image enhancement operation is performed, and the image is enhanced by horizontal flipping, grid masking, random cropping, color jittering and blurring with a probability of 0.5, so as to improve the image quality of the database. Diversity to enhance the robustness of the model.

3.1.4计算出整个数据集中图片三通道像素值的均值和标准差，对图片进行标准化处理以便于模型的学习，提供给步骤3.3。3.1.4 Calculate the mean and standard deviation of the three-channel pixel values of the picture in the entire data set, standardize the picture to facilitate the learning of the model, and provide it to step 3.3.

3.2在渣土检测网络搭建模块中，网络主要分为三个部分：骨干网络、颈部网络以及顶部网络，提供给步骤3.3。3.2 In the muck detection network building module, the network is mainly divided into three parts: the backbone network, the neck network and the top network, which are provided to step 3.3.

骨干网络的主要功能为提取图像特征，其基本结构为卷积层、批标准化层以及激活层的组合，将此基本结构进行堆叠即可完成骨干网络的搭建。在骨干网络中加入残差连接能够增加网络的深度，有效提高其特征提取的能力。The main function of the backbone network is to extract image features, and its basic structure is a combination of convolution layer, batch normalization layer and activation layer. The backbone network can be built by stacking this basic structure. Adding residual connections to the backbone network can increase the depth of the network and effectively improve its feature extraction capability.

颈部网络的主要功能为特征增强和融合，举例而非限定，本发明采用的空间金字塔池化(spatial pyramid pooling，SPP)和路径聚合网络(path aggregation network，PANet)为颈部网络常用的模块。其中，SPP模块能够有效扩展感受野并分离出最重要的上下文信息，PANet能够通过融合骨干网络不同层级的参数来增强提取的图像特征。颈部网络对于整个目标检测网络的性能提升有很大的帮助。The main function of the neck network is feature enhancement and fusion. For example, but not limitation, the spatial pyramid pooling (spatial pyramid pooling, SPP) and path aggregation network (path aggregation network, PANet) used in the present invention are commonly used modules of the neck network. . Among them, the SPP module can effectively expand the receptive field and separate the most important contextual information, and PANet can enhance the extracted image features by fusing the parameters of different levels of the backbone network. The neck network is of great help to the performance improvement of the entire object detection network.

顶部网络即为检测器，其主要功能即为对渣土位置及类别信息做最终的回归预测。更高分辨率的输出特征图包含了输入图像更详细的特征，善于小目标的检测，更低分辨率的输出特征图包含输入图像更粗略的特征，善于大目标的检测。The top network is the detector, and its main function is to make the final regression prediction for the location and category information of the muck. The higher-resolution output feature map contains more detailed features of the input image and is good at detecting small objects, while the lower-resolution output feature map contains coarser features of the input image and is good at detecting large objects.

作为实施例，骨干网络采用Darknet53，颈部网络采用SPP和PANet，顶部网络由三个输出尺度的目标检测器组成。图片首先输入骨干网络中，该骨干网络由53层卷积组合构成，其中插入一定数目的残差连接。残差连接是为了解决在训练深度神经网络时，网络达到一定层数后可能会出现网络退化的现象，也就是不仅无法提升模型的表达能力，反而使得模型的效果变差。每个卷积组合可以看作一个函数F，输入观测值即可得到输出预测值y＝F(x)。残差连接分为两条线，一条线是观测值x输入代表函数F的卷积组合中F(x)，另一条线则直接传递观测值x，最终的预测值为两条线的输出结果相加，即F(x)+x。若把整个残差连接看作一个函数H，输入的观测值为x，则预测值y＝H(x)＝F(x)+x。F(x)＝H(x)-x即为残差，也就是预测值y和观测值x之间的差距。这样，经过残差连接的下一层不仅包含上一层经过非线性变化(卷积组合)后的信息，也包含上一层原始的信息，这样处理使得信息只可能逐层递增，模型的性能也不会因为网络深度的增加而降低。As an example, the backbone network adopts Darknet53, the neck network adopts SPP and PANet, and the top network consists of object detectors with three output scales. The picture is first input into the backbone network, which consists of 53 layers of convolutional combinations, in which a certain number of residual connections are inserted. Residual connection is to solve the problem of network degradation when the network reaches a certain number of layers when training a deep neural network, that is, it not only fails to improve the expressive ability of the model, but also makes the effect of the model worse. Each convolution combination can be regarded as a function F, and the output predicted value y=F(x) can be obtained by inputting the observed value. The residual connection is divided into two lines, one line is the observation value x input represents F(x) in the convolution combination of the function F, and the other line directly transmits the observation value x, and the final predicted value is the output result of the two lines Add, that is, F(x)+x. If the entire residual connection is regarded as a function H, and the input observation value is x, the predicted value y=H(x)=F(x)+x. F(x)=H(x)-x is the residual, which is the difference between the predicted value y and the observed value x. In this way, the next layer after the residual connection not only contains the information of the previous layer after nonlinear change (convolution combination), but also contains the original information of the previous layer. This processing makes the information only possible to increase layer by layer, and the performance of the model Nor does it decrease as the depth of the network increases.

经过骨干网络的层层特征提取后，特征图已经蕴含了能够识别渣土的高级语义信息，但不免失去了很多细节特征，不利于小目标的检测。因此，骨干网络训练得到的特征图被输入颈部网络进行进一步的特征融合和增强。After the layer-by-layer feature extraction of the backbone network, the feature map already contains high-level semantic information that can identify the muck, but it inevitably loses many detailed features, which is not conducive to the detection of small targets. Therefore, the feature maps trained by the backbone network are input to the neck network for further feature fusion and enhancement.

骨干网络顶端的特征图首先被输入颈部网络的SPP结构中，SPP网络对特征图进行三种不同尺度的池化操作再在通道维度拼接起来，可解决卷积网络对特征的重复提取问题，大大提高产生检测候选框的速度，节省计算成本。接着，骨干网络最顶端的特征图再与骨干网络中间两个不同层级的特征图输入颈部网络的PANet网络中进行自顶向下和自底向上的双向融合，使其既具有深层的语义信息又具有浅层的纹理、颜色等基本信息，保证特征的完整性和多样性，提高最终的预测效果。The feature map at the top of the backbone network is first input into the SPP structure of the neck network. The SPP network performs three different scale pooling operations on the feature map and then stitches them together in the channel dimension, which can solve the problem of repeated feature extraction by the convolutional network. It greatly improves the speed of generating detection candidate frames and saves computing costs. Then, the top feature map of the backbone network and the feature maps of two different levels in the middle of the backbone network are input into the PANet network of the neck network for top-down and bottom-up bidirectional fusion, so that it has both deep semantic information. It also has shallow texture, color and other basic information to ensure the integrity and diversity of features and improve the final prediction effect.

骨干网络中三个不同层级的特征图(“特征图1、特征图2、特征图3”)经过颈部网络的融合增强后分别输入最终的顶部网络检测器中，各自经过一个简单的卷积组合，回归得到最终的预测结果“预测输出1”“预测输出2”“预测输出3”。三个不同大小的特征图分别包含了不同尺度的特征，也对应预测不同大小的目标。预测输出1特征图较大，包含的细节信息最多，负责预测小目标物体；预测输出3特征图较小，更易于判别整体信息，负责预测大目标物体。上述三个预测输出的结果合并在一起即为整个网络对图片的预测结果。The feature maps of three different levels in the backbone network ("feature map 1, feature map 2, feature map 3") are fused and enhanced by the neck network and then input into the final top network detector respectively, and each undergoes a simple convolution. Combination, regression to get the final prediction result "prediction output 1" "prediction output 2" "prediction output 3". The three feature maps of different sizes contain features of different scales, and also correspond to predicting targets of different sizes. The feature map of prediction output 1 is larger and contains the most detailed information, and is responsible for predicting small target objects; the feature map of prediction output 3 is smaller, which is easier to distinguish the overall information, and is responsible for predicting large target objects. The results of the above three prediction outputs are combined together to be the prediction results of the entire network for the picture.

3.3在构建训练过程模块中3.3 In building the training process module

3.3.1首先导入自定义数据集模块以及渣土检测网络搭建模块，实例化渣土数据集以及渣土检测网络，构建数据加载器以自定义图片及标签数据输入网络的方式。3.3.1 First, import the custom data set module and the muck detection network building module, instantiate the muck data set and the muck detection network, and build a data loader to customize the way of inputting pictures and label data into the network.

3.3.2设计网络的损失函数，本方法所使用目标检测网络的损失函数分为三部分：渣土定位损失(Localization loss)、目标框置信度损失(Confidence loss)以及渣土分类损失(Classification loss)，如公式(1)所示。3.3.2 Design the loss function of the network. The loss function of the target detection network used in this method is divided into three parts: localization loss, target frame confidence loss and classification loss. ), as shown in formula (1).

Loss＝Localization loss+Confidence loss+Classification lossLoss=Localization loss+Confidence loss+Classification loss

(1)(1)

目标框置信度代表目标框是否包含渣土体以及包含时目标框与真实框交并比的大小。The confidence of the target frame represents whether the target frame contains the slag body and the size of the intersection of the target frame and the real frame when it is included.

3.3.3网络训练时需要设置很多超参数，包括学习率初值及其随训练循环次数增加的变化方式、优化器以及优化器的参数、输入图片批大小、训练循环次数、损失函数各项的权值等。设置好损失函数及超参数后即可进入步骤3.3.4开始训练。3.3.3 Many hyperparameters need to be set during network training, including the initial value of the learning rate and how it changes with the increase of the number of training cycles, the optimizer and the parameters of the optimizer, the batch size of input images, the number of training cycles, and the loss function. weight, etc. After setting the loss function and hyperparameters, you can go to step 3.3.4 to start training.

3.3.4每一批图像输入网络中都会得到预测结果，将预测值与步骤2.3标签中的真实值(ground truth，GT)输入损失函数能够算得当前损失值，也就是预测值和真实值之间的距离。计算损失值对于所有网络参数的导数，利用优化器对网络参数进行优化更新，即为一轮训练迭代。当把训练集中所有图片都轮次输入网络中进行训练后，即为一个训练循环。结束一个训练循环后，再将验证集的图片轮次输入网络中计算网络精度，据此观察网络训练情况并作为下次网络训练调整超参数的依据。当损失下降收敛至某个稳定值后，训练可以结束，保存训练后的网络参数，即获得可准确定位渣土目标和判断渣土类型的识别模型。3.3.4 Each batch of images is input to the network, and the prediction result will be obtained. The current loss value can be calculated by inputting the predicted value and the ground truth (GT) in the label of step 2.3 into the loss function, that is, the difference between the predicted value and the real value. the distance. Calculate the derivative of the loss value for all network parameters, and use the optimizer to optimize and update the network parameters, which is a round of training iterations. When all the pictures in the training set are input into the network for training in turn, it is a training loop. After a training cycle is completed, the images of the validation set are input into the network in turn to calculate the network accuracy. Based on this, the network training situation is observed and used as the basis for adjusting the hyperparameters for the next network training. When the loss decreases and converges to a certain stable value, the training can be ended, and the network parameters after training are saved, that is, a recognition model that can accurately locate the muck target and judge the muck type is obtained.

3.4在模型测试及评估指标模块中，加载训练好的模型和参数，输入需要测试的渣土图片即可获得回归出的渣土定位信息以及类别信息。编写评估指标计算代码，对测试集的检测结果进行定量评估，可评价检测模型的效果。3.4 In the model test and evaluation index module, load the trained model and parameters, and enter the muck image to be tested to obtain the regressed muck location information and category information. Write the evaluation index calculation code to quantitatively evaluate the detection results of the test set, and evaluate the effect of the detection model.

3.5在渣土识别结果可视化模块中，将网络回归得到的渣土体目标框定位信息以及类别信息绘制在原图像上进行识别结果可视化。利用网络回归出的目标框中心点位置信息x0、y0以及长宽信息h、w在原图中绘制目标框，将回归得到的渣土类别信息及相应的置信度以文字形式写在每一个目标框的左上角，同时不同渣土类型的目标框和类别信息采用不同颜色进行绘制和书写。3.5 In the visualization module of muck identification results, the positioning information and category information of muck body target frame obtained by network regression are drawn on the original image to visualize the identification results. Use the center point information x0, y0 and the length and width information h, w of the target frame returned by the network to draw the target frame in the original image, and write the returned slag category information and the corresponding confidence in each target frame in text form At the same time, the target boxes and category information of different muck types are drawn and written in different colors.

有益效果beneficial effect

1、本发明提出了基于土体工程性质与盾构施工经验的渣土分类方法以及渣土目力鉴别准则，该方法得到的分类结果既能科学地反映土体自身的性质又适用于盾构施工的实际需要，构建的渣土目力鉴别准则为数据集标签的制作提供了依据。1. The present invention proposes a method for classifying slag based on soil engineering properties and shield construction experience and a criterion for identifying slag vision. The classification result obtained by this method can not only reflect the nature of the soil itself but also be suitable for shield construction. According to the actual needs, the constructed muck visual identification criterion provides a basis for the production of dataset labels.

2、出渣土是反映前方土质类型最直接的资料，其表观特征的决定因素即为前方土质条件，因此利用深度学习算法通过对出渣土图像进行识别能够快速、准确地得到开挖面前方土质类型信息。而且随着渣土图像数据库的扩充而更新的渣土识别模型适用范围也会越来越广，不存在只能用相似工程训练出的模型预测某一工程土质类型的限制。2. The excavated soil is the most direct data to reflect the type of soil in front, and the determinant of its apparent characteristics is the soil condition in front. Therefore, the deep learning algorithm can be used to identify the excavated soil images quickly and accurately. Cubic soil type information. Moreover, with the expansion of the muck image database, the application range of the muck identification model updated will become wider and wider, and there is no limitation that only a model trained from similar projects can be used to predict a certain engineering soil type.

3、本发明提出的方法在数据收集以及实际应用过程中都仅需要利用现场原本就有的渣土监控视频数据，无需额外装置传感器或其他设备。所采用算法本身的推理速度也高于监控视频的帧率，因此该方法具有实时、成本低的特点。3. In the process of data collection and practical application, the method proposed in the present invention only needs to use the original muck monitoring video data on site, and does not require additional sensors or other equipment. The reasoning speed of the algorithm itself is also higher than the frame rate of the surveillance video, so this method has the characteristics of real-time and low cost.

附图说明Description of drawings

图1为本发明方法流程图Fig. 1 is the flow chart of the method of the present invention

图2为本发明渣土分类系统建立流程Fig. 2 is a flow chart for establishing the slag classification system of the present invention

图3为本发明渣土识别数据库建立流程Fig. 3 is the establishment process flow of the muck identification database of the present invention

图4为本发明渣土图像数据采集示意图Fig. 4 is a schematic diagram of data acquisition of muck images in the present invention

图5为本发明渣土识别模型构建流程Fig. 5 is the construction flow of the muck identification model of the present invention

图6为本发明实施例网络结构示意图FIG. 6 is a schematic diagram of a network structure according to an embodiment of the present invention

具体实施方式Detailed ways

下面将结合具体实施例及其附图对本申请提供的技术方案作进一步说明。结合下面说明，本申请的优点和特征将更加清楚。The technical solutions provided by the present application will be further described below with reference to specific embodiments and accompanying drawings. The advantages and features of the present application will become more apparent in conjunction with the following description.

实施例Example

发明的整体流程如图1所示。The overall flow of the invention is shown in Figure 1.

渣土分类系统的建立流程如图2所示。为实现盾构掘进时土质类型的识别，对盾构工程所在地区浅部地层工程性质的了解是第一步。静力触探试验(CPT)是一种可以综合反映土体物理力学性质的原位试验方法。通过分析不同地层的贯入阻力统计特征值，即可掌握各地层土体的工程特性，为渣土分类奠定基础。The establishment process of the muck classification system is shown in Figure 2. In order to realize the identification of soil types during shield tunneling, it is the first step to understand the engineering properties of the shallow strata in the area where the shield tunnel is located. Static penetration test (CPT) is an in-situ test method that can comprehensively reflect the physical and mechanical properties of soil. By analyzing the statistical eigenvalues of penetration resistance of different strata, the engineering characteristics of soil mass in each stratum can be grasped, which lays a foundation for the classification of slag.

1.1首先收集盾构施工所在地区的地质勘察报告，整理其中的静力触探分层参数表，将各钻孔经过地层的贯入阻力值统计在excel中，剔除异常值，即明显不合理的数据。1.1 First, collect the geological survey report of the area where the shield construction is located, sort out the static penetration layer parameter table, count the penetration resistance value of each borehole through the stratum in excel, and eliminate the abnormal value, which is obviously unreasonable. data.

1.2接着计算各地层贯入阻力指标的统计特征值，包括均值、标准差以及变异系数。首先根据各层的土层名称进行大类划分，如：粘土(CL)、粉土(SI)、砂土(SA)。再根据能够反映土体工程性质的贯入阻力均值进行详细划分，如粘土(CL)划分为软粘土(SC)、普通粘土(NC)、硬粘土(HC)。依据此方法，能够将某一地区的地层根据其工程性质进行合并分类，获得适用于土压平衡盾构参数调整的土质类型划分。1.2 Then calculate the statistical characteristic values of the penetration resistance index of each layer, including the mean, standard deviation and coefficient of variation. First, the soil layers are classified into categories according to the name of each layer, such as: clay (CL), silt (SI), and sand (SA). According to the average penetration resistance, which can reflect the properties of soil engineering, it is divided into details, such as clay (CL) is divided into soft clay (SC), ordinary clay (NC), and hard clay (HC). According to this method, the strata in a certain area can be combined and classified according to their engineering properties, and the classification of soil types suitable for the adjustment of EPB shield parameters can be obtained.

1.3对某地区土质类型划分后，在此基础上确定渣土的类别划分。首先考虑是否存在对土压平衡盾构施工影响较大的特殊土或承压水等情况，若存在此类有特殊性质的地层，而在根据贯入阻力值进行分类时又将此类地层与其他地层合并为一类，则应该将这类地层单独划分。除此之外，考虑盾构法施工时常出现开挖面经过多层地层的情况，此时排出的渣土可定为“混合土”这一特殊的类型。至此，在土质类别划分的基础上进行调整即可获得同时考虑土体自身工程性质以及施工经验的渣土类别划分结果。1.3 After classifying the soil types in a certain area, determine the classification of the muck on this basis. First, consider whether there are special soils or confined water that have a great impact on the earth pressure balance shield construction. If other strata are combined into one category, such strata should be classified separately. In addition, considering that the excavation surface often passes through multi-layer strata during the construction of the shield method, the slag discharged at this time can be designated as a special type of "mixed soil". So far, the classification results of slag soil can be obtained by adjusting on the basis of the classification of soil quality, taking into account the engineering properties of the soil itself and construction experience.

渣土识别数据库的建立流程如图3所示。Figure 3 shows the process of establishing the muck identification database.

2.1首先广泛收集该方法应用地区的渣土监控视频，一般监控视频分辨率为1920*1080，最好不低于1280*720。渣土图像数据采集示意图如图4所示，渣土监控摄像头应安装于皮带传送机的正上方或斜上方位置，对准螺旋出土机出土口，保证拍摄到完整的出渣过程和渣土形态。在出土口上方安装照明设备，并保持镜头表面洁净，使得拍摄的图像明亮清晰、易于识别。2.1 First, collect the monitoring video of the muck in the area where the method is applied. Generally, the resolution of the monitoring video is 1920*1080, preferably not less than 1280*720. Figure 4 shows the schematic diagram of muck image data collection. The muck monitoring camera should be installed directly above or diagonally above the belt conveyor, aiming at the excavation port of the screw excavator to ensure that the complete slag process and muck form can be photographed. . Install lighting equipment above the excavation opening and keep the lens surface clean, so that the captured images are bright, clear and easy to identify.

2.2收集到视频数据后对数据进行预处理。首先将稳定出土阶段的有效视频截取出来，再以每30帧一张的频率抽取视频帧(具体取帧频率可根据盾构机推进速度进行调整)。依据渣土形态和图像背景尽量变化多样的原则选取图像，剔除镜头污损下拍摄的图像以及由于视频卡顿造成的马赛克不良图像和冗余图像。2.2 Preprocessing the data after collecting the video data. First, cut out the valid video of the stable unearthed stage, and then extract video frames at a frequency of one frame every 30 frames (the specific frame frequency can be adjusted according to the advancing speed of the shield machine). Images were selected according to the principle of changing the shape of the muck and the image background as much as possible, and the images taken under the contamination of the lens, as well as the poor mosaic images and redundant images caused by video freezes were excluded.

渣土识别模型的构建流程如图5所示，也是代码的组成部分以及编写顺序。本发明采用卷积神经网络建立渣土识别模型，其任务属于目标检测任务，因此需要搭建目标检测网络。模型在深度学习框架Pytorch上进行搭建，采用python语言实现。The construction process of the muck identification model is shown in Figure 5, which is also a part of the code and the writing sequence. The present invention uses the convolutional neural network to establish the muck identification model, and its task belongs to the target detection task, so a target detection network needs to be built. The model is built on the deep learning framework Pytorch and implemented in python language.

3.1在自定义数据集模块中3.1 In custom dataset module

3.1.2接着对图像数据和标签数据进行一定的处理和转换。将图像数据从PIL格式转换成形状为(C,H,W)的Tensor格式，其中并C代表图像通道数，H代表图片高度，W代表图片宽度，并将所有像素值除以255归一化到0-1之间。3.1.2 Then perform certain processing and conversion on the image data and label data. Convert image data from PIL format to Tensor format of shape (C, H, W), where C represents the number of image channels, H represents the image height, W represents the image width, and normalize all pixel values by dividing by 255 to between 0-1.

如图6所示为实施例网络的结构示意图，骨干网络采用Darknet53，颈部网络采用SPP和PANet，顶部网络由三个输出尺度的目标检测器组成。图片首先输入骨干网络中，该骨干网络由53层卷积组合构成，其中插入一定数目的残差连接。残差连接是为了解决在训练深度神经网络时，网络达到一定层数后可能会出现网络退化的现象，也就是不仅无法提升模型的表达能力，反而使得模型的效果变差。每个卷积组合可以看作一个函数F，输入观测值即可得到输出预测值y＝F(x)。残差连接分为两条线，一条线是观测值x输入代表函数F的卷积组合中F(x)，另一条线则直接传递观测值x，最终的预测值为两条线的输出结果相加，即F(x)+x。若把整个残差连接看作一个函数H，输入的观测值为x，则预测值y＝H(x)＝F(x)+x。F(x)＝H(x)-x即为残差，也就是预测值y和观测值x之间的差距。这样，经过残差连接的下一层不仅包含上一层经过非线性变化(卷积组合)后的信息，也包含上一层原始的信息，这样处理使得信息只可能逐层递增，模型的性能也不会因为网络深度的增加而降低。Figure 6 is a schematic diagram of the structure of the embodiment network. The backbone network adopts Darknet53, the neck network adopts SPP and PANet, and the top network consists of target detectors with three output scales. The picture is first input into the backbone network, which consists of 53 layers of convolutional combinations, in which a certain number of residual connections are inserted. Residual connection is to solve the problem of network degradation when the network reaches a certain number of layers when training a deep neural network, that is, it not only fails to improve the expressive ability of the model, but also makes the effect of the model worse. Each convolution combination can be regarded as a function F, and the output predicted value y=F(x) can be obtained by inputting the observed value. The residual connection is divided into two lines, one line is the observation value x input represents F(x) in the convolution combination of the function F, and the other line directly transmits the observation value x, and the final predicted value is the output result of the two lines Add, that is, F(x)+x. If the entire residual connection is regarded as a function H, and the input observation value is x, the predicted value y=H(x)=F(x)+x. F(x)=H(x)-x is the residual, which is the difference between the predicted value y and the observed value x. In this way, the next layer after the residual connection not only contains the information of the previous layer after nonlinear change (convolution combination), but also contains the original information of the previous layer. This processing makes the information only possible to increase layer by layer, and the performance of the model Nor does it decrease as the depth of the network increases.

3.3在构建训练过程的模块中3.3 In the module that builds the training process

3.3.2设计网络的损失函数，本方法所使用目标检测网络的损失函数主要分为三部分：渣土定位损失(Localization loss)、目标框置信度损失(Confidence loss)以及渣土分类损失(Classification loss)，如公式(1)所示。3.3.2 Design the loss function of the network. The loss function of the target detection network used in this method is mainly divided into three parts: localization loss, target frame confidence loss and classification loss. loss), as shown in formula (1).

Loss＝Localization loss+Confidence loss+Classificaltion loss (1)Loss=Localization loss+Confidence loss+Classificaltion loss (1)

3.5在渣土识别结果可视化模块中，将网络回归得到的渣土体目标框定位信息以及类别信息绘制在原图像上进行识别结果可视化。利用网络回归出的目标框中心点位置信息x₀、y₀以及长宽信息h、w在原图中绘制目标框，将回归得到的渣土类别信息及相应的置信度以文字形式写在每一个目标框的左上角，同时不同渣土类型的目标框和类别信息采用不同颜色进行绘制和书写。3.5 In the visualization module of muck identification results, the positioning information and category information of muck body target frame obtained by network regression are drawn on the original image to visualize the identification results. Use the center point information x ₀ , y ₀ and the length and width information h and w of the target frame returned by the network to draw the target frame in the original image, and write the returned slag category information and the corresponding confidence in the form of text in each The upper left corner of the target box, and the target boxes and category information of different muck types are drawn and written in different colors.

用本发明提出的方法得到的渣土分类系统与渣土目力鉴别准则都会随着地勘报告收集范围的不断扩大和渣土数据库的不断扩充而更新，获得适用范围相应更广的渣土分类系统与目力鉴别准则，同时更新训练渣土识别模型即可获得适用范围相应更广的渣土识别模型。因此，本发明提出的方法理论上适用于所有地区土压平衡盾构开挖面前方土质类型的识别。The muck classification system and the muck visual identification criterion obtained by the method proposed by the present invention will be updated with the continuous expansion of the collection range of the geological prospecting report and the continuous expansion of the muck database, and the muck classification system and the muck classification system with a correspondingly wider application range can be obtained. Visual identification criteria, and updating the training muck identification model at the same time can obtain a muck identification model with a wider application range. Therefore, the method proposed in the present invention is theoretically applicable to the identification of the soil type in front of the earth pressure balance shield excavation in all areas.

本发明的创新点Innovative point of the present invention

1、本发明提供一种基于土体工程性质和盾构施工经验的土压平衡盾构渣土分类方法。利用该分类方法得到的渣土类型划分结果既能通过深度学习算法在图像中识别得出，又能满足盾构施工时的实际工程需要，为后续渣土图像识别奠定分类基础。1. The present invention provides a method for classifying earth pressure balance shield slag based on soil engineering properties and shield construction experience. The classification results of the muck types obtained by this classification method can not only be identified in the image through the deep learning algorithm, but also meet the actual engineering needs during shield construction, laying a classification basis for the subsequent identification of muck images.

2、本发明提供一种出渣土目力鉴别方法。提出依据渣土四种重要的外观特征，即渣土形状、渣土断面形态、渣土表面平整度以及渣土颜色来制定不同渣土类别的鉴别准则，为后续制作数据集标签提供鉴别依据。2. The present invention provides a method for visual identification of dregs. According to the four important appearance characteristics of muck, namely muck shape, muck cross-section shape, muck surface flatness and muck color, the identification criteria of different muck types are formulated to provide identification basis for the subsequent production of dataset labels.

3、本发明提供一种识别土压平衡盾构开挖面土质类型的方法，即基于深度学习进行出渣土图像识别来获取前方土质类型信息。该方法利用监控视频数据即可进行模型训练以及应用时的实时识别，具有准确、快速、低成本的特点。3. The present invention provides a method for identifying the soil quality type of an earth pressure balance shield excavation surface, that is, based on deep learning, the image recognition of the slag is carried out to obtain the information of the soil quality type ahead. The method can use monitoring video data to perform model training and real-time recognition during application, and has the characteristics of accuracy, speed and low cost.

上述描述仅是对本申请较佳实施例的描述，并非是对本申请范围的任何限定。任何熟悉该领域的普通技术人员根据上述揭示的技术内容做出的任何变更或修饰均应当视为等同的有效实施例，均属于本申请技术方案保护的范围。The above description is only a description of the preferred embodiments of the present application, and is not intended to limit the scope of the present application. Any changes or modifications made by any person of ordinary skill in the field according to the technical contents disclosed above shall be regarded as equivalent effective embodiments, and all belong to the protection scope of the technical solutions of the present application.

附：术语解释：Attachment: Terminology Explanation:

地层：一切成层岩石的总称，可以是固结的岩石，也可以是没有固结的沉积物，即土，是一层或一组具有某种统一的特征和属性的并和上下层有着明显区别的岩(土)层。Stratigraphy: The general term for all layered rocks, which can be consolidated rocks or unconsolidated sediments, namely soil, which is a layer or a group of uniform features and properties that are distinct from the upper and lower layers. Distinctive rock (soil) layers.

土质：指土壤的构造和性质。也指土壤性质的好坏和结构。Soil quality: refers to the structure and properties of soil. It also refers to the quality and structure of the soil.

地质勘查：运用测绘、地球物理勘探、地球化学探矿、钻探、坑探、采样测试、地质遥感等地质勘查方法，对一定地区内的岩石、地层构造、矿产、地下水、地貌等地质情况进行的调查研究工作。Geological Exploration: The use of geological exploration methods such as surveying and mapping, geophysical exploration, geochemical prospecting, drilling, pit exploration, sampling testing, and geological remote sensing to investigate the geological conditions of rocks, stratigraphic structures, minerals, groundwater, and landforms in a certain area research work.

静力触探试验：静力触探试验是以静压力将圆锥形探头按一定速率匀速压入土中，量测其贯入阻力(包括锥头阻力和侧壁摩阻力或摩阻比)，并按其所受阻力的大小划分土层，确定土的工程性质。Static penetration test: The static penetration test is to press the conical probe into the soil at a constant speed with static pressure to measure the penetration resistance (including the resistance of the cone head and the frictional resistance of the side wall or the frictional resistance ratio), and The soil layers are divided according to the resistance they receive to determine the engineering properties of the soil.

统计特征值：指对统计调查的原始资料进行整理后得到的可以精确描述统计数据分布的、具有代表性的数量特征。Statistical eigenvalues: refers to the representative quantitative characteristics that can accurately describe the distribution of statistical data obtained after sorting out the original data of statistical surveys.

隧道：埋置于地层内的工程建筑物，是人类利用地下空间的一种形式。Tunnel: An engineering building buried in the ground, which is a form of human utilization of underground space.

地铁：在城市中修建的快速、大运量、用电力牵引的轨道交通，修建于隧道中。Subway: fast, large-capacity, electric-driven rail transit built in the city, built in tunnels.

盾构法：一种隧道施工方法，使用盾构机进行地层的开挖，隧道管片的拼装。Shield method: A tunnel construction method, which uses a shield machine to excavate the ground and assemble the tunnel segments.

盾构机：一种施工机械，由外壳、刀盘、顶推设备、拼装设备以及其它配套设备组成，外壳为柱体，起保护作用，其他设备在外壳内部。Shield machine: a kind of construction machinery, which consists of shell, cutter head, pushing equipment, assembly equipment and other supporting equipment. The shell is a cylinder, which plays a protective role, and other equipment is inside the shell.

土压平衡盾构：盾构机的一种，利用盾构推进时前端刀盘旋转切削下来的土体充满土舱，其被动土压与掘削面上的土压、水压基本平衡，使得掘削面与盾构面处于平衡状态。Earth pressure balance shield: a type of shield machine, the soil that is cut by the rotation of the front cutter head when the shield is propelled is filled with the soil chamber, and its passive earth pressure is basically balanced with the earth pressure and water pressure on the excavation surface, which makes the excavation The face and the shield face are in equilibrium.

土压平衡盾构渣土：土压平衡盾构推进时掘削下来的弃土。Earth pressure balance shield muck: spoil excavated when the earth pressure balance shield is pushed forward.

刀盘：盾构机用于切削地层的设备，处于盾构机前端，通过旋转挤压将土切下。Cutter: The equipment used by the shield machine to cut the stratum, it is located at the front end of the shield machine, and the soil is cut off by rotating and extruding.

掌子面：开挖坑道不断向前推进的工作面。Face: The face where the excavation tunnel is continuously advancing.

螺旋出土机：通过螺旋形构件旋转，将刀盘切削下的土体运送至皮带机上的机械，螺旋形构件的旋转速度决定了出土的速度。Spiral excavator: a machine that transports the soil cut by the cutter head to the belt conveyor through the rotation of the helical member. The rotation speed of the helical member determines the speed of excavation.

皮带输送机：将来自螺旋出土机的土体运送到渣土车上的皮带装置。Belt Conveyor: A belt device that transports the soil from the screw excavator to the muck truck.

盾构施工参数：盾构机施工过程中需要设定的各种参数，例如刀盘转速等等，参数设定是否合理决定了盾构法施工的安全和质量。Shield construction parameters: various parameters that need to be set during the construction of the shield machine, such as cutter head speed, etc. Whether the parameter setting is reasonable determines the safety and quality of shield construction.

人工智能：一门研究、开发用于模拟、延伸和扩展人的智能的理论、方法、技术及应用系统的新技术科学。Artificial Intelligence: A new technology science that studies and develops theories, methods, techniques and applied systems for simulating, extending and expanding human intelligence.

机器学习：一门多领域交叉学科，涉及概率论、统计学、逼近论、算法复杂度理论等多门学科，是人工智能的核心，是使计算机具有智能的根本途径。Machine learning: a multi-field interdisciplinary subject involving probability theory, statistics, approximation theory, algorithmic complexity theory and other subjects. It is the core of artificial intelligence and the fundamental way to make computers intelligent.

深度学习：是机器学习领域中一个新的研究方向，学习样本数据的内在规律和表示层次，获得的信息对诸如文字、图像和声音等数据的解释有很大的帮助。Deep learning: It is a new research direction in the field of machine learning. It learns the inherent laws and representation levels of sample data, and the obtained information is very helpful for the interpretation of data such as text, images and sounds.

目标检测：目标检测是深度学习的一个重要应用，就是要将图片中的物体识别出来，并标出物体的位置。Target detection: Target detection is an important application of deep learning, which is to identify objects in pictures and mark the location of objects.

样本：机器学习方法所使用的数据的基本单位，可以是一维矩阵，也可以是高维矩阵。Sample: The basic unit of data used by machine learning methods, which can be a one-dimensional matrix or a high-dimensional matrix.

标签：数据的真实信息。Label: The real information of the data.

LabelImg：目标检测任务给数据集打标签的工具。LabelImg: A tool for labeling datasets for object detection tasks.

数据集：训练集、验证集和测试集的合集称为数据集，包括数据和标签。Dataset: The collection of training, validation, and test sets is called a dataset, including data and labels.

训练集：是模型拟合的数据样本，用于调试神经网络。Training set: is the data sample fitted by the model and used to debug the neural network.

验证集：是模型训练过程中单独留出的样本集，它可以用于调整模型的超参数和用于对模型的能力进行初步评估，用来查看训练效果。Validation set: It is a sample set set aside separately during the model training process. It can be used to adjust the hyperparameters of the model and to perform a preliminary evaluation of the model's ability to check the training effect.

测试集：用来评估最终模型的泛化能力。但不能作为调参、选择特征等算法相关的选择的依据。它用来测试网络的实际学习能力。Test set: used to evaluate the generalization ability of the final model. However, it cannot be used as the basis for algorithm-related selections such as parameter tuning and feature selection. It is used to test the actual learning ability of the network.

卷积神经网络：是一类包含卷积计算且具有深度结构的前馈神经网络，是深度学习的代表算法之一。Convolutional Neural Network: It is a type of feedforward neural network that includes convolutional computation and has a deep structure, and is one of the representative algorithms of deep learning.

图像通道：在RGB色彩模式下指红色、绿色、蓝色三个通道。Image channel: In RGB color mode, it refers to three channels of red, green and blue.

图像增强：通过一定手段对原图像附加一些信息或变换数据，有选择地突出图像中感兴趣的特征或者抑制(掩盖)图像中某些不需要的特征，使图像与视觉响应特性相匹配。Image enhancement: Add some information or transform data to the original image by certain means, selectively highlight interesting features in the image or suppress (mask) some unwanted features in the image, so that the image matches the visual response characteristics.

网格掩码：生成一个和原图相同分辨率的网格，灰色区域值为1，黑色区域值为0，与原图相乘得到增强后的图像，实现了特定区域的信息删除，本质上可以理解为一种正则化方法。Grid mask: Generate a grid with the same resolution as the original image, the gray area value is 1, the black area value is 0, and the enhanced image is multiplied with the original image, which realizes the deletion of information in specific areas, essentially It can be understood as a regularization method.

色彩抖动：是指通过随机调整原始图片的饱和度，亮度，对比度来产生新的图像。Color dithering: refers to generating a new image by randomly adjusting the saturation, brightness, and contrast of the original image.

鲁棒性：指系统在受到扰动或者不确定的情况下，仍然可以维持某些性能的特性。Robustness: refers to the characteristic that the system can still maintain certain performance even when it is disturbed or uncertain.

卷积层：卷积层由若干卷积单元组成，每个卷积单元的参数都是通过反向传播算法最优化得到的。卷积运算的目的是提取输入的不同特征，第一层卷积层可能只能提取一些低级的特征如边缘、线条和角等层级，更多层的网络能从低级特征中迭代提取更复杂的特征。Convolutional layer: The convolutional layer consists of several convolutional units, and the parameters of each convolutional unit are optimized by the back-propagation algorithm. The purpose of the convolution operation is to extract different features of the input. The first convolution layer may only extract some low-level features such as edges, lines, and corners. More layers of the network can iteratively extract more complex features from the low-level features. feature.

批标准化层：批标准化(Batch Normalization，BN)是一种用于改善人工神经网络的性能和稳定性的技术。其能够规范化神经网络中的任何层的输入，固定每层输入信号的均值与方差。Batch Normalization Layer: Batch Normalization (BN) is a technique used to improve the performance and stability of artificial neural networks. It can normalize the input of any layer in the neural network, fixing the mean and variance of the input signal of each layer.

激活层：对输入数据进行激活操作，即非线性变换，将特征映射到高维的非线性区间进行解释,解决线性模型所不能解决的问题。Activation layer: Activate the input data, that is, nonlinear transformation, map the features to high-dimensional nonlinear intervals for interpretation, and solve problems that cannot be solved by linear models.

感受野：某一层特性图中某个位置的特征向量，是由前面某一层固定区域的输入计算出来的，那这个区域就是这个位置的感受野。常用的是某层特征图对应输入图像的感受野。Receptive field: The feature vector of a certain position in the characteristic map of a certain layer is calculated from the input of a fixed area in a previous layer, then this area is the receptive field of this position. Commonly used is the receptive field of the input image corresponding to the feature map of a certain layer.

特征图：图像经过卷积层特征提取后得到的图层。Feature map: The layer obtained after the image is extracted from the convolutional layer.

损失函数：用来评价模型的预测值和真实值不一样的程度。Loss function: used to evaluate the degree to which the predicted value of the model is different from the true value.

超参数：超参数是在开始学习过程之前设置值的参数，而不是通过训练得到的参数数据。通常情况下，需要对超参数进行优化，给模型选择一组最优超参数，以提高学习的性能和效果。Hyperparameters: Hyperparameters are parameters whose values are set before starting the learning process, not parameter data obtained through training. Usually, hyperparameters need to be optimized, and a set of optimal hyperparameters is selected for the model to improve the performance and effect of learning.

学习率：是每一次更新模型参数的步长。Learning rate: is the step size for each update of the model parameters.

批大小：每次训练输入网络的样本数目。Batch size: The number of samples input to the network for each training session.

Claims

1. A method for identifying soil property types of an excavation surface of a soil pressure balance shield is characterized by comprising the following steps:

step 1, establishing a muck classification system

1.1 collecting geological survey reports of the area where shield construction is located, sorting a static sounding layering parameter table in the geological survey reports, counting penetration resistance values of all drill holes passing through a stratum, and eliminating abnormal values, namely obvious and unreasonable data;

1.2 calculating the statistical characteristic values of the injection resistance indexes of each stratum, including a mean value, a standard deviation and a variation coefficient; firstly, carrying out large-class classification according to soil layer names of all layers; dividing in detail according to the average value of the penetration resistance which can reflect the engineering property of the soil body;

according to the method, strata in a certain area are combined and classified according to the engineering properties of the strata, and soil property type division suitable for adjusting the soil pressure balance shield parameters is obtained;

1.3 after the soil property types of a certain area are divided, determining the classification of the muck on the basis;

firstly, considering whether special soil or confined water and other conditions which have great influence on the construction of the earth pressure balance shield exist, if the stratum with special properties exists, and the stratum and other strata are combined into one when the stratum is classified according to the penetration resistance value, the stratum should be divided separately; in addition, considering the condition that the excavation surface passes through a plurality of layers of stratums when shield construction is carried out, the discharged muck can be determined as a special type of 'mixed soil'; therefore, the classification result of the muck classification considering the engineering property of the soil body and the construction experience can be obtained by adjusting on the basis of the classification of the soil property;

1.4 constructing a criterion for identifying the type of the muck with the eyesight based on the visual characteristics of the muck, so as to facilitate the manufacture of subsequent labels;

step 2, establishing a muck identification database

2.1, widely collecting muck monitoring videos of an area to which the method is applied;

2.2 preprocessing the data after collecting the video data;

intercepting effective videos at a stable unearthing stage, and extracting video frames at a frequency of one frame per 30 frames, wherein the specific frame extraction frequency can be adjusted according to the propelling speed of the shield tunneling machine; selecting images according to the principle that the shape of the muck and the image background are changed as much as possible, and removing images shot under the condition that a lens is stained and mosaic poor images and redundant images caused by video blockage;

2.3 manually marking the muck image by adopting LabelImg image marking software, framing out a muck body target in each image by using a rectangular frame, and inputting the muck type of each target frame;

the label information is in a PASCAL VOC format, and the real target frame position information and the corresponding muck type information of each picture are stored as XML files;

in the training set and the verification set, each image must be provided with a label, and the information recorded in the image is called a true value which is provided for the training process of 3.3.4;

2.4, the obtained muck image file and the corresponding label file are the muck data set, and the muck image file and the corresponding label file are calculated according to the following steps of 8: 1: 1, dividing the ratio into a training set, a verification set and a test set;

step 3, construction of muck recognition model

The muck identification model comprises: the system comprises a user-defined data set module, a muck detection network building module, a construction training process module, a model testing and evaluation index module and a muck identification result visualization module;

3.1 in custom data set Module

3.1.1 loading the muck image data and corresponding label information from the data set storage path, and respectively storing the data and the corresponding label information in a list;

3.1.2 then processing and converting the image data and the label data; converting the image data from a PIL format into a Tensor format with the shape of (C, H, W), wherein C represents the number of image channels, H represents the height of the picture, W represents the width of the picture, and dividing all pixel values by 255 to be normalized to be between 0 and 1;

3.1.3, performing certain random image enhancement operation on the image before the image is input into the network, and performing five kinds of data enhancement of horizontal turning, grid mask, random cutting, color dithering and blurring on the image with the probability of 0.5, so as to improve the diversity of database pictures and enhance the robustness of a model;

3.1.4, calculating the mean value and the standard deviation of three-channel pixel values of the picture in the whole data set, standardizing the picture so as to facilitate the learning of the model, and providing the standard deviation and the mean value for the three-channel pixel values of the picture in the whole data set to the step 3.3;

3.2 in the muck detection network building module, the network is divided into three parts: a backbone network, a neck network and a top network, which are provided for step 3.3;

the main function of the backbone network is to extract image features, the basic structure of the backbone network is the combination of a convolutional layer, a batch normalization layer and an activation layer, and the basic structure is stacked to complete the construction of the backbone network; the depth of the network can be increased by adding residual connection in the backbone network, and the capability of extracting the features of the network is effectively improved;

the main function of the neck network is feature enhancement and fusion;

the top network is a detector, and the main function of the detector is to perform final regression prediction on the position and the category information of the residue soil; the output characteristic diagram with higher resolution contains more detailed characteristics of the input image and is good for detecting small targets, and the output characteristic diagram with lower resolution contains coarser characteristics of the input image and is good for detecting large targets;

3.3 in building the training Process Module

3.3.1, firstly, importing a custom data set module and a muck detection network building module, instantiating a muck data set and a muck detection network, and building a data loader to input a custom picture and tag data into a network;

3.3.2 designing the loss function of the network, wherein the loss function of the target detection network used by the method is divided into three parts: the muck positioning loss, the target frame confidence loss and the muck classification loss are shown in formula (1),

Loss＝Localization loss+Confidence loss+Classification loss (1)

the confidence coefficient of the target frame represents whether the target frame contains the slag soil body or not and the size of the intersection ratio of the target frame and the real frame when the target frame contains the slag soil body;

3.3.3 need set up the hyper-parameter when the network trains, include: the initial value of the learning rate and the change mode of the initial value along with the increase of the training cycle number, the parameters of an optimizer and the optimizer, the batch size of input pictures, the training cycle number, the weight of each item of a loss function and the like; after the loss function and the over-parameters are set, the step 3.3.4 can be entered for training;

3.3.4 each batch of images is input into the network to obtain a prediction result, and the current loss value, namely the distance between the prediction value and the true value can be calculated by inputting the prediction value and the true value in the label in the step 2.3 into a loss function; calculating the derivative of the loss value to all the network parameters, and optimizing and updating the network parameters by using an optimizer, namely a round of training iteration; when all pictures in the training set are input into the network for training in turn, a training cycle is formed; after finishing a training cycle, inputting the pictures of the verification set into the network in turn to calculate the network precision, observing the network training condition according to the network precision, and taking the observed network training condition as a basis for adjusting the hyper-parameters in the next network training; when the loss is reduced and converged to a certain stable value, the training can be finished, and the trained network parameters are stored, so that an identification model capable of accurately positioning the muck target and judging the muck type is obtained;

3.4 loading the trained model and parameters in a model testing and evaluating index module, and inputting a muck picture to be tested to obtain regressed muck positioning information and classification information; compiling an evaluation index calculation code, carrying out quantitative evaluation on the detection result of the test set, and evaluating the effect of the detection model;

3.5 in a muck recognition result visualization module, drawing the positioning information and the category information of the muck body target frame obtained by network regression on an original image to visualize the recognition result; and drawing the target frames in the original image by using the position information x0 and y0 of the central points of the target frames and the length and width information h and w obtained by network regression, writing the regressed muck type information and the corresponding confidence coefficient in the upper left corner of each target frame in a character form, and simultaneously drawing and writing the target frames and the type information of different muck types by adopting different colors.

2. The method for identifying the soil property type of the excavation face of the earth pressure balance shield as claimed in claim 1, wherein the muck identification criterion determines the type of muck by using the apparent characteristics of four types of muck, which are respectively the shape of muck, the shape of the cross section of muck, the surface flatness of muck and the color of muck.

3. The method for identifying the soil property type of the excavation face of the earth pressure balance shield as claimed in claim 1, wherein the resolution of the muck monitoring video is 1920 x 1080 and is not lower than 1280 x 720;

the muck monitoring camera is arranged right above or obliquely above the belt conveyor and is aligned with the muck outlet of the spiral muck remover, so that the complete muck removing process and muck form are ensured to be shot; and the lighting equipment is arranged above the soil outlet, and the surface of the lens is kept clean, so that the shot image is bright and clear and is easy to identify.

4. The method for identifying the soil property of the excavation surface of the earth pressure balance shield as claimed in claim 1, wherein the muck identification model is built on a deep learning frame Pythrch and is implemented by adopting python language.

5. The method for identifying the soil property type of the excavation surface of the earth pressure balance shield as claimed in claim 1, wherein in the step 3.2, a space pyramid pooling and path aggregation network adopted by the residue soil detection network building module is a neck network module; the SPP module can effectively expand the receptive field and separate the most important context information, and the PANet can enhance the extracted image characteristics by fusing parameters of different levels of a backbone network; the neck network is greatly helpful for improving the performance of the whole target detection network;

the backbone network adopts Darknet53, the neck network adopts SPP and PANet, and the top network consists of three target detectors with output scales; firstly, inputting pictures into a backbone network, wherein the backbone network is formed by 53 layers of convolution combination, and a certain number of residual error connections are inserted; the residual error connection is used for solving the problem that when the deep neural network is trained, the network is degraded after a certain number of layers of the network is reached, namely, the expression capability of the model cannot be improved, and the effect of the model is poor; each convolution combination can be regarded as a function F, and an output predicted value y is obtained by inputting an observation value; the residual error is connected and divided into two lines, one line is F (x) in the convolution combination of the observation value x input representation function F, the other line directly transmits the observation value x, and the final predicted value is the addition of the output results of the two lines, namely F (x) + x; if the whole residual join is regarded as a function H and the input observation value is x, the predicted value y ═ H (x) ═ f (x) + x; f (x) h (x) -x is the residual, i.e. the difference between the predicted value y and the observed value x; therefore, the next layer connected by the residual errors not only contains the information of the previous layer after nonlinear change (convolution combination), but also contains the original information of the previous layer, so that the information can only increase gradually layer by processing, and the performance of the model cannot be reduced due to the increase of the network depth;

after layer-by-layer feature extraction of a backbone network, the feature map already contains high-level semantic information capable of identifying the muck, but many detailed features are not lost, so that the detection of small targets is not facilitated; therefore, the feature map obtained by backbone network training is input into the neck network for further feature fusion and enhancement;

the feature graph at the top end of the backbone network is firstly input into an SPP structure of a neck network, the SPP network performs three kinds of pooling operations with different scales on the feature graph and then splices the feature graph with channel dimensions, so that the problem of repeated extraction of features by a convolutional network can be solved, the speed of generating a detection candidate frame is greatly improved, and the calculation cost is saved; secondly, inputting the feature map at the topmost end of the backbone network and the feature maps at two different levels in the middle of the backbone network into a PANet network of a neck network for bidirectional fusion from top to bottom and from bottom to top, so that the feature maps have not only deep semantic information but also basic information such as shallow texture and color, the integrity and diversity of features are ensured, and the final prediction effect is improved;

three feature graphs of different levels in a backbone network, namely a feature graph 1, a feature graph 2 and a feature graph 3, are respectively input into a final top network detector after fusion enhancement of a neck network, and are respectively subjected to simple convolution combination and regression to obtain final prediction results, namely a prediction output 1, a prediction output 2 and a prediction output 3; the three feature maps with different sizes respectively contain features with different scales, and the targets with different sizes are correspondingly predicted; the feature map of the prediction output 1 is large, contains most detail information and is responsible for predicting small target objects; the feature map of the prediction output 3 is smaller, so that the overall information is easier to distinguish, and the prediction output 3 is responsible for predicting a large target object; the three results output by prediction are merged together to obtain the prediction result of the whole network on the picture.