CN114743201A

CN114743201A - A multimeter reading recognition method and system based on rotating target detection

Info

Publication number: CN114743201A
Application number: CN202210410911.6A
Authority: CN
Inventors: 彭键清; 周威
Original assignee: Sun Yat Sen University
Current assignee: Sun Yat Sen University
Priority date: 2022-04-19
Filing date: 2022-04-19
Publication date: 2022-07-12

Abstract

The invention discloses a multimeter reading recognition method and system based on rotating target detection. The method includes: processing an image to be tested based on an improved YOLOv5 model, and outputting a reading area rotation frame, a reading area rotation angle and a switch rotation angle; The rotation angle of the reading area and the rotation angle of the transfer switch are used to calculate the actual rotation angle of the transfer switch; the actual rotation angle of the transfer switch is matched with the unit matching information to obtain the reading unit result; the rotation frame of the reading area is subjected to affine transformation and identification to obtain the reading number Result; Combine the reading digital result with the reading unit result to get the complete reading result. The system includes: a detection module, an actual rotation angle calculation module, a unit matching module, a digital identification module and a reading integration module. By using the present invention, the complete reading result under any rotation angle of the multimeter with a changeover switch can be detected and identified.

Description

A multimeter reading recognition method and system based on rotating target detection

技术领域technical field

本发明涉及中国智能制造领域，尤其涉及一种基于旋转目标检测的万用表读数识别方法及系统。The invention relates to the field of intelligent manufacturing in China, in particular to a multimeter reading recognition method and system based on rotating target detection.

背景技术Background technique

工业仪器仪表读数识别自动化、智能化是大势所趋。基于计算机视觉的仪器仪表读数识别技术，可对采集到的仪器仪表数值信息自动识别，并快速录入到业务系统中，有效解决人工抄录过程中抄错、抄漏等问题，提升抄录效率，减少人工录入工作量，降低企业人力成本，实现仪器仪表数据录入的自动化。The automation and intelligence of industrial instrument reading recognition is the general trend. The instrument reading recognition technology based on computer vision can automatically identify the collected instrument and meter numerical information, and quickly enter it into the business system, effectively solve the problems of copying errors and omissions in the manual transcription process, improve transcription efficiency and reduce labor Entry workload, reduce enterprise labor costs, and automate instrument data entry.

现有的万用表读数识别方法主要是首先将检测读数区域并裁剪，进而对裁剪读数图像进行读数识别，但它们大多只关注于水平读数的检测和识别，且往往忽略了读数单位的识别，导致应用价值和应用范围受限。基于传统图像处理的方法要求环境背景单一，且受光照等环境变化干扰较大，因此普适性较差。相比之下，YOLOv5是不含RPN结构的One-stage深度学习目标检测算法，它能很好地检测出读数区域，且具有小型骨干结构如tiny-darknet，能通过一系列优化措施使模型轻量化，适合作为工程算法在移动端部署，具有较高的实际应用价值。但YOLOv5作为通用目标检测算法，不适用于具有大倾角的旋转目标检测，更无法确定目标朝向和角度。The existing multimeter reading recognition methods mainly firstly detect the reading area and crop it, and then perform reading recognition on the cropped reading image, but most of them only focus on the detection and recognition of horizontal readings, and often ignore the recognition of reading units, which leads to application The value and scope of application are limited. The method based on traditional image processing requires a single environmental background and is greatly disturbed by environmental changes such as illumination, so the universality is poor. In contrast, YOLOv5 is a one-stage deep learning target detection algorithm without RPN structure, which can detect the read region well, and has a small backbone structure such as tiny-darknet, which can make the model lighter through a series of optimization measures. Quantization is suitable for deployment on mobile terminals as engineering algorithms, and has high practical application value. However, as a general target detection algorithm, YOLOv5 is not suitable for rotating target detection with a large inclination angle, and it is impossible to determine the target orientation and angle.

发明内容SUMMARY OF THE INVENTION

为了解决上述技术问题，本发明的目的是提供一种基于旋转目标检测的万用表读数识别方法及系统，可以检测并识别带转换开关万用表任意旋转角度下的完整读数结果。In order to solve the above technical problems, the purpose of the present invention is to provide a multimeter reading identification method and system based on rotating target detection, which can detect and identify the complete reading results of a multimeter with a switch at any rotation angle.

本发明所采用的第一技术方案是：一种基于旋转目标检测的万用表读数识别方法，包括以下步骤：The first technical solution adopted by the present invention is: a multimeter reading identification method based on rotating target detection, comprising the following steps:

基于改进的YOLOv5模型对待测图像进行处理，输出读数区域旋转框、读数区域旋转角度和转换开关旋转角度；Based on the improved YOLOv5 model, the image to be tested is processed, and the rotation frame of the reading area, the rotation angle of the reading area and the rotation angle of the switch are output;

根据读数区域旋转角度和转换开关旋转角度，计算转换开关实际旋转角度；Calculate the actual rotation angle of the transfer switch according to the rotation angle of the reading area and the rotation angle of the transfer switch;

将转换开关实际旋转角度与单位匹配信息进行匹配，得到读数单位结果；Match the actual rotation angle of the transfer switch with the unit matching information to get the reading unit result;

对读数区域旋转框进行仿射变换并识别，得到读数数字结果；Perform affine transformation on the rotation frame of the reading area and identify it to obtain the digital result of the reading;

整合读数数字结果与读数单位结果，得到完整读数结果。Integrate the reading numerical result with the reading unit result to obtain the complete reading result.

进一步，所述改进的YOLOv5模型的训练步骤具体包括：Further, the training steps of the improved YOLOv5 model specifically include:

获取训练图像并进行旋转框标注，将旋转框四个角的坐标编码为几何要素，得到标注后图像；Obtain the training image and label the rotating frame, encode the coordinates of the four corners of the rotating frame as geometric elements, and obtain the labeled image;

对标注后图像进行数据增强，得到训练集；Data enhancement is performed on the labeled images to obtain a training set;

将训练集输入至YOLOv5模型；Input the training set to the YOLOv5 model;

依次进行特征提取、特征融合和边框回归得到旋转框信息；Perform feature extraction, feature fusion, and frame regression in sequence to obtain rotation frame information;

结合真实值标签计算损失对模型参数进行更新，得到改进的YOLOv5模型。The model parameters are updated by combining the ground truth label calculation loss to obtain an improved YOLOv5 model.

进一步，所述旋转框信息包括旋转框的中心点坐标、宽高、角度、置信度、偏置信息、各类别概率和各朝向类别概率。Further, the rotation frame information includes the center point coordinates, width and height, angle, confidence, bias information, probability of each category and probability of each orientation category of the rotation frame.

进一步，所述改进的YOLOv5模型的检测头输出公式表示如下：Further, the detection head output formula of the improved YOLOv5 model is expressed as follows:

上式中，n_h为检测头数量，b_s为输入模型的待测图像数量大小，n_a为网格单元的预设框锚点数量，H_i为第i个检测头输出的高，W_i为第i个检测头输出的宽，(x,y)为预测框的中心坐标，(w,h)为预测框的宽和高，conf为预测目标的置信度，C为预测框属于各个类别的概率，n_c为预测的目标类别数量，r_j为旋转框回归参数，O_k为目标各朝向类别概率。In the above formula, n _h is the number of detection heads, _b _s is the number of images to be tested in the input model, na is the number of preset frame anchor points of the grid unit, H _i is the output height of the ith detection head, W _i is the width of the output of the i-th detection head, (x, y) is the center coordinate of the prediction frame, (w, h) is the width and height of the prediction frame, conf is the confidence level of the prediction target, and C is the prediction frame belonging to each The probability of the category, n _c is the number of predicted target categories, r _j is the rotation box regression parameter, and O _k is the probability of each orientation category of the target.

进一步，所述将转换开关实际旋转角度与单位匹配信息进行匹配，得到读数单位结果这一步骤，具体包括：Further, the step of matching the actual rotation angle of the transfer switch with the unit matching information to obtain the result of the reading unit specifically includes:

获取单位匹配信息，得到单位信息数据集和角度信息数据集；Obtain unit matching information, and obtain a unit information data set and an angle information data set;

将转换开关实际旋转角度与角度信息数据集进行匹配，得到角度索引值；Match the actual rotation angle of the transfer switch with the angle information data set to obtain the angle index value;

将角度索引值与单位信息数据集进行匹配，得到读数单位结果。Match the angle index value to the unit info dataset to get the reading unit result.

进一步，所述对读数区域旋转框进行仿射变换并识别，得到读数数字结果这一步骤，具体包括：Further, the step of performing affine transformation and identification on the rotating frame of the reading area, and obtaining the digital result of the reading, specifically includes:

将读数区域旋转框进行仿射变换，得到水平读数区域图；Perform affine transformation on the rotating frame of the reading area to obtain a horizontal reading area map;

所述仿射变换包括对读数区域旋转框进行平移、缩放、旋转、翻转和错切组合变换；The affine transformation includes the combined transformation of translation, scaling, rotation, flipping and staggering of the rotation frame of the reading area;

基于CRNN读数识别模型对水平读数区域图依次进行CNN特征提取、RNN序列建模和CTC转录，得到读数数字结果。Based on the CRNN read identification model, CNN feature extraction, RNN sequence modeling and CTC transcription were performed on the horizontal read region map in turn, and the read number results were obtained.

进一步，所述仿射变换的矩阵公式如下：Further, the matrix formula of the affine transformation is as follows:

上式中，(x,y)为读数区域旋转框点坐标，(u,v)为读数区域旋转框点经仿射变换后得到的旋转框点，a₀、a₁、a₂、b₀、b₁和b₂为M参数，M为仿射变换矩阵。In the above formula, (x, y) is the coordinates of the rotation frame point in the reading area, (u, v) is the rotation frame point obtained by the affine transformation of the rotation frame point in the reading area, a ₀ , a ₁ , a ₂ , b ₀ , b ₁ and b ₂ are M parameters, and M is an affine transformation matrix.

本发明所采用的第二技术方案是：一种基于旋转目标检测的万用表读数识别系统，包括：The second technical solution adopted by the present invention is: a multimeter reading identification system based on rotating target detection, comprising:

检测模块，基于改进的YOLOv5模型对待测图像进行处理，输出读数区域旋转框、读数区域旋转角度和转换开关旋转角度；The detection module, based on the improved YOLOv5 model, processes the image to be tested, and outputs the rotation frame of the reading area, the rotation angle of the reading area and the rotation angle of the switch;

实际旋转角度计算模块，用于根据读数区域旋转角度和转换开关旋转角度，计算转换开关实际旋转角度；The actual rotation angle calculation module is used to calculate the actual rotation angle of the transfer switch according to the rotation angle of the reading area and the rotation angle of the transfer switch;

单位匹配模块，用于将转换开关实际旋转角度与单位匹配信息进行匹配，得到读数单位结果；The unit matching module is used to match the actual rotation angle of the transfer switch with the unit matching information to obtain the reading unit result;

数字识别模块，用于对读数区域旋转框进行仿射变换并识别，得到读数数字结果；The digital recognition module is used to perform affine transformation and recognition on the rotating frame of the reading area, and obtain the digital result of the reading;

读数整合模块，用于整合读数数字结果与读数单位结果，得到完整读数结果。The reading integration module is used to integrate the reading digital result and the reading unit result to obtain the complete reading result.

本发明方法及系统的有益效果是：通过改进YOLOv5算法解决了任意旋转角度下的万用表读数识别难点，并针对一系列带转换开关的万用表提出了读数单位的识别方法，使得读数信息更加完整，且该方法适用于一类带转换开关的数字万用表，扩大了应用范围和应用价值。The beneficial effects of the method and system of the present invention are: by improving the YOLOv5 algorithm, the difficulty in identifying the reading of a multimeter under any rotation angle is solved, and a method for identifying the reading unit is proposed for a series of multimeters with a switch, so that the reading information is more complete, and The method is suitable for a class of digital multimeters with transfer switches, which expands the application scope and application value.

附图说明Description of drawings

图1是本发明一种基于旋转目标检测的万用表读数识别方法的步骤流程图；Fig. 1 is the step flow chart of a kind of multimeter reading identification method based on rotating target detection of the present invention;

图2是本发明一种基于旋转目标检测的万用表读数识别系统的结构框图；Fig. 2 is a kind of structural block diagram of the multimeter reading identification system based on rotating target detection of the present invention;

图3是本发明具体实施例改进YOLOv5旋转区域检测模型示意图；3 is a schematic diagram of a specific embodiment of the present invention to improve the YOLOv5 rotation area detection model;

图4是本发明具体实施例改进YOLOv5输出旋转框示意图；4 is a schematic diagram of a specific embodiment of the present invention to improve the output rotation frame of YOLOv5;

图5是本发明具体实施例转换开关示意图；5 is a schematic diagram of a transfer switch according to a specific embodiment of the present invention;

图6是本发明具体实施例读数区域矫正示意图；6 is a schematic diagram of a reading area correction according to a specific embodiment of the present invention;

图7是本发明具体实施例CRNN读数识别模型的结构框图。FIG. 7 is a structural block diagram of a CRNN reading recognition model according to a specific embodiment of the present invention.

具体实施方式Detailed ways

下面结合附图和具体实施例对本发明做进一步的详细说明。对于以下实施例中的步骤编号，其仅为了便于阐述说明而设置，对步骤之间的顺序不做任何限定，实施例中的各步骤的执行顺序均可根据本领域技术人员的理解来进行适应性调整。The present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments. The numbers of the steps in the following embodiments are only set for the convenience of description, and the sequence between the steps is not limited in any way, and the execution sequence of each step in the embodiments can be adapted according to the understanding of those skilled in the art Sexual adjustment.

参照图1，本发明提供了一种基于旋转目标检测的万用表读数识别方法，该方法包括以下步骤：1, the present invention provides a multimeter reading recognition method based on rotating target detection, the method includes the following steps:

S1、参照图3，对YOLOv5模型进行训练，得到改进的YOLOv5模型；S1. Referring to Figure 3, train the YOLOv5 model to obtain an improved YOLOv5 model;

S1.1、获取训练图像并进行旋转框标注，将旋转框四个角的坐标编码为几何要素，得到标注后图像；S1.1. Obtain the training image and label the rotating frame, encode the coordinates of the four corners of the rotating frame as geometric elements, and obtain the labeled image;

具体地，参照图4的方式将旋转框的四个角的坐标编码为几何要素，具体操作如下：Specifically, the coordinates of the four corners of the rotating frame are encoded as geometric elements with reference to the mode of FIG. 4 , and the specific operations are as follows:

||R_j||＝w×Sigmoid(r_j)，j∈{0,1}；||R _j ||=w×Sigmoid(r _j ), j∈{0,1};

上式中，R_j为预测旋转框j点的位置与预测水平框j点的位置之间的距离；w为网格单元预测的水平框的宽度，Sigmoid(r_j)为边框回归参数的归一化值，r_j为边框偏移回归参数。In the above formula, R _j is the distance between the position of the predicted rotation frame j and the predicted horizontal frame j; w is the width of the horizontal frame predicted by the grid unit, and Sigmoid(r _j ) is the regression parameter of the frame. The normalized value, r _j is the border offset regression parameter.

S1.2、对标注后图像进行数据增强，得到训练集；S1.2, perform data enhancement on the labeled images to obtain a training set;

S1.3、将训练集输入至YOLOv5模型；S1.3. Input the training set to the YOLOv5 model;

S1.4、依次进行特征提取、特征融合和边框回归得到旋转框信息；S1.4, perform feature extraction, feature fusion and frame regression in sequence to obtain the rotation frame information;

具体地，旋转框信息包括旋转框的中心点坐标、宽高、角度、置信度、偏置信息、各类别概率和各朝向类别概率。Specifically, the rotation frame information includes the center point coordinates, width and height, angle, confidence, bias information, probability of each category, and probability of each orientation category of the rotation frame.

其中，旋转框角度的计算操作如下：Among them, the calculation operation of the rotation frame angle is as follows:

当r_j≠0，即存在旋转框时，从竖直向上方向开始，以顺时针方向计算旋转角度

When r _j ≠ 0, that is, when there is a rotation frame, start from the vertical upward direction, and calculate the rotation angle in a clockwise direction

上式中，

为标注后图像的四个顶点的坐标，

为旋转框角度。In the above formula,

are the coordinates of the four vertices of the labeled image,

is the rotation frame angle.

当r_j＝0，即仅存在水平框，则可以通过预测朝向

计算出目标的旋转角度

When r _j = 0, that is, there is only a horizontal frame, then the direction can be predicted by

Calculate the rotation angle of the target

其中，

为各朝向类别概率中最大概率的朝向，其具体计算操作如下：in,

is the orientation with the largest probability among the probabilities of each orientation category, and its specific calculation operations are as follows:

上式中，P(O_k)为各朝向类别概率，其具体计算操作如下：In the above formula, P(O _k ) is the probability of each orientation category, and its specific calculation operation is as follows:

P(O_k)＝Sigmoid(O_k),k∈{0,1,2,3}。P(O _k )=Sigmoid(O _k ),k∈{0,1,2,3}.

S1.5、结合真实值标签计算损失对模型参数进行更新，得到改进的YOLOv5模型。S1.5, update the model parameters by calculating the loss of the true value label, and obtain an improved YOLOv5 model.

具体地，该模型的损失函数计算操作如下：Specifically, the loss function calculation operation of the model is as follows:

L＝λ_boxL_box+λ_objL_obj+λ_clsL_cls+λ_rboxL_rbox+λ_rotL_rot；L=λ _box L _box +λ _obj L _obj +λ _cls L _cls +λ _rbox L _rbox +λ _rot L _rot ;

上式中，λ_box、λ_obj、λ_cls、λ_rbox和λ_rot分别为L_box、L_obj、L_cls、L_rbox和L_rot的权重系数；L_box为水平框bbox的损失函数，使用CIOU Loss损失函数；L_obj为目标置信度损失函数，使用Focal Loss损失函数；L_cls为预测类别损失函数，使用Focal Loss损失函数；L_rbox为旋转框偏移回归损失函数，使用Smooth_L1损失函数；L_rot为各朝向分类损失函数，使用Focal Loss损失函数。In the above formula, λ _box , λ _obj , λ _cls , λ _rbox and λ _rot are the weight coefficients of L _box , L _obj , L _cls , L _rbox and L _rot respectively; L _box is the loss function of the horizontal box bbox, using CIOU Loss loss function; L _obj is the target confidence loss function, using the Focal Loss loss function; L _cls is the prediction category loss function, using the Focal Loss loss function; L _rbox is the rotation frame offset regression loss function, using the Smooth _L1 loss function; L _rot is the classification loss function for each orientation, and the Focal Loss loss function is used.

其中，旋转框偏移回归损失函数L_rbox表示如下：Among them, the rotation box offset regression loss function L _rbox is expressed as follows:

上式中，r_j′为r_j的归一化值，

为r_j的真实值，r_j为旋转框偏移回归参数，Smooth_L1为计算r_j′和

之间的回归误差损失的损失函数，其具体操作如下：In the above formula, r _j ′ is the normalized value of r _j ,

is the real value of r _j , r _j is the rotation frame offset regression parameter, Smooth _L1 is the calculation of r _j ′ and

The loss function between the regression error loss and the specific operation is as follows:

各朝向分类损失函数L_rot表示如下：The classification loss function L _rot of each orientation is expressed as follows:

上式中，α_k为用于平衡正负样本的权重系数，γ为用于平衡简单困难样本。In the above formula, α _k is the weight coefficient used to balance positive and negative samples, and γ is used to balance simple and difficult samples.

S2、基于改进的YOLOv5模型对待测图像进行处理，输出读数区域旋转框、读数区域旋转角度和转换开关旋转角度；S2. Process the image to be tested based on the improved YOLOv5 model, and output the rotation frame of the reading area, the rotation angle of the reading area and the rotation angle of the switch;

具体地，改进的YOLOv5模型由输入端、Backbone特征提取主干网络、Neck特征增强网络和Head检测头构成。Specifically, the improved YOLOv5 model consists of an input end, a Backbone feature extraction backbone network, a Neck feature enhancement network, and a Head detection head.

其中，改进的YOLOv5模型检测头输出具体计算操作如下：Among them, the specific calculation operation of the output of the improved YOLOv5 model detection head is as follows:

S3、根据读数区域旋转角度和转换开关旋转角度，计算转换开关实际旋转角度；S3. Calculate the actual rotation angle of the transfer switch according to the rotation angle of the reading area and the rotation angle of the transfer switch;

具体地，转换开关实际旋转角度的具体计算操作为：Specifically, the specific calculation operation of the actual rotation angle of the transfer switch is:

上式中，θ_d为读数区域旋转角度，θ_g为转换开关旋转角度，θ_r为转换开关实际旋转角度。In the above formula, θ _d is the rotation angle of the reading area, θ _g is the rotation angle of the transfer switch, and θ _r is the actual rotation angle of the transfer switch.

其中，θ_d和θ_g就是

where _θd and _θg are

S4、参考图5，将转换开关实际旋转角度与单位匹配信息进行匹配，得到读数单位结果；S4. Referring to Figure 5, match the actual rotation angle of the transfer switch with the unit matching information to obtain the unit result of the reading;

S4.1、获取单位匹配信息，得到单位信息数据集和角度信息数据集；S4.1. Obtain unit matching information, and obtain a unit information data set and an angle information data set;

具体地，将单位信息存储于数组U,U＝{U_ll∈{1,2,...,n_u}}；Specifically, the unit information is stored in the array U, U={U _l l∈{1,2,...,n _u }};

上式中，U_l表示第l个单位，n_u表示转换开关单位种类数；In the above formula, U _l represents the lth unit, and n _u represents the number of types of transfer switch units;

各U_l对应的角度信息储存于数组θ，θ＝{θ_ll∈{1,2,...,n_u}}：The angle information corresponding to each U _l is stored in the array θ, θ={θ _l l∈{1,2,...,n _u }}:

上式中，θ_l表示转换开关中单位U_l对应的旋转角度。In the above formula, θ _l represents the rotation angle corresponding to the unit U _l in the transfer switch.

S4.2、将转换开关实际旋转角度与角度信息数据集进行匹配，得到角度索引值；S4.2. Match the actual rotation angle of the transfer switch with the angle information data set to obtain the angle index value;

具体地，转换开关实际旋转角度与角度信息数据集的匹配操作公式如下：Specifically, the matching operation formula between the actual rotation angle of the switch and the angle information data set is as follows:

m＝argmin_l|θ_r-θ_l|；m = argmin _l |θ _r -θ _l |;

上式中，m为θ中与θ_r最接近的角度的索引值。In the above formula, m is the index value of the angle closest to θ _r in θ.

S4.3、将角度索引值与单位信息数据集进行匹配，得到读数单位结果。S4.3. Match the angle index value with the unit information data set to obtain the reading unit result.

具体地，角度索引值与单位信息数据集的匹配操作公式如下：Specifically, the matching operation formula between the angle index value and the unit information data set is as follows:

S5、参照图6，对读数区域旋转框进行仿射变换并识别，得到读数数字结果；S5, referring to Fig. 6, carry out affine transformation and identification to the rotating frame of the reading area, and obtain the digital result of reading;

S5.1、将读数区域旋转框进行仿射变换，得到水平读数区域图；S5.1. Perform affine transformation on the rotating frame of the reading area to obtain a horizontal reading area map;

具体地，将原旋转框点坐标(x,y)通过仿射变换到坐标(u,v)，其具体操作如下：Specifically, the original rotation frame point coordinates (x, y) are transformed into coordinates (u, v) by affine transformation, and the specific operations are as follows:

其矩阵表示形式如下：Its matrix representation is as follows:

上式中，M为仿射变换矩阵，可以对读数区域旋转框进行平移、缩放、旋转、翻转和错切组合变换，a₀、a₁、a₂、b₀、b₁和b₂为M参数。In the above formula, M is the affine transformation matrix, which can perform translation, scaling, rotation, flipping and staggered transformation on the rotation frame of the reading area, a ₀ , a ₁ , a ₂ , b ₀ , b ₁ and b ₂ are M parameter.

S5.2、参照图7，基于CRNN读数识别模型对水平读数区域图依次进行CNN特征提取、RNN序列建模和CTC转录，得到读数数字结果；S5.2. Referring to Figure 7, based on the CRNN read identification model, CNN feature extraction, RNN sequence modeling and CTC transcription are sequentially performed on the horizontal read area map to obtain the read number result;

S5.2.1、将水平读数区域图输入至CRNN读数识别模型；S5.2.1. Input the horizontal reading area map into the CRNN reading recognition model;

具体地，CRNN读数识别模型由CNN特征提取模块、RNN序列建模模块和CTC转录模块构成，是一种稳定的端到端文本识别框架。Specifically, the CRNN read recognition model consists of a CNN feature extraction module, an RNN sequence modeling module, and a CTC transcription module, and is a stable end-to-end text recognition framework.

S5.2.2、基于CNN特征提取模块对水平读数区域图进行特征提取，得到卷积特征图；S5.2.2. Perform feature extraction on the horizontal reading area map based on the CNN feature extraction module to obtain a convolution feature map;

具体地，将水平读数区域图像输入MobileNetv3轻量级卷积神经网络网络提取整体特征信息，并输出卷积特征图。Specifically, the horizontal reading area image is input into the MobileNetv3 lightweight convolutional neural network network to extract the overall feature information, and the convolutional feature map is output.

S5.2.3、基于RNN序列建模模块对卷积特征图进行序列建模，得到后验概率矩阵；S5.2.3. Perform sequence modeling on the convolution feature map based on the RNN sequence modeling module to obtain a posterior probability matrix;

具体地，为了增强RNN网络的前后序列特征提取能力，本方法使用深度双向LSTM网络提取万用表读数前后序列特征；输入Bi-LSTM的特征图尺寸为(1，T,D)，其高度为1，最大时间长度为T，每个输入向量维度为D，Bi-LSTM输出y是一个后验概率矩阵。Specifically, in order to enhance the feature extraction capability of the RNN network before and after the sequence, this method uses the deep bidirectional LSTM network to extract the sequence features before and after the multimeter reading; the feature map size of the input Bi-LSTM is (1, T, D), and its height is 1, The maximum time length is T, each input vector has dimension D, and the Bi-LSTM output y is a posterior probability matrix.

S5.2.4、基于CTC转录模块对后验概率矩阵进行转录，得到读数数字结果。S5.2.4, transcribe the posterior probability matrix based on the CTC transcription module, and obtain the digital result of reading.

具体地，CTC转录模块把后验概率矩阵y的每一个时间片接入softmax以获取预测序列，并去除序列中的重复标签，再去除序列中的无效空格标签，从而将含有空格的预测序列翻译为文本信息；假设双向LSTM网络输出的预测序列为“--11-2-22-3--”，经CTC转录后变为最终预测结果“1223”。Specifically, the CTC transcription module inserts each time slice of the posterior probability matrix y into softmax to obtain the predicted sequence, removes the repeated tags in the sequence, and then removes the invalid space tags in the sequence, so as to translate the predicted sequence containing spaces. It is text information; it is assumed that the prediction sequence output by the bidirectional LSTM network is "--11-2-22-3--", which becomes the final prediction result "1223" after being transcribed by CTC.

S6、整合读数数字结果与读数单位结果，得到完整读数结果。S6. Integrate the reading digital result and the reading unit result to obtain a complete reading result.

参照图2，一种基于旋转目标检测的万用表读数识别系统，包括：2, a multimeter reading recognition system based on rotating target detection, including:

上述方法实施例中的内容均适用于本系统实施例中，本系统实施例所具体实现的功能与上述方法实施例相同，并且达到的有益效果与上述方法实施例所达到的有益效果也相同。The contents in the above method embodiments are all applicable to the present system embodiments, the specific functions implemented by the present system embodiments are the same as the above method embodiments, and the beneficial effects achieved are also the same as those achieved by the above method embodiments.

本发明的有益效果具体包括：The beneficial effects of the present invention specifically include:

1)能够检测并识别万用表任意旋转角度下的读数单位结果：基于YOLOv5进行扩展，对其检测头增加了目标旋转框偏移回归参数和目标各朝向概率参数的输出，仅用一个模型便可完成目标旋转框和角度的检测；基于万用表读数区域和转换开关位姿关系特性，结合读数区域角度和转换开关角度便可计算转换开关实际角度获取读数单位。1) Able to detect and identify the reading unit result at any rotation angle of the multimeter: based on YOLOv5 expansion, the detection head adds the output of the target rotation frame offset regression parameter and the target orientation probability parameter, which can be completed with only one model. Detection of target rotation frame and angle; based on the multimeter reading area and the relationship between the position and attitude of the transfer switch, combined with the angle of the reading area and the angle of the transfer switch, the actual angle of the transfer switch can be calculated to obtain the reading unit.

2)提高了万用表读数数字结果识别精度且简化了读数识别步骤：将检测的读数区域旋转框进行仿射变换成为水平框以便进行读数识别。常用的读数识别方法由读数字符分割和识别两步组成，而CRNN算法无需进行读数字符分割，可以端到端的完成读数识别。2) The recognition accuracy of the multimeter reading digital result is improved and the reading recognition steps are simplified: the rotating frame of the detected reading area is affine transformed into a horizontal frame for reading recognition. The commonly used reading recognition method consists of two steps of reading character segmentation and recognition, while the CRNN algorithm does not need to perform reading character segmentation, and can complete the reading recognition end-to-end.

以上是对本发明的较佳实施进行了具体说明，但本发明创造并不限于所述实施例，熟悉本领域的技术人员在不违背本发明精神的前提下还可做作出种种的等同变形或替换，这些等同的变形或替换均包含在本申请权利要求所限定的范围内。The above is a specific description of the preferred implementation of the present invention, but the present invention is not limited to the described embodiments, and those skilled in the art can make various equivalent deformations or replacements without departing from the spirit of the present invention. , these equivalent modifications or substitutions are all included within the scope defined by the claims of the present application.

Claims

1. a multimeter reading identification method based on rotating target detection, is characterized in that, comprises the following steps:

Based on the improved YOLOv5 model, the image to be tested is processed, and the rotation frame of the reading area, the rotation angle of the reading area and the rotation angle of the switch are output;

Calculate the actual rotation angle of the transfer switch according to the rotation angle of the reading area and the rotation angle of the transfer switch;

Match the actual rotation angle of the transfer switch with the unit matching information to get the reading unit result;

Perform affine transformation on the rotation frame of the reading area and identify it to obtain the digital result of the reading;

Integrate the reading numerical result with the reading unit result to obtain the complete reading result.

2. a kind of multimeter reading identification method based on rotating target detection according to claim 1, is characterized in that, the training step of described improved YOLOv5 model specifically comprises:

Obtain the training image and label the rotating frame, encode the coordinates of the four corners of the rotating frame as geometric elements, and obtain the labeled image;

Data enhancement is performed on the labeled images to obtain a training set;

Input the training set to the YOLOv5 model;

Perform feature extraction, feature fusion, and frame regression in sequence to obtain rotation frame information;

The model parameters are updated by combining the ground truth label calculation loss to obtain an improved YOLOv5 model.

3. a kind of multimeter reading recognition method based on rotating target detection according to claim 2, is characterized in that, described rotating frame information comprises the center point coordinates, width and height, angle, confidence, offset information, each Class probabilities and class probabilities for each orientation.

4. a kind of multimeter reading identification method based on rotating target detection according to claim 1 is characterized in that, the detection head output formula of described improved YOLOv5 model is expressed as follows:

In the above formula, n _h is the number of detection heads, _b _s is the number of images to be tested in the input model, na is the number of preset frame anchor points of the grid unit, H _i is the output height of the ith detection head, W _i is the width of the output of the i-th detection head, (x, y) is the center coordinate of the prediction frame, (w, h) is the width and height of the prediction frame, conf is the confidence level of the prediction target, and C is the prediction frame belonging to each The probability of the category, n _c is the number of predicted target categories, r _j is the rotation box regression parameter, and O _k is the probability of each orientation category of the target.

5. a kind of multimeter reading identification method based on rotating target detection according to claim 1, is characterized in that, described with actual rotation angle of transfer switch and unit matching information are matched, obtain this step of reading unit result, specifically comprises:

Obtain unit matching information, and obtain a unit information data set and an angle information data set;

Match the actual rotation angle of the transfer switch with the angle information data set to obtain the angle index value;

Match the angle index value to the unit info dataset to get the reading unit result.

6. a kind of multimeter reading identification method based on rotating target detection according to claim 1, is characterized in that, described to reading area rotating frame is carried out affine transformation and identification, obtains this step of reading digital result, specifically comprises:

Perform affine transformation on the rotation frame of the reading area to obtain a horizontal reading area map;

The affine transformation includes the combined transformation of translation, scaling, rotation, flipping and staggering of the rotation frame of the reading area;

Based on the CRNN read identification model, CNN feature extraction, RNN sequence modeling and CTC transcription were performed on the horizontal read region map in turn, and the read number results were obtained.

7. a kind of multimeter reading identification method based on rotating target detection according to claim 6 is characterized in that, the matrix formula of described affine transformation is as follows:

In the above formula, (x, y) is the coordinates of the rotation frame point in the reading area, (u, v) is the rotation frame point obtained by the affine transformation of the rotation frame point in the reading area, a ₀ , a ₁ , a ₂ , b ₀ , b ₁ and b ₂ are M parameters, and M is an affine transformation matrix.

8. A multimeter reading recognition system based on rotating target detection is characterized in that, comprising:

The detection module, based on the improved YOLOv5 model, processes the image to be tested, and outputs the rotation frame of the reading area, the rotation angle of the reading area and the rotation angle of the switch;

The actual rotation angle calculation module is used to calculate the actual rotation angle of the transfer switch according to the rotation angle of the reading area and the rotation angle of the transfer switch;

The unit matching module is used to match the actual rotation angle of the transfer switch with the unit matching information to obtain the reading unit result;

The digital recognition module is used to perform affine transformation and recognition on the rotation frame of the reading area, and obtain the digital result of the reading;

The reading integration module is used to integrate the reading digital result and the reading unit result to obtain the complete reading result.