[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN114743201A - A multimeter reading recognition method and system based on rotating target detection - Google Patents

A multimeter reading recognition method and system based on rotating target detection Download PDF

Info

Publication number
CN114743201A
CN114743201A CN202210410911.6A CN202210410911A CN114743201A CN 114743201 A CN114743201 A CN 114743201A CN 202210410911 A CN202210410911 A CN 202210410911A CN 114743201 A CN114743201 A CN 114743201A
Authority
CN
China
Prior art keywords
reading
rotation angle
frame
result
rotation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210410911.6A
Other languages
Chinese (zh)
Inventor
彭键清
周威
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sun Yat Sen University
Original Assignee
Sun Yat Sen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sun Yat Sen University filed Critical Sun Yat Sen University
Priority to CN202210410911.6A priority Critical patent/CN114743201A/en
Publication of CN114743201A publication Critical patent/CN114743201A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformations in the plane of the image
    • G06T3/02Affine transformations

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Image Analysis (AREA)

Abstract

本发明公开了一种基于旋转目标检测的万用表读数识别方法及系统,该方法包括:基于改进的YOLOv5模型对待测图像进行处理,输出读数区域旋转框、读数区域旋转角度和转换开关旋转角度;根据读数区域旋转角度和转换开关旋转角度,计算转换开关实际旋转角度;将转换开关实际旋转角度与单位匹配信息进行匹配,得到读数单位结果;对读数区域旋转框进行仿射变换并识别,得到读数数字结果;整合读数数字结果与读数单位结果,得到完整读数结果。该系统包括:检测模块、实际旋转角度计算模块、单位匹配模块、数字识别模块和读数整合模块。通过使用本发明,可以检测并识别带转换开关万用表任意旋转角度下的完整读数结果。

Figure 202210410911

The invention discloses a multimeter reading recognition method and system based on rotating target detection. The method includes: processing an image to be tested based on an improved YOLOv5 model, and outputting a reading area rotation frame, a reading area rotation angle and a switch rotation angle; The rotation angle of the reading area and the rotation angle of the transfer switch are used to calculate the actual rotation angle of the transfer switch; the actual rotation angle of the transfer switch is matched with the unit matching information to obtain the reading unit result; the rotation frame of the reading area is subjected to affine transformation and identification to obtain the reading number Result; Combine the reading digital result with the reading unit result to get the complete reading result. The system includes: a detection module, an actual rotation angle calculation module, a unit matching module, a digital identification module and a reading integration module. By using the present invention, the complete reading result under any rotation angle of the multimeter with a changeover switch can be detected and identified.

Figure 202210410911

Description

一种基于旋转目标检测的万用表读数识别方法及系统A multimeter reading recognition method and system based on rotating target detection

技术领域technical field

本发明涉及中国智能制造领域,尤其涉及一种基于旋转目标检测的万用表读数识别方法及系统。The invention relates to the field of intelligent manufacturing in China, in particular to a multimeter reading recognition method and system based on rotating target detection.

背景技术Background technique

工业仪器仪表读数识别自动化、智能化是大势所趋。基于计算机视觉的仪器仪表读数识别技术,可对采集到的仪器仪表数值信息自动识别,并快速录入到业务系统中,有效解决人工抄录过程中抄错、抄漏等问题,提升抄录效率,减少人工录入工作量,降低企业人力成本,实现仪器仪表数据录入的自动化。The automation and intelligence of industrial instrument reading recognition is the general trend. The instrument reading recognition technology based on computer vision can automatically identify the collected instrument and meter numerical information, and quickly enter it into the business system, effectively solve the problems of copying errors and omissions in the manual transcription process, improve transcription efficiency and reduce labor Entry workload, reduce enterprise labor costs, and automate instrument data entry.

现有的万用表读数识别方法主要是首先将检测读数区域并裁剪,进而对裁剪读数图像进行读数识别,但它们大多只关注于水平读数的检测和识别,且往往忽略了读数单位的识别,导致应用价值和应用范围受限。基于传统图像处理的方法要求环境背景单一,且受光照等环境变化干扰较大,因此普适性较差。相比之下,YOLOv5是不含RPN结构的One-stage深度学习目标检测算法,它能很好地检测出读数区域,且具有小型骨干结构如tiny-darknet,能通过一系列优化措施使模型轻量化,适合作为工程算法在移动端部署,具有较高的实际应用价值。但YOLOv5作为通用目标检测算法,不适用于具有大倾角的旋转目标检测,更无法确定目标朝向和角度。The existing multimeter reading recognition methods mainly firstly detect the reading area and crop it, and then perform reading recognition on the cropped reading image, but most of them only focus on the detection and recognition of horizontal readings, and often ignore the recognition of reading units, which leads to application The value and scope of application are limited. The method based on traditional image processing requires a single environmental background and is greatly disturbed by environmental changes such as illumination, so the universality is poor. In contrast, YOLOv5 is a one-stage deep learning target detection algorithm without RPN structure, which can detect the read region well, and has a small backbone structure such as tiny-darknet, which can make the model lighter through a series of optimization measures. Quantization is suitable for deployment on mobile terminals as engineering algorithms, and has high practical application value. However, as a general target detection algorithm, YOLOv5 is not suitable for rotating target detection with a large inclination angle, and it is impossible to determine the target orientation and angle.

发明内容SUMMARY OF THE INVENTION

为了解决上述技术问题,本发明的目的是提供一种基于旋转目标检测的万用表读数识别方法及系统,可以检测并识别带转换开关万用表任意旋转角度下的完整读数结果。In order to solve the above technical problems, the purpose of the present invention is to provide a multimeter reading identification method and system based on rotating target detection, which can detect and identify the complete reading results of a multimeter with a switch at any rotation angle.

本发明所采用的第一技术方案是:一种基于旋转目标检测的万用表读数识别方法,包括以下步骤:The first technical solution adopted by the present invention is: a multimeter reading identification method based on rotating target detection, comprising the following steps:

基于改进的YOLOv5模型对待测图像进行处理,输出读数区域旋转框、读数区域旋转角度和转换开关旋转角度;Based on the improved YOLOv5 model, the image to be tested is processed, and the rotation frame of the reading area, the rotation angle of the reading area and the rotation angle of the switch are output;

根据读数区域旋转角度和转换开关旋转角度,计算转换开关实际旋转角度;Calculate the actual rotation angle of the transfer switch according to the rotation angle of the reading area and the rotation angle of the transfer switch;

将转换开关实际旋转角度与单位匹配信息进行匹配,得到读数单位结果;Match the actual rotation angle of the transfer switch with the unit matching information to get the reading unit result;

对读数区域旋转框进行仿射变换并识别,得到读数数字结果;Perform affine transformation on the rotation frame of the reading area and identify it to obtain the digital result of the reading;

整合读数数字结果与读数单位结果,得到完整读数结果。Integrate the reading numerical result with the reading unit result to obtain the complete reading result.

进一步,所述改进的YOLOv5模型的训练步骤具体包括:Further, the training steps of the improved YOLOv5 model specifically include:

获取训练图像并进行旋转框标注,将旋转框四个角的坐标编码为几何要素,得到标注后图像;Obtain the training image and label the rotating frame, encode the coordinates of the four corners of the rotating frame as geometric elements, and obtain the labeled image;

对标注后图像进行数据增强,得到训练集;Data enhancement is performed on the labeled images to obtain a training set;

将训练集输入至YOLOv5模型;Input the training set to the YOLOv5 model;

依次进行特征提取、特征融合和边框回归得到旋转框信息;Perform feature extraction, feature fusion, and frame regression in sequence to obtain rotation frame information;

结合真实值标签计算损失对模型参数进行更新,得到改进的YOLOv5模型。The model parameters are updated by combining the ground truth label calculation loss to obtain an improved YOLOv5 model.

进一步,所述旋转框信息包括旋转框的中心点坐标、宽高、角度、置信度、偏置信息、各类别概率和各朝向类别概率。Further, the rotation frame information includes the center point coordinates, width and height, angle, confidence, bias information, probability of each category and probability of each orientation category of the rotation frame.

进一步,所述改进的YOLOv5模型的检测头输出公式表示如下:Further, the detection head output formula of the improved YOLOv5 model is expressed as follows:

Figure BDA0003603642260000021
Figure BDA0003603642260000021

上式中,nh为检测头数量,bs为输入模型的待测图像数量大小,na为网格单元的预设框锚点数量,Hi为第i个检测头输出的高,Wi为第i个检测头输出的宽,(x,y)为预测框的中心坐标,(w,h)为预测框的宽和高,conf为预测目标的置信度,C为预测框属于各个类别的概率,nc为预测的目标类别数量,rj为旋转框回归参数,Ok为目标各朝向类别概率。In the above formula, n h is the number of detection heads, b s is the number of images to be tested in the input model, na is the number of preset frame anchor points of the grid unit, H i is the output height of the ith detection head, W i is the width of the output of the i-th detection head, (x, y) is the center coordinate of the prediction frame, (w, h) is the width and height of the prediction frame, conf is the confidence level of the prediction target, and C is the prediction frame belonging to each The probability of the category, n c is the number of predicted target categories, r j is the rotation box regression parameter, and O k is the probability of each orientation category of the target.

进一步,所述将转换开关实际旋转角度与单位匹配信息进行匹配,得到读数单位结果这一步骤,具体包括:Further, the step of matching the actual rotation angle of the transfer switch with the unit matching information to obtain the result of the reading unit specifically includes:

获取单位匹配信息,得到单位信息数据集和角度信息数据集;Obtain unit matching information, and obtain a unit information data set and an angle information data set;

将转换开关实际旋转角度与角度信息数据集进行匹配,得到角度索引值;Match the actual rotation angle of the transfer switch with the angle information data set to obtain the angle index value;

将角度索引值与单位信息数据集进行匹配,得到读数单位结果。Match the angle index value to the unit info dataset to get the reading unit result.

进一步,所述对读数区域旋转框进行仿射变换并识别,得到读数数字结果这一步骤,具体包括:Further, the step of performing affine transformation and identification on the rotating frame of the reading area, and obtaining the digital result of the reading, specifically includes:

将读数区域旋转框进行仿射变换,得到水平读数区域图;Perform affine transformation on the rotating frame of the reading area to obtain a horizontal reading area map;

所述仿射变换包括对读数区域旋转框进行平移、缩放、旋转、翻转和错切组合变换;The affine transformation includes the combined transformation of translation, scaling, rotation, flipping and staggering of the rotation frame of the reading area;

基于CRNN读数识别模型对水平读数区域图依次进行CNN特征提取、RNN序列建模和CTC转录,得到读数数字结果。Based on the CRNN read identification model, CNN feature extraction, RNN sequence modeling and CTC transcription were performed on the horizontal read region map in turn, and the read number results were obtained.

进一步,所述仿射变换的矩阵公式如下:Further, the matrix formula of the affine transformation is as follows:

Figure BDA0003603642260000031
Figure BDA0003603642260000031

上式中,(x,y)为读数区域旋转框点坐标,(u,v)为读数区域旋转框点经仿射变换后得到的旋转框点,a0、a1、a2、b0、b1和b2为M参数,M为仿射变换矩阵。In the above formula, (x, y) is the coordinates of the rotation frame point in the reading area, (u, v) is the rotation frame point obtained by the affine transformation of the rotation frame point in the reading area, a 0 , a 1 , a 2 , b 0 , b 1 and b 2 are M parameters, and M is an affine transformation matrix.

本发明所采用的第二技术方案是:一种基于旋转目标检测的万用表读数识别系统,包括:The second technical solution adopted by the present invention is: a multimeter reading identification system based on rotating target detection, comprising:

检测模块,基于改进的YOLOv5模型对待测图像进行处理,输出读数区域旋转框、读数区域旋转角度和转换开关旋转角度;The detection module, based on the improved YOLOv5 model, processes the image to be tested, and outputs the rotation frame of the reading area, the rotation angle of the reading area and the rotation angle of the switch;

实际旋转角度计算模块,用于根据读数区域旋转角度和转换开关旋转角度,计算转换开关实际旋转角度;The actual rotation angle calculation module is used to calculate the actual rotation angle of the transfer switch according to the rotation angle of the reading area and the rotation angle of the transfer switch;

单位匹配模块,用于将转换开关实际旋转角度与单位匹配信息进行匹配,得到读数单位结果;The unit matching module is used to match the actual rotation angle of the transfer switch with the unit matching information to obtain the reading unit result;

数字识别模块,用于对读数区域旋转框进行仿射变换并识别,得到读数数字结果;The digital recognition module is used to perform affine transformation and recognition on the rotating frame of the reading area, and obtain the digital result of the reading;

读数整合模块,用于整合读数数字结果与读数单位结果,得到完整读数结果。The reading integration module is used to integrate the reading digital result and the reading unit result to obtain the complete reading result.

本发明方法及系统的有益效果是:通过改进YOLOv5算法解决了任意旋转角度下的万用表读数识别难点,并针对一系列带转换开关的万用表提出了读数单位的识别方法,使得读数信息更加完整,且该方法适用于一类带转换开关的数字万用表,扩大了应用范围和应用价值。The beneficial effects of the method and system of the present invention are: by improving the YOLOv5 algorithm, the difficulty in identifying the reading of a multimeter under any rotation angle is solved, and a method for identifying the reading unit is proposed for a series of multimeters with a switch, so that the reading information is more complete, and The method is suitable for a class of digital multimeters with transfer switches, which expands the application scope and application value.

附图说明Description of drawings

图1是本发明一种基于旋转目标检测的万用表读数识别方法的步骤流程图;Fig. 1 is the step flow chart of a kind of multimeter reading identification method based on rotating target detection of the present invention;

图2是本发明一种基于旋转目标检测的万用表读数识别系统的结构框图;Fig. 2 is a kind of structural block diagram of the multimeter reading identification system based on rotating target detection of the present invention;

图3是本发明具体实施例改进YOLOv5旋转区域检测模型示意图;3 is a schematic diagram of a specific embodiment of the present invention to improve the YOLOv5 rotation area detection model;

图4是本发明具体实施例改进YOLOv5输出旋转框示意图;4 is a schematic diagram of a specific embodiment of the present invention to improve the output rotation frame of YOLOv5;

图5是本发明具体实施例转换开关示意图;5 is a schematic diagram of a transfer switch according to a specific embodiment of the present invention;

图6是本发明具体实施例读数区域矫正示意图;6 is a schematic diagram of a reading area correction according to a specific embodiment of the present invention;

图7是本发明具体实施例CRNN读数识别模型的结构框图。FIG. 7 is a structural block diagram of a CRNN reading recognition model according to a specific embodiment of the present invention.

具体实施方式Detailed ways

下面结合附图和具体实施例对本发明做进一步的详细说明。对于以下实施例中的步骤编号,其仅为了便于阐述说明而设置,对步骤之间的顺序不做任何限定,实施例中的各步骤的执行顺序均可根据本领域技术人员的理解来进行适应性调整。The present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments. The numbers of the steps in the following embodiments are only set for the convenience of description, and the sequence between the steps is not limited in any way, and the execution sequence of each step in the embodiments can be adapted according to the understanding of those skilled in the art Sexual adjustment.

参照图1,本发明提供了一种基于旋转目标检测的万用表读数识别方法,该方法包括以下步骤:1, the present invention provides a multimeter reading recognition method based on rotating target detection, the method includes the following steps:

S1、参照图3,对YOLOv5模型进行训练,得到改进的YOLOv5模型;S1. Referring to Figure 3, train the YOLOv5 model to obtain an improved YOLOv5 model;

S1.1、获取训练图像并进行旋转框标注,将旋转框四个角的坐标编码为几何要素,得到标注后图像;S1.1. Obtain the training image and label the rotating frame, encode the coordinates of the four corners of the rotating frame as geometric elements, and obtain the labeled image;

具体地,参照图4的方式将旋转框的四个角的坐标编码为几何要素,具体操作如下:Specifically, the coordinates of the four corners of the rotating frame are encoded as geometric elements with reference to the mode of FIG. 4 , and the specific operations are as follows:

||Rj||=w×Sigmoid(rj),j∈{0,1};||R j ||=w×Sigmoid(r j ), j∈{0,1};

上式中,Rj为预测旋转框j点的位置与预测水平框j点的位置之间的距离;w为网格单元预测的水平框的宽度,Sigmoid(rj)为边框回归参数的归一化值,rj为边框偏移回归参数。In the above formula, R j is the distance between the position of the predicted rotation frame j and the predicted horizontal frame j; w is the width of the horizontal frame predicted by the grid unit, and Sigmoid(r j ) is the regression parameter of the frame. The normalized value, r j is the border offset regression parameter.

S1.2、对标注后图像进行数据增强,得到训练集;S1.2, perform data enhancement on the labeled images to obtain a training set;

S1.3、将训练集输入至YOLOv5模型;S1.3. Input the training set to the YOLOv5 model;

S1.4、依次进行特征提取、特征融合和边框回归得到旋转框信息;S1.4, perform feature extraction, feature fusion and frame regression in sequence to obtain the rotation frame information;

具体地,旋转框信息包括旋转框的中心点坐标、宽高、角度、置信度、偏置信息、各类别概率和各朝向类别概率。Specifically, the rotation frame information includes the center point coordinates, width and height, angle, confidence, bias information, probability of each category, and probability of each orientation category of the rotation frame.

其中,旋转框角度的计算操作如下:Among them, the calculation operation of the rotation frame angle is as follows:

当rj≠0,即存在旋转框时,从竖直向上方向开始,以顺时针方向计算旋转角度

Figure BDA0003603642260000041
When r j ≠ 0, that is, when there is a rotation frame, start from the vertical upward direction, and calculate the rotation angle in a clockwise direction
Figure BDA0003603642260000041

Figure BDA0003603642260000042
Figure BDA0003603642260000042

上式中,

Figure BDA0003603642260000043
为标注后图像的四个顶点的坐标,
Figure BDA0003603642260000044
为旋转框角度。In the above formula,
Figure BDA0003603642260000043
are the coordinates of the four vertices of the labeled image,
Figure BDA0003603642260000044
is the rotation frame angle.

当rj=0,即仅存在水平框,则可以通过预测朝向

Figure BDA0003603642260000045
计算出目标的旋转角度
Figure BDA0003603642260000046
When r j = 0, that is, there is only a horizontal frame, then the direction can be predicted by
Figure BDA0003603642260000045
Calculate the rotation angle of the target
Figure BDA0003603642260000046

Figure BDA0003603642260000047
Figure BDA0003603642260000047

其中,

Figure BDA0003603642260000048
为各朝向类别概率中最大概率的朝向,其具体计算操作如下:in,
Figure BDA0003603642260000048
is the orientation with the largest probability among the probabilities of each orientation category, and its specific calculation operations are as follows:

Figure BDA0003603642260000049
Figure BDA0003603642260000049

上式中,P(Ok)为各朝向类别概率,其具体计算操作如下:In the above formula, P(O k ) is the probability of each orientation category, and its specific calculation operation is as follows:

P(Ok)=Sigmoid(Ok),k∈{0,1,2,3}。P(O k )=Sigmoid(O k ),k∈{0,1,2,3}.

S1.5、结合真实值标签计算损失对模型参数进行更新,得到改进的YOLOv5模型。S1.5, update the model parameters by calculating the loss of the true value label, and obtain an improved YOLOv5 model.

具体地,该模型的损失函数计算操作如下:Specifically, the loss function calculation operation of the model is as follows:

L=λboxLboxobjLobjclsLclsrboxLrboxrotLrotL=λ box L boxobj L objcls L clsrbox L rboxrot L rot ;

上式中,λbox、λobj、λcls、λrbox和λrot分别为Lbox、Lobj、Lcls、Lrbox和Lrot的权重系数;Lbox为水平框bbox的损失函数,使用CIOU Loss损失函数;Lobj为目标置信度损失函数,使用Focal Loss损失函数;Lcls为预测类别损失函数,使用Focal Loss损失函数;Lrbox为旋转框偏移回归损失函数,使用SmoothL1损失函数;Lrot为各朝向分类损失函数,使用Focal Loss损失函数。In the above formula, λ box , λ obj , λ cls , λ rbox and λ rot are the weight coefficients of L box , L obj , L cls , L rbox and L rot respectively; L box is the loss function of the horizontal box bbox, using CIOU Loss loss function; L obj is the target confidence loss function, using the Focal Loss loss function; L cls is the prediction category loss function, using the Focal Loss loss function; L rbox is the rotation frame offset regression loss function, using the Smooth L1 loss function; L rot is the classification loss function for each orientation, and the Focal Loss loss function is used.

其中,旋转框偏移回归损失函数Lrbox表示如下:Among them, the rotation box offset regression loss function L rbox is expressed as follows:

Figure BDA0003603642260000051
Figure BDA0003603642260000051

上式中,rj′为rj的归一化值,

Figure BDA0003603642260000052
为rj的真实值,rj为旋转框偏移回归参数,SmoothL1为计算rj′和
Figure BDA0003603642260000053
之间的回归误差损失的损失函数,其具体操作如下:In the above formula, r j ′ is the normalized value of r j ,
Figure BDA0003603642260000052
is the real value of r j , r j is the rotation frame offset regression parameter, Smooth L1 is the calculation of r j ′ and
Figure BDA0003603642260000053
The loss function between the regression error loss and the specific operation is as follows:

Figure BDA0003603642260000054
Figure BDA0003603642260000054

各朝向分类损失函数Lrot表示如下:The classification loss function L rot of each orientation is expressed as follows:

Figure BDA0003603642260000055
Figure BDA0003603642260000055

上式中,αk为用于平衡正负样本的权重系数,γ为用于平衡简单困难样本。In the above formula, α k is the weight coefficient used to balance positive and negative samples, and γ is used to balance simple and difficult samples.

S2、基于改进的YOLOv5模型对待测图像进行处理,输出读数区域旋转框、读数区域旋转角度和转换开关旋转角度;S2. Process the image to be tested based on the improved YOLOv5 model, and output the rotation frame of the reading area, the rotation angle of the reading area and the rotation angle of the switch;

具体地,改进的YOLOv5模型由输入端、Backbone特征提取主干网络、Neck特征增强网络和Head检测头构成。Specifically, the improved YOLOv5 model consists of an input end, a Backbone feature extraction backbone network, a Neck feature enhancement network, and a Head detection head.

其中,改进的YOLOv5模型检测头输出具体计算操作如下:Among them, the specific calculation operation of the output of the improved YOLOv5 model detection head is as follows:

Figure BDA0003603642260000056
Figure BDA0003603642260000056

上式中,nh为检测头数量,bs为输入模型的待测图像数量大小,na为网格单元的预设框锚点数量,Hi为第i个检测头输出的高,Wi为第i个检测头输出的宽,(x,y)为预测框的中心坐标,(w,h)为预测框的宽和高,conf为预测目标的置信度,C为预测框属于各个类别的概率,nc为预测的目标类别数量,rj为旋转框回归参数,Ok为目标各朝向类别概率。In the above formula, n h is the number of detection heads, b s is the number of images to be tested in the input model, na is the number of preset frame anchor points of the grid unit, H i is the output height of the ith detection head, W i is the width of the output of the i-th detection head, (x, y) is the center coordinate of the prediction frame, (w, h) is the width and height of the prediction frame, conf is the confidence level of the prediction target, and C is the prediction frame belonging to each The probability of the category, n c is the number of predicted target categories, r j is the rotation box regression parameter, and O k is the probability of each orientation category of the target.

S3、根据读数区域旋转角度和转换开关旋转角度,计算转换开关实际旋转角度;S3. Calculate the actual rotation angle of the transfer switch according to the rotation angle of the reading area and the rotation angle of the transfer switch;

具体地,转换开关实际旋转角度的具体计算操作为:Specifically, the specific calculation operation of the actual rotation angle of the transfer switch is:

Figure BDA0003603642260000061
Figure BDA0003603642260000061

上式中,θd为读数区域旋转角度,θg为转换开关旋转角度,θr为转换开关实际旋转角度。In the above formula, θ d is the rotation angle of the reading area, θ g is the rotation angle of the transfer switch, and θ r is the actual rotation angle of the transfer switch.

其中,θd和θg就是

Figure BDA0003603642260000062
where θd and θg are
Figure BDA0003603642260000062

S4、参考图5,将转换开关实际旋转角度与单位匹配信息进行匹配,得到读数单位结果;S4. Referring to Figure 5, match the actual rotation angle of the transfer switch with the unit matching information to obtain the unit result of the reading;

S4.1、获取单位匹配信息,得到单位信息数据集和角度信息数据集;S4.1. Obtain unit matching information, and obtain a unit information data set and an angle information data set;

具体地,将单位信息存储于数组U,U={Ull∈{1,2,...,nu}};Specifically, the unit information is stored in the array U, U={U l l∈{1,2,...,n u }};

上式中,Ul表示第l个单位,nu表示转换开关单位种类数;In the above formula, U l represents the lth unit, and n u represents the number of types of transfer switch units;

各Ul对应的角度信息储存于数组θ,θ={θll∈{1,2,...,nu}}:The angle information corresponding to each U l is stored in the array θ, θ={θ l l∈{1,2,...,n u }}:

上式中,θl表示转换开关中单位Ul对应的旋转角度。In the above formula, θ l represents the rotation angle corresponding to the unit U l in the transfer switch.

S4.2、将转换开关实际旋转角度与角度信息数据集进行匹配,得到角度索引值;S4.2. Match the actual rotation angle of the transfer switch with the angle information data set to obtain the angle index value;

具体地,转换开关实际旋转角度与角度信息数据集的匹配操作公式如下:Specifically, the matching operation formula between the actual rotation angle of the switch and the angle information data set is as follows:

m=argminlrl|;m = argmin lrl |;

上式中,m为θ中与θr最接近的角度的索引值。In the above formula, m is the index value of the angle closest to θ r in θ.

S4.3、将角度索引值与单位信息数据集进行匹配,得到读数单位结果。S4.3. Match the angle index value with the unit information data set to obtain the reading unit result.

具体地,角度索引值与单位信息数据集的匹配操作公式如下:Specifically, the matching operation formula between the angle index value and the unit information data set is as follows:

Figure BDA0003603642260000063
Figure BDA0003603642260000063

S5、参照图6,对读数区域旋转框进行仿射变换并识别,得到读数数字结果;S5, referring to Fig. 6, carry out affine transformation and identification to the rotating frame of the reading area, and obtain the digital result of reading;

S5.1、将读数区域旋转框进行仿射变换,得到水平读数区域图;S5.1. Perform affine transformation on the rotating frame of the reading area to obtain a horizontal reading area map;

具体地,将原旋转框点坐标(x,y)通过仿射变换到坐标(u,v),其具体操作如下:Specifically, the original rotation frame point coordinates (x, y) are transformed into coordinates (u, v) by affine transformation, and the specific operations are as follows:

Figure BDA0003603642260000071
Figure BDA0003603642260000071

其矩阵表示形式如下:Its matrix representation is as follows:

Figure BDA0003603642260000072
Figure BDA0003603642260000072

上式中,M为仿射变换矩阵,可以对读数区域旋转框进行平移、缩放、旋转、翻转和错切组合变换,a0、a1、a2、b0、b1和b2为M参数。In the above formula, M is the affine transformation matrix, which can perform translation, scaling, rotation, flipping and staggered transformation on the rotation frame of the reading area, a 0 , a 1 , a 2 , b 0 , b 1 and b 2 are M parameter.

S5.2、参照图7,基于CRNN读数识别模型对水平读数区域图依次进行CNN特征提取、RNN序列建模和CTC转录,得到读数数字结果;S5.2. Referring to Figure 7, based on the CRNN read identification model, CNN feature extraction, RNN sequence modeling and CTC transcription are sequentially performed on the horizontal read area map to obtain the read number result;

S5.2.1、将水平读数区域图输入至CRNN读数识别模型;S5.2.1. Input the horizontal reading area map into the CRNN reading recognition model;

具体地,CRNN读数识别模型由CNN特征提取模块、RNN序列建模模块和CTC转录模块构成,是一种稳定的端到端文本识别框架。Specifically, the CRNN read recognition model consists of a CNN feature extraction module, an RNN sequence modeling module, and a CTC transcription module, and is a stable end-to-end text recognition framework.

S5.2.2、基于CNN特征提取模块对水平读数区域图进行特征提取,得到卷积特征图;S5.2.2. Perform feature extraction on the horizontal reading area map based on the CNN feature extraction module to obtain a convolution feature map;

具体地,将水平读数区域图像输入MobileNetv3轻量级卷积神经网络网络提取整体特征信息,并输出卷积特征图。Specifically, the horizontal reading area image is input into the MobileNetv3 lightweight convolutional neural network network to extract the overall feature information, and the convolutional feature map is output.

S5.2.3、基于RNN序列建模模块对卷积特征图进行序列建模,得到后验概率矩阵;S5.2.3. Perform sequence modeling on the convolution feature map based on the RNN sequence modeling module to obtain a posterior probability matrix;

具体地,为了增强RNN网络的前后序列特征提取能力,本方法使用深度双向LSTM网络提取万用表读数前后序列特征;输入Bi-LSTM的特征图尺寸为(1,T,D),其高度为1,最大时间长度为T,每个输入向量维度为D,Bi-LSTM输出y是一个后验概率矩阵。Specifically, in order to enhance the feature extraction capability of the RNN network before and after the sequence, this method uses the deep bidirectional LSTM network to extract the sequence features before and after the multimeter reading; the feature map size of the input Bi-LSTM is (1, T, D), and its height is 1, The maximum time length is T, each input vector has dimension D, and the Bi-LSTM output y is a posterior probability matrix.

S5.2.4、基于CTC转录模块对后验概率矩阵进行转录,得到读数数字结果。S5.2.4, transcribe the posterior probability matrix based on the CTC transcription module, and obtain the digital result of reading.

具体地,CTC转录模块把后验概率矩阵y的每一个时间片接入softmax以获取预测序列,并去除序列中的重复标签,再去除序列中的无效空格标签,从而将含有空格的预测序列翻译为文本信息;假设双向LSTM网络输出的预测序列为“--11-2-22-3--”,经CTC转录后变为最终预测结果“1223”。Specifically, the CTC transcription module inserts each time slice of the posterior probability matrix y into softmax to obtain the predicted sequence, removes the repeated tags in the sequence, and then removes the invalid space tags in the sequence, so as to translate the predicted sequence containing spaces. It is text information; it is assumed that the prediction sequence output by the bidirectional LSTM network is "--11-2-22-3--", which becomes the final prediction result "1223" after being transcribed by CTC.

S6、整合读数数字结果与读数单位结果,得到完整读数结果。S6. Integrate the reading digital result and the reading unit result to obtain a complete reading result.

参照图2,一种基于旋转目标检测的万用表读数识别系统,包括:2, a multimeter reading recognition system based on rotating target detection, including:

检测模块,基于改进的YOLOv5模型对待测图像进行处理,输出读数区域旋转框、读数区域旋转角度和转换开关旋转角度;The detection module, based on the improved YOLOv5 model, processes the image to be tested, and outputs the rotation frame of the reading area, the rotation angle of the reading area and the rotation angle of the switch;

实际旋转角度计算模块,用于根据读数区域旋转角度和转换开关旋转角度,计算转换开关实际旋转角度;The actual rotation angle calculation module is used to calculate the actual rotation angle of the transfer switch according to the rotation angle of the reading area and the rotation angle of the transfer switch;

单位匹配模块,用于将转换开关实际旋转角度与单位匹配信息进行匹配,得到读数单位结果;The unit matching module is used to match the actual rotation angle of the transfer switch with the unit matching information to obtain the reading unit result;

数字识别模块,用于对读数区域旋转框进行仿射变换并识别,得到读数数字结果;The digital recognition module is used to perform affine transformation and recognition on the rotating frame of the reading area, and obtain the digital result of the reading;

读数整合模块,用于整合读数数字结果与读数单位结果,得到完整读数结果。The reading integration module is used to integrate the reading digital result and the reading unit result to obtain the complete reading result.

上述方法实施例中的内容均适用于本系统实施例中,本系统实施例所具体实现的功能与上述方法实施例相同,并且达到的有益效果与上述方法实施例所达到的有益效果也相同。The contents in the above method embodiments are all applicable to the present system embodiments, the specific functions implemented by the present system embodiments are the same as the above method embodiments, and the beneficial effects achieved are also the same as those achieved by the above method embodiments.

本发明的有益效果具体包括:The beneficial effects of the present invention specifically include:

1)能够检测并识别万用表任意旋转角度下的读数单位结果:基于YOLOv5进行扩展,对其检测头增加了目标旋转框偏移回归参数和目标各朝向概率参数的输出,仅用一个模型便可完成目标旋转框和角度的检测;基于万用表读数区域和转换开关位姿关系特性,结合读数区域角度和转换开关角度便可计算转换开关实际角度获取读数单位。1) Able to detect and identify the reading unit result at any rotation angle of the multimeter: based on YOLOv5 expansion, the detection head adds the output of the target rotation frame offset regression parameter and the target orientation probability parameter, which can be completed with only one model. Detection of target rotation frame and angle; based on the multimeter reading area and the relationship between the position and attitude of the transfer switch, combined with the angle of the reading area and the angle of the transfer switch, the actual angle of the transfer switch can be calculated to obtain the reading unit.

2)提高了万用表读数数字结果识别精度且简化了读数识别步骤:将检测的读数区域旋转框进行仿射变换成为水平框以便进行读数识别。常用的读数识别方法由读数字符分割和识别两步组成,而CRNN算法无需进行读数字符分割,可以端到端的完成读数识别。2) The recognition accuracy of the multimeter reading digital result is improved and the reading recognition steps are simplified: the rotating frame of the detected reading area is affine transformed into a horizontal frame for reading recognition. The commonly used reading recognition method consists of two steps of reading character segmentation and recognition, while the CRNN algorithm does not need to perform reading character segmentation, and can complete the reading recognition end-to-end.

以上是对本发明的较佳实施进行了具体说明,但本发明创造并不限于所述实施例,熟悉本领域的技术人员在不违背本发明精神的前提下还可做作出种种的等同变形或替换,这些等同的变形或替换均包含在本申请权利要求所限定的范围内。The above is a specific description of the preferred implementation of the present invention, but the present invention is not limited to the described embodiments, and those skilled in the art can make various equivalent deformations or replacements without departing from the spirit of the present invention. , these equivalent modifications or substitutions are all included within the scope defined by the claims of the present application.

Claims (8)

1.一种基于旋转目标检测的万用表读数识别方法,其特征在于,包括以下步骤:1. a multimeter reading identification method based on rotating target detection, is characterized in that, comprises the following steps: 基于改进的YOLOv5模型对待测图像进行处理,输出读数区域旋转框、读数区域旋转角度和转换开关旋转角度;Based on the improved YOLOv5 model, the image to be tested is processed, and the rotation frame of the reading area, the rotation angle of the reading area and the rotation angle of the switch are output; 根据读数区域旋转角度和转换开关旋转角度,计算转换开关实际旋转角度;Calculate the actual rotation angle of the transfer switch according to the rotation angle of the reading area and the rotation angle of the transfer switch; 将转换开关实际旋转角度与单位匹配信息进行匹配,得到读数单位结果;Match the actual rotation angle of the transfer switch with the unit matching information to get the reading unit result; 对读数区域旋转框进行仿射变换并识别,得到读数数字结果;Perform affine transformation on the rotation frame of the reading area and identify it to obtain the digital result of the reading; 整合读数数字结果与读数单位结果,得到完整读数结果。Integrate the reading numerical result with the reading unit result to obtain the complete reading result. 2.根据权利要求1所述一种基于旋转目标检测的万用表读数识别方法,其特征在于,所述改进的YOLOv5模型的训练步骤具体包括:2. a kind of multimeter reading identification method based on rotating target detection according to claim 1, is characterized in that, the training step of described improved YOLOv5 model specifically comprises: 获取训练图像并进行旋转框标注,将旋转框四个角的坐标编码为几何要素,得到标注后图像;Obtain the training image and label the rotating frame, encode the coordinates of the four corners of the rotating frame as geometric elements, and obtain the labeled image; 对标注后图像进行数据增强,得到训练集;Data enhancement is performed on the labeled images to obtain a training set; 将训练集输入至YOLOv5模型;Input the training set to the YOLOv5 model; 依次进行特征提取、特征融合和边框回归得到旋转框信息;Perform feature extraction, feature fusion, and frame regression in sequence to obtain rotation frame information; 结合真实值标签计算损失对模型参数进行更新,得到改进的YOLOv5模型。The model parameters are updated by combining the ground truth label calculation loss to obtain an improved YOLOv5 model. 3.根据权利要求2所述一种基于旋转目标检测的万用表读数识别方法,其特征在于,所述旋转框信息包括旋转框的中心点坐标、宽高、角度、置信度、偏置信息、各类别概率和各朝向类别概率。3. a kind of multimeter reading recognition method based on rotating target detection according to claim 2, is characterized in that, described rotating frame information comprises the center point coordinates, width and height, angle, confidence, offset information, each Class probabilities and class probabilities for each orientation. 4.根据权利要求1所述一种基于旋转目标检测的万用表读数识别方法,其特征在于,所述改进的YOLOv5模型的检测头输出公式表示如下:4. a kind of multimeter reading identification method based on rotating target detection according to claim 1 is characterized in that, the detection head output formula of described improved YOLOv5 model is expressed as follows:
Figure FDA0003603642250000011
Figure FDA0003603642250000011
上式中,nh为检测头数量,bs为输入模型的待测图像数量大小,na为网格单元的预设框锚点数量,Hi为第i个检测头输出的高,Wi为第i个检测头输出的宽,(x,y)为预测框的中心坐标,(w,h)为预测框的宽和高,conf为预测目标的置信度,C为预测框属于各个类别的概率,nc为预测的目标类别数量,rj为旋转框回归参数,Ok为目标各朝向类别概率。In the above formula, n h is the number of detection heads, b s is the number of images to be tested in the input model, na is the number of preset frame anchor points of the grid unit, H i is the output height of the ith detection head, W i is the width of the output of the i-th detection head, (x, y) is the center coordinate of the prediction frame, (w, h) is the width and height of the prediction frame, conf is the confidence level of the prediction target, and C is the prediction frame belonging to each The probability of the category, n c is the number of predicted target categories, r j is the rotation box regression parameter, and O k is the probability of each orientation category of the target.
5.根据权利要求1所述一种基于旋转目标检测的万用表读数识别方法,其特征在于,所述将转换开关实际旋转角度与单位匹配信息进行匹配,得到读数单位结果这一步骤,具体包括:5. a kind of multimeter reading identification method based on rotating target detection according to claim 1, is characterized in that, described with actual rotation angle of transfer switch and unit matching information are matched, obtain this step of reading unit result, specifically comprises: 获取单位匹配信息,得到单位信息数据集和角度信息数据集;Obtain unit matching information, and obtain a unit information data set and an angle information data set; 将转换开关实际旋转角度与角度信息数据集进行匹配,得到角度索引值;Match the actual rotation angle of the transfer switch with the angle information data set to obtain the angle index value; 将角度索引值与单位信息数据集进行匹配,得到读数单位结果。Match the angle index value to the unit info dataset to get the reading unit result. 6.根据权利要求1所述一种基于旋转目标检测的万用表读数识别方法,其特征在于,所述对读数区域旋转框进行仿射变换并识别,得到读数数字结果这一步骤,具体包括:6. a kind of multimeter reading identification method based on rotating target detection according to claim 1, is characterized in that, described to reading area rotating frame is carried out affine transformation and identification, obtains this step of reading digital result, specifically comprises: 将读数区域旋转框进行仿射变换,得到水平读数区域图;Perform affine transformation on the rotation frame of the reading area to obtain a horizontal reading area map; 所述仿射变换包括对读数区域旋转框进行平移、缩放、旋转、翻转和错切组合变换;The affine transformation includes the combined transformation of translation, scaling, rotation, flipping and staggering of the rotation frame of the reading area; 基于CRNN读数识别模型对水平读数区域图依次进行CNN特征提取、RNN序列建模和CTC转录,得到读数数字结果。Based on the CRNN read identification model, CNN feature extraction, RNN sequence modeling and CTC transcription were performed on the horizontal read region map in turn, and the read number results were obtained. 7.根据权利要求6所述一种基于旋转目标检测的万用表读数识别方法,其特征在于,所述仿射变换的矩阵公式如下:7. a kind of multimeter reading identification method based on rotating target detection according to claim 6 is characterized in that, the matrix formula of described affine transformation is as follows:
Figure FDA0003603642250000021
Figure FDA0003603642250000021
上式中,(x,y)为读数区域旋转框点坐标,(u,v)为读数区域旋转框点经仿射变换后得到的旋转框点,a0、a1、a2、b0、b1和b2为M参数,M为仿射变换矩阵。In the above formula, (x, y) is the coordinates of the rotation frame point in the reading area, (u, v) is the rotation frame point obtained by the affine transformation of the rotation frame point in the reading area, a 0 , a 1 , a 2 , b 0 , b 1 and b 2 are M parameters, and M is an affine transformation matrix.
8.一种基于旋转目标检测的万用表读数识别系统,其特征在于,包括:8. A multimeter reading recognition system based on rotating target detection is characterized in that, comprising: 检测模块,基于改进的YOLOv5模型对待测图像进行处理,输出读数区域旋转框、读数区域旋转角度和转换开关旋转角度;The detection module, based on the improved YOLOv5 model, processes the image to be tested, and outputs the rotation frame of the reading area, the rotation angle of the reading area and the rotation angle of the switch; 实际旋转角度计算模块,用于根据读数区域旋转角度和转换开关旋转角度,计算转换开关实际旋转角度;The actual rotation angle calculation module is used to calculate the actual rotation angle of the transfer switch according to the rotation angle of the reading area and the rotation angle of the transfer switch; 单位匹配模块,用于将转换开关实际旋转角度与单位匹配信息进行匹配,得到读数单位结果;The unit matching module is used to match the actual rotation angle of the transfer switch with the unit matching information to obtain the reading unit result; 数字识别模块,用于对读数区域旋转框进行仿射变换并识别,得到读数数字结果;The digital recognition module is used to perform affine transformation and recognition on the rotation frame of the reading area, and obtain the digital result of the reading; 读数整合模块,用于整合读数数字结果与读数单位结果,得到完整读数结果。The reading integration module is used to integrate the reading digital result and the reading unit result to obtain the complete reading result.
CN202210410911.6A 2022-04-19 2022-04-19 A multimeter reading recognition method and system based on rotating target detection Pending CN114743201A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210410911.6A CN114743201A (en) 2022-04-19 2022-04-19 A multimeter reading recognition method and system based on rotating target detection

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210410911.6A CN114743201A (en) 2022-04-19 2022-04-19 A multimeter reading recognition method and system based on rotating target detection

Publications (1)

Publication Number Publication Date
CN114743201A true CN114743201A (en) 2022-07-12

Family

ID=82280642

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210410911.6A Pending CN114743201A (en) 2022-04-19 2022-04-19 A multimeter reading recognition method and system based on rotating target detection

Country Status (1)

Country Link
CN (1) CN114743201A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115082922A (en) * 2022-08-24 2022-09-20 济南瑞泉电子有限公司 Water meter digital picture processing method and system based on deep learning
CN115909347A (en) * 2022-10-18 2023-04-04 五邑大学 Meter indication identification method, device, equipment and medium
CN116740704A (en) * 2023-06-16 2023-09-12 安徽农业大学 Method and device for monitoring the change rate of wheat leaf phenotypic parameters based on deep learning

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111062282A (en) * 2019-12-05 2020-04-24 武汉科技大学 Identification method of substation pointer meter based on improved YOLOV3 model
CN111353502A (en) * 2020-02-25 2020-06-30 北京眸视科技有限公司 Digital table identification method and device and electronic equipment
CN112906694A (en) * 2021-03-25 2021-06-04 中国长江三峡集团有限公司 Reading correction system and method for inclined pointer instrument image of transformer substation
CN113627427A (en) * 2021-08-04 2021-11-09 中国兵器装备集团自动化研究所有限公司 Instrument and meter reading method and system based on image detection technology
CN114067304A (en) * 2021-11-17 2022-02-18 国网智能科技股份有限公司 Digital instrument intelligent identification method and system based on deep learning
CN114120138A (en) * 2021-11-02 2022-03-01 中山大学·深圳 Method, device, equipment and medium for detecting and identifying remote sensing image target

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111062282A (en) * 2019-12-05 2020-04-24 武汉科技大学 Identification method of substation pointer meter based on improved YOLOV3 model
CN111353502A (en) * 2020-02-25 2020-06-30 北京眸视科技有限公司 Digital table identification method and device and electronic equipment
CN112906694A (en) * 2021-03-25 2021-06-04 中国长江三峡集团有限公司 Reading correction system and method for inclined pointer instrument image of transformer substation
CN113627427A (en) * 2021-08-04 2021-11-09 中国兵器装备集团自动化研究所有限公司 Instrument and meter reading method and system based on image detection technology
CN114120138A (en) * 2021-11-02 2022-03-01 中山大学·深圳 Method, device, equipment and medium for detecting and identifying remote sensing image target
CN114067304A (en) * 2021-11-17 2022-02-18 国网智能科技股份有限公司 Digital instrument intelligent identification method and system based on deep learning

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115082922A (en) * 2022-08-24 2022-09-20 济南瑞泉电子有限公司 Water meter digital picture processing method and system based on deep learning
CN115909347A (en) * 2022-10-18 2023-04-04 五邑大学 Meter indication identification method, device, equipment and medium
CN116740704A (en) * 2023-06-16 2023-09-12 安徽农业大学 Method and device for monitoring the change rate of wheat leaf phenotypic parameters based on deep learning
CN116740704B (en) * 2023-06-16 2024-02-27 安徽农业大学 Wheat leaf phenotype parameter change rate monitoring method and device based on deep learning

Similar Documents

Publication Publication Date Title
CN109977918B (en) An Optimization Method for Object Detection and Localization Based on Unsupervised Domain Adaptation
CN114743201A (en) A multimeter reading recognition method and system based on rotating target detection
CN111462120B (en) Defect detection method, device, medium and equipment based on semantic segmentation model
CN110570396A (en) A method of industrial product defect detection based on deep learning
CN112836650B (en) Semantic analysis method and system for quality inspection report scanning image table
CN110598698B (en) Natural scene text detection method and system based on adaptive regional suggestion network
CN113160291B (en) Change detection method based on image registration
CN111079847A (en) Remote sensing image automatic labeling method based on deep learning
CN110363196B (en) Method for accurately recognizing characters of inclined text
CN111985325A (en) Aerial small target rapid identification method in extra-high voltage environment evaluation
CN111723789A (en) A deep learning-based image text coordinate positioning method
CN113159215A (en) Small target detection and identification method based on fast Rcnn
CN114863407B (en) A multi-task cold-start target detection method based on deep fusion of visual language
CN112036260A (en) An expression recognition method and system for multi-scale sub-block aggregation in natural environment
CN110659637A (en) Electric energy meter number and label automatic identification method combining deep neural network and SIFT features
CN116612478A (en) Off-line handwritten Chinese character scoring method, device and storage medium
CN114581905A (en) A method and system for scene text recognition based on semantic enhancement mechanism
CN113569650A (en) Unmanned aerial vehicle autonomous inspection positioning method based on electric power tower label identification
CN114298159B (en) Image similarity detection method based on text fusion under label-free sample
CN116912670A (en) Deep sea fish identification method based on improved YOLO model
Li et al. Few-shot meta-learning on point cloud for semantic segmentation
CN111144469B (en) An end-to-end multi-sequence text recognition method based on multi-dimensional correlation temporal classification neural network
CN117671693A (en) Multi-service-domain text recognition method based on countermeasure generation and migration learning
CN118351388A (en) Training method and device for image panoramic segmentation model based on open vocabulary segmentation
CN111578956A (en) Visual SLAM positioning method based on deep learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination