[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN110544302A - Human motion reconstruction system, method and motion training system based on multi-eye vision - Google Patents

Human motion reconstruction system, method and motion training system based on multi-eye vision Download PDF

Info

Publication number
CN110544302A
CN110544302A CN201910844266.7A CN201910844266A CN110544302A CN 110544302 A CN110544302 A CN 110544302A CN 201910844266 A CN201910844266 A CN 201910844266A CN 110544302 A CN110544302 A CN 110544302A
Authority
CN
China
Prior art keywords
dimensional
motion
human body
module
action
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910844266.7A
Other languages
Chinese (zh)
Inventor
张炜乐
樊奕良
梁鑫
杜钦涛
李培杰
姚伟聪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong University of Technology
Original Assignee
Guangdong University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong University of Technology filed Critical Guangdong University of Technology
Priority to CN201910844266.7A priority Critical patent/CN110544302A/en
Publication of CN110544302A publication Critical patent/CN110544302A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/80Analysis of captured images to determine intrinsic or extrinsic camera parameters, i.e. camera calibration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Geometry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Graphics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Psychiatry (AREA)
  • Social Psychology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

本申请公开了一种基于多目视觉的人体动作重建系统、方法和动作训练系统,通过校正后的基于至少两个单目相机捕获目标人体的动作图像,对于动作图像进行二维图像上的人体二维关节点识别,然后基于人体二维关节点重建出目标人体的三维动作,无需用户穿戴特制的光学捕捉服或者装配相应的传感器,解决了现有的深度摄像机人体三维动作识别方法,对应用场景要求较高,在复杂的应用场景下,识别准确率较低的技术问题。

The present application discloses a human motion reconstruction system, method and motion training system based on multi-eye vision. The corrected motion image of a target human body is captured based on at least two monocular cameras, and the motion image is subjected to a two-dimensional image of the human body. Two-dimensional joint point recognition, and then reconstruct the three-dimensional action of the target human body based on the two-dimensional joint points of the human body, without the need for the user to wear a special optical capture suit or assemble the corresponding sensor, which solves the existing depth camera human body three-dimensional action recognition method, suitable for applications Scenario requirements are high, and in complex application scenarios, technical problems with low recognition accuracy are recognized.

Description

基于多目视觉的人体动作重建系统、方法和动作训练系统Human motion reconstruction system, method and motion training system based on multi-eye vision

技术领域technical field

本申请涉及动作识别技术领域,特别涉及一种基于多目视觉的人体动作重建系统、方法和动作训练系统。The present application relates to the technical field of motion recognition, and in particular, to a system, method and motion training system for human motion reconstruction based on multi-eye vision.

背景技术Background technique

随着人工智能的发展,人体姿势识别技术在大数据时代可获得的更加庞大的数据集与强大计算能力的基础下取得了重要的突破。With the development of artificial intelligence, human gesture recognition technology has made important breakthroughs based on the larger data sets and powerful computing power available in the era of big data.

现有的三维人体动作识别方法是利用光学捕捉技术捕捉目标人体三维动作进行动作识别,用户穿戴有光学标记的特制光学捕捉服或者相应的检测传感器来获取相应的标记位置生成空间相对位置,从而构建人体三维模型,进行动作识别。利用光学捕捉技术捕捉目标人体三维动作进行动作识别的方法,需要用户穿戴特制的光学捕捉服或者装配相应的传感器,穿戴麻烦,且用户负重较大,对动作训练造成不便,影响用户体验。The existing 3D human action recognition method is to use optical capture technology to capture the 3D action of the target human body for action recognition. The user wears a special optical capture suit with optical markings or a corresponding detection sensor to obtain the corresponding marked position to generate the relative spatial position, thereby constructing a 3D model of human body for action recognition. The method of using optical capture technology to capture the three-dimensional movement of the target human body for action recognition requires the user to wear a special optical capture suit or assemble the corresponding sensor, which is troublesome to wear, and the user has a large load, which is inconvenient for movement training and affects the user experience.

发明内容SUMMARY OF THE INVENTION

本申请的目的是提供一种基于多目视觉的人体动作重建系统、方法和动作训练系统,用于解决现有的三维人体动作识别方法需要用户穿戴特制的光学捕捉服或者装配相应的传感器,穿戴麻烦,且用户负重较大,对动作训练造成不便,影响用户体验的技术问题。The purpose of this application is to provide a human motion reconstruction system, method and motion training system based on multi-eye vision, which is used to solve the problem that the existing three-dimensional human motion recognition method requires the user to wear a special optical capture suit or assemble a corresponding sensor. It is troublesome, and the user has a large load, which causes inconvenience to the movement training and technical problems that affect the user experience.

本申请第一方面提供了一种基于多目视觉的人体动作重建系统,包括:A first aspect of the present application provides a multi-vision-based human motion reconstruction system, including:

相机标定模块,用于根据采集到的标定点数据进行单目相机标定,所述单目相机的数量至少两个;a camera calibration module, configured to perform monocular camera calibration according to the collected calibration point data, and the number of the monocular cameras is at least two;

动作采集模块,用于获取所有所述单目相机采集到的目标人体的动作图像序列,将所述动作图像序列发送至二维人体动作识别模块;a motion acquisition module, configured to acquire all motion image sequences of the target human body collected by the monocular camera, and send the motion image sequences to the two-dimensional human motion recognition module;

所述二维人体动作识别模块,用于识别所述动作图像序列中的人体关键肢体关节部位,提取所述动作图像序列的每一帧中属于同一目标人体的二维关节点,以便于重建所述目标人体的人体骨架,将每一帧的所述人体骨架信息存储并发送至三维动作重建模块;The two-dimensional human action recognition module is used to identify the key limb joint parts of the human body in the action image sequence, and extract the two-dimensional joint points belonging to the same target human body in each frame of the action image sequence, so as to facilitate the reconstruction of all body parts. The human skeleton of the target human body is stored, and the human skeleton information of each frame is stored and sent to the three-dimensional motion reconstruction module;

所述三维动作重建模块,用于基于每一帧中的所述人体骨架信息还原所述目标人体的所述二维关节点在三维空间中的真实位置,重建出所述目标人体的三维动作。The three-dimensional motion reconstruction module is configured to restore the real positions of the two-dimensional joint points of the target human body in the three-dimensional space based on the human skeleton information in each frame, and reconstruct the three-dimensional motion of the target human body.

可选的,还包括:成像点误差纠正模块;Optionally, it also includes: an imaging point error correction module;

所述成像点误差纠正模块,用于在所述单目相机发生偏移时,纠正所述二维人体动作识别模块中的所述二维关节点的像素坐标,以便于所述二维人体动作识别模块根据纠正像素坐标后的所述二维关节点重建所述目标人体的人体骨架,并将每一帧的所述人体骨架信息存储并发送至三维动作重建模块。The imaging point error correction module is used to correct the pixel coordinates of the two-dimensional joint points in the two-dimensional human motion recognition module when the monocular camera is offset, so as to facilitate the two-dimensional human motion The recognition module reconstructs the human skeleton of the target human body according to the two-dimensional joint points after correcting the pixel coordinates, and stores and sends the human skeleton information of each frame to the three-dimensional motion reconstruction module.

可选的,所述三维动作重建模块具体包括:Optionally, the three-dimensional motion reconstruction module specifically includes:

第一求解子模块,用于基于所述单目相机标定后的单目相机参数,求解与所述单目相机的焦点垂直投影于所述单目相机的成像平面上的投影成像点在预置三维坐标系中的第一真实三维坐标;The first solving sub-module is used to solve, based on the monocular camera parameters calibrated by the monocular camera, the projection imaging point perpendicular to the focus of the monocular camera projected on the imaging plane of the monocular camera at the preset value. the first real three-dimensional coordinate in the three-dimensional coordinate system;

第二求解子模块,用于基于所述第一真实三维坐标和所述单目相机的成像平面参数,求解所述第一真实三维坐标的旋转矩阵和平移向量;a second solving sub-module, configured to solve the rotation matrix and translation vector of the first real three-dimensional coordinates based on the first real three-dimensional coordinates and the imaging plane parameters of the monocular camera;

第三求解子模块,用于基于所述旋转矩阵和平移向量求解所述成像平面上的所述二维关节点在所述预置三维坐标系中的第二真实三维坐标;a third solving submodule, configured to solve the second real three-dimensional coordinates of the two-dimensional joint points on the imaging plane in the preset three-dimensional coordinate system based on the rotation matrix and the translation vector;

关节点重建子模块,用于将所述预置三维坐标系中,使得所有所述单目相机的光心坐标与对应的所述第二真实三维坐标直线取得最小距离的三维坐标点,作为所述二维关节点在所述预置三维坐标系中的真实三维关节点,重建出所述目标人体的三维动作。The joint point reconstruction sub-module is used to make the three-dimensional coordinate point at which the minimum distance is obtained between the optical center coordinates of all the monocular cameras and the corresponding second real three-dimensional coordinate line in the preset three-dimensional coordinate system, as the The real three-dimensional joint points of the two-dimensional joint points in the preset three-dimensional coordinate system are used to reconstruct the three-dimensional motion of the target human body.

可选的,所述关节点子模块具体用于:Optionally, the joint submodule is specifically used for:

基于超定方程组最小二乘法求解所有所述单目相机的光心坐标与对应的所述第二真实三维坐标所成的直线取得最小距离的三维坐标点,作为所述二维关节点在所述预置三维坐标系中的真实三维关节点,重建出所述目标人体的三维动作。Based on the least squares method of overdetermined equations, solve the straight line formed by the optical center coordinates of all the monocular cameras and the corresponding second real three-dimensional coordinates to obtain the three-dimensional coordinate point with the minimum distance, as the two-dimensional joint point in the The real three-dimensional joint points in the preset three-dimensional coordinate system are used to reconstruct the three-dimensional motion of the target human body.

可选的,所述二维人体动作识别模块具体用于:Optionally, the two-dimensional human motion recognition module is specifically used for:

基于预置卷积神经网络对所述动作图像序列中的人体关键肢体关节部位进行识别,提取所述动作图像序列的每一帧中属于同一目标人体的25个关节点,以便于重建所述目标人体的人体骨架,将每一帧的所述人体骨架信息存储并发送至三维动作重建模块。Based on the preset convolutional neural network, the key limb joint parts of the human body in the action image sequence are identified, and 25 joint points belonging to the same target human body in each frame of the action image sequence are extracted to facilitate the reconstruction of the target. The human body skeleton of the human body, the information of the human body skeleton of each frame is stored and sent to the three-dimensional motion reconstruction module.

本申请第二方面还提供了一种三维人体动作重建方法,包括:A second aspect of the present application also provides a three-dimensional human motion reconstruction method, including:

基于采集到的标定点数据进行单目相机标定,所述单目相机的数量至少两个;Perform monocular camera calibration based on the collected calibration point data, and the number of the monocular cameras is at least two;

获取至少两个所述单目相机拍摄的目标人体的动作图像序列;Acquiring at least two action image sequences of the target human body captured by the monocular camera;

识别所述动作图像序列中的人体关键肢体关节部位,提取所述动作图像序列的每一帧中属于同一目标人体的二维关节点,以便于重建所述目标人体的人体骨架,得到每一帧的所述人体骨架信息;Identify the key limb joint parts of the human body in the action image sequence, extract the two-dimensional joint points belonging to the same target human body in each frame of the action image sequence, so as to reconstruct the human skeleton of the target human body, and obtain each frame of the human skeleton information;

基于每一帧中的所述人体骨架信息还原所述目标人体在关节点在三维空间中的真实位置,重建出所述目标人体的三维动作。Based on the human skeleton information in each frame, the real position of the target human body in the joint point in the three-dimensional space is restored, and the three-dimensional action of the target human body is reconstructed.

可选的,所述基于每一帧中的所述人体骨架信息还原所述目标人体在关节点在三维空间中的真实位置,重建出所述目标人体的三维动作,具体包括:Optionally, restoring the real position of the target human body in the three-dimensional space of the joint points based on the human skeleton information in each frame, and reconstructing the three-dimensional action of the target human body, specifically includes:

基于所述单目相机标定后的单目相机参数,求解与所述单目相机的焦点垂直投影于所述单目相机的成像平面上的投影成像点在预置三维坐标系中的第一真实三维坐标;Based on the monocular camera parameters calibrated by the monocular camera, the first real value of the projected imaging point perpendicular to the focal point of the monocular camera on the imaging plane of the monocular camera in the preset three-dimensional coordinate system is obtained. three-dimensional coordinates;

基于所述第一真实三维坐标和所述单目相机的成像平面参数,求解所述第一真实三维坐标的旋转矩阵和平移向量;Based on the first real three-dimensional coordinates and the imaging plane parameters of the monocular camera, solve the rotation matrix and translation vector of the first real three-dimensional coordinates;

基于所述旋转矩阵和平移向量求解所述成像平面上的所述二维关节点在所述预置三维坐标系中的第二真实三维坐标;Solving the second real three-dimensional coordinates of the two-dimensional joint points on the imaging plane in the preset three-dimensional coordinate system based on the rotation matrix and the translation vector;

将所述预置三维坐标系中,使得所有所述单目相机的光心坐标与对应的所述第二真实三维坐标直线取得最小距离的三维坐标点,作为所述二维关节点在所述预置三维坐标系中的真实三维关节点,重建出所述目标人体的三维动作。In the preset three-dimensional coordinate system, the three-dimensional coordinate points that make the optical center coordinates of all the monocular cameras and the corresponding second real three-dimensional coordinate lines obtain the minimum distance are taken as the two-dimensional joint points in the The real three-dimensional joint points in the three-dimensional coordinate system are preset to reconstruct the three-dimensional motion of the target human body.

本申请第三方面提供了一种动作训练系统,包括第一方面所述的任一种三维人体动作重建系统,还包括动作评估模块;A third aspect of the present application provides an action training system, including any of the three-dimensional human action reconstruction systems described in the first aspect, and further including an action evaluation module;

所述动作评估模块,用于将从所述三维动作重建模块获取到的目标人体的三维动作与预置标准动作进行差异比对,输出与所述差异比对的结果对应的动作评估结果。The motion evaluation module is configured to perform a difference comparison between the three-dimensional motion of the target human body obtained from the three-dimensional motion reconstruction module and a preset standard motion, and output a motion evaluation result corresponding to the difference comparison result.

可选的,所述动作评估模块具体用于:Optionally, the action evaluation module is specifically used for:

将从所述三维动作重建模块获取到的目标人体的三维动作与预置标准动作进行关节点组合角度相似性判断、关节点组合运动轨迹的平均曲率对比和人体关节点组合运动量对比;The three-dimensional motion of the target human body obtained from the three-dimensional motion reconstruction module and the preset standard motion are judged on the similarity of the joint point combination angle, the average curvature comparison of the joint point combined motion trajectory, and the human body joint point combined motion amount comparison;

获取所述关节点组合角度相似性判断的结果对应的第一动作评估结果、所述关节点组合运动轨迹的平均曲率对比的结果对应的第二动作评估结果和所述人体关节点组合运动量对比的结果对应的第三动作评估结果;Obtain the first action evaluation result corresponding to the result of the joint point combination angle similarity judgment, the second action evaluation result corresponding to the result of the average curvature comparison of the joint point combination motion trajectory, and the human body joint point combination exercise amount comparison. The third action evaluation result corresponding to the result;

对所述第一动作评估结果、所述第二动作评估结果和所述第三动作评估结果进行加权处理,输出加权处理后得到的动作评估结果。Perform weighting processing on the first action evaluation result, the second action evaluation result and the third action evaluation result, and output the action evaluation result obtained after the weighting process.

可选的,还包括:音乐节奏契合度模块;Optionally, it also includes: a music rhythm fit module;

所述音乐节奏契合度模块,用于基于音频提取算法提取在播音乐的音乐特征,以节拍为单位,判断所述目标人体在连续帧的人体动作与预置标准动作在匹配节拍的系列动作的匹配程度,输出节拍匹配结果。The music rhythm fit module is used for extracting the music features of the music being broadcast based on the audio extraction algorithm, and in units of beats, it is judged that the human body movements of the target human body in continuous frames and the preset standard movements are in the series of movements that match the rhythm. Matching degree, output beat matching result.

本申请提供一种基于多目视觉的人体动作重建系统,包括:相机标定模块,用于根据采集到的标定点数据进行单目相机标定,单目相机的数量至少两个;动作采集模块,用于获取所有单目相机采集到的目标人体的动作图像序列,将动作图像序列发送至二维人体动作识别模块;二维人体动作识别模块,用于识别所述动作图像序列中的人体关键肢体关节部位,提取动作图像序列的每一帧中属于同一目标人体的二维关节点,以便于重建目标人体的人体骨架,将每一帧的人体骨架信息存储并发送至三维动作重建模块;三维动作重建模块,用于基于每一帧中的人体骨架信息还原目标人体的二维关节点在三维空间中的真实位置,重建出目标人体的三维动作。The present application provides a human motion reconstruction system based on multi-eye vision, including: a camera calibration module for performing monocular camera calibration according to the collected calibration point data, and the number of monocular cameras is at least two; In order to obtain the action image sequence of the target human body collected by all monocular cameras, the action image sequence is sent to the two-dimensional human action recognition module; the two-dimensional human action recognition module is used to identify the key human limb joints in the action image sequence Parts, extract the two-dimensional joint points belonging to the same target human body in each frame of the action image sequence, so as to reconstruct the human skeleton of the target human body, store and send the human skeleton information of each frame to the three-dimensional action reconstruction module; three-dimensional action reconstruction The module is used to restore the real position of the two-dimensional joint points of the target human body in the three-dimensional space based on the human skeleton information in each frame, and reconstruct the three-dimensional action of the target human body.

本申请提供的基于多目视觉的人体动作重建,通过校正后的基于至少两个单目相机捕获目标人体的动作图像,对于动作图像进行二维图像上的人体二维关节点识别,然后基于人体二维关节点重建出目标人体的三维动作,无需用户穿戴特制的光学捕捉服或者装配相应的传感器,解决了现有的深度摄像机人体三维动作识别方法,对应用场景要求较高,在复杂的应用场景下,识别准确率较低的技术问题。The human motion reconstruction based on multi-eye vision provided by the present application captures the motion image of the target human body based on at least two monocular cameras after correction, and performs two-dimensional joint point recognition of the human body on the two-dimensional image for the motion image, and then based on the human body The 2D joint points reconstruct the 3D action of the target human body, without the need for the user to wear a special optical capture suit or assemble the corresponding sensor, which solves the existing 3D action recognition method of the human body by the depth camera, which has high requirements for application scenarios and is used in complex applications. In scenarios, identify technical problems with low accuracy.

附图说明Description of drawings

为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据提供的附图获得其他的附图。In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the following briefly introduces the accompanying drawings required for the description of the embodiments or the prior art. Obviously, the drawings in the following description are only It is an embodiment of the present application. For those of ordinary skill in the art, other drawings can also be obtained according to the provided drawings without any creative effort.

图1为本申请实施例所提供的一种基于多目视觉的人体动作重建系统的一个实施例的结构示意图;FIG. 1 is a schematic structural diagram of an embodiment of a multi-vision-based human motion reconstruction system provided by an embodiment of the application;

图2为本申请实施例所提供的一种基于多目视觉的人体动作重建系统的另一个结构示意图;FIG. 2 is another schematic structural diagram of a multi-vision-based human motion reconstruction system provided by an embodiment of the present application;

图3为本申请实施例所提供的一种三维人体动作重建方法的一个实施例的流程示意图;FIG. 3 is a schematic flowchart of an embodiment of a three-dimensional human motion reconstruction method provided by an embodiment of the present application;

图4为本申请实施例所提供的一种三维人体动作重建方法的一个实施例的另一个流程示意图FIG. 4 is another schematic flowchart of an embodiment of a three-dimensional human motion reconstruction method provided by an embodiment of the present application

图5为本申请实施例中提供的一种动作训练系统的结构示意图;5 is a schematic structural diagram of an action training system provided in an embodiment of the application;

图6为本申请实施例中提供的成像点误差纠正模块的成像点误差纠正示意图;6 is a schematic diagram of an imaging point error correction of an imaging point error correction module provided in an embodiment of the present application;

图7为本申请实施例中提供的三个单目相机(图中的A、B、C)的三维空间理论布置图。FIG. 7 is a three-dimensional space theoretical layout diagram of three monocular cameras (A, B, and C in the figure) provided in an embodiment of the application.

具体实施方式Detailed ways

为使本申请实施例的目的、技术方案和优点更加清楚,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be described clearly and completely below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.

请参考图1,图1为本申请实施例所提供的一种基于多目视觉的人体动作重建系统的结构示意图,本申请实施例中提供的基于多目视觉的人体动作重建系统,包括:Please refer to FIG. 1. FIG. 1 is a schematic structural diagram of a multi-vision-based human motion reconstruction system provided by an embodiment of the present application. The multi-vision-based human motion reconstruction system provided in the embodiment of the present application includes:

相机标定模块101,用于根据采集到的标定点数据进行单目相机标定,单目相机的数量至少两个。The camera calibration module 101 is configured to perform monocular camera calibration according to the collected calibration point data, and the number of monocular cameras is at least two.

需要说明的是,本申请实施例中通过至少两个单目相机来获取目标人体的动作图像,在系统中用于获取目标人体动作图像的单目相机需要进行标定处理,将所有单目相机进行标定点的采集,可以是棋盘格标定点,也可以是二维码标定点,此处不对标定点的标定方法进行限定。标定点采集只需要在单目相机第一次被使用时采集即可,后续无需再重复进行标定点采集,将采集到的标定点数据存储到存储模块,以便后续再次调用。根据标定点数据对单目相机进行标定后,可得到单目相机的标定参数。It should be noted that, in the embodiment of the present application, at least two monocular cameras are used to obtain the action images of the target human body, and the monocular cameras used to obtain the action images of the target human body in the system need to be calibrated, and all monocular cameras are used for calibration. The collection of calibration points may be checkerboard calibration points or two-dimensional code calibration points, and the calibration method of the calibration points is not limited here. The calibration point collection only needs to be collected when the monocular camera is used for the first time, and there is no need to repeat the calibration point collection in the future. The collected calibration point data is stored in the storage module for subsequent recall. After calibrating the monocular camera according to the calibration point data, the calibration parameters of the monocular camera can be obtained.

本申请实施例中,采集标定点数据时每个单目相机采集的图像数目为25张标定点图像,能够为每个单目相机进行标定提高标定精度和稳定性。In the embodiment of the present application, when the calibration point data is collected, the number of images collected by each monocular camera is 25 calibration point images, which can improve calibration accuracy and stability for each monocular camera.

动作采集模块102,用于获取所有单目相机采集到的目标人体的动作图像序列,将动作图像序列发送至二维人体动作识别模块103。The motion acquisition module 102 is configured to acquire the motion image sequence of the target human body collected by all monocular cameras, and send the motion image sequence to the two-dimensional human motion recognition module 103 .

需要说明的是,通过标定好的单目相机来采集目标人体的动作图像序列,将动作图像序列发送至二维人体动作识别模块103进行二维图像处理。It should be noted that the action image sequence of the target human body is collected by the calibrated monocular camera, and the action image sequence is sent to the two-dimensional human action recognition module 103 for two-dimensional image processing.

二维人体动作识别模块103,用于识别动作图像序列中的人体关键肢体关节部位,提取动作图像序列的每一帧中属于同一目标人体的二维关节点,以便于重建目标人体的人体骨架,将每一帧的人体骨架信息存储并发送至三维动作重建模块104。The two-dimensional human action recognition module 103 is used to identify the key limb joint parts of the human body in the action image sequence, and extract the two-dimensional joint points belonging to the same target human body in each frame of the action image sequence, so as to reconstruct the human skeleton of the target human body, The human skeleton information of each frame is stored and sent to the three-dimensional motion reconstruction module 104 .

需要说明的是,将动作图像序列每一帧中属于同一个人的关节点提取出来构成一个完整的人体骨架,并将每一帧的骨架信息用json格式的文件缓存至数据库,进一步提供给三维动作重建模块104使用。本申请实施例中二维人体动作识别模块103采用开源的多人二维姿态神经网络openpose来实现二维人体动作识别来重建目标人体的人体骨架。It should be noted that the joint points belonging to the same person in each frame of the action image sequence are extracted to form a complete human skeleton, and the skeleton information of each frame is cached in a json format file to the database, and further provided to the 3D action Rebuild module 104 is used. In the embodiment of the present application, the two-dimensional human motion recognition module 103 uses the open-source multi-person two-dimensional pose neural network openpose to realize two-dimensional human motion recognition and reconstruct the human skeleton of the target human body.

三维动作重建模块104,用于基于每一帧中的人体骨架信息还原目标人体在关节点在三维空间中的真实位置,重建出目标人体的三维动作。The three-dimensional motion reconstruction module 104 is configured to restore the real position of the target human body in the three-dimensional space of the joint points based on the human skeleton information in each frame, and reconstruct the three-dimensional motion of the target human body.

需要说明的是,将校正后的单目相机拍摄到的目标人体的动作图像序列输入上述二维人体动作识别模块103得出以二维关节点信息表示的骨架信息,通过三维动作重建模块104将每一帧中的人体骨架信息还原目标人体在关节点在三维空间中的真实位置,重建出目标人体的三维动作。It should be noted that the motion image sequence of the target human body captured by the corrected monocular camera is input into the above-mentioned two-dimensional human motion recognition module 103 to obtain the skeleton information represented by the two-dimensional joint point information. The human skeleton information in each frame restores the real position of the target human body in the three-dimensional space of the joint points, and reconstructs the three-dimensional action of the target human body.

本申请实施例中提供的三维人体动作重建系统,通过校正后的基于至少两个单目相机捕获目标人体的动作图像,对于动作图像进行二维图像上的人体二维关节点识别,然后基于人体二维关节点重建出目标人体的三维动作,无需用户穿戴特制的光学捕捉服或者装配相应的传感器,解决了现有的深度摄像机人体三维动作识别方法,对应用场景要求较高,在复杂的应用场景下,识别准确率较低的技术问题。The three-dimensional human motion reconstruction system provided in the embodiment of the present application captures the corrected motion image of the target human body based on at least two monocular cameras, performs 2D human body joint point recognition on the two-dimensional image for the motion image, and The 2D joint points reconstruct the 3D action of the target human body, without the need for the user to wear a special optical capture suit or assemble the corresponding sensor, which solves the existing 3D action recognition method of the human body by the depth camera, which has high requirements for application scenarios and is used in complex applications. In scenarios, identify technical problems with low accuracy.

作为对本申请实施例中的基于多目视觉的人体动作重建系统的进一步改进,如图2所示,本申请实施例中的基于多目视觉的人体动作重建系统还包括:成像点误差纠正模块105;As a further improvement to the human motion reconstruction system based on polycular vision in the embodiment of the present application, as shown in FIG. 2 , the human motion reconstruction system based on polyocular vision in the embodiment of the present application further includes: an imaging point error correction module 105 ;

成像点误差纠正模块105,用于在单目相机发生偏移时,纠正二维人体动作识别模块中的二维关节点的像素坐标,以便于二维人体动作识别模块根据纠正像素坐标后的二维关节点重建目标人体的人体骨架,并将每一帧的人体骨架信息存储并发送至三维动作重建模块。The imaging point error correction module 105 is used to correct the pixel coordinates of the two-dimensional joint points in the two-dimensional human motion recognition module when the monocular camera is offset, so that the two-dimensional human motion recognition module can correct the pixel coordinates according to the two-dimensional The 3D joint points reconstruct the human skeleton of the target human body, and store and send the human skeleton information of each frame to the 3D motion reconstruction module.

需要说明的是,如图6所示,图6为成像点误差纠正模块的成像点误差纠正示意图,在实际应用场景中,单目相机的安装使用过程中,有很大的概率会出现单目相机抖动或偏移的情况,会造成成像点误差,从而影响识别结果,为解决此技术问题,本申请实施例中设置由成像点误差纠正模块105,在单目相机发生偏移时,纠正二维人体动作识别模块103中的二维关节点的像素坐标,以便于二维人体动作识别模块103根据纠正像素坐标后的二维关节点重建目标人体的人体骨架,并将每一帧的人体骨架信息存储并发送至三维动作重建模块104。以下结合图6对成像点误差纠正模块105的工作原理进行解释:It should be noted that, as shown in Figure 6, Figure 6 is a schematic diagram of the imaging point error correction of the imaging point error correction module. In the actual application scenario, during the installation and use of the monocular camera, there is a high probability that a monocular camera will appear. The camera shakes or shifts, which will cause imaging point errors, thereby affecting the recognition results. To solve this technical problem, an imaging point error correction module 105 is provided in this embodiment of the present application. The pixel coordinates of the two-dimensional joint points in the two-dimensional human motion recognition module 103, so that the two-dimensional human motion recognition module 103 reconstructs the human skeleton of the target human body according to the two-dimensional joint points after correcting the pixel coordinates, and converts the human skeleton of each frame The information is stored and sent to the 3D motion reconstruction module 104 . The working principle of the imaging point error correction module 105 is explained below with reference to FIG. 6 :

成像面EFHG为单目相机A的原成像面(即抖动或偏移前的正常成像面),成像面BCJK为单目相机A的抖动或偏移后的成像面,已知标志点D,通过opencv自带的图像识别算法能够识别得到标志点D在偏移前后的图像像素坐标分别为(ui,vi)与(ui',vi'),根据标定后的单目相机的焦距f进行以下的运算能够把误差成像点的像素坐标(uw',vw')修正为(uw,vw)。运算公式为:The imaging plane EFHG is the original imaging plane of the monocular camera A (that is, the normal imaging plane before shaking or offset), and the imaging plane BCJK is the imaging plane of the monocular camera A after shaking or offset. The image recognition algorithm that comes with opencv can recognize that the image pixel coordinates of the marker point D before and after the offset are (u i ,v i ) and ( u i ',vi '), respectively. According to the focal length of the calibrated monocular camera f performs the following operations to correct the pixel coordinates (u w ', v w ') of the error imaging point to (u w , v w ). The operation formula is:

其中,Δθ与分别为成像面偏移前后,标志点D的成像点在球坐标系上的仰角差与方向角差;θ'与分别为成像面发生偏移后,任意误差成像点在球坐标系上的仰角与方向角。where Δθ and are the elevation angle difference and the direction angle difference of the imaging point of the marker point D on the spherical coordinate system before and after the imaging plane is shifted; θ' and are the elevation angle and direction angle of any error imaging point on the spherical coordinate system after the imaging plane is shifted.

根据以上公式,能够对单目相机A内任意成像点的像素坐标进行误差纠正,避免了因单目相机发生抖动或偏移带来的成像点误差造成的识别结果变差的技术问题。According to the above formula, the error correction can be performed on the pixel coordinates of any imaging point in the monocular camera A, which avoids the technical problem of poor recognition results caused by imaging point errors caused by the shaking or offset of the monocular camera.

作为进一步的改进,本申请实施例中的二维人体动作识别模块103具体用于:As a further improvement, the two-dimensional human motion recognition module 103 in the embodiment of the present application is specifically used for:

基于预置卷积神经网络对所述动作图像序列中的人体关键肢体关节部位进行识别,提取动作图像序列的每一帧中属于同一目标人体的25个关节点,以便于重建目标人体的人体骨架,将每一帧的人体骨架信息存储并发送至三维动作重建模块104。Based on the preset convolutional neural network, the key limb joint parts of the human body in the action image sequence are identified, and 25 joint points belonging to the same target human body in each frame of the action image sequence are extracted, so as to reconstruct the human skeleton of the target human body , and the human skeleton information of each frame is stored and sent to the three-dimensional motion reconstruction module 104 .

需要说明的是,预置卷积神经网络采用非参数表征方法Part AffinityFields来将二维图像中的身体关键肢体关节部位与对应个体匹配并重建人体关节骨架。主要思想是利用贪心算法自下而上的解析步骤从而达到高准确率和实时性,身体部位定位和关联是在两个分支上同时进行的。可以识别二维图像人体的25个关节点的位置,具有较高的精度和实时速率。It should be noted that the preset convolutional neural network adopts the non-parametric representation method Part AffinityFields to match the key limb joint parts in the two-dimensional image with the corresponding individuals and reconstruct the human joint skeleton. The main idea is to use the bottom-up parsing steps of the greedy algorithm to achieve high accuracy and real-time performance. Body part localization and association are performed simultaneously on two branches. It can identify the positions of 25 joint points of the human body in two-dimensional images with high accuracy and real-time speed.

采用预置卷积神经网络识别二维人体姿态的方法流程为:将像素规模为w×h的二维人体图像视频序列输入,对图像视频序列每一帧中出现的个体进行关键点定位,然后利用双分支多阶段体系结构的CNN神经网络模型进行预测,CNN第一个分支预测一组人体身体部分的置信map S和一组二维肢体矢量场J(每一个身体部位有对应的置信map,每一个人体肢体对应一个矢量),通过贪心推理解析置信map和PAF后输出图像视频序列每一帧上每个人的2D骨架关键点信息,一共有25个人体关节点The process of using a preset convolutional neural network to identify a two-dimensional human pose is as follows: input a two-dimensional human image video sequence with a pixel scale of w×h, locate the key points of the individuals appearing in each frame of the image video sequence, and then Using a CNN neural network model with a two-branch multi-stage architecture for prediction, the first branch of CNN predicts a set of confidence maps S of human body parts and a set of two-dimensional limb vector fields J (each body part has a corresponding confidence map, Each human limb corresponds to a vector), and after parsing the confidence map and PAF through greedy reasoning, the 2D skeleton key point information of each person on each frame of the image video sequence is output, and there are a total of 25 human body joint points.

作为进一步的改进,本申请实施例中的三维动作重建模块104具体包括:As a further improvement, the three-dimensional motion reconstruction module 104 in this embodiment of the present application specifically includes:

第一求解子模块1041,用于基于单目相机标定后的单目相机参数,求解与单目相机的焦点垂直投影于单目相机的成像平面上的投影成像点在预置三维坐标系中的第一真实三维坐标。The first solving sub-module 1041 is used to solve the projection imaging point in the preset three-dimensional coordinate system that is perpendicular to the focus of the monocular camera on the imaging plane of the monocular camera based on the monocular camera parameters calibrated by the monocular camera. The first real three-dimensional coordinates.

第二求解子模块1042,用于基于第一真实三维坐标和单目相机的成像平面参数,求解第一真实三维坐标的旋转矩阵和平移向量。The second solving sub-module 1042 is configured to solve the rotation matrix and translation vector of the first real three-dimensional coordinates based on the first real three-dimensional coordinates and the imaging plane parameters of the monocular camera.

第三求解子模块1043,用于基于旋转矩阵和平移向量求解成像平面上的所述二维关节点在预置三维坐标系中的第二真实三维坐标。The third solving sub-module 1043 is configured to solve the second real three-dimensional coordinates of the two-dimensional joint points on the imaging plane in the preset three-dimensional coordinate system based on the rotation matrix and the translation vector.

关节点重建子模块1044,用于将预置三维坐标系中,使得所有单目相机的光心坐标与对应的第二真实三维坐标直线取得最小距离的三维坐标点,作为二维关节点在预置三维坐标系中的真实三维关节点,重建出目标人体的三维动作。The joint point reconstruction sub-module 1044 is used to set the three-dimensional coordinate point with the minimum distance between the optical center coordinates of all monocular cameras and the corresponding second real three-dimensional coordinate line in the preset three-dimensional coordinate system, as the two-dimensional joint point in the preset three-dimensional coordinate point. Set the real 3D joint points in the 3D coordinate system to reconstruct the 3D action of the target body.

具体的,关节点子模块1044具体用于:Specifically, the joint sub-module 1044 is specifically used for:

基于超定方程组最小二乘法求解所有单目相机的光心坐标与对应的第二真实三维坐标所成的直线取得最小距离的三维坐标点,作为二维关节点在预置三维坐标系中的真实三维关节点,重建出目标人体的三维动作。Based on the least squares method of overdetermined equations, solve the straight line formed by the optical center coordinates of all monocular cameras and the corresponding second real three-dimensional coordinates to obtain the three-dimensional coordinate point with the minimum distance, as the two-dimensional joint point in the preset three-dimensional coordinate system. Real 3D joint points to reconstruct the 3D action of the target body.

需要说明的是,图7为本申请实施例中提供的三个单目相机(图中的A、B、C)的三维空间理论布置图,为简化说明,本申请实施例中仅以A、B两个单目相机作为研究对象,通过单目相机标定,可以知道A、B两个单目相机的焦距f(厘米为单位),以及两个焦点(I、N)的二维图像坐标(以像素为单位),A、B两个单目相机的焦点I、N的坐标分别为I(ui,vi),N(un,vn)。It should be noted that FIG. 7 is a three-dimensional space theoretical layout diagram of three monocular cameras (A, B, and C in the figure) provided in the embodiment of the application. To simplify the description, only A, The two monocular cameras B are used as the research objects. Through the calibration of the monocular cameras, the focal length f (in centimeters) of the two monocular cameras A and B, and the two-dimensional image coordinates of the two focal points (I, N) can be known ( In pixels), the coordinates of the focal points I and N of the two monocular cameras A and B are I(u i , v i ) and N(u n , v n ), respectively.

以单目相机A为例,求解焦点I的真实三维坐标为(xi,yi,zi),D为A点到焦点I的延长线与B点到焦点N的延长线的交点,那么A点到D点的距离可表示为:Taking the monocular camera A as an example, the real three-dimensional coordinates of the solution focus I are (x i , y i , z i ), and D is the intersection of the extension line from point A to focus I and the extension line from point B to focus N, then The distance from point A to point D can be expressed as:

A点与D点成的向量与A点与I点成的向量的关系为:A vector of points A and D vector with points A and I The relationship is:

上式可变形为:The above formula can be transformed into:

由此可以求得点I的真实三维坐标(xi,yi,zi)以及根据以上关系可以相应求得另外两个标志点在A成像面EFHG上的成像点E',F'的真实三维坐标(xe',ye',ze')与(xf',yf',zf'),同理也可求得单目相机B中的焦点N的真实三维坐标(xn,yn,zn)。From this, the real three-dimensional coordinates (x i , y i , z i ) of the point I can be obtained, and the real three-dimensional coordinates of the imaging points E' and F' of the other two marker points on the A imaging plane EFHG can be obtained correspondingly according to the above relationship. Coordinates (x e ', y e ', z e ') and (x f ', y f ', z f '), in the same way, the real three-dimensional coordinates of the focus N in the monocular camera B can be obtained (x n ,y n ,z n ).

求解单目相机的旋转矩阵a和平移向量t,根据以上确定的I点坐标、A点坐标、E'点坐标和F'点坐标,可以求得旋转矩阵a与平移向量T,可表示为:To solve the rotation matrix a and translation vector t of the monocular camera, according to the coordinates of point I, point A, point E' and point F' determined above, the rotation matrix a and translation vector T can be obtained, which can be expressed as:

旋转矩阵a和平移向量t分别为:The rotation matrix a and the translation vector t are respectively:

求解A、B成像面上的成像点W、V的真实三维坐标:Solve the real three-dimensional coordinates of the imaging points W and V on the A and B imaging surfaces:

代入点W的单目相机坐标系坐标(uw,vw,f),通过以下等式可求得点W的真实三维坐标系下的坐标(xw,yw,zw):Substitute the coordinates (u w , v w , f) of the monocular camera coordinate system of the point W, and the coordinates (x w , y w , z w ) in the real three-dimensional coordinate system of the point W can be obtained by the following equation:

同理,对于B单目相机也可以求得相应的转换矩阵B,以及得到点V的真实三维坐标(xv,yv,zv)。求得两个单目相机各自的旋转矩阵a与平移向量t后,空间中任意的一点在对应的成像面上的成像点的单目相机坐标中的像素坐标均可以转换为真实三维坐标。Similarly, for the B monocular camera, the corresponding transformation matrix B can also be obtained, and the real three-dimensional coordinates (x v , y v , z v ) of the point V can be obtained. After obtaining the respective rotation matrix a and translation vector t of the two monocular cameras, the pixel coordinates of the monocular camera coordinates of the imaging point of any point in the space on the corresponding imaging plane can be converted into real three-dimensional coordinates.

以A,B两摄像机为研究对象,通过直线AT,BT相交于空间中的一点T,通过以下的等式可以求解空间中任意一点T的真实三维坐标(x,y,z):Taking the two cameras A and B as the research object, the straight lines AT and BT intersect at a point T in the space, and the real three-dimensional coordinates (x, y, z) of any point T in the space can be solved by the following equation:

已知两个单目相机能够求得空间中的一点的三维坐标。但在应该过程中由于单目相机制作的问题,无法保证BT于AT交于点T。为了减小误差,可以采取增加单目相机的方法提高求解的准确性。空间中多条不相交直线,可以运用超定方程组最小二乘法求解各直线之间的最小二乘解,该点距离各条直线的距离之和取得最小值对应的三维坐标点,可作为二维关节点在预置三维坐标系中的真实三维关节点,重建出目标人体的三维动作。It is known that two monocular cameras can obtain the three-dimensional coordinates of a point in space. However, due to the production problem of the monocular camera, it cannot be guaranteed that BT and AT intersect at point T in the process. In order to reduce the error, the method of adding a monocular camera can be taken to improve the accuracy of the solution. There are many disjoint straight lines in the space, and the least squares method of the overdetermined equation system can be used to solve the least squares solution between the straight lines. The real 3D joint points of the 3D joint points in the preset 3D coordinate system are used to reconstruct the 3D movements of the target human body.

为了便于理解,请参阅图3,本申请中还提供了一种三维人体动作重建方法的实施例,包括:For ease of understanding, please refer to FIG. 3 , an embodiment of a three-dimensional human motion reconstruction method is also provided in this application, including:

步骤S1、基于采集到的标定点数据进行单目相机标定,单目相机的数量至少两个。Step S1 , perform monocular camera calibration based on the collected calibration point data, and the number of monocular cameras is at least two.

步骤S2、获取至少两个单目相机拍摄的目标人体的动作图像序列。Step S2, acquiring a sequence of motion images of the target human body captured by at least two monocular cameras.

步骤S3、识别动作图像序列中的人体关键肢体关节部位,提取动作图像序列的每一帧中属于同一目标人体的二维关节点,以便于重建目标人体的人体骨架,得到每一帧的人体骨架信息。Step S3: Identify the key limb joints of the human body in the action image sequence, and extract the two-dimensional joint points belonging to the same target human body in each frame of the action image sequence, so as to reconstruct the human skeleton of the target human body and obtain the human skeleton of each frame. information.

步骤S4、基于每一帧中的人体骨架信息还原目标人体在关节点在三维空间中的真实位置,重建出目标人体的三维动作。Step S4, based on the human skeleton information in each frame, restore the real position of the target human body at the joint point in the three-dimensional space, and reconstruct the three-dimensional action of the target human body.

作为进一步的改进,请参与图4,步骤S4具体包括以下步骤:As a further improvement, please refer to Figure 4. Step S4 specifically includes the following steps:

步骤S41、基于单目相机标定后的单目相机参数,求解与单目相机的焦点垂直投影于单目相机的成像平面上的投影成像点在预置三维坐标系中的第一真实三维坐标。Step S41 , based on the monocular camera parameters calibrated by the monocular camera, obtain the first real three-dimensional coordinates in the preset three-dimensional coordinate system of the projected imaging point perpendicular to the focus of the monocular camera on the imaging plane of the monocular camera.

步骤S42、基于第一真实三维坐标和单目相机的成像平面参数,求解第一真实三维坐标的旋转矩阵和平移向量。Step S42 , based on the first real three-dimensional coordinates and the imaging plane parameters of the monocular camera, solve the rotation matrix and translation vector of the first real three-dimensional coordinates.

步骤S43、基于旋转矩阵和平移向量求解成像平面上的二维关节点在预置三维坐标系中的第二真实三维坐标。Step S43 , solving the second real three-dimensional coordinates of the two-dimensional joint points on the imaging plane in the preset three-dimensional coordinate system based on the rotation matrix and the translation vector.

步骤S44、将预置三维坐标系中,使得所有单目相机的光心坐标与对应的第二真实三维坐标直线取得最小距离的三维坐标点,作为二维关节点在预置三维坐标系中的真实三维关节点,重建出目标人体的三维动作。Step S44, in the preset three-dimensional coordinate system, the optical center coordinates of all monocular cameras and the corresponding second real three-dimensional coordinate line obtain the three-dimensional coordinate point of the minimum distance, as the two-dimensional joint point in the preset three-dimensional coordinate system. Real 3D joint points to reconstruct the 3D action of the target body.

为了便于理解,请参阅图3,本申请中提供了一种动作训练系统的实施例,包括前述实施例中的相机标定模块101、动作采集模块102、二维人体动作识别模块103、三维动作重建模块104和成像点误差纠正模块105,还包括动作评估模块106;For ease of understanding, please refer to FIG. 3 , an embodiment of an action training system is provided in this application, including the camera calibration module 101 , the action acquisition module 102 , the two-dimensional human action recognition module 103 , the three-dimensional action reconstruction in the foregoing embodiment module 104 and imaging point error correction module 105, and also includes an action evaluation module 106;

动作评估模块106,用于将从三维动作重建模块获取到的目标人体的三维动作与预置标准动作进行差异比对,输出与差异比对的结果对应的动作评估结果。The action evaluation module 106 is configured to compare the difference between the three-dimensional action of the target human body obtained from the three-dimensional action reconstruction module and the preset standard action, and output an action evaluation result corresponding to the difference comparison result.

具体的,动作评估模块106用于:Specifically, the action evaluation module 106 is used for:

将从三维动作重建模块获取到的目标人体的三维动作与预置标准动作进行关节点组合角度相似性判断、关节点组合运动轨迹的平均曲率对比和人体关节点组合运动量对比;The three-dimensional motion of the target body obtained from the three-dimensional motion reconstruction module and the preset standard motion are used to judge the similarity of the joint point combination angle, the average curvature of the joint point combined motion trajectory and the human body joint point combined motion amount comparison;

获取关节点组合角度相似性判断的结果对应的第一动作评估结果、关节点组合运动轨迹的平均曲率对比的结果对应的第二动作评估结果和人体关节点组合运动量对比的结果对应的第三动作评估结果;Obtain the first action evaluation result corresponding to the result of the joint point combination angle similarity judgment, the second action evaluation result corresponding to the result of the average curvature comparison of the joint point combination motion trajectory and the third action corresponding to the human body joint point combination motion amount comparison result evaluation result;

对第一动作评估结果、所述第二动作评估结果和第三动作评估结果进行加权处理,输出加权处理后得到的动作评估结果。Perform weighting processing on the first action evaluation result, the second action evaluation result and the third action evaluation result, and output the action evaluation result obtained after the weighting process.

需要说明的是,在得到用户的人体三维舞蹈动作后,对其动作与标准动作进行差异评估,差异评估的依据分为以下几个部分:It should be noted that after obtaining the three-dimensional dance movements of the user's human body, the differences between the movements and the standard movements are evaluated. The basis for the difference evaluation is divided into the following parts:

(1)关节点组合角度相似性判断。角度相似性分析的评估方法通过使用欧几里德点积公式(A,B为三个关节点构成的两向量,即三个关节点组合成一组关节点组合)可以导出对应关节角度的余弦后再转化为角度θ,方便使用者更直观地看出差异 (1) Judgment of the similarity of the joint point combination angle. The evaluation method of angle similarity analysis can derive the cosine of the corresponding joint angle by using the Euclidean dot product formula (A and B are two vectors composed of three joint points, that is, three joint points are combined into a group of joint point combinations). Then convert it into an angle θ, which is convenient for users to see the difference more intuitively

而一组连续动作中以一组关节点组合的平均差异可以表示为:And the average difference between a set of joint points combined in a set of continuous actions It can be expressed as:

与标准动作数据进行对比,一般对应关节角度在小于10°时可视为良好,大于10°小于20°可视为一般,有些许误差,超过20°判别为动作错误。Compared with the standard motion data, generally the corresponding joint angle can be regarded as good when it is less than 10°, and it can be regarded as normal when it is greater than 10° and less than 20°.

(2)关节点组合运动轨迹的平均曲率对比。人体动作是动态变化的运动,关节的整体移动程度会影响舞蹈的美观性。通过对人体各部位关节点组合运动轨迹的平均曲率进行计算并和标准动作关节点组合运动轨迹的平均曲率对比,判断运动轨迹的相似性。首先对关节点运动轨迹拟合成一条曲线,定义两帧之间关节点移动的空间距离为Δs,对应转过的角度为Δα,表示各个关节点运动的平均曲率,而关节点组合平均曲率可以表示为(2) The average curvature comparison of the combined motion trajectories of joint points. Human movements are dynamically changing movements, and the overall degree of movement of the joints will affect the aesthetics of the dance. The similarity of the motion trajectories is judged by calculating the average curvature of the combined motion trajectories of the joint points of each part of the human body and comparing it with the average curvature of the combined motion trajectories of the standard action joint points. First, fit the motion trajectory of the joint point into a curve, define the spatial distance of the joint point movement between the two frames as Δs, and the corresponding rotated angle as Δα, Represents the average curvature of the motion of each joint point, and the joint point combined average curvature It can be expressed as

通过判断的大小可以比较两者对应关节点组合运动轨迹的相似度。by judgment The size of the two can compare the similarity of the combined motion trajectory of the corresponding joint points of the two.

(3)人体关节点组合运动量对比。目标人体动作(比如舞蹈动作)是人体整体的运动,除了单关节情况评估外,对整个人体的摆动幅度进行判断是有必要的。由于各人体有所不同,为统一化标准,先采取用人体两肩的距离对人体各节点进行归一化处理,之后对归一化后的关节前后移动量相加得出人体在某一时刻的人体运动量后再与标准动作对比。(3) Comparison of combined motion of human joint points. The target human movement (such as dance movement) is the movement of the whole human body. In addition to the evaluation of the single joint situation, it is necessary to judge the swing range of the whole human body. Since each human body is different, in order to unify the standard, first normalize each node of the human body with the distance between the two shoulders of the human body, and then add the normalized forward and backward movement of the joints to obtain the human body at a certain moment. The amount of human exercise and then compared with the standard action.

①因为人体存在不同程度的差异,需采用人体双肩的三维空间坐标距离对人体各关节点坐标进行归一化数据处理①Because there are different degrees of differences in the human body, it is necessary to use the three-dimensional space coordinate distance of the human body's shoulders to normalize the coordinates of each joint point of the human body for data processing

x*=(x-xmin)/(xmax-xmin)x*=(xx min )/(x max -x min )

y*=(y-ymin)/(ymax-ymin)y*=(yy min )/(y max -y min )

z*=(z-zmin)/(zmax-zmin)z*=(zz min )/(z max -z min )

②归一化后,人体在某一对应时间间隔计算间隔前后各关节点的归一化后相对坐标位移量的总和为Wp,以关节点构成的关节点组合的总位移量W作为该关节点组合的运动量② After normalization, the sum of the normalized relative coordinate displacement of each joint point before and after a certain corresponding time interval calculation interval is Wp, and the total displacement W of the joint point combination composed of joint points is used as the joint point. Combined amount of exercise

③把人体在某一时刻的人体运动量后再与标准动作对比,通过ΔW判断两者差异的大小。③ Compare the amount of human body movement at a certain moment with the standard action, and judge the difference between the two by ΔW.

将三部分得到的数值进行加权处理,但同时每个身体部位决定的质量水平取决于运动的难度。要设置适合质量等级结果的百分比范围,得出最后的舞蹈动作评分,并将上述三部分数据通过中央处理器在显示屏可视化一并提供给用户。The values obtained from the three parts are weighted, but at the same time the quality level determined by each body part depends on the difficulty of the exercise. To set the percentage range suitable for the quality level results, get the final dance movement score, and provide the above three parts of data to the user through the central processing unit and visualized on the display screen.

作为进一步的改进,本申请实施例中的动作训练系统还包括:音乐节奏契合度模块107;As a further improvement, the action training system in the embodiment of the present application further includes: a music rhythm fit module 107;

音乐节奏契合度模块107,用于基于音频提取算法提取在播音乐的音乐特征,以节拍为单位,判断目标人体在连续帧的人体动作与预置标准动作在匹配节拍的系列动作的匹配程度,输出节拍匹配结果。The music rhythm fit module 107 is used for extracting the music features of the music being broadcast based on the audio extraction algorithm, and using the beat as a unit, to determine the matching degree of the human body movements of the target human body in continuous frames and the preset standard movements in the series of movements matching the rhythm, Output beat matching results.

需要说明的是,利用音频提取算法提取舞曲的音乐特征,以节拍为单位,判断用户在连续帧的动作是否与标准动作在该节拍的系列动作一致,从而判断是否合拍。如发生抢拍或慢拍情况下,系统将前半部分合拍评分记录后重置节拍对齐,以配合用户的实际动作,为上述动作评估模块106提供准确的动作数据对比。通过动作评估模块106和音乐节奏契合度模块107结合生成各节拍评分与训练报告直观地展示给用户。It should be noted that an audio extraction algorithm is used to extract the musical features of the dance music, and the beat is used as the unit to determine whether the user's actions in consecutive frames are consistent with the series of actions of the standard action in the beat, so as to determine whether it is in tune. In the event of a snap shot or a slow shot, the system records the first half of the in-beat score and then resets the beat alignment to match the actual motion of the user and provide the motion evaluation module 106 with accurate motion data comparison. Through the combination of the action evaluation module 106 and the music rhythm fit module 107, each beat score and training report are generated and visually displayed to the user.

还可以设置数据存储模块109,在数据存储模块109内,用户可选择将评分与训练报告以及训练录制与对比视频数据保存至硬盘,或部分通过网络上传至云平台以供推荐算法匹配更合适用户个人的舞种与训练模式。同时数据存储模块109还存储着相机参数,用于人体三维动作重建。A data storage module 109 can also be set. In the data storage module 109, the user can choose to save the score and training report and training recording and comparison video data to the hard disk, or partially upload it to the cloud platform through the network for the recommendation algorithm to match more suitable users. Individual dance styles and training patterns. At the same time, the data storage module 109 also stores camera parameters, which are used for three-dimensional motion reconstruction of the human body.

作为进一步的改进,本申请实施例中的动作训练系统还包括:健康守护模块108;As a further improvement, the action training system in the embodiment of the present application further includes: a health guard module 108;

健康守护模块108,用于根据目标人体的用户信息,通过云平台算法计算用户的身体素质程度、合适舞种和当前训练运动量,根据计算结果个性化高质量推荐用户最适合的动作集,避免用户训练强度过弱或过高的动作,同时记录用户每次的训练时间与训练强度,通过语音播报和/或显示弹窗的方式提示用户进行休息。The health guard module 108 is used to calculate the user's physical fitness level, suitable dance style and current training exercise volume through the cloud platform algorithm according to the user information of the target human body, and recommend the most suitable action set for the user according to the calculation result with high quality, so as to avoid the user The training intensity is too weak or too high, and the user's training time and training intensity are recorded at the same time, and the user is prompted to take a break through voice broadcast and/or display of a pop-up window.

需要说明的是,还可以设置模式切换模块110:模块提供舞蹈训练的模式选择。该模块下可选择三种功能,分别为学习模块、评分模式、特训模式。其中,学习模块、评分模式可以多人同时进行,特训模式默认为单人模式It should be noted that a mode switching module 110 can also be set: the module provides mode selection for dance training. Three functions can be selected under this module, namely learning module, scoring mode and special training mode. Among them, the learning module and scoring mode can be performed by multiple people at the same time, and the special training mode is single-player mode by default.

1)学习模式可从云平台选择下载由专业舞蹈人士预先录制完毕的标准舞蹈视频进行模仿学习,且标准视频会提供正面版本以及镜面反转版本可供用户选择显示至显示器,同时将系统会根据标准动作或音乐节奏的差异性,划分学习步骤并将重难点动作标识出来给用户注意以强化学习。1) In the learning mode, you can choose to download the standard dance video pre-recorded by professional dancers from the cloud platform for imitation learning, and the standard video will provide a positive version and a mirror inversion version for users to choose to display on the monitor, and the system will Differences in standard movements or music rhythms, divide learning steps and identify difficult and difficult movements for users' attention to strengthen learning.

2)评分模式则会调用舞蹈评估模块,包括对用户舞蹈动作与音乐节拍契合度进行评估,通过文字特效的方式可视化直观显示给用户。2) The scoring mode will call the dance evaluation module, including evaluating the fit between the user's dance movements and the music rhythm, and visually displaying it to the user through text special effects.

3)特训模块会由云平台根据用户的系列课程学习评分存储数据判断用户舞蹈缺陷,并根据该缺陷提供相应的局部特训。3) In the special training module, the cloud platform will judge the user's dance defects according to the user's series of course learning scores and store data, and provide corresponding local special training according to the defects.

本申请实施例中提供的动作训练系统,基于双目立体视觉原理,单目相机无需添加其余昂贵的传感器,只需经过相机标定后即可达到较高精度,基本接近Kinect深度摄像机的识别精度,实现成本较低,同时不用担心相机抖动、复杂应用场景的问题,用户可以根据需求动态改变摄像机的位置以获得更好的人体观测角度,基于视差三角测量的双目成像即使在杂乱的房间里也不会影响其精度,适应性较强。同时舞蹈学习者无需穿戴任何用于采集人体动作的传感设备或者特制服装,更方便舞蹈学习者舒展舞姿,减少负重。同时也可拓展适应多人舞蹈动作的识别。The motion training system provided in the embodiment of the present application is based on the principle of binocular stereo vision. The monocular camera does not need to add other expensive sensors, and only needs to be calibrated to achieve high accuracy, which is basically close to the recognition accuracy of the Kinect depth camera. The implementation cost is low, and there is no need to worry about camera shake and complex application scenarios. Users can dynamically change the position of the camera according to their needs to obtain a better human observation angle. The binocular imaging based on parallax triangulation can be used even in a cluttered room. It will not affect its accuracy and has strong adaptability. At the same time, dance learners do not need to wear any sensing equipment or special clothing for collecting human movements, which is more convenient for dance learners to stretch their dance postures and reduce weight. At the same time, it can also be expanded to adapt to the recognition of multi-person dance movements.

以上所述,以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。As mentioned above, the above embodiments are only used to illustrate the technical solutions of the present application, but not to limit them; although the present application has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand: The technical solutions described in the embodiments are modified, or some technical features thereof are equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions in the embodiments of the present application.

Claims (10)

1.一种基于多目视觉的人体动作重建系统,其特征在于,包括:1. a human action reconstruction system based on multi-eye vision, is characterized in that, comprises: 相机标定模块,用于根据采集到的标定点数据进行单目相机标定,所述单目相机的数量至少两个;a camera calibration module, configured to perform monocular camera calibration according to the collected calibration point data, and the number of the monocular cameras is at least two; 动作采集模块,用于获取所有所述单目相机采集到的目标人体的动作图像序列,将所述动作图像序列发送至二维人体动作识别模块;a motion acquisition module, configured to acquire all motion image sequences of the target human body collected by the monocular camera, and send the motion image sequences to the two-dimensional human motion recognition module; 所述二维人体动作识别模块,用于识别所述动作图像序列中的人体关键肢体关节部位,提取所述动作图像序列的每一帧中属于同一目标人体的二维关节点,以便于重建所述目标人体的人体骨架,将每一帧的所述人体骨架信息存储并发送至三维动作重建模块;The two-dimensional human action recognition module is used to identify the key limb joint parts of the human body in the action image sequence, and extract the two-dimensional joint points belonging to the same target human body in each frame of the action image sequence, so as to facilitate the reconstruction of all body parts. The human skeleton of the target human body is stored, and the human skeleton information of each frame is stored and sent to the three-dimensional motion reconstruction module; 所述三维动作重建模块,用于基于每一帧中的所述人体骨架信息还原所述目标人体的所述二维关节点在三维空间中的真实位置,重建出所述目标人体的三维动作。The three-dimensional motion reconstruction module is configured to restore the real positions of the two-dimensional joint points of the target human body in the three-dimensional space based on the human skeleton information in each frame, and reconstruct the three-dimensional motion of the target human body. 2.根据权利要求1所述的基于多目视觉的人体动作重建系统,其特征在于,还包括:成像点误差纠正模块;2. The multi-vision based human motion reconstruction system according to claim 1, further comprising: an imaging point error correction module; 所述成像点误差纠正模块,用于在所述单目相机发生偏移时,纠正所述二维人体动作识别模块中的所述二维关节点的像素坐标,以便于所述二维人体动作识别模块根据纠正像素坐标后的所述二维关节点重建所述目标人体的人体骨架,并将每一帧的所述人体骨架信息存储并发送至三维动作重建模块。The imaging point error correction module is used to correct the pixel coordinates of the two-dimensional joint points in the two-dimensional human motion recognition module when the monocular camera is offset, so as to facilitate the two-dimensional human motion The recognition module reconstructs the human skeleton of the target human body according to the two-dimensional joint points after correcting the pixel coordinates, and stores and sends the human skeleton information of each frame to the three-dimensional motion reconstruction module. 3.根据权利要求1所述的基于多目视觉的人体动作重建系统,其特征在于,所述三维动作重建模块具体包括:3. The multi-vision based human motion reconstruction system according to claim 1, wherein the three-dimensional motion reconstruction module specifically comprises: 第一求解子模块,用于基于所述单目相机标定后的单目相机参数,求解与所述单目相机的焦点垂直投影于所述单目相机的成像平面上的投影成像点在预置三维坐标系中的第一真实三维坐标;The first solving sub-module is used to solve, based on the monocular camera parameters calibrated by the monocular camera, the projection imaging point perpendicular to the focus of the monocular camera projected on the imaging plane of the monocular camera at the preset value. the first real three-dimensional coordinate in the three-dimensional coordinate system; 第二求解子模块,用于基于所述第一真实三维坐标和所述单目相机的成像平面参数,求解所述第一真实三维坐标的旋转矩阵和平移向量;a second solving sub-module, configured to solve the rotation matrix and translation vector of the first real three-dimensional coordinates based on the first real three-dimensional coordinates and the imaging plane parameters of the monocular camera; 第三求解子模块,用于基于所述旋转矩阵和平移向量求解所述成像平面上的所述二维关节点在所述预置三维坐标系中的第二真实三维坐标;a third solving submodule, configured to solve the second real three-dimensional coordinates of the two-dimensional joint points on the imaging plane in the preset three-dimensional coordinate system based on the rotation matrix and the translation vector; 关节点重建子模块,用于将所述预置三维坐标系中,使得所有所述单目相机的光心坐标与对应的所述第二真实三维坐标直线取得最小距离的三维坐标点,作为所述二维关节点在所述预置三维坐标系中的真实三维关节点,重建出所述目标人体的三维动作。The joint point reconstruction sub-module is used to set the three-dimensional coordinate point at which the minimum distance is obtained between the optical center coordinates of all the monocular cameras and the corresponding second real three-dimensional coordinate line in the preset three-dimensional coordinate system, as the The real three-dimensional joint points of the two-dimensional joint points in the preset three-dimensional coordinate system are used to reconstruct the three-dimensional motion of the target human body. 4.根据权利要求3所述的基于多目视觉的人体动作重建系统,其特征在于,所述关节点子模块具体用于:4. The human motion reconstruction system based on multi-eye vision according to claim 3, wherein the joint sub-module is specifically used for: 基于超定方程组最小二乘法求解所有所述单目相机的光心坐标与对应的所述第二真实三维坐标所成的直线取得最小距离的三维坐标点,作为所述二维关节点在所述预置三维坐标系中的真实三维关节点,重建出所述目标人体的三维动作。Based on the least squares method of overdetermined equations, solve the straight line formed by the optical center coordinates of all the monocular cameras and the corresponding second real three-dimensional coordinates to obtain the three-dimensional coordinate point with the minimum distance, as the two-dimensional joint point in the The real three-dimensional joint points in the preset three-dimensional coordinate system are used to reconstruct the three-dimensional motion of the target human body. 5.根据权利要求1所述的基于多目视觉的人体动作重建系统,其特征在于,所述二维人体动作识别模块具体用于:5. The human motion reconstruction system based on multi-eye vision according to claim 1, wherein the two-dimensional human motion recognition module is specifically used for: 基于预置卷积神经网络对所述动作图像序列中的人体关键肢体关节部位进行识别,提取所述动作图像序列的每一帧中属于同一目标人体的25个关节点,以便于重建所述目标人体的人体骨架,将每一帧的所述人体骨架信息存储并发送至三维动作重建模块。Based on the preset convolutional neural network, the key limb joint parts of the human body in the action image sequence are identified, and 25 joint points belonging to the same target human body in each frame of the action image sequence are extracted to facilitate the reconstruction of the target. The human body skeleton of the human body, the information of the human body skeleton of each frame is stored and sent to the three-dimensional motion reconstruction module. 6.一种基于多目视觉的人体动作重建方法,其特征在于,包括:6. A human action reconstruction method based on multi-eye vision, is characterized in that, comprises: 基于采集到的标定点数据进行单目相机标定,所述单目相机的数量至少两个;Perform monocular camera calibration based on the collected calibration point data, and the number of the monocular cameras is at least two; 获取至少两个所述单目相机拍摄的目标人体的动作图像序列;Acquiring at least two action image sequences of the target human body captured by the monocular camera; 识别所述动作图像序列中的人体关键肢体关节部位,提取所述动作图像序列的每一帧中属于同一目标人体的二维关节点,以便于重建所述目标人体的人体骨架,得到每一帧的所述人体骨架信息;Identify the key limb joint parts of the human body in the action image sequence, extract the two-dimensional joint points belonging to the same target human body in each frame of the action image sequence, so as to reconstruct the human skeleton of the target human body, and obtain each frame of the human skeleton information; 基于每一帧中的所述人体骨架信息还原所述目标人体在关节点在三维空间中的真实位置,重建出所述目标人体的三维动作。Based on the human skeleton information in each frame, the real position of the target human body in the joint point in the three-dimensional space is restored, and the three-dimensional action of the target human body is reconstructed. 7.根据权利要求6所述的基于多目视觉的人体动作重建方法,其特征在于,所述基于每一帧中的所述人体骨架信息还原所述目标人体在关节点在三维空间中的真实位置,重建出所述目标人体的三维动作,具体包括:7 . The method for reconstructing human motions based on multi-eye vision according to claim 6 , wherein the restoration of the reality of the target human body at joint points in three-dimensional space based on the human skeleton information in each frame. 8 . position to reconstruct the three-dimensional motion of the target human body, including: 基于所述单目相机标定后的单目相机参数,求解与所述单目相机的焦点垂直投影于所述单目相机的成像平面上的投影成像点在预置三维坐标系中的第一真实三维坐标;Based on the monocular camera parameters calibrated by the monocular camera, the first real value of the projected imaging point perpendicular to the focal point of the monocular camera on the imaging plane of the monocular camera in the preset three-dimensional coordinate system is obtained. three-dimensional coordinates; 基于所述第一真实三维坐标和所述单目相机的成像平面参数,求解所述第一真实三维坐标的旋转矩阵和平移向量;Based on the first real three-dimensional coordinates and the imaging plane parameters of the monocular camera, solve the rotation matrix and translation vector of the first real three-dimensional coordinates; 基于所述旋转矩阵和平移向量求解所述成像平面上的所述二维关节点在所述预置三维坐标系中的第二真实三维坐标;Solving the second real three-dimensional coordinates of the two-dimensional joint points on the imaging plane in the preset three-dimensional coordinate system based on the rotation matrix and the translation vector; 将所述预置三维坐标系中,使得所有所述单目相机的光心坐标与对应的所述第二真实三维坐标直线取得最小距离的三维坐标点,作为所述二维关节点在所述预置三维坐标系中的真实三维关节点,重建出所述目标人体的三维动作。In the preset three-dimensional coordinate system, the three-dimensional coordinate points that make the optical center coordinates of all the monocular cameras and the corresponding second real three-dimensional coordinate lines obtain the minimum distance are taken as the two-dimensional joint points in the The real three-dimensional joint points in the three-dimensional coordinate system are preset to reconstruct the three-dimensional motion of the target human body. 8.一种动作训练系统,其特征在于,包括权利要求1-5中任一项所述的基于多目视觉的人体动作重建系统,还包括动作评估模块;8. An action training system, characterized in that, comprising the multi-vision based human action reconstruction system according to any one of claims 1-5, and also comprising an action evaluation module; 所述动作评估模块,用于将从所述三维动作重建模块获取到的目标人体的三维动作与预置标准动作进行差异比对,输出与所述差异比对的结果对应的动作评估结果。The motion evaluation module is configured to perform a difference comparison between the three-dimensional motion of the target human body obtained from the three-dimensional motion reconstruction module and a preset standard motion, and output a motion evaluation result corresponding to the difference comparison result. 9.根据权利要求8所述的动作训练系统,其特征在于,所述动作评估模块具体用于:9. The motion training system according to claim 8, wherein the motion evaluation module is specifically used for: 将从所述三维动作重建模块获取到的目标人体的三维动作与预置标准动作进行关节点组合角度相似性判断、关节点组合运动轨迹的平均曲率对比和人体关节点组合运动量对比;The three-dimensional motion of the target human body obtained from the three-dimensional motion reconstruction module and the preset standard motion are judged on the similarity of the joint point combination angle, the average curvature comparison of the joint point combined motion trajectory, and the human body joint point combined motion amount comparison; 获取所述关节点组合角度相似性判断的结果对应的第一动作评估结果、所述关节点组合运动轨迹的平均曲率对比的结果对应的第二动作评估结果和所述人体关节点组合运动量对比的结果对应的第三动作评估结果;Obtain the first action evaluation result corresponding to the result of the joint point combination angle similarity judgment, the second action evaluation result corresponding to the result of the average curvature comparison of the joint point combination motion trajectory, and the human body joint point combination exercise amount comparison. The third action evaluation result corresponding to the result; 对所述第一动作评估结果、所述第二动作评估结果和所述第三动作评估结果进行加权处理,输出加权处理后得到的动作评估结果。Perform weighting processing on the first action evaluation result, the second action evaluation result and the third action evaluation result, and output the action evaluation result obtained after the weighting process. 10.根据权利要求9所述的动作训练系统,其特征在于,还包括:音乐节奏契合度模块;10. The motion training system according to claim 9, further comprising: a music rhythm fit module; 所述音乐节奏契合度模块,用于基于音频提取算法提取在播音乐的音乐特征,以节拍为单位,判断所述目标人体在连续帧的人体动作与预置标准动作在匹配节拍的系列动作的匹配程度,输出节拍匹配结果。The music rhythm fit module is used for extracting the music features of the music being broadcast based on the audio extraction algorithm, and in units of beats, it is judged that the human body movements of the target human body in continuous frames and the preset standard movements are in the series of movements that match the beats. Matching degree, output beat matching result.
CN201910844266.7A 2019-09-06 2019-09-06 Human motion reconstruction system, method and motion training system based on multi-eye vision Pending CN110544302A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910844266.7A CN110544302A (en) 2019-09-06 2019-09-06 Human motion reconstruction system, method and motion training system based on multi-eye vision

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910844266.7A CN110544302A (en) 2019-09-06 2019-09-06 Human motion reconstruction system, method and motion training system based on multi-eye vision

Publications (1)

Publication Number Publication Date
CN110544302A true CN110544302A (en) 2019-12-06

Family

ID=68712830

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910844266.7A Pending CN110544302A (en) 2019-09-06 2019-09-06 Human motion reconstruction system, method and motion training system based on multi-eye vision

Country Status (1)

Country Link
CN (1) CN110544302A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111028912A (en) * 2019-12-23 2020-04-17 北京深测科技有限公司 Environment-assisted life monitoring method and system
CN111027522A (en) * 2019-12-30 2020-04-17 华通科技有限公司 Bird detection positioning system based on deep learning
CN111105486A (en) * 2019-12-07 2020-05-05 东南大学 Multi-view-angle dynamic three-dimensional reconstruction method for mice
CN112200041A (en) * 2020-09-29 2021-01-08 Oppo(重庆)智能科技有限公司 Video motion recognition method and device, storage medium and electronic equipment
CN112365578A (en) * 2020-11-11 2021-02-12 杭州勾勾科技有限公司 Three-dimensional human body model reconstruction system and method based on double cameras
CN113421286A (en) * 2021-07-12 2021-09-21 北京未来天远科技开发有限公司 Motion capture system and method
CN113936123A (en) * 2021-10-21 2022-01-14 江苏大学 Operation state identification method and system based on virtual-real fusion
CN113989928A (en) * 2021-10-27 2022-01-28 南京硅基智能科技有限公司 Motion capturing and redirecting method
CN114463850A (en) * 2022-02-08 2022-05-10 南京科源视觉技术有限公司 Human body action recognition system suitable for multiple application scenes
WO2022241583A1 (en) * 2021-05-15 2022-11-24 电子科技大学 Family scenario motion capture method based on multi-target video

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104346829A (en) * 2013-07-29 2015-02-11 中国农业机械化科学研究院 Three-dimensional color reconstruction system and method based on PMD (photonic mixer device) cameras and photographing head
CN106204656A (en) * 2016-07-21 2016-12-07 中国科学院遥感与数字地球研究所 Target based on video and three-dimensional spatial information location and tracking system and method
CN106503605A (en) * 2015-09-01 2017-03-15 南京理工大学 Human body target recognition methods based on stereovision technique
CN109003301A (en) * 2018-07-06 2018-12-14 东南大学 A kind of estimation method of human posture and rehabilitation training system based on OpenPose and Kinect

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104346829A (en) * 2013-07-29 2015-02-11 中国农业机械化科学研究院 Three-dimensional color reconstruction system and method based on PMD (photonic mixer device) cameras and photographing head
CN106503605A (en) * 2015-09-01 2017-03-15 南京理工大学 Human body target recognition methods based on stereovision technique
CN106204656A (en) * 2016-07-21 2016-12-07 中国科学院遥感与数字地球研究所 Target based on video and three-dimensional spatial information location and tracking system and method
CN109003301A (en) * 2018-07-06 2018-12-14 东南大学 A kind of estimation method of human posture and rehabilitation training system based on OpenPose and Kinect

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
李琳等: "二维和三维视觉传感集成系统联合标定方法", 《仪器仪表学报》 *
陈学梅: "基于人体三维姿态的动作评价系统", 《中国优秀硕士学位论文全文数据库信息科技辑》 *
魏玮等: "人体运动分析综述", 《计算机系统应用》 *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111105486A (en) * 2019-12-07 2020-05-05 东南大学 Multi-view-angle dynamic three-dimensional reconstruction method for mice
CN111028912A (en) * 2019-12-23 2020-04-17 北京深测科技有限公司 Environment-assisted life monitoring method and system
CN111027522A (en) * 2019-12-30 2020-04-17 华通科技有限公司 Bird detection positioning system based on deep learning
CN111027522B (en) * 2019-12-30 2023-09-01 华通科技有限公司 Bird detection positioning system based on deep learning
CN112200041B (en) * 2020-09-29 2022-08-02 Oppo(重庆)智能科技有限公司 Video motion recognition method and device, storage medium and electronic equipment
CN112200041A (en) * 2020-09-29 2021-01-08 Oppo(重庆)智能科技有限公司 Video motion recognition method and device, storage medium and electronic equipment
CN112365578A (en) * 2020-11-11 2021-02-12 杭州勾勾科技有限公司 Three-dimensional human body model reconstruction system and method based on double cameras
WO2022241583A1 (en) * 2021-05-15 2022-11-24 电子科技大学 Family scenario motion capture method based on multi-target video
CN113421286A (en) * 2021-07-12 2021-09-21 北京未来天远科技开发有限公司 Motion capture system and method
CN113421286B (en) * 2021-07-12 2024-01-02 北京未来天远科技开发有限公司 Motion capturing system and method
CN113936123A (en) * 2021-10-21 2022-01-14 江苏大学 Operation state identification method and system based on virtual-real fusion
CN113989928A (en) * 2021-10-27 2022-01-28 南京硅基智能科技有限公司 Motion capturing and redirecting method
CN113989928B (en) * 2021-10-27 2023-09-05 南京硅基智能科技有限公司 Motion capturing and redirecting method
CN114463850A (en) * 2022-02-08 2022-05-10 南京科源视觉技术有限公司 Human body action recognition system suitable for multiple application scenes

Similar Documents

Publication Publication Date Title
CN110544302A (en) Human motion reconstruction system, method and motion training system based on multi-eye vision
CN110544301A (en) A three-dimensional human motion reconstruction system, method and motion training system
CN111460875B (en) Image processing method and apparatus, image device, and storage medium
US11069144B2 (en) Systems and methods for augmented reality body movement guidance and measurement
JP7427188B2 (en) 3D pose acquisition method and device
CN113706699B (en) Data processing method and device, electronic equipment and computer readable storage medium
JP2020042476A (en) Method and apparatus for acquiring joint position, method and apparatus for acquiring motion
CN112911393B (en) Method, device, terminal and storage medium for identifying part
WO2014042121A1 (en) Movement evaluation device and program therefor
CN114219878A (en) Animation generation method and device for virtual character, storage medium and terminal
JP2019096113A (en) Processing device, method and program relating to keypoint data
US12062245B2 (en) System and method for real-time creation and execution of a human digital twin
CN113989928B (en) Motion capturing and redirecting method
EP4036863A1 (en) Human body model reconstruction method and reconstruction system, and storage medium
CN110298279A (en) A kind of limb rehabilitation training householder method and system, medium, equipment
JP2013120556A (en) Object attribute estimation device and video plotting device
CN109448105A (en) Three-dimensional human skeleton generation method and system based on more depth image sensors
CN113947811B (en) A Tai Chi movement correction method and system based on generative adversarial network
Chen et al. Camera networks for healthcare, teleimmersion, and surveillance
CN111401340B (en) Method and device for detecting motion of target object
CN105225270B (en) A kind of information processing method and electronic equipment
CN114926762A (en) Motion scoring method, system, terminal and storage medium
CN113902845A (en) An action video generation method, apparatus, electronic device and readable storage medium
KR20200030947A (en) 4d rig reconstructing device and a method thereof
CN113345552A (en) Method and system for intelligently assisting in guiding dance exercises and mobile terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20191206