CN108093188B

CN108093188B - A method for large-field video panorama stitching based on hybrid projection transformation model

Info

Publication number: CN108093188B
Application number: CN201711419418.6A
Authority: CN
Inventors: 袁丁; 胡晓辉; 张弘
Original assignee: Beihang University
Current assignee: Beihang University
Priority date: 2017-12-25
Filing date: 2017-12-25
Publication date: 2019-01-25
Anticipated expiration: 2037-12-25
Also published as: CN108093188A

Abstract

The present invention provides a kind of joining method of big visual field video panorama based on hybrid projection transformation model, including five big steps, step 1: down-sampled, reduction operand is carried out to video；Step 2: the detection of Sift characteristic point is extracted；Step 3: to feature points clustering, region segmentation is carried out to image using SVM method, and carry out the matching of characteristic point；Step 4: hybrid projection matrix is calculated；Step 5: interative computation, realize the registration of subsequent image sequence with merge, the final splicing for realizing panorama sketch.Time-consuming problem in big, more projection array methods that present invention effectively avoids errors in traditional single projection array method has wide application background.

Description

A method for large-field video panorama stitching based on hybrid projection transformation model

技术领域technical field

本发明涉及一种基于混合投影变换模型的大视场视频全景图拼接的方法，该方法可以实现视频图像序列的准确快速拼接，从而生成全景图，可以应用于虚拟现实等技术中，属于计算机视觉领域。The invention relates to a method for splicing a large field of view video panorama based on a hybrid projection transformation model. The method can realize accurate and fast splicing of video image sequences, thereby generating a panorama, which can be applied to technologies such as virtual reality and belongs to computer vision. field.

背景技术Background technique

大视场视频全景图拼接技术是当前视频处理领域中的一个热点研究问题，它可以将一段视频的图像序列进行拼接形成一张包含完整场景的广视角图像，解决了因摄像设备限制而存在的拍摄角度小、无法全景观测的问题。该技术已经广泛应用于遥感、海底探测和虚拟现实等领域之中。在遥感领域中，高分辨率图像的获取需要拼接建立大面积全景图像；在海底探测方面，同样需要对获取的小镜头视频信息进行拼接形成全景图以进行观测和处理；而虚拟现实更是需要通过全景图的建立来为用户提供更加直观立体的感受。The large-field video panorama stitching technology is a hot research problem in the current video processing field. It can stitch the image sequence of a video to form a wide-angle image containing a complete scene, which solves the problems existing due to the limitation of camera equipment. The problem is that the shooting angle is small and the panoramic observation cannot be performed. The technology has been widely used in remote sensing, seabed detection and virtual reality and other fields. In the field of remote sensing, the acquisition of high-resolution images requires splicing to build a large-area panoramic image; in seabed detection, it is also necessary to stitch the acquired small-lens video information to form a panoramic image for observation and processing; and virtual reality requires more Provide users with a more intuitive and three-dimensional experience through the establishment of panoramic images.

近年来，虽然全景图的拼接技术已经被广泛研究，并且已有很多成熟的方法，但是，仍然存在很多具有挑战性的问题需要去解决。最大的问题就是在目前大多数的全景图拼接中，都有一个假设前提：要么图像与摄像机距离很远，可以认为图像中的场景都处于同一平面中；要么图像是由摄像机沿投影中心旋转拍摄得到的。这样，在图像配准时，可以使用一个单应矩阵作为图像配准的投影变换模型。In recent years, although panorama stitching technology has been widely studied, and there are many mature methods, there are still many challenging problems to be solved. The biggest problem is that in most of the current panorama stitching, there is an assumption: either the image and the camera are far away, and the scenes in the image can be considered to be in the same plane; or the image is shot by the camera rotating along the projection center. owned. In this way, during image registration, a homography matrix can be used as a projection transformation model for image registration.

但是，这样的方法并不具有鲁棒性，因为在实际拍摄视频时具有很强的随机性和自由性，基本无法满足上述假设条件；因此，如果依然使用一个单应阵进行拼接，得到的全景图是存在误差的。目前，大多算法都是在后期处理时使用图像融合技术来弥补配准过程的误差，但这不能从根本上解决问题。However, this method is not robust, because it has strong randomness and freedom in actual video shooting, and basically cannot meet the above assumptions; therefore, if a homography is still used for stitching, the obtained panorama The picture is wrong. At present, most algorithms use image fusion technology in post-processing to compensate for the errors in the registration process, but this cannot fundamentally solve the problem.

基于此，使用多单应阵来进行视频全景图的拼接逐渐开始被研究。其中，比较典型的有2011年在CVPR上发表的Dual-Homography Warping(DHW)方法，它通过综合两个平面的单应矩阵来实现整幅图像的配准拼接工作。但是，在权重计算时，由于需要进行欧式距离的计算，并且随着特征点的增多，计算次数也会很大程度地增多，因此会耗费很大的时间。Based on this, the use of multi-homography arrays for video panorama stitching has gradually begun to be studied. Among them, a typical one is the Dual-Homography Warping (DHW) method published on CVPR in 2011, which realizes the registration and stitching of the entire image by synthesizing the homography matrix of two planes. However, in the weight calculation, since the Euclidean distance needs to be calculated, and with the increase of feature points, the number of calculations will also increase to a large extent, so it will take a lot of time.

发明内容SUMMARY OF THE INVENTION

本发明的技术解决问题：克服现有技术的不足，提供一种基于混合投影变换模型的大视场视频全景图拼接的方法，耗费时间更短，同时准确度也更高，且对拍摄视频方式无任何约束，能够具有广泛的应用。The technical solution of the present invention is to overcome the deficiencies of the prior art and provide a method for splicing a large field of view video panorama based on a hybrid projection transformation model, which consumes less time and has higher accuracy. Without any constraints, it can have a wide range of applications.

本发明技术解决方案：本发明旨在实现大视场视频全景图的拼接。其输入量是由摄像机获取的一段视频，输出为一幅全景图，本发明假设所处理的视频图像大致可分为两个平面(Background plane和foreground plane)。下面介绍本发明中实现大视场视频全景图拼接的技术解决方案：Technical solution of the present invention: The present invention aims to realize the splicing of video panorama images with a large field of view. The input is a video acquired by the camera, and the output is a panoramic image. The present invention assumes that the processed video image can be roughly divided into two planes (Background plane and foreground plane). The following introduces the technical solution for realizing large-field video panorama stitching in the present invention:

(1)对视频图像序列降采样，保证每两幅相邻图像间最少有1/3的重叠区域；(1) Downsampling the video image sequence to ensure that there is at least 1/3 of the overlapping area between every two adjacent images;

(2)对采样获得的前两幅图像的重叠区域进行Sift特征点的检测，获得的所述特征点集，并对特征点进行特征向量的提取；(2) The detection of Sift feature points is performed on the overlapping area of the first two images obtained by sampling, the obtained feature point set, and feature vector extraction is performed on the feature points;

(3)对特征点集进行聚类，使用支持向量机SVM的方法将图像进行划分为三个区域，记为区域S_b，S_u和S_f，并进行特征点的匹配；(3) Clustering the feature point set, using the support vector machine SVM method to divide the image into three regions, denoted as regions S _b , _Su and S _f , and matching the feature points;

(4)分别对S_b和S_f中的特征点使用Random Sample Consensus(RANSAC)方法来剔除误匹配点，并计算出单应矩阵，分别记为H_b、H_f；(4) The Random Sample Consensus (RANSAC) method is used for the feature points in S _b and S _f respectively to eliminate the mismatched points, and the homography matrix is calculated, which are denoted as H _b and H _f respectively;

(5)对采样获得的前幅两图像进行配准，配准过程中，对三个区域S_b，S_u和S_f中的像素采用混合投影变换模型进行第二幅图像到第一幅图像的配准；(5) The first two images obtained by sampling are registered. During the registration process, the pixels in the three regions S _b , S _u and S _f are processed by the hybrid projection transformation model from the second image to the first image. registration;

(6)通过单应矩阵的迭代来实现第三幅到第二幅图像的配准，重复步骤(6)，进而实现后续图像序列的配准，最后进行图像的融合，完成整个全景图的拼接任务。(6) Realize the registration of the third image to the second image through the iteration of the homography matrix, repeat step (6), and then realize the registration of subsequent image sequences, and finally perform image fusion to complete the stitching of the entire panorama Task.

本发明与现有技术相比的有益效果在于：The beneficial effects of the present invention compared with the prior art are:

(1)在使用点聚类方法后，本发明采用SVM方法来实现图像不同区域的划分，为后续的图像配准过程做准备；(1) After using the point clustering method, the present invention adopts the SVM method to realize the division of different areas of the image, so as to prepare for the subsequent image registration process;

(2)在图像配准过程中，使用了创新的混合投影变换模型，可以实现相对于单投影矩阵更加好的拼接效果；(2) In the process of image registration, an innovative hybrid projection transformation model is used, which can achieve a better stitching effect than a single projection matrix;

(3)在图像配准过程中，使用了创新的混合投影变换模型，不仅比传统的DHW等混合投影变换模型更贴近真实情况，而且还可以大大减少配准时间，使整个拼接过程更加高效。(3) In the process of image registration, an innovative hybrid projection transformation model is used, which is not only closer to the real situation than the traditional DHW and other hybrid projection transformation models, but also can greatly reduce the registration time and make the entire stitching process more efficient.

附图说明Description of drawings

图1为SVM方法分割图像为三个区域示意图；Fig. 1 is the schematic diagram that the SVM method divides the image into three regions;

图2本发明方法流程框图；Fig. 2 method flow chart of the present invention;

图3本发明实验结果图。Figure 3 is a graph of the experimental results of the present invention.

具体实施方式Detailed ways

下面结合附图及实施例对本发明进行详细说明。The present invention will be described in detail below with reference to the accompanying drawings and embodiments.

本发明通过传统的单反相机拍摄获取视频序列，在Matlab R2016b下实现视频全景图的拼接。本发明的流程图见图2所示，为了更好地理解本发明的技术方案，以下对本发明的具体实施方式作进一步描述：The present invention obtains video sequences by shooting with a traditional single-lens reflex camera, and realizes the stitching of video panoramas under Matlab R2016b. The flow chart of the present invention is shown in Figure 2. In order to better understand the technical scheme of the present invention, the specific embodiments of the present invention are further described below:

步骤一：读入视频序列，使用Matlab对获取的视频序列进行降采样，以减少图像配准的运算量。采样时，只需保证相邻采样图像中有约1/3的重叠区域即可。Step 1: Read in the video sequence, and use Matlab to downsample the acquired video sequence to reduce the computational complexity of image registration. When sampling, it is only necessary to ensure that there is about 1/3 of the overlapping area in the adjacent sampled images.

步骤二：本发明采用Sift算子对相邻两幅图像I₁和I₂进行特征点的检测以及Sift特征向量的计算。Step 2: The present invention uses the Sift operator to detect the feature points of the two adjacent images I ₁ and I ₂ and calculate the Sift feature vector.

步骤三：将二维图像平面当作二维特征空间，图像像素的u、v值分别代表不同的特征值，对获取的特征点集使用K-means方法进行聚类，其中K＝2；接着，本发明采用支持向量机(SVM)的方法来实现对图像平面的划分，如图1所示，其中每个小圆圈代表一个Sift特征点。与支持向量相切的两条直线方程可分别记为：Step 3: The two-dimensional image plane is regarded as a two-dimensional feature space, and the u and v values of the image pixels represent different eigenvalues respectively, and the K-means method is used to cluster the obtained feature point set, where K=2; then , the present invention adopts the support vector machine (SVM) method to realize the division of the image plane, as shown in FIG. 1 , wherein each small circle represents a Sift feature point. The equations of the two straight lines tangent to the support vector can be written as:

w^Tx+b₁＝0 (1)w ^T x+b ₁ =0 (1)

w^Tx+b₂＝0 (2)w ^T x+b ₂ =0 (2)

其中，x＝(u,v)^T。where x=(u,v) ^T .

接着，将满足w^Tx+b₁＜0的像素点区域记为S_b；将满足w^Tx+b₂＞0的像素点区域记为S_f；将满足w^Tx+b₁＞0且w^Tx+b₂＜0的像素点区域记为S_n。Next, the pixel point region satisfying w ^T x+b ₁ <0 is denoted as S _b ; the pixel point region satisfying w ^T x+b ₂ >0 is denoted as S _f ; the pixel point region satisfying w ^T x+b ₁ >0 is denoted as S f ; And the pixel area where w ^T x+b ₂ <0 is denoted as _Sn .

对区域S_b和S_f中的Sift特征点进行匹配，匹配过程中，为了提高特征点的匹配准确度，定义I₁中区域S_b(S_f)中的任一待匹配点(记为p₁)与I₂中区域S_b(S_f)中的特征点集中最近的特征点距离为d₁，与I₂中区域S_b(S_f)中的特征点集中次近的特征点的距离为d₂，则距离比记为：Match the Sift feature points in the regions S _b and S _f . During the matching process, in order to improve the matching accuracy of the feature points, define any point to be matched in the region S _b (S _f ) in I ₁ (denoted as p ₁ ) The distance from the closest feature point in the feature point set in the area S _b (S _f ) in I ₂ is d ₁ , and the distance from the next closest feature point in the feature point set in the area S _b (S _f ) in I ₂ is d ₂ , then the distance ratio is recorded as:

如果Ratio＞ε(ε是经验阈值，本发明经过大量反复试验确定为：5≤ε)，则认为点p₁找到了匹配点。If Ratio>ε (ε is an empirical threshold, the present invention determines after a lot of trial and error: 5≦ε), it is considered that point p ₁ has found a matching point.

遍历I₁中所有的Sift特征点，便得到了初始匹配点对集，分别记为P_b和P_f。By traversing all the Sift feature points in I ₁ , an initial set of matching point pairs is obtained, denoted as P _b and P _f respectively.

步骤四：用RANSAC方法分别对区域S_b和S_f进行单应矩阵H的计算。对于两幅二维图像，对应点之间满足单应变换(或投影变换)，如(4)所示，其中(x,y,1)^T和(x',y',1)^T是空间点X在两幅图像上的投影齐次坐标值。对于区域S_b，每次随机从匹配点对集P_b中选择4组匹配点对计算H_b，并统计P_b中满足(4)的点个数比例，记为Inlier_Ratio。最后选择使得Inlier_Ratio达到最大的单应矩阵记为最终的H_b。Step 4: Use the RANSAC method to calculate the homography matrix H for the regions S _b and S _f respectively. For two two-dimensional images, the homography transformation (or projective transformation) is satisfied between the corresponding points, as shown in (4), where (x, y, 1) ^T and (x', y', 1) ^T are the space The projected homogeneous coordinates of point X on the two images. For the region S _b , 4 sets of matching point pairs are randomly selected from the matching point pair set P _b each time to calculate H _b , and the ratio of the number of points satisfying (4) in P _b is counted, denoted as Inlier_Ratio. Finally, the homography matrix that maximizes Inlier_Ratio is selected as the final H _b .

其中，in,

同理，可以计算出S_f区域对应的单应矩阵记为H_f Similarly, the homography matrix corresponding to the S _f region can be calculated as H _f

步骤五：在得到了两个单应矩阵后，便可对图像进行配准。对于三个不同区域，本发明采用三种配准模型。对于区域S_b中的像素点，可以认为它们均属于S_b所在的那个平面，则这些像素点在配准时使用单应矩阵H_b，如(6)所示，其中(x,y,1)^T为I₂中某像素点的齐次坐标，(x',y',1)^T为该点通过投影变换到I₁中对应的点坐标。Step 5: After two homography matrices are obtained, the images can be registered. For the three different regions, the present invention employs three registration models. For the pixels in the area S _b , it can be considered that they all belong to the plane where S _b is located, then these pixels are registered using the homography matrix H _b , as shown in (6), where (x, y, 1) ^T is the homogeneous coordinate of a pixel in I ₂ , and (x', y', 1) ^T is the point that is transformed to the corresponding point coordinate in I ₁ by projection.

对于区域S_f中的像素点，同理可使用(7)所示变换将点进行投影变换。For the pixel points in the area S _f , similarly, the transformation shown in (7) can be used to perform projective transformation on the points.

对于S_u中的像素点，本发明将它们称为“Undefined pixels”，即无法准确判断出这些点属于哪一个平面。针对此问题，本发明提出以下方法解决。For the pixels in Su, the present invention calls them " _Undefined pixels", that is, it is impossible to accurately determine which plane these points belong to. In view of this problem, the present invention proposes the following methods to solve it.

(1)对于所有S_u中的点，定义单应矩阵：(1) For all points in Su, define the _homography matrix:

H_ij＝w_bH_b+w_fH_f (8)H _ij = w _b H _b +w _f H _f (8)

其中H_ij表示(i，j)处的像素对应的单应矩阵，H_b、H_f分别表示图像两个区域所对应的单应矩阵，w_b和w_f分别为这两个平面对应单应矩阵的权重系数。where H _ij represents the homography matrix corresponding to the pixel at (i, j), H _b and H _f represent the homography matrix corresponding to the two regions of the image respectively, w _b and w _f are the homography corresponding to the two planes, respectively Weight coefficients for the matrix.

(2)计算每一个S_u中像素点到区域S_b、S_f的特征点集中最近的特征点的距离，分别记为d_b、d_f，本发明提出在距离计算过程中使用曼哈顿距离来代替传统的欧式距离，这样可以很大程度的节约时间，权重计算公式如下：(2) Calculate the distance from the pixel point in each _{Su to the nearest feature point in the feature point set of the regions S b} _and S _f , which are denoted as _{db and d f} _respectively . The present invention proposes to use the Manhattan distance in the process of distance calculation. Instead of the traditional Euclidean distance, this can save time to a great extent. The weight calculation formula is as follows:

w_b＝d_f/(d_b+d_f) (9)w _b =d _f /(d _b +d _f ) (9)

w_f＝d_b/(d_b+d_f) (10)w _f =d _b /(d _b +d _f ) (10)

综上所述，可以得到每个像素点的混合投影变换模型：In summary, the mixed projection transformation model of each pixel can be obtained:

其中，background表示背景区域，undefined area表示不明确区域，foreground表示前景区域。Among them, background represents the background area, undefined area represents the ambiguous area, and foreground represents the foreground area.

经过上述步骤，便实现了两幅图像的配准。After the above steps, the registration of the two images is achieved.

步骤六：经过上述五个步骤，实现了两幅图像序列的配准工作，接着，对第三幅I₃以及后续的图像序列来实现配准拼接。本发明采用单应阵迭代的方法来实现此目的。为了方便清楚说明，本步骤将H_ij的下标省去，记作H，Step 6: After the above five steps, the registration work of the two image sequences is realized, and then, the registration and splicing are realized for the third image I ₃ and the subsequent image sequences. The present invention adopts the homography matrix iteration method to achieve this purpose. For the convenience of clear description, the subscript of H _ij is omitted in this step, and it is denoted as H,

(1)对于I₂和I₃重叠的区域，可以使用式(12)直接进行迭代投影变换，(1) For the region where I ₂ and I ₃ overlap, Equation (12) can be used to directly perform iterative projection transformation,

H^3→1＝H^2→1(H^3→2(I₃)) (12)H ^3→1 =H ^2→1 (H ^3→2 (I ₃ )) (12)

(2)对于I₃中的I₂和I₃非重叠区域，可以使用式(13)进行加权迭代投影变换(2) For the non-overlapping regions of I ₂ and I ₃ in I ₃ , the weighted iterative projection transformation can be performed using Eq. (13)

其中，本发明将K定义为I₂和I₃的重叠区域中的Sift特征点集，定义权重λ_q为λ_q＝1/d_Manh，其中d_Manh表示H^3→2(p)与q之间的曼哈顿距离。再对λ_q进行归一化处理，使得∑λ_q＝1，Wherein, the present invention defines K as the Sift feature point set in the overlapping area of I ₂ and I ₃ , and defines the weight λ _q as λ _q =1/d_Manh, where d_Manh represents the relationship between H ^3→2 (p) and q Manhattan distance. Then normalize λ _q so that ∑λ _q =1,

综上可得：In summary:

经过上述步骤可以计算出第三幅图像相对于第一幅图象的投影矩阵，然后进行投影变换，这样便实现了将第三幅图像的配准工作。对于后续图像序列，可循环迭代使用该方式。After the above steps, the projection matrix of the third image relative to the first image can be calculated, and then the projection transformation is performed, thus realizing the registration of the third image. This method can be used iteratively for subsequent image sequences.

最后，对配准的图像进行融合，本发明采用传统的渐入渐出融合方法(即线形插值融合算法)，使用线形加权函数来实现在靠近重叠区域边界的地方做平滑过渡处理，加权函数如下式所示：Finally, the registered images are fused. The present invention adopts the traditional fade-in and fade-out fusion method (ie, linear interpolation fusion algorithm), and uses a linear weighting function to achieve smooth transition processing near the boundary of the overlapping area. The weighting function is as follows The formula shows:

其中I₁和I₂表示相邻两幅图像， where I ₁ and I ₂ represent two adjacent images,

本发明的有效性和准确性已经通过实验进行了验证，取得了很好的拼接结果。实验结果如图3所示，(a)表示视频序列中的四幅图像，(b)表示由这四幅图像计算得到的全景图。本发明的最大优势在于使用SVM方法对图像进行区域划分，并以此为基础，分区域进行图像的投影变换(单应变换)，有效地避免了传统的单投影阵方法中误差大、多投影阵方法中费时的问题，在虚拟现实等技术领域中有着很好的前景。The validity and accuracy of the present invention have been verified through experiments, and good splicing results have been obtained. The experimental results are shown in Figure 3, where (a) represents the four images in the video sequence, and (b) represents the panorama calculated from these four images. The biggest advantage of the present invention is that the SVM method is used to divide the image into regions, and based on this, the projection transformation (homography transformation) of the image is performed in different regions, which effectively avoids the large error and multiple projections in the traditional single projection array method. The time-consuming problem in the matrix method has a good prospect in the technical fields such as virtual reality.

Claims

1. a method for stitching a large field of view video panorama based on a hybrid projection transformation model, is characterized in that: comprise the following steps:

(1) Downsampling the video image sequence to ensure that there is at least 1/3 of the overlapping area between every two adjacent images;

(2) The detection of Sift feature points is performed on the overlapping area of the first two images obtained by sampling, the obtained feature point set, and feature vector extraction is performed on the feature points;

(3) Cluster the feature point set, use the support vector machine SVM method to divide the image into three regions, denoted as regions S _b , S _u and S _f , and perform the corresponding regions S in I ₁ and I ₂ The matching of _b and S _f feature points;

(4) The Random Sample Consensus (RANSAC) method is used for the feature points in S _b and S _f respectively to remove the mismatched points, and the relationship between the two-dimensional coordinates between the two images is calculated, that is, the homography matrix, which is recorded as H _b , H _f ;

(5) The first two images obtained by sampling are registered. During the registration process, the pixels in the three regions S _b , S _u and S _f are processed by the hybrid projection transformation model from the second image to the first image. registration;

(6) Realize the registration of the third image to the first image through the iteration of the homography matrix, repeat step (6), and then realize the registration of subsequent image sequences, and finally perform image fusion to complete the stitching of the entire panorama Task;

In the step (5), the hybrid projection transformation model is implemented as follows:

(51) For the pixels in the area S _b , use the homography matrix H _b during registration, where I ₁ and I ₂ represent the first and second images, respectively, (x, y, 1) ^T is I ₂ The homogeneous coordinates of a pixel in I 1 , (x', y', 1) ^T is the corresponding point coordinate of the point transformed to I ₁ by projection,

(52) For the pixel points in the area S _f , use the transformation shown in (4) to perform projection transformation on the points;

(53) For the pixels in Su, define the _homography matrix:

H _ij = w _b H _b +w _f H _f (5)

Wherein H _ij represents the homography matrix corresponding to the pixel at (i, j), H _b and H _f represent the homography matrix corresponding to the two image regions S _b and S _f respectively, w _b and w _f are two The weight coefficients of the image regions S _b and S _f corresponding to the homography matrix;

Calculate the Manhattan distance from each pixel in S _u to the nearest feature point in the feature point set of regions S _b and S _f , denoted as _{db and d f} _respectively , and the weight calculation formula is as follows:

w _b =d _f /(d _b +d _f ) (6)

w _f =d _b /(d _b +d _f ) (7)

The hybrid projection model is expressed as:

Wherein, foreground is a foreground area, namely S _f , undefined area is an ambiguous area, namely Su, and background is _{a background area, namely S b} _.

2. a kind of method based on the large field of view video panorama stitching of hybrid projection transformation model according to claim 1, is characterized in that: in step (3), when image is carried out area division,

(1) After using the K-means method for feature point clustering, the SVM method is used to divide the image, and the two straight line equations that are tangent to the support vector are:

w ^T x+b ₁ =0 (1)

w ^T x+b ₂ =0 (2)

Among them, x=(u,v) ^T , w, b ₁ and b ₂ represent the coefficients of the straight line equation, u represents the abscissa of the pixel, and v represents the ordinate of the pixel;

(2) Denote the pixel area that satisfies w ^T x+b ₁ <0 as S _b ;

Denote the pixel area that satisfies w ^T x+b ₂ >0 as S _f ;

The pixel area that satisfies w ^T x+b ₁ >0 and w ^T x+b ₂ <0 is denoted as _Su to realize the division of the image area.

3. a kind of method based on the large field of view video panorama stitching of hybrid projection transformation model according to claim 1, is characterized in that: in described step (6), the registration of the 3rd and subsequent image sequence is as follows :

In the non-overlapping regions of I ₂ and I ₃ , the method of weighted homography matrix iteration is used to achieve image registration, as follows:

Among them, K is defined as the set of Sift feature points in the overlapping area of I ₂ and I ₃ , and the weight λ _q is defined as λ _q =1/d_Manh, where d_Manh represents the distance between H ^3→2 (p) and the Sift feature point q The Manhattan distance of , and then normalize λ _q to make Σλ _q =1.