[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Next Article in Journal
Estimation of Ground Reaction Forces and Moments During Gait Using Only Inertial Motion Capture
Previous Article in Journal
High-Precision Registration of Point Clouds Based on Sphere Feature Constraints
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Line-Based Registration of Panoramic Images and LiDAR Point Clouds for Mobile Mapping

1
State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China
2
Research Center of Remote Sensing in Public Security, People’s Public Security University of China, Beijing 100038, China
3
School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China
4
RSISE, Australian National University, Canberra, ACT 2600, Australia
5
Collaborative Innovation Center of Geospatial Technology, Wuhan University, Wuhan 430079, China
6
Lyles School of Civil Engineering, Purdue University, West Lafayette, IN 47907, USA
*
Authors to whom correspondence should be addressed.
Sensors 2017, 17(1), 70; https://doi.org/10.3390/s17010070
Submission received: 20 October 2016 / Revised: 18 December 2016 / Accepted: 26 December 2016 / Published: 31 December 2016
(This article belongs to the Section Remote Sensors)
Figure 1
<p>The MMS used in this study: (<b>a</b>) the vehicle; (<b>b</b>) the panoramic camera, laser scanners, and GPS receiver.</p> ">
Figure 2
<p>Coordinate systems of the Mobile Mapping System.</p> ">
Figure 3
<p>Comparison between (<b>a</b>) a panoramic image and (<b>b</b>) a frame image. (The meaning of the Chinese characters on the building is Supermarket for Logistics in Central China).</p> ">
Figure 4
<p>Differences between the spherical and panoramic camera models. (<b>a</b>) The dashed line shows the ray through 3D point <b><span class="html-italic">P</span></b>, panoramic image point <b><span class="html-italic">u</span></b>, and sphere center <b><span class="html-italic">S</span></b>; (<b>b</b>) the solid line shows the ray through 3D point <b><span class="html-italic">P</span></b>′, mono-camera image point <b><span class="html-italic">u<sub>c</sub></span></b>, panoramic image point u, and the mono-camera projection center <b><span class="html-italic">C</span></b>.</p> ">
Figure 5
<p>Images of Camera 0–5: (<b>a</b>) 6 fish-eye images; (<b>b</b>) 6 rectified images. (The meaning of the Chinese characters on the building is Supermarket for Logistics in Central China).</p> ">
Figure 6
<p>Global and local coordinate systems of multi-camera rig under a cylindrical projection. (<b>a</b>) the global panoramic camera coordinate system and (<b>b</b>) six local coordinate systems of the rectified cameras.</p> ">
Figure 7
<p>Line-based transformation model on panoramic image.</p> ">
Figure 8
<p>Line segments fitting for building patch: (<b>a</b>) projected points; (<b>b</b>) boundary points; (<b>c</b>) fitting lines using conventional least square method; and (<b>d</b>) fitting lines using regularity constraints.</p> ">
Figure 9
<p>Line segments fitting for (<b>a</b>) pole-like objects and (<b>b</b>) curbs.</p> ">
Figure 10
<p>Overview of the test data: (<b>a</b>) the test area in Google Earth; (<b>b</b>) 3D point cloud of the test area.</p> ">
Figure 10 Cont.
<p>Overview of the test data: (<b>a</b>) the test area in Google Earth; (<b>b</b>) 3D point cloud of the test area.</p> ">
Figure 11
<p>Alignments of two datasets before and after registration based on the panoramic camera model with lens id 0–4. (<b>a</b>,<b>b</b>) are the LiDAR points projected to a panoramic image before and after registration respectively; (<b>c</b>,<b>d</b>) are the 3d point cloud rendered by the corresponding panoramic image pixels respectively.</p> ">
Figure 11 Cont.
<p>Alignments of two datasets before and after registration based on the panoramic camera model with lens id 0–4. (<b>a</b>,<b>b</b>) are the LiDAR points projected to a panoramic image before and after registration respectively; (<b>c</b>,<b>d</b>) are the 3d point cloud rendered by the corresponding panoramic image pixels respectively.</p> ">
Figure 12
<p>Check points distribution shown on panoramic image. (The meaning of the Chinese characters on the building is Supermarket for Logistics in Central China).</p> ">
Figure 13
<p>The residuals of the check points before and after registration based on the panoramic camera model. The vertical axis is the residual in pixels; the horizontal axis is (<b>a</b>) the ID of the check points and (<b>b</b>) the ID of the lens and check points (lens ID—check point ID).</p> ">
Figure 14
<p>Linear features of the two datasets. (<b>a</b>) EDISON edge pixels in the panoramic image; and (<b>b</b>) boundary points in LiDAR point cloud.</p> ">
Figure 15
<p>Definition of overlap rate: (<b>a</b>) image edge pixels; (<b>b</b>) LiDAR projected boundary points; (<b>c</b>) union of (a,b); (<b>d</b>) overlap of (a,b); (<b>e</b>) composition of (c) and highlighted (d).</p> ">
Versions Notes

Abstract

:
For multi-sensor integrated systems, such as the mobile mapping system (MMS), data fusion at sensor-level, i.e., the 2D-3D registration between an optical camera and LiDAR, is a prerequisite for higher level fusion and further applications. This paper proposes a line-based registration method for panoramic images and a LiDAR point cloud collected by a MMS. We first introduce the system configuration and specification, including the coordinate systems of the MMS, the 3D LiDAR scanners, and the two panoramic camera models. We then establish the line-based transformation model for the panoramic camera. Finally, the proposed registration method is evaluated for two types of camera models by visual inspection and quantitative comparison. The results demonstrate that the line-based registration method can significantly improve the alignment of the panoramic image and the LiDAR datasets under either the ideal spherical or the rigorous panoramic camera model, with the latter being more reliable.

1. Introduction

Motivated by applications such as vehicle navigation [1], urban planning [2], and autonomous driving [3], the need for 3D detailed photorealistic models in urban areas has increased dramatically in recent years. A mobile mapping system (MMS), which collects 3D and/or 2D photographic data while vehicles move at a regular speed, have been widely used as the efficient data acquisition technology at street-level [4]. Short range laser scanners (e.g., 100–200 m) and electro-optical cameras are two primary sensors on a MMS, each of which has its own characteristics. Laser scanners can acquire 3D information directly, but offer relatively low resolution and short range of use, whereas a camera captures 2D images with high-resolution textures but the depth information is missing. Thus, these complementary characteristics between ranging and imaging sensors have been broadly studied through the so-called camera/LiDAR fusion [5,6,7,8].
A prerequisite of data fusion is to transform different datasets to a common coordinate system, which is often called 2D-to-3D registration. Although the camera and LiDAR sensors are usually calibrated [9,10,11,12] in advance, considerable misalignment may still exist between the two datasets. Possible reasons for such misalignment are as follows: (1) System calibration errors. The relative orientation and position offsets between all the sensors, GPS, Inertial Measurement Unit (IMU), LiDAR and camera on a MMS, may lack precise measurements by the manufacturer. Further, the offsets and relative orientations determined prior to data collection may change due to mechanical vibrations [13]; (2) Different acquisition time. The image is collected at a certain exposure time while the corresponding point cloud data are actually collected via continuous scanning during a period of time, indicating that the two sensors are not strictly synchronized; (3) Unreliable GPS signals make this even worse. At street level, the GPS signals often suffer from ambiguity and loss due to multipath and canyon effects. Even when IMU, Distance Measuring Indicator (DMI) or Different GPS (DGPS) are onboard, noticeable co-registration errors may still exist. All of the above unavoidable factors call for data-driven registration methods in the subsequent data processing procedure.
There have been a number of related studies pertaining to 2D-3D registration, and the reader is referred to the review in [14] for a comprehensive discussion of them. The registration framework for LiDAR and optical images [15], as outlined in the concept of image registration [16], includes four components: registration primitives, similarity measure, registration transformation, and matching strategy.
As to the first component, we use straight-line as the registration primitives in this paper for two reasons. First, linear features have the potential to be reliably, accurately, and automatically extracted from both 2D images and 3D point clouds. While the former has been well studied [17,18,19], the latter has received relatively less attention. Early studies on 3D line segments usually extracted plane intersection lines [20,21], and more recent work also considered depth discontinuity lines [22,23]. Applying a 2D line detection algorithm on a set of shaded images rendered from a point cloud with different views, reference [24] provided reasonable results of 3D line segments from an unorganized point cloud. And specifically, some studies also focused on linear objects such as poles [25,26] and curbs [27,28]. Second, the methods using linear features exceed those using point features on registration accuracy, which is demonstrated in the technical report of the “Registration Quality—Towards Integration of Laser Scanning and their Performance” project, sponsored by European Spatial Data Research (EuroSDR), 2008–2011 [29].
The principles and concepts of line-based registration originated from the highly researched topic of line photogrammetry in the late 1980s and early 1990s whereby linear features, especially straight lines, were regarded as basic an entity as traditional point features. Reference [30] described the concepts, mathematical formulations, and algorithms related to line photogrammetry, which is based on the straight lines extracted from digital images. In [31] a collinearity model to establish the relationship between lower level features, such as edge pixels, instead of fitting lines in the image space and the control lines in the object space was proposed. Reference [32] proposed the concept of “generalized points,” a series of connected points representing a linear feature. According to this concept, the traditional collinearity model can accommodate more complicated features, such as circles and rectangles, rather than only straight lines. The authors in [20,21] utilized linear features to determine camera orientation and position relative to a 3D model. Vanishing points extracted from 2D images and 3D principle directions derived from a 3D model were also used to estimate camera orientation. In addition, there were also many studies using statistical similarity measures for the registration of the LiDAR points and images. Reference [33] developed a registration strategy based on global mutual information and exploited the statistical dependency between the intensity and measured LiDAR elevation, while [34] investigated the effectiveness of both local and global mutual information.
All of the above studies were based on the popular frame cameras. However, a panoramic camera rather than a frame camera is used in our MMS, which means a new panoramic sensor model must be taken into account. The greatest advantage of a panoramic camera is its 360° view angle, essentially making it a standard component in recently produced MMS. The online street-view maps provided by Google, Microsoft, and Baidu and Tencent were mostly generated from geo-registered panoramic imagery. Unfortunately, none of the limited studies on LiDAR and image registration involving panoramic cameras considered a rigorous panoramic camera model. References [8,35] proposed an automatic registration of mobile LiDAR and spherical panoramas; however, only part of the panoramic image in the limited viewport was used. Moreover, the registration was based on a conventional frame camera model.
Besides registration, many applications based on panoramic images have been reported in recent years. In [36] a structure-from-motion (SfM) pipeline for 3D modeling using Google Street View imagery with an ideal spherical camera model was presented, while [37] presented a piecewise planar model to reconstruct 3D scenes from panoramic image sequences. A quadrangular prism panoramic camera model was used for improved image matching. Reference [38] explained both the ideal spherical camera model and the rigorous panoramic sensor model for a multi-camera rig and their corresponding epipolar geometry. In [39] the two models were further compared and their effects on the localization quality in object space and the quality of space resection. In this paper, we utilize the rigorous panoramic sensor model for image and LiDAR registration. Since there were many studies based on the ideal spherical camera model, we also attempt to demonstrate its limitations in this paper by comparing it with the rigorous model.
This paper proposes a rigorous line-based registration method for precise alignment of LiDAR point clouds and panoramic images. The remainder of this paper is structured as follows: Section 2 introduces a MMS and its sensor frames for the LiDAR and the panoramic camera. Section 3 presents registration models based on straight-line features for panoramic cameras. Section 4 addresses 3D line extraction from LiDAR point clouds. Section 5 introduces the datasets and analyzes the registration results based on our model. Finally, the conclusions and future work recommendations are presented in Section 6.

2. The Mobile Mapping System and Sensor Configuration

The MMS used in this paper was jointly developed by Wuhan University and Leador Spatial Information Technology Corporation, Ltd. (Wuhan, China), configured with a Ladybug 3 camera [40], three low-cost SICK laser scanners [41] (one for ground and two for facades) and a GPS/IMU. The system and its sensor configuration are shown in Figure 1.
This section first introduces several key coordinate systems of the MMS and their geometric relationships, followed by a description of the geo-referenced LiDAR and the rigorous camera model for a multi-camera rig.

2.1. Coordinate Systems

As shown in Figure 2, there are three coordinate systems in our MMS: (1) a world coordinate system; (2) a vehicle platform coordinate system; and (3) a camera-centered coordinate system. The world coordinate system is the reference for data management and organization. In the proposed method, the GPS/IMU records the translation and orientation from the world coordinate system to the vehicle platform coordinate system denoted as M1(R1, T1). The LiDAR points are geo-referenced to the world coordinate system (the left-bottom dotted line) according to M1 and the calibration parameters M3 between the LiDAR sensor and the platform. M2(R2, T2) is the transformation from the camera to the vehicle platform, which is also achieved through prior calibration.
The goal of registration is to determine the transformation M from the LiDAR points to the panoramic image. Other than a static calibration which concerns only M23, the time series of localization information M1 is considered. For simplification, the possible errors of M3 are ignored and the transformation is constructed directly between the georeferenced LiDAR and camera (the bottom solid line). It is assumed that there exists ΔMR, ΔT) to meet:
M = [ R T 0 1 ] [ R 2 T 2 0 1 ] [ R 1 T 1 0 1 ]
ΔM compensates for several aspects including M3 (as is discussed in Section 1). The line features are extracted from both the images and the LiDAR points, which are then utilized to determine the optimal ΔM for an accurate registration. The solution procedure is based on the standard least squares technique imbedded with RANSAC for removal of possible gross errors in M1.

2.2. Geo-Refereneced LiDAR

The LiDAR points are geo-referenced to the world coordinate system by the interpolated rotation values recorded by INS at the corresponding position from the GPS/IMU integrated navigation data M1 and the calibration parameters M3 between the LiDAR and the IMU [42]. In the proposed system, three low-cost SICK laser scanners (all linear-array lasers) are equipped to acquire a 3D point cloud of the object’s facade. The angular resolution (0.25°–1.0°) and scan frequency (25–75 Hz) are fixed during data acquisition. The density of LiDAR points is uneven, i.e., the closer they are to the measured surface, the higher the density of the points. For instance, the points on the ground are much denser than those on the top facade. In addition, the point density in the horizontal direction is dependent on the velocity of the MMS vehicle.

2.3. Multi-Camera Rig Models

The panoramic image in Figure 3 covers a 360° view of a surrounding scene, captured by the Ladybug 3 system composed of six fisheye cameras. The straight lines in reality are no longer straight on panoramic images compared to a common frame Figure 3b.
Generally, the panoramic imaging process can be approximated by an ideal spherical camera model. However, since the entire image is technically stitched from six individual images through blending, stitching errors cannot be avoided. This section first introduces the traditional ideal spherical camera model and then extends it to the rigorous multi-camera rig panoramic model. The spherical camera model is referred to as the ideal one and the panoramic camera model as the rigorous one.

2.3.1. Spherical Camera Model

Under this model, the imaging surface is regarded as a sphere, whose center is the projection center. Figure 4a presents a schematic diagram of the spherical projection, where the sphere center S, the 3D points P in plane π, and the panoramic image point u are collinear [39]. The pixels in a panoramic image are typically expressed in polar coordinates. Assuming that the width and height of the panoramic image is W and H respectively, the horizontal 360° view is mapped to [0, W] and the vertical 180° view is mapped to [0, H]. Thus, each pixel (u, v) can be transformed to polar coordinate ( θ , φ ) by Equation (2):
{ θ = ( 2 u W ) · π W φ = ( 1 2 v H ) · π 2
θ is the horizontal angle between −π and π, and φ is the vertical angle between −π/2 and π/2. Let r be the radius of the projection sphere, Equation (3) is used to calculate a set of Cartesian coordinates. In most cases, r = 20.0 m for the best stitching accuracy [43].
{ x = r · cos   φ · sin   θ y = r · cos   φ · cos   θ z = r · sin   φ
The sphere center S, 3D point P, and edge pixel u are collinear. The relationship between X and P is established by perspective transformation in Equation (4):
X =   λ 1 R T ( P T )
where P is the coordinate of the object point, and X(x, y, z) is the Cartesian coordinate of image point u; R and T are respectively the rotation matrix and translation vector between the object space and the panoramic camera space; and λ is the scale factor.
Unlike the traditional camera model, the z coordinate of the image point is not equal to −f, where f is the focal length in the traditional camera model. In the widely used spherical camera model, the image point u is under the spherical geometry constraint as Equation (5):
x 2 + y 2 + z 2 = r 2
As a result, Equation (4) actually has two degrees of freedom, i.e., two independent equations.

2.3.2. Panoramic Camera Model

A multi-camera rig consists of several separate and fixed fish-eye lenses. Independent images are captured by each lens and then stitched to form the entire panoramic image. As shown in Figure 4b, each lens has its own projection center C, but it cannot be precisely located on sphere center S due to the manufacturing constraints. The mono-lens center C, instead of sphere center S, panoramic image point u, and 3D point P′ in object space are collinear.
The panoramic camera used in this paper is composed of six separate fish-eye lenses. Figure 5a shows an example of the six raw fisheye images, and Figure 5b shows the corresponding undistorted images rectified from the raw ones. Rectification from a fisheye to an ideal plane image only depends on the known camera calibration parameters Kr, including the projection model and the radial and tangential distortion. The index r means that every fisheye camera has its own calibration parameters. Since the straight lines, such as the boundaries of buildings, are distorted in the raw fisheye image, rectified images are used for line extraction.
As shown in Figure 6, the global camera coordinate system is defined for the whole multi-camera rig, and six local coordinate systems are defined for each lens separately. The global coordinate system (see Figure 6a) of the panoramic camera is defined by three main directions: the X-axis typically is along the driving direction; the Z-axis is the zenith direction; and the Y-axis is orthogonal to both the Y-axis and the Z-axis. Each of the six lenses has its own local coordinate system (see Figure 6b): (1) The origin is the optical centre of the lens; (2) the Z-axis is the optical axis and points towards the scene; and (3) the X-axis and the Y-axis are parallel to the corresponding image coordinate. For each lens, there are three interior orientation elements (focal length f and image centre (x0, y0)) and six exterior orientation parameters (Tx, Ty, Tz, Rx, Ry, Rz) relative to the global coordinate system (the offsets between C and S in Figure 4b under spherical view). Both of them were acquired in advance by careful calibration by the manufacturer.
Although each lens may have its own camera model, the advantages of panoramic imaging may be overlooked. To address this issue, all six images are projected into the global spherical imaging surface to obtain uniform coordinates. First, the coordinates of the rectified image of each separate lens are transformed to the global coordinate system. Each pixel p (x, y) in the rectified images forms a 3D ray in the global coordinate system by Equation (6):
X = m R r X r + T r .
where Xr (xx0, yy0, f) is the mono-camera coordinates of pixel p and the translation vector Tr (Tx, Ty, Tz) and the local rotation matrix Rr are known, the latter can be calculated by the following equation:
R r = { cos R z cos R y cos R z sin R y sin R x sin R z cos R x cos R z sin R y cos R x + sin R z sin R x sin R z cos R y sin R z sin R y sin R x + cos R z cos R x sin R z sin R y cos R x cos R z sin R x s i n R y cos R y sin R x cos R y cos R x
X′ (x′, y′, z′) is the coordinates transformed into the global panoramic coordinate system, and the scale factor m defines the distance from the rectified image plane to the projection surface (typically a sphere or cylinder). By combining Equations (5) and (6), we can resolve m and X′ for a sphere projection.
In the next step, the collinearity equation based on the multi-camera rig is established. As shown in Figure 4b, the real 3D ray is through CuP′, instead of SuP, which can be vectorized as ( X T r ) . Translating the vector to the global camera coordinate system yields:
T r + λ ( X T r ) = R T ( P T )
Equation (8) would be the same as the sphere projection (4) when Tr is small enough and vanishing. However, for the self-assembly panoramic camera whose Tr is too large to ignore, the panoramic camera model is a better choice.

3. Line-Based Registration Method

To simplify the transformation in Equation (1), an auxiliary coordinate system is introduced, which is close to the camera-centered coordinate system but still has ΔM bias. Using M1 and M2 in Figure 2, LiDAR point Pw is transformed in the world coordinate into the auxiliary coordinate P, as is defined in Equation (9), which is further discussed below:
P = [ R 2 T 2 0 1 ] [ R 1 T 1 0 1 ] P w

3.1. Transformation Model

Suppose a known line segment AB is given in the world coordinate (actually the auxiliary coordinate in this paper). Its corresponding line in the panoramic image is detected as edge pixels. The projection ray through the perspective panoramic camera center C, an edge pixel p on panoramic image, intersects AB on point P, as illustrated in Figure 7. By letting the line be represented by the two endpoints XA and XB, an arbitrary point P is defined using Equation (10):
P = X A + t ( X B X A )
with t a scale factor along the line.
Substituting the object point P in Equation (10) to Equation (4) yields the line-based sphere camera model:
λ X = R 1   [ ( X A T ) + t ( X B X A ) ]
Further, Equations (2) and (3) are combined and P is substituted in Equations (10)–(8), yielding the line-based panoramic camera model:
T r + λ ( X T r ) = R 1   [ ( X A T ) + t ( X B X A ) ]
where X can be obtained from Equation (6). The scalar parameter λ and the line parameter t are unknown. What we try to resolve is the rotation matrix R and translation T.

3.2. Solution

To get the best alignment between the two datasets, the non-linear least squares method is used to solve the unknowns iteratively. Euclidean distance in the panoramic image coordinate system is used as the similarity metric. Denoting the right-hand term in Equation (11) as [ X ¯   Y ¯   Z ¯ ] T and combining Equations (2) and (4), resulting Equation (13):
{ u = [ tan 1 ( X ¯ Y ¯ ) · W π + W ] · 1 2 v = [ 1 sin 1 ( Z ¯ X ¯ 2 + Y ¯ 2 + Z ¯ 2 ) · 2 π ] · H 2
where (u, v) is the coordinates in the panoramic image coordinate system and (W,H) is the panoramic image size.
In addition to the six orientation and translation values ( X , Y , Z , φ , ω , κ ), unknown line parameter t must be estimated. Here the right terms are multivariate composite functions fu(R, T, t) and fv(R, T, t). Given one pixel on the corresponding lines, the two equations in Equation (11) are formed with one line-parameter t introduced. In order to solve the six unknowns, at least six points are needed. If one point per line is used, six pairs of corresponding lines are needed; and if two points per line are used, three pairs of corresponding lines are needed. More than two points on a line does not reduce the rank deficiency but only increases the redundancy.
The equations of i-th pair of corresponding lines can be termed by { u i = f u ( R , T , t ) v i = f v ( R , T , t ) . Defining a parameter vector X = ( X , Y , Z , φ , ω , κ , t 1 , , t n ) T for n pairs of corresponding lines, Equation (13) is then expanded as Equation (14) after linearization by the Taylor series:
{ u = u 0 + f u X · X + f u Y · Y + + f u t n · t n v = v 0 + f v X · X + f u Y · Y + + f v t n · t n
The above equation is expressed in matrix form as Equation (15):
V = A L
where = ( X T , Y T , Z T , φ , ω , κ , t 1 , , t n ) T and L = ( L 1 ; ; L n ) with L i =   ( u i u i 0 , v i v i 0 ) T . The coefficient matrix A ( A 1 ; ; A n ) . are defined as partial derivative of functions f u and f v :
A i = { f u X , f u Y , f u Z , f u φ , f u ω , f u κ , f u t 1 , , f u t n f v X , f v Y , f v Z , f v φ , f v ω , f v κ , f v t 1 , , f v t n }
The results can be obtained by solving the normal equation = ( A T A ) 1 A T L . The unknowns X are updated through X X + iteratively until the elements of are less than a given threshold. In order to assess the accuracy of the results, the standard deviation is calculated by Equation (17):
m 0 = ± V T V r
Here r is the number of redundant observations, r = (2 × n) − (n + 6). n is the number of pixels involved in the transformation, and usually n = 2m with m pairs of corresponding lines. To handle the mismatch between the LiDAR cloud lines and image lines as well as the occasional large biases in GPS/IMU records, a RANSAC paradigm [29] is applied in iteration to remove the outliers in the corresponding line segments from the LiDAR and the camera.

4. Line Feature Extraction from LiDAR

The insufficient density of LiDAR points in a low-cost configuration usually makes 3D line fitting a challenge. The previous works about line extraction tend to use the intersection of two neighboring unparalleled planar patches. However, the methods cannot work well in our case. There are only a few intersections of planar patches in the dataset both due to the point cloud density and flat building facades. Hence we fit linear features directly from cloud points. There are three types of objects containing abundant linear features in the common street-view scenes: buildings, pole-like objects and curbs. In this section, we introduce the methods to fit 3D straight lines from points belonging to the three objects. It is noted that the line-fitting is based on the well classified 3D point cloud achieved by the existing algorithms or software.

4.1. Buildings

Buildings provide the most reliable straight line features in street view. Given a mobile LiDAR point cloud dataset P { p i | p i ( x i , y i , z i ) } of buildings, a group of 3D line segments L { l j | l j ( x 0 j , y 0 j , z 0 j ; x 1 j , y 1 j , z 1 j ) } can be obtained. The detection procedure consists of three steps: (1) apply a region growing segmentation [44] on P and get a set of planar segments S { s k | s k ( i k 1 , i k 2 , , i k n ) } ; (2) project the points onto the 3D plane model of segments s k (Figure 8a) and detect the boundary points in each segment s k (Figure 8b); and (3) fit the boundary points into 3D straight border lines with RANSAC (Figure 8c).
In order to overcome the weakness of the traditional least-square regression method [45], certain constraints are introduced: (1) lines must be through the outermost point, instead of the centroid; (2) only the lines close to being vertical or horizontal are considered; and (3) only the lines having sufficient points are considered (Figure 8d). Compared to the line segments detected by [33] (Figure 8c), the fitting lines with constraints are more reliable.

4.2. Street Light Poles

The pole-like segments are labeled and divided into separate objects by spatial connectivity and presented by one or two arrays of points. A percentile-based pole recognition algorithm [46] is adopted to extract the pole objects, which could exclude disrupting structures, such as a flowerbed at the bottom of a light pole and non-pole elements such as lamps. The main steps of the algorithm are as follows: (1) the segment is first sliced into subparts, for which 2D enclosing rectangles and centroids are derived; (2) the deviation of the centroids between neighboring subparts is checked; (3) the diagonal length of the rectangle is checked; (4) the neighboring subparts with the maximum length are kept. The final fitting line segment is defined by the 2D centroids and the minimum and maximum Z values. The fitting results are shown in Figure 9a.

4.3. Curbs

Curbs are usually located at a height of 10–20 cm above the road surface and are designed to separate the roads from the sidewalks. The density of the points on the ground is relatively high, and the points on curbs present as narrow stripes which are vertical to the road surface [28]. A curb-line can be approximated as the intersection of the vertical curb with the ground surface. In this paper, the curb-lines in the following steps: (1) the points assigned as curbs are fitted into a plane parallel to the Z-axis under a RANSAC method with direction constraint and the noises are filtered as outliers; (2) the points are fitted into the 2D line segment in the OXY plane, (3) the height of the ground is used as the Z value of the 2D line segment. The fitting results are shown in Figure 9b.

5. Experiments and Results

5.1. Datasets

The test data were collected on Hankou North Street in the northern part of Wuhan City and included buildings, trees, poles, streets, and moving cars. Figure 10a shows the test area in Google Earth, and Figure 10b shows the 3D point cloud of the test area containing about 1.2 million points, which were previously classified and rendered by classification.
In both subfigures, the red dots are the driving path, and each of the dots is the location where the panoramic camera exposed at an approximate spacing of 7.5 m. The GPS observations were post-processed with RTK [47] technology and can reach an accuracy of up to 0.1 m.
We first extracted a number of three typical line features from the LiDAR dataset. Second, we projected the lines to the rectified mono-images to obtain the corresponding 2D lines, followed by a manual check for eliminating possible one-to-many uncertainty. Then, the proposed registration approach based on the panoramic camera model was applied. A linear feature from LiDAR was defined by two 3D endpoints; and a linear feature from an image was defined as a sequence of pixels in which only two pixels were used in the transformation. Finally, the registration results were assessed based on 2D and 3D visual comparison before and after registration, quantitative evaluation of check points, and statistical evaluation of edge pixels and 3D boundary points.
As mentioned in Section 2.1, the MMS recorded the POS data of the vehicle when it captured an image, while the exterior orientation parameters (EOP) of the camera relative to the vehicle platform coordinate system were acquired in advance through system calibration. The EOP defined the position and rotation of the camera at the instant of exposure with six parameters: three Euclidean coordinates (X, Y, Z) of the projection center and three angles of rotation ( φ , ω , κ ) . Table 1 shows an example of the POS data and the EOP, which correspond to M1(R1, T1) and M2(R2, T2), respectively, in Equation (9).
Table 2 shows the known parameters of the six cameras in the panoramic camera model. Rx, Ry, Rz are the rotation angles about the X, Y and Z axes, and Tx, Ty, Tz are the translation along the X, Y and Z axes. x0, y0 indicate the pixel location of the camera center, and f is the focal length. These parameters and definitions of the coordinate systems are discussed in detail in Section 2. These parameters are used in the line-based panoramic camera model (12).

5.2. Registration Results

This section analyzes the registration results based on the panoramic camera model in the following steps: registration, visual inspection, and quantitative and statistical evaluation. For comparison purposes, the results from the spherical camera model are presented as well. Table 3 lists the registration results based on the spherical and panoramic camera models. Here the deltas are the corrections after registration, which are the correction term in Equation (15). The root mean square error (RMSE) is the assessment of registration accuracy, which is defined in Equation (17). Both RMSEs were below five pixels. Given an object point 20 m away from the camera center, the error in the object space was about 6 cm.
Figure 11a,c are the panoramic images and the labeled LiDAR point cloud before registration, and Figure 11b,d are those after registration with the panoramic camera model, which visually proved that the proposed method effectively removed the displacement between the panoramic image and the LiDAR point cloud (see the borders of buildings in Lens ID 0–2, the two poles marked with yellow pointers in Lens ID 3, and the windows in Lens ID 4).
To quantitatively evaluate the registration results based on the panoramic camera model, we manually selected check points both in the LiDAR points and the images according to the following rules: (1) the check points must be correspondent and recognizable in both datasets; (2) the check points should be selected from stationary objects with sufficiently dense LiDAR points; and (3) the check points should be evenly distributed horizontally. As a result, 20 check points were selected (see Figure 12) in the 3D LiDAR point cloud and the panoramic image, whereas 28 corresponding points were selected on the rectified mono-camera images. Please note that there are more check points for images than for LiDAR points because the check points for the panoramic model in the overlapping areas appear twice in adjacent cameras.
We projected all the check points to images to determine their 2D coordinates before and after registration separately. Then, we calculated the Euclidean distances between the projected 2D points and the image check points as residuals. According to Figure 13, both the spherical and panoramic camera models reduced the residuals significantly, with the latter showing a slight advantage. The average residuals decreased from 12.0 to 2.9 pixels with the panoramic camera model (Figure 13b), and from 20.5 to 6.5 pixels with the spherical camera model (Figure 13a) after registration. The residuals on most check points decreased substantially after registration, and a few of them showed minimal change. The latter occurred for the checks points whose initial residuals were small enough before registration, such as 1, 3, and 20.
In order to further evaluate the overall effect of registration, we also calculated the statistical value “overlap rate” of the linear features from the two datasets. For the panoramic image, we adopted the EDISON edge detector [48] to extract the edge pixels, as shown in Figure 14a, in which 8 × windows and 20 pixels were used as the min length of the lines to remove noise. For the LiDAR points, we extracted the boundary points using 30 k-nearest neighborhoods, and the results are shown in Figure 14b. From the figures, it can be seen that most of the geometric linear features in the two datasets corresponded. At the same time, some disturbing elements were present, such as the edges due to the color difference in the image and the points from missing data in the LiDAR points.
Figure 15 illustrates the “overlap rate” of the linear features from the two datasets. First, we projected the 3D boundary points onto an image and determined the binary image (see Figure 15b). Second, we performed the Overlap and Union operation on the two binary images (see Figure 15a,b) to determine the overlap binary image (see Figure 15c) and the union binary image (see Figure 15d). We then counted the number of non-zero pixels in both binary images: the number in the overlap binary image is no, and the number in the union binary image is nu. Finally, we defined the overlap rate as follows:
r = n o / n u
If the alignment of the image and the LiDAR point cloud improves, there will be more overlapping linear features, which means n o will increase while n u will decrease and the overlap rate will increase after efficient registration. The results in Table 4 show that all of the overlap rates slightly increased up to 2% after registration. Among the five lenses, Len ID 1 showed the most significant improvement, mainly because part of the image captured by Len ID 1 was a facade containing many linear elements (see Figure 5).

6. Conclusions

This paper proposed a line-based registration approach for panoramic images and LiDAR point clouds collected by a MMS. We first established the transformation model between the primitives from the two datasets in the camera-centered coordinate system. Then, we extracted the primitives (three typical linear features) in street view from LiDAR automatically and from panoramic images through a semi-automation process. Using the extracted features, we resolved the relative orientations and translations between the camera and LiDAR.
Compared with other related works, the main contribution of this study is that it focused on the registration between LiDAR and the panoramic camera, which is widely used in a MMS instead of a conventional frame camera. Two types of camera models (spherical and panoramic) were utilized in our registration. The experimental results show that both models were able to remove obvious misalignment between the LiDAR point cloud and the panoramic image. However, the panoramic model achieved better registration accuracy. It is suggested that a suitable camera model may need to be chosen for certain data fusion tasks. For example, for rendering a LiDAR point cloud with acceptable misalignment, the spherical camera model would be adequate while the panoramic camera model may be necessary for high level fusion tasks such as facade modeling.
There are ways to further improve the registration accuracy and automation of the proposed method in future work. First, the errors from the LiDAR point cloud itself could not be overlooked. In our case, the LiDAR points were collected by three laser scanners, whose calibration errors also influenced the registration accuracy. Moreover, finding reliable correspondences between different datasets may need to use local statistical similarity such as mutual information. In addition to geometrical features, utilizing the physical attributes of LiDAR, such as intensity, is also a future research topic.

Acknowledgments

The authors thank Yuchun Huang and the staff in the Mobile Mapping Laboratory, School of Remote Sensing and Information Engineering, Wuhan University, for their assistance with data collection and preprocessing. This work was partially supported by the National Natural Science Foundation of China (41271431, 41471288 and 61403285) and the National High-Tech R&D Program of China (2015AA124001).

Author Contributions

Tingting Cui performed the experiments and wrote the draft manuscript; Shunping Ji conceived and designed the experiments; Jie Shan advised the study and contributed to writing in all phases of the work; and Jianya Gong and Kejian Liu revised the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cornelis, N.; Leibe, B.; Cornelis, K.; Van Gool, L. 3D urban scene modeling integrating recognition and reconstruction. Int. J. Comput. Vis. 2008, 78, 121–141. [Google Scholar] [CrossRef]
  2. Wonka, P.; Muller, P.; Watson, B.; Fuller, A. Urban design and procedural modeling. In ACM SIGGRAPH 2007 Courses; ACM: San Diego, CA, USA, 2007. [Google Scholar]
  3. Zhuang, Y.; He, G.; Hu, H.; Wu, Z. A novel outdoor scene-understanding framework for unmanned ground vehicles with 3d laser scanners. Trans. Inst. Meas. Control 2015, 37, 435–445. [Google Scholar] [CrossRef]
  4. Li, D. Mobile mapping technology and its applications. Geospat. Inf. 2006, 4, 125. [Google Scholar]
  5. Pu, S.; Vosselman, G. Building facade reconstruction by fusing terrestrial laser points and images. Sensors 2009, 9, 4525–4542. [Google Scholar] [PubMed]
  6. Brenner, C. Building reconstruction from images and laser scanning. Int. J. Appl. Earth Obs. Geoinf. 2005, 6, 187–198. [Google Scholar] [CrossRef]
  7. Pu, S. Knowledge Based Building Facade Reconstruction from Laser Point Clouds and Images; University of Twente: Enschede, The Netherlands, 2010. [Google Scholar]
  8. Wang, R. Towards Urban 3d Modeling Using Mobile Lidar and Images; McGill University: Montreal, QC, Canada, 2011. [Google Scholar]
  9. Park, Y.; Yun, S.; Won, C.S.; Cho, K.; Um, K.; Sim, S. Calibration between color camera and 3d lidar instruments with a polygonal planar board. Sensors 2014, 14, 5333–5353. [Google Scholar] [CrossRef] [PubMed]
  10. Naroditsky, O.; Patterson, A.; Daniilidis, K. Automatic alignment of a camera with a line scan lidar system. In Proceedings of the 2011 IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China, 9–13 May 2011; pp. 3429–3434.
  11. Gong, X.; Lin, Y.; Liu, J. 3d lidar-camera extrinsic calibration using an arbitrary trihedron. Sensors 2013, 13, 1902–1918. [Google Scholar] [CrossRef] [PubMed]
  12. Zhuang, Y.; Yan, F.; Hu, H. Automatic extrinsic self-calibration for fusing data from monocular vision and 3-d laser scanner. IEEE Trans. Instrum. Meas. 2014, 63, 1874–1876. [Google Scholar] [CrossRef]
  13. Levinson, J.; Thrun, S. Automatic Online Calibration of Cameras and Lasers. Robot. Sci. Syst. 2013, 2013, 24–28. [Google Scholar]
  14. Mishra, R.; Zhang, Y. A review of optical imagery and airborne lidar data registration methods. Open Remote Sens. J. 2012, 5, 54–63. [Google Scholar] [CrossRef]
  15. Habib, A.; Ghanma, M.; Morgan, M.; Al-Ruzouq, R. Photogrammetric and lidar data registration using linear features. Photogramm. Eng. Remote Sens. 2005, 71, 699–707. [Google Scholar] [CrossRef]
  16. Brown, L. A Survey of Image Registration Techniques. ACM Comput. Surv. 1992, 24, 325–376. [Google Scholar] [CrossRef]
  17. Canny, J. A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 1986, 8, 679–698. [Google Scholar] [CrossRef] [PubMed]
  18. Ballard, D.H. Generalizing the hough transform to detect arbitrary shapes. Pattern Recognit. 1981, 13, 111–122. [Google Scholar] [CrossRef]
  19. Von Gioi, R.G.; Jakubowicz, J.; Morel, J.-M.; Randall, G. LSD: A line segment detector. Image Process. Line 2012, 2, 35–55. [Google Scholar] [CrossRef]
  20. Liu, L.; Stamos, I. Automatic 3d to 2d registration for the photorealistic rendering of urban scenes. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), San Diego, CA, USA, 20–25 June 2005; pp. 137–143.
  21. Liu, L.; Stamos, I. A systematic approach for 2d-image to 3d-range registration in urban environments. Comput. Vis. Image Underst. 2012, 116, 25–37. [Google Scholar] [CrossRef]
  22. Moghadam, P.; Bosse, M.; Zlot, R. Line-based extrinsic calibration of range and image sensors. In Proceedings of the 2013 IEEE International Conference on Robotics and Automation, Karlsruhe, Germany, 6–10 May 2013; pp. 3685–3691.
  23. Borges, P.; Zlot, R.; Bosse, M.; Nuske, S.; Tews, A. Vision-based localization using an edge map extracted from 3d laser range data. In Proceedings of the 2010 IEEE International Conference on Robotics and Automation, Anchorage, AK, USA, 3–7 May 2010; pp. 4902–4909.
  24. Lin, Y.; Wang, C.; Cheng, J.; Chen, B.; Jia, F.; Chen, Z.; Li, J. Line segment extraction for large scale unorganized point clouds. ISPRS J. Photogramm. Remote Sens. 2015, 102, 172–183. [Google Scholar] [CrossRef]
  25. Yu, Y.; Li, J.; Guan, H.; Wang, C.; Yu, J. Semiautomated extraction of street light poles from mobile lidar point-clouds. IEEE Trans. Geosci. Remote Sens. 2015, 53, 1374–1386. [Google Scholar] [CrossRef]
  26. Yokoyama, H.; Date, H.; Kanai, S.; Takeda, H. Pole-Like Objects Recognition from Mobile Laser Scanning Data Using Smoothing and Principal Component Analysis. Int. Archives Photogramm. Remote Sens. Spat. Inf. Sci. 2011, 38, 115–120. [Google Scholar] [CrossRef]
  27. El-Halawany, S.; Moussa, A.; Lichti, D.D.; El-Sheimy, N. Detection of Road Curb from Mobile Terrestrial Laser Scanner Point Cloud. In Proceedings of the 2011 ISPRS Workshop om Laser Scanning, Calgary, AB, Canada, 29–31 August 2011.
  28. Tan, J.; Li, J.; An, X.; He, H. Robust curb detection with fusion of 3d-lidar and camera data. Sensors 2014, 14, 9046–9073. [Google Scholar] [CrossRef] [PubMed]
  29. Ronnholm, P. Registration Quality-Towards Integration of Laser Scanning and Photogrammetry; European Spatial Data Research Network: Leuven, Belgium, 2011. [Google Scholar]
  30. Patias, P.; Petsa, E.; Streilein, A. Digital Line Photogrammetry: Concepts, Formulations, Degeneracies, Simulations, Algorithms, Practical Examples; ETH Zürich: Zürich, Switzerland, 1995. [Google Scholar]
  31. Schenk, T. From point-based to feature-based aerial triangulation. ISPRS J. Photogramm. Remote Sens. 2004, 58, 315–329. [Google Scholar] [CrossRef]
  32. Zhang, Z.; Zhang, Y.; Zhang, J.; Zhang, H. Photogrammetric modeling of linear features with generalized point photogrammetry. Photogramm. Eng. Remote Sens. 2008, 74, 1119–1127. [Google Scholar] [CrossRef]
  33. Mastin, A.; Kepner, J.; Fisher, J. Automatic Registration of Lidar and Optical Images of Urban Scenes. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; pp. 2639–2646.
  34. Parmehr, E.G.; Fraser, C.S.; Zhang, C.; Leach, J. Automatic registration of optical imagery with 3d lidar data using statistical similarity. ISPRS J. Photogramm. Remote Sen. 2014, 88, 28–40. [Google Scholar] [CrossRef]
  35. Wang, R.; Ferrie, F.P.; Macfarlane, J. Automatic registration of mobile lidar and spherical panoramas. In Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Providence, RI, USA, 18–20 June 2012; pp. 33–40.
  36. Torii, A.; Havlena, M.; Pajdla, T. From google street view to 3d city models. In Proceedings of the IEEE 12th International Conference on Computer Vision Workshops (ICCV Workshops), Kyoto, Japan, 29 September–2 October 2009; pp. 2188–2195.
  37. Micusik, B.; Kosecka, J. Piecewise planar city 3d modeling from street view panoramic sequences. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; pp. 2906–2912.
  38. Shi, Y.; Ji, S.; Shi, Z.; Duan, Y.; Shibasaki, R. GPS-supported visual slam with a rigorous sensor model for a panoramic camera in outdoor environments. Sensors 2013, 13, 119–136. [Google Scholar] [CrossRef] [PubMed]
  39. Ji, S.; Shi, Y.; Shi, Z.; Bao, A.; Li, J.; Yuan, X.; Duan, Y.; Shibasaki, R. Comparison of two panoramic sensor models for precise 3d measurements. Photogramm. Eng. Remote Sens. 2014, 80, 229–238. [Google Scholar] [CrossRef]
  40. PointGrey. Ladybug 3. Available online: https://www.ptgrey.com/ladybug3-360-degree-firewire-spherical-camera-systems (accessed on 28 December 2016).
  41. SICK. Lms5xx. Available online: https://www.sick.com/de/en/product-portfolio/detection-and-ranging-solutions/2d-laser-scanners/lms5xx/c/g179651 (accessed on 28 December 2016).
  42. Sairam, N.; Nagarajan, S.; Ornitz, S. Development of mobile mapping system for 3d road asset inventory. Sensors 2016, 16, 367. [Google Scholar] [CrossRef] [PubMed]
  43. Point Grey Research, I. Geometric Vision Using Ladybug Cameras. Available online: http://www.ptgrey.com/tan/10621 (accessed on 28 December 2016).
  44. Rabbani, T.; van Den Heuvel, F.; Vosselmann, G. Segmentation of point clouds using smoothness constraint. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2006, 36, 248–253. [Google Scholar]
  45. Schnabel, R.; Wahl, R.; Klein, R. Efficient Ransac for Point-Cloud Shape Detection. Comput. Graph. Forum 2007, 24, 214–226. [Google Scholar] [CrossRef]
  46. Pu, S.; Rutzinger, M.; Vosselman, G.; Oude Elberink, S. Recognizing basic structures from mobile laser scanning data for road inventory studies. ISPRS J. Photogramm. Remote Sens. 2011, 66, S28–S39. [Google Scholar] [CrossRef]
  47. Meguro, J.; Hashizume, T.; Takiguchi, J.; Kurosaki, R. Development of an autonomous mobile surveillance system using a network-based rtk-GPS. In Proceedings of the International Conference on Robotics and Automation, Barcelona, Spain, 18–22 April 2005.
  48. Comaniciu, D.; Meer, P. Mean shift: A robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 2002, 24, 603–619. [Google Scholar] [CrossRef]
Figure 1. The MMS used in this study: (a) the vehicle; (b) the panoramic camera, laser scanners, and GPS receiver.
Figure 1. The MMS used in this study: (a) the vehicle; (b) the panoramic camera, laser scanners, and GPS receiver.
Sensors 17 00070 g001
Figure 2. Coordinate systems of the Mobile Mapping System.
Figure 2. Coordinate systems of the Mobile Mapping System.
Sensors 17 00070 g002
Figure 3. Comparison between (a) a panoramic image and (b) a frame image. (The meaning of the Chinese characters on the building is Supermarket for Logistics in Central China).
Figure 3. Comparison between (a) a panoramic image and (b) a frame image. (The meaning of the Chinese characters on the building is Supermarket for Logistics in Central China).
Sensors 17 00070 g003
Figure 4. Differences between the spherical and panoramic camera models. (a) The dashed line shows the ray through 3D point P, panoramic image point u, and sphere center S; (b) the solid line shows the ray through 3D point P′, mono-camera image point uc, panoramic image point u, and the mono-camera projection center C.
Figure 4. Differences between the spherical and panoramic camera models. (a) The dashed line shows the ray through 3D point P, panoramic image point u, and sphere center S; (b) the solid line shows the ray through 3D point P′, mono-camera image point uc, panoramic image point u, and the mono-camera projection center C.
Sensors 17 00070 g004
Figure 5. Images of Camera 0–5: (a) 6 fish-eye images; (b) 6 rectified images. (The meaning of the Chinese characters on the building is Supermarket for Logistics in Central China).
Figure 5. Images of Camera 0–5: (a) 6 fish-eye images; (b) 6 rectified images. (The meaning of the Chinese characters on the building is Supermarket for Logistics in Central China).
Sensors 17 00070 g005
Figure 6. Global and local coordinate systems of multi-camera rig under a cylindrical projection. (a) the global panoramic camera coordinate system and (b) six local coordinate systems of the rectified cameras.
Figure 6. Global and local coordinate systems of multi-camera rig under a cylindrical projection. (a) the global panoramic camera coordinate system and (b) six local coordinate systems of the rectified cameras.
Sensors 17 00070 g006
Figure 7. Line-based transformation model on panoramic image.
Figure 7. Line-based transformation model on panoramic image.
Sensors 17 00070 g007
Figure 8. Line segments fitting for building patch: (a) projected points; (b) boundary points; (c) fitting lines using conventional least square method; and (d) fitting lines using regularity constraints.
Figure 8. Line segments fitting for building patch: (a) projected points; (b) boundary points; (c) fitting lines using conventional least square method; and (d) fitting lines using regularity constraints.
Sensors 17 00070 g008
Figure 9. Line segments fitting for (a) pole-like objects and (b) curbs.
Figure 9. Line segments fitting for (a) pole-like objects and (b) curbs.
Sensors 17 00070 g009
Figure 10. Overview of the test data: (a) the test area in Google Earth; (b) 3D point cloud of the test area.
Figure 10. Overview of the test data: (a) the test area in Google Earth; (b) 3D point cloud of the test area.
Sensors 17 00070 g010aSensors 17 00070 g010b
Figure 11. Alignments of two datasets before and after registration based on the panoramic camera model with lens id 0–4. (a,b) are the LiDAR points projected to a panoramic image before and after registration respectively; (c,d) are the 3d point cloud rendered by the corresponding panoramic image pixels respectively.
Figure 11. Alignments of two datasets before and after registration based on the panoramic camera model with lens id 0–4. (a,b) are the LiDAR points projected to a panoramic image before and after registration respectively; (c,d) are the 3d point cloud rendered by the corresponding panoramic image pixels respectively.
Sensors 17 00070 g011aSensors 17 00070 g011b
Figure 12. Check points distribution shown on panoramic image. (The meaning of the Chinese characters on the building is Supermarket for Logistics in Central China).
Figure 12. Check points distribution shown on panoramic image. (The meaning of the Chinese characters on the building is Supermarket for Logistics in Central China).
Sensors 17 00070 g012
Figure 13. The residuals of the check points before and after registration based on the panoramic camera model. The vertical axis is the residual in pixels; the horizontal axis is (a) the ID of the check points and (b) the ID of the lens and check points (lens ID—check point ID).
Figure 13. The residuals of the check points before and after registration based on the panoramic camera model. The vertical axis is the residual in pixels; the horizontal axis is (a) the ID of the check points and (b) the ID of the lens and check points (lens ID—check point ID).
Sensors 17 00070 g013
Figure 14. Linear features of the two datasets. (a) EDISON edge pixels in the panoramic image; and (b) boundary points in LiDAR point cloud.
Figure 14. Linear features of the two datasets. (a) EDISON edge pixels in the panoramic image; and (b) boundary points in LiDAR point cloud.
Sensors 17 00070 g014
Figure 15. Definition of overlap rate: (a) image edge pixels; (b) LiDAR projected boundary points; (c) union of (a,b); (d) overlap of (a,b); (e) composition of (c) and highlighted (d).
Figure 15. Definition of overlap rate: (a) image edge pixels; (b) LiDAR projected boundary points; (c) union of (a,b); (d) overlap of (a,b); (e) composition of (c) and highlighted (d).
Sensors 17 00070 g015
Table 1. POS of the vehicle platform and EOP of the camera aboard.
Table 1. POS of the vehicle platform and EOP of the camera aboard.
POSEOP
X (m)38,535,802.519−0.3350
Y (m)3,400,240.762−0.8870
Z (m)762,11.0890.4390
φ ( ° )0.2483−1.3489
ω ( ° )0.43440.6250
κ ( ° )87.50761.2000
Table 2. Parameters of mono-cameras in the panoramic camera model (image size is 1616 × 1232 in pixels).
Table 2. Parameters of mono-cameras in the panoramic camera model (image size is 1616 × 1232 in pixels).
Lens IDRx (Radians)Ry (Radians)Rz (Radians)Tx (m)Ty (m)Tz (m)x0 (Pixels)y0 (Pixels)f (Pixels)
02.16251.56752.15810.0416−0.0020−0.0002806.484639.546400.038
11.04901.5620−0.25720.0114−0.04000.0002794.553614.885402.208
20.61341.5625−1.9058−0.0350−0.02290.0006783.593630.813401.557
31.70051.5633−2.0733−0.03280.0261−0.0003790.296625.776400.521
4−2.22531.5625−0.99740.01480.0388−0.0003806.926621.216406.115
5−0.00280.00520.00430.0010−0.00060.06202776.909589.499394.588
Table 3. Registration results based on the spherical and panoramic camera models.
Table 3. Registration results based on the spherical and panoramic camera models.
ModelSphericalPanoramic
DeltasErrorsDeltasErrors
X (m)−3.4372 × 10−21.1369 × 10−33.4328 × 10−21.0373 × 10−3
Y (m)1.06531.2142 × 10−31.09291.0579 × 10−3
Z (m)1.9511 × 10−19.9237 × 10−42.2075 × 10−18.0585 × 10−4
φ ( ° )−1.2852 × 10−21.4211 × 10−3−1.4731 × 10−21.0920 × 10−3
ω ( ° )5.8824 × 10−41.4489 × 10−41.5866 × 10−31.2430 × 10−3
κ ( ° )−7.9019 × 10−38.4789 × 10−4−6.7691 × 10−37.7509 × 10−4
RMSE (pixels)4.7184.244
Table 4. Overlap rate based on the panoramic camera model before and after registration.
Table 4. Overlap rate based on the panoramic camera model before and after registration.
Lens IDBefore (%)After (%)
07.808.29
18.3110.30
211.3211.83
39.849.90
47.427.54

Share and Cite

MDPI and ACS Style

Cui, T.; Ji, S.; Shan, J.; Gong, J.; Liu, K. Line-Based Registration of Panoramic Images and LiDAR Point Clouds for Mobile Mapping. Sensors 2017, 17, 70. https://doi.org/10.3390/s17010070

AMA Style

Cui T, Ji S, Shan J, Gong J, Liu K. Line-Based Registration of Panoramic Images and LiDAR Point Clouds for Mobile Mapping. Sensors. 2017; 17(1):70. https://doi.org/10.3390/s17010070

Chicago/Turabian Style

Cui, Tingting, Shunping Ji, Jie Shan, Jianya Gong, and Kejian Liu. 2017. "Line-Based Registration of Panoramic Images and LiDAR Point Clouds for Mobile Mapping" Sensors 17, no. 1: 70. https://doi.org/10.3390/s17010070

APA Style

Cui, T., Ji, S., Shan, J., Gong, J., & Liu, K. (2017). Line-Based Registration of Panoramic Images and LiDAR Point Clouds for Mobile Mapping. Sensors, 17(1), 70. https://doi.org/10.3390/s17010070

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop