WO2014020823A1 - 画像処理システムおよび画像処理方法 - Google Patents
画像処理システムおよび画像処理方法 Download PDFInfo
- Publication number
- WO2014020823A1 WO2014020823A1 PCT/JP2013/004059 JP2013004059W WO2014020823A1 WO 2014020823 A1 WO2014020823 A1 WO 2014020823A1 JP 2013004059 W JP2013004059 W JP 2013004059W WO 2014020823 A1 WO2014020823 A1 WO 2014020823A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- pattern
- projection
- image processing
- image
- projection pattern
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/70—Determining position or orientation of objects or cameras
- G06T7/73—Determining position or orientation of objects or cameras using feature-based methods
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01B—MEASURING LENGTH, THICKNESS OR SIMILAR LINEAR DIMENSIONS; MEASURING ANGLES; MEASURING AREAS; MEASURING IRREGULARITIES OF SURFACES OR CONTOURS
- G01B11/00—Measuring arrangements characterised by the use of optical techniques
- G01B11/24—Measuring arrangements characterised by the use of optical techniques for measuring contours or curvatures
- G01B11/25—Measuring arrangements characterised by the use of optical techniques for measuring contours or curvatures by projecting a pattern, e.g. one or more lines, moiré fringes on the object
- G01B11/2513—Measuring arrangements characterised by the use of optical techniques for measuring contours or curvatures by projecting a pattern, e.g. one or more lines, moiré fringes on the object with several lines being projected in more than one direction, e.g. grids, patterns
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01B—MEASURING LENGTH, THICKNESS OR SIMILAR LINEAR DIMENSIONS; MEASURING ANGLES; MEASURING AREAS; MEASURING IRREGULARITIES OF SURFACES OR CONTOURS
- G01B11/00—Measuring arrangements characterised by the use of optical techniques
- G01B11/24—Measuring arrangements characterised by the use of optical techniques for measuring contours or curvatures
- G01B11/25—Measuring arrangements characterised by the use of optical techniques for measuring contours or curvatures by projecting a pattern, e.g. one or more lines, moiré fringes on the object
- G01B11/2545—Measuring arrangements characterised by the use of optical techniques for measuring contours or curvatures by projecting a pattern, e.g. one or more lines, moiré fringes on the object with one projection direction and several detection directions, e.g. stereo
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/0007—Image acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/50—Depth or shape recovery
- G06T7/521—Depth or shape recovery from laser ranging, e.g. using interferometry; from the projection of structured light
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10004—Still image; Photographic image
- G06T2207/10012—Stereo images
Definitions
- the present invention relates to an image processing system and an image processing method, and more particularly to an image processing system and an image processing method for performing high-density shape restoration by one-shot three-dimensional measurement using a single-color pattern.
- Non-Patent Document 1 a game product that realizes an interface that does not require mounting of a device by measuring a human body in real time and analyzing its movement has achieved great success.
- Currently available moving body scanners cannot measure as accurately and densely as 3D scanners that measure static scenes.
- the application range such as medical application and fluid analysis will be greatly expanded.
- the method of projecting structured rays is roughly classified into a temporal encoding method and a spatial encoding method.
- the spatial coding method is suitable for measuring a moving object at a high frame rate because it is a technique for reconstructing a shape from a single image (one-shot scan). For this reason, it has been actively studied.
- corresponding point information that can be uniquely identified from the entire projection pattern is directly embedded in the two-dimensional pattern. For this purpose, since a large area is required to some extent, the restoration density tends to be low. Decoding errors due to pattern distortion due to surface shape changes are also likely to occur.
- color information As a method for efficiently embedding corresponding point information in a two-dimensional pattern.
- a method of embedding a plurality of bit information at each point using a plurality of colors has been widely used (for example, see Non-Patent Documents 3 and 5-8).
- the surface of the target object needs to sufficiently reflect each color element of RGB.
- the spectral distribution between the color elements interferes, so that an error is likely to occur in determining the color of each pixel.
- a method using a dotted line pattern, a grid pattern, or the like has been proposed as a spatial encoding method that does not use color.
- the problem of ambiguity and density reduction has not been completely solved.
- a system using a TOF scanner or active stereo is generally well known.
- Various methods for active measurement of moving objects have been studied.
- a point laser beam is irradiated toward an object to be measured, and the time until the laser beam returns to the detector is measured. For this reason, only one point is measured at a time, and it is not suitable for measuring a wide range in a short time. Therefore, in order to measure an animal body, a device that realizes measurement of a distance image by irradiating a temporally modulated light over a wide area and observing the modulation at each pixel of a two-dimensional sensor. (For example, see Non-Patent Documents 9 and 10).
- it is currently vulnerable to disturbances caused by other light sources, and the resolution is lower than that of a normal camera.
- Non-Patent Document 11 a method of switching patterns at high speed (see, for example, Non-Patent Document 11), a method of reducing the number of sheets required using a phase pattern (see, for example, Non-Patent Document 12), or a method of using a DMD pattern (for example, Non-Patent Document) Reference 13).
- Non-Patent Document 14 a space-time stereo method that irradiates a time-varying pattern using two or more cameras.
- a space-time stereo method for example, see Non-Patent Document 14
- examples of successful measurement at about 100 fps by combining motion estimation have been introduced.
- information of a plurality of frames is required, it is essentially not suitable for high-speed object measurement.
- the spatial coding method is suitable for measuring a moving object because a shape is restored from only one input image using a static pattern.
- the resolution is low due to the need to embed information in the spatial area of the pattern.
- the pattern is distorted by the texture and shape of the target object surface, it tends to be unstable. For this reason, many methods for reducing this have been proposed.
- Non-Patent Documents 15 and 16 For example, a method using a plurality of color bands so as not to have the same combination (for example, see Non-Patent Documents 15 and 16), a method using a specific dotted line (for example, refer to Non-Patent Documents 17 and 18), two-dimensional There is a method of embedding information in a simple pattern (for example, see Non-Patent Documents 1 and 19). However, there is still no method that satisfies sufficient performance in all of accuracy, resolution, and stability.
- Kiyota “Dense one-shot 3d reconstruction by detecting continuous regions with parallel line projection,” ICCV, 2011. Canesta, Inc., “Canesta Vision, EP Development, Kit,” 2010. http://www.canesta.com/devkit.htm. Mesa Imaging AG., “Swiss Ranger SR-4000,” 2011. http://www.swissranger.ch/index.php. S. Rusinkiewicz, O. Hall-Holt and M. Levoy, “Realtime 3D model acquisition,” Proc. SIGGRAPH, pp.438-446, 2002. T. Weise, B. Leibe and L. V. Gool, “Fast 3D scanning with automatic motion compensation,” CVPR, 2007. S. G.
- An object of the present invention is to provide an image processing system and an image processing method that can perform shape restoration by one-shot three-dimensional measurement using a monochromatic pattern and can perform high-density shape restoration even in measurement of an animal body. .
- an image obtained by photographing a projection pattern projected onto an observation target by a projection device by an imaging device and connected to the projection device and the imaging device.
- the image processing device captures an input image photographed by the photographing device, and a line of the projection pattern projected by the projection device.
- a projection pattern projected onto an observation object by a projection device is photographed by a photographing device, and the projection device and the image processing device connected to the photographing device include the projection pattern.
- An image processing method for performing shape restoration from an input image wherein the image processing device captures an input image captured by the imaging device and performs line detection of a projection pattern projected by the projection device.
- the projected pattern is a grid pattern of wavy lines, and the image processing apparatus restores the shape by associating the intersection of the vertical and horizontal lines extracted by the line detection with the projected pattern, thereby restoring the restored image. And a step of displaying.
- the shape restoration by the one-shot three-dimensional measurement by the monochromatic pattern is performed using the grid pattern by the wavy line, so that the high-density shape restoration can be performed even in the measurement of the moving object. It becomes possible.
- FIG. 1 is a configuration diagram illustrating an image processing system according to a first embodiment of the present invention. It is a flowchart which shows the shape restoration algorithm concerning the 1st Embodiment of this invention. It is a figure which shows an example of the grid pattern by a wavy line. It is a figure which shows the static pattern which a projector projects. It is a figure which shows the image acquired by projecting the grid pattern by a wavy line with respect to an observation object. It is a figure which shows the result of having performed line detection from the grid pattern by a wavy line. It is a figure which shows the patch approximated by the tangent plane of the lattice point. It is a figure for demonstrating the depth estimation of a quasi-pixel unit.
- the present embodiment is a technique based on spatial coding using grid pattern connectivity.
- This method is known to have the problem of ambiguity of corresponding points and erroneous restoration due to an error in the determination of connection of detected lines (for example, see Non-Patent Documents 2-4).
- use of a grid pattern using a plurality of colors has been proposed in order to solve this problem.
- measurement is not stable because it is affected by the reflectance and texture of the target object surface.
- both the grid pattern and multicolor pattern problems can be solved simultaneously.
- FIG. 1 shows an image processing system according to the first embodiment of the present invention.
- One camera 101 imaging device
- one projector 102 projection device
- the projector 102 projects a grid pattern with wavy lines onto the observation target 103. Since the projection pattern is static, synchronization is not required for shooting. Therefore, it is possible to measure at a very high FPS (Flames Per Second).
- the camera 101 and the projector 102 are connected to an image processing apparatus 104 that is a personal computer.
- the image processing apparatus 104 stores a projection pattern such as a grid pattern using wavy lines in a storage medium in advance, and can transmit projection pattern data to the projector 102 and project it onto the observation target 103. Further, the image processing apparatus 104 captures an input image taken by the camera 101, stores it in a storage medium, and executes image processing for restoring the shape from the input image.
- a projection pattern such as a grid pattern using wavy lines in a storage medium in advance
- the image processing apparatus 104 captures an input image taken by the camera 101, stores it in a storage medium, and executes image processing for restoring the shape from the input image.
- FIG. 2 shows a shape restoration algorithm according to the first embodiment of the present invention.
- an image is acquired by projecting a grid pattern with wavy lines onto an observation target (S202).
- line detection is performed from the captured image using the technique described in Non-Patent Document 3.
- BP Belief Propagation
- An intersection is calculated from the detected vertical and horizontal lines, and a graph with the intersection as a node is created (S204).
- the position of the epipolar line corresponding to each node is calculated on the projector pattern, and if there is a grid intersection on the line, this is determined as a corresponding candidate (S206).
- the optimum combination of the correspondence candidates at each point is obtained using BP (S208). Since the restoration result is sparse as it is, the depth at each pixel is obtained using interpolation and matching of the pattern and the observed image in units of pixels (S210) to obtain a dense three-dimensional restored shape ( S212).
- the conventional method In order to obtain a unique association between a camera image (image captured on the camera image plane) and a projector image (pattern projected from the projector image plane) by spatial coding, the conventional method has a large window size. A complicated pattern was required. Furthermore, although a wide baseline length is desirable for improving accuracy, the observed pattern is greatly distorted, so that decoding becomes difficult in practice. Therefore, a simple but highly unique pattern that is easy to detect and decode is desirable.
- a pattern that gives information related to priority of correspondence is used instead of a pattern that is uniquely determined by image processing. Specifically, a grid pattern composed of vertical and horizontal wavy lines is used.
- FIG. 3A shows an example of a grid pattern using wavy lines. Since the wavy line pattern is a simple pattern, it can be easily detected as a curve in the image, and the position can be obtained with subpixel accuracy by calculating the peak of the luminance value.
- Both the longitudinal wave line and the transverse wave line use a wavy curve having a certain periodicity, for example, a periodic sine wave pattern.
- Both the longitudinal wave line and the transverse wave line are composed of a plurality of wave lines arranged at regular intervals, and the grid pattern of the wave lines is composed of a set of wave lines in two directions intersecting each other.
- the grid pattern with wavy lines has information useful for detecting corresponding points.
- the intersection of vertical and horizontal wavy lines is used as a feature point.
- the arrangement of the intersection points is determined by the interval between the wavy lines and the wavelength.
- the local pattern around the intersection is not unique among the entire projection pattern.
- Sx and Sy are intervals between adjacent wavy lines
- Wx and Wy are wavelengths.
- Nx 1 cm (Sx, Wx)
- Sx Ny 1cm (Sy, Wy) / Sy
- lcm (a, b) is the least common multiple of a and b
- the subscripts x and y represent values along the vertical and horizontal axes, respectively.
- the local pattern is a pattern that can be identified in each cycle.
- FIG. 3B shows a static pattern projected by the projector 102.
- This pattern is a single color and includes sinusoidal vertical and horizontal curves arranged in a grid.
- the corresponding point candidates are limited to points on the epipolar line.
- the intersection of the projector image is selected as one of corresponding point candidates.
- the number of candidates depends on the intersection position of the camera image. Since the corresponding point candidates are sparsely distributed in the projector image, the number of corresponding candidates is dramatically reduced as compared with normal stereo vision in which candidate points are searched for in pixel units.
- FIG. 4A is an image obtained by projecting a grid pattern with wavy lines onto an observation target.
- FIG. 4B shows the result of line detection.
- the intersection of the vertical and horizontal lines in the grid pattern by the wavy lines on the camera image is hereinafter referred to as “lattice point”. If a plurality of grid points are connected by grid lines, they are on the same wavy line on the projector image. This is used for regularization when determining the corresponding points.
- the connectivity of grid points is obtained by line detection. However, the determination of connectivity by line detection may be erroneous. In particular, such errors increase in the vicinity of the discontinuous boundary of the shape. Therefore, in order to assign a corresponding point to each grid point, the energy minimization defined on the grid is used.
- the matching cost is calculated for all corresponding candidates, and this is used as a data term in energy minimization.
- the cost is calculated as SSD (Sum of Squared Difference) between the camera image and the projector image (pattern image).
- SSD Standard of Squared Difference
- the image observed by the camera is distorted depending on the shape of the target object. Therefore, a simple SSD for a square area is not suitable for a data term. Therefore, consider a patch that approximates the surrounding area of the lattice point of the target object by the tangent plane of the lattice point. Thereby, it is possible to calculate the matching cost with high accuracy and to calculate the corresponding points with sub-pixel accuracy.
- FIG. 5 shows a patch approximated by a tangent plane of a lattice point. It is assumed that the shape around the lattice point (square patch 513) on the surface 503 to be observed is locally a plane.
- a, b, and c are plane parameters. This parameter is obtained by SSD minimization taking image distortion into consideration.
- a square patch R (p) 511 around the lattice point p in the camera image 501 is projected onto a three-dimensional tangent plane and reprojected onto the projector image 502.
- the brightness value SSD between the reprojected square patch 512 and the projector image 502 is calculated.
- (4) Repeat the above steps several times.
- the initial values of a, b, and c are set so that the tangent plane includes the three-dimensional position of the lattice point calculated by the parallax and is parallel to the image plane of the camera.
- the SSD is expressed by the following equation. Is done.
- R (p) is a square patch around p
- H a, b, c (p ′) represents a deformation when p ′ is reprojected on the projector image plane.
- I c (•) and I p (•) represent luminance values in the camera image and the projector image, respectively.
- the grid pattern is composed of a node p ⁇ V composed of lattice points and an edge (p, q) ⁇ U representing connection of lattice points.
- p and q are lattice points
- V is a set of lattice points
- U is an edge set of the grid graph.
- the lattice point p has a corresponding point candidate tp ⁇ T p .
- T p is a set of corresponding point candidates for the lattice point p.
- the stereo matching energy is defined as follows using the corresponding set as a parameter.
- T ⁇ t p
- D p (t p ) is a data term when a corresponding point of p is allocated to t p
- W pq (t p, t q ) is a normalization term for allocating the lattice points adjacent to t p and t q.
- the above data term takes the SSD value calculated by the method described above.
- the normalization term is defined by the following equation.
- ⁇ is a constant determined by the user.
- the defined energy is minimized by BP.
- Non-Patent Document 3 it is necessary to eliminate erroneous connections at the line detection stage before 3D reconstruction, but in this embodiment, removal of erroneous connections and 3D reconstruction are performed simultaneously. Therefore, it is possible to restore with higher density and higher accuracy.
- Correspondence at sparse grid points can be obtained by grid-based stereo method.
- close correspondence is obtained using information of all pixels.
- the depth of the sample pixels resampled at a high density is obtained by interpolation of grid points.
- the difference in luminance between the camera image and the projector image is minimized by using the depth of these sample pixels as a variable.
- Non-Patent Document 8 describes a method based on interpolation between detected grid lines.
- depth estimation independent of each (quasi-) pixel is realized by optimization based on photo-consistency.
- a x , b x , and c x are calculated for each pixel.
- a x at each pixel is interpolated as follows.
- p is a lattice point
- G (•) is a Gaussian function
- is the distance between p and x.
- b x and c x are obtained by a similar weighted average.
- the quasi-pixel unit depth estimation will be described.
- a triangular mesh is created with three sample pixels as vertices from the image to be observed. Depths at pixels other than the sample pixel are linearly interpolated. For optimization iterative calculations, the depth approximates the minute displacement [Delta] d x of d x as a variable.
- the depth of the pixel x in FIG. 6 is expressed as follows.
- w x2 and w x3 are weights for linear interpolation.
- D + ⁇ D is a vector obtained by collecting d x + ⁇ d x for all sample pixels.
- the reprojection error for the projector image (pattern image) is obtained by the following equation for all pixels, not just the sample pixels.
- P D + ⁇ D (x) represents a reprojection position on the projector image.
- a part of D + ⁇ D is used for reprojection of each pixel.
- x and x ′ are adjacent vertices.
- ⁇ is a normalization parameter determined by the user. The parameter ⁇ D is determined so as to minimize the error. D is determined by repeating reprojection and minimization alternately until convergence of the solution.
- FIG. 7 shows an image processing system according to the second embodiment of the present invention.
- Two cameras 1101 and 1102 imaging device
- one projector 1103 projection device
- the projector 1103 projects a grid pattern with wavy lines on the observation target 1104. Since the projection pattern is static, synchronization is not required for shooting. Therefore, it is possible to measure at a very high FPS (Flames Per Second).
- the cameras 1101 and 1102 and the projector 1103 are connected to an image processing apparatus 1105 that is a personal computer.
- the image processing apparatus 1105 stores a projection pattern such as a grid pattern using wavy lines in a storage medium in advance, and can transmit projection pattern data to the projector 1103 and project it onto the observation target 1104.
- the image processing apparatus 1105 also captures input images captured by the cameras 1101 and 1102, stores them in a storage medium, and executes image processing for restoring the shape from the input images.
- a constraint condition between two cameras is used as additional information for finding correspondence candidates.
- a method of assigning corresponding points based on energy minimization on the grid will be described.
- An additional constraint is derived as an edge connecting the graphs between the two cameras. With reference to FIG. 8, how to generate an edge between two grid graphs will be described. First, an image is acquired by projecting a grid pattern of wavy lines onto an observation target. Next, line detection is performed from the captured image, intersections are calculated from the detected vertical and horizontal lines, and a grid graph is created with the intersections as nodes.
- the corresponding point candidate t p0 ⁇ T p0 is an intersection point on the epipolar line 1211 of the lattice point p 0 on the projector pattern 1204, and T p0 is a set of corresponding point candidates of the lattice point p 0 .
- T p0 is a set of corresponding point candidates of the lattice point p 0 .
- the projection of the coordinates P 3D (t P0 ) onto the grid pattern 1202 of the camera 1102 is P 1 (t P0 ).
- the lattice point p 1 of the camera 1102 satisfies the following expression
- the lattice point p 0 and the lattice point p 1 correspond (straight line L1).
- D (a, b) is the distance between the two points ab
- ⁇ is the radius of the search range of the lattice points close to P 1 (t P0 )
- T p1 is the set of corresponding point candidates t p1 is there.
- FIG. 8 four P 3D (t P0 ) are projected, and the leftmost P 3D (t P0 ) 1221 has no grid point in the search range on the grid pattern 1202 and no corresponding point candidate.
- the rightmost P 3D (t P0 ) 1222 has a grid point p 1 in the search range on the grid pattern 1202, but has the same corresponding point candidate t as the corresponding point candidate set T p1 of the epipolar line 1212 of the grid point p 1.
- the central two P 3D (t P0 ) satisfy the above condition and are connected to the lattice point p 0 . When the edges between the two cameras are connected on the graph (straight line L1), the result is a single graph that facilitates the search for the corresponding points of the two cameras.
- an erroneous edge may be generated (straight line L2).
- the second projection 1223 in FIG. 8 is an erroneous edge and is not on the surface 1203 of the observation target 1104.
- the grid point has both a correct edge and an incorrect edge, the total cost of the BP is not affected by the incorrect edge.
- the grid point has only erroneous edges, if the number of erroneous edges is small, it is determined that the corresponding point candidate is not a BP process.
- p 1 and q 1 are lattice points
- V 1 is a set of lattice points
- U 1 is a set of edges
- S is a set of edges between cameras.
- the grid point p 0 has a corresponding point candidate tp 0 ⁇ T p0 on the projector pattern.
- X p0, p1 (t p0 , t p1 ) is a normalized term of the edge between cameras (p 0 , p 1 ).
- ⁇ is a constant determined by the user. If the grid point p is a edge between cameras, assignment of t p for the grid points are selected from one of the edges between the cameras. This is because the energy increases when allocation other than the edge between the cameras is selected.
- a high-density image is obtained by interpolating the grid graph of the camera image.
- the 2-camera / 1-projector of this embodiment there are two sets of grid graphs.
- a graph is created on a camera image, a part of the graph may not be visible from the other camera, and it is not possible to integrate and restore the grid graph using the coordinate system of one camera. Therefore, the pixel information is combined in the projector coordinate system by reprojecting the graphs from both cameras.
- Figure 9 shows a case where the lattice point t p of the projector pattern, correspond to the grid points p 0, p 1 of both cameras.
- the grid pattern 1304 of the projector 1103 is introduced between the grid pattern 1301 of the camera 1101 and the grid pattern 1302 of the camera 1102, and the coordinate P 3D on the surface 1303 of the observation target 1104 is obtained.
- the two coordinate points p 3D0 and p 3D1 are calculated from the two corresponding points, they usually do not match due to an error in image processing. Therefore, the depth d 0, d 1 from the viewpoint of the projector, when the pixel r is in the range of near (R) of the lattice point t p, integrating the average depth d 0, d 1.
- defining the depth d r in the pixel r as follows.
- d (t p , p) is the depth of the coordinate system calculated from t p , p.
- D (r, t pk ) is a distance between the two points r and t pk
- ⁇ is a parameter determined by the user, and determines the vicinity of the lattice point. Since the coordinate point p 3D can be seen from the projector, the depth information can be combined.
- a calculation method of d (t p , p) for example, linear interpolation (for example, bilinear interpolation) according to the distance to p from a set of the lattice point t p and its neighboring lattice points is considered.
- a weighted average may be used when calculating the average of equation (9). As the weight, for example, an angle formed by the camera and the projector can be used.
- FIG. 10 shows an image processing system according to the third embodiment of the present invention.
- Six cameras 2101-2106 imaging device
- six projectors 2201-2206 projection device
- the projectors 2201-2206 project a grid pattern with wavy lines onto the observation object 2301. Since the projection pattern is static, synchronization is not required for shooting. Therefore, it is possible to measure at a very high FPS (Flames Per Second).
- the cameras 2101-2106 and the projectors 2201-2206 are connected to an image processing device 2401 composed of a personal computer.
- the image processing apparatus 2401 stores in advance a projection pattern such as a grid pattern using wavy lines in a storage medium, and can transmit projection pattern data to the projector 2201-2206 to project it onto the observation object 2301.
- the image processing apparatus 2401 captures an input image taken by the cameras 2101-2106, stores it in a storage medium, and executes image processing for restoring the shape from the input image.
- an image obtained from a camera since an image obtained from a camera includes a plurality of patterns, it is necessary to identify which pattern is projected from which projector. Therefore, the color is used for identification.
- the three primary colors of light, red, green and blue are each projected from two projectors onto the observation target.
- FIG. 11A shows an image obtained by projecting a grid pattern of wavy lines of the three primary colors.
- FIG. 11B shows the result of detecting a red pattern from this image
- FIG. 11C shows the result of detecting a blue pattern.
- the corresponding points are found without using the green pattern.
- line detection is performed using a red pattern and a blue pattern, it is affected by the green pattern.
- FIG. 11D there is a case where a green pattern is detected in the blue pattern (the head side surface in FIG. 11D). Therefore, before line detection, the color is converted to a saturated color (pure color) as follows.
- RGB2HSV RGB2HSV (r, g, b)
- R ′, g ′, b ′) HSV2RGB (h, 1, v) (11)
- RGB2HSV and HSV2RGB represent conversion between color spaces, respectively, and colors are represented in the range [0, 1].
- the method of finding corresponding point candidates in each of the red pattern and the blue pattern may be performed in the same manner as in the case of the two cameras + 1 projector shown in the second embodiment.
- corresponding points between the projectors are detected using camera images.
- FIG. 12 shows a camera image in which a plurality of grid patterns are overlapped.
- D (a, b) is the distance between the two points ab
- ⁇ is the radius of the search range of p ik .
- corresponding points in the two graphs are connected by a dotted line (between points p ik and p il in the figure).
- the corresponding point assignment is optimized by energy minimization.
- the edge energy between the corresponding projectors is
- d i (P 3D ) is the depth of the coordinate point P 3D of the camera i
- ⁇ is a weight determined by the user.
- the total energy of multiple cameras and multiple projectors is defined as follows:
- a p (i) is a set of projectors sharing the viewpoint of camera i
- a c (k) is a set of cameras sharing the viewpoint of projector k.
- S ijk is a set of edges between the cameras i and j given by the pattern of the projector k
- Q ikl is a set of edges between the projectors k and l in the image of the camera i.
- image density can be increased by using the method described with reference to FIG. 9 in the second embodiment.
- energy is divided into a data term and a normalization term.
- the data term is calculated from the intensity difference between the camera and the projector, and the normalization term is defined using the curvature of each vertex of the grid graph.
- FIG. 13 shows a state in which images of two ranges of two projectors are superimposed.
- Coordinate point p 3D k is calculated from the point r k projector k (2503). If the projection of the coordinate point p 3Dk is the mask of the camera (2501), point r k overlaps with the projector l (2502).
- coordinate points p 3D k to the image projector l when in the inside of a triangle coordinate point p 3D k is composed of three points r l0, r l1, r l2 , these are the corresponding points.
- the depth of the point r and d r when a minute change [Delta] d r of d r, to minimize repeated using [Delta] d r, updates the depth. It is defined as follows energy using [Delta] d r.
- [Delta] D is the set of [Delta] d r
- E I data section is a normalization term.
- E P represents the constraint between the two ranges of images.
- G (r k ) is a function for finding the corresponding point r ln of the point r k .
- P 3D ( ⁇ d r ) indicates that the coordinate point p 3D has been moved by ⁇ d r along the line of sight.
- D r of each pixel, by adding [Delta] d r that minimizes error E ([Delta] D) by nonlinear minimization are updated iteratively.
- the third embodiment for example, consider a case where 6-camera / 6-projectors are alternately arranged on the circumference. Since there are two cameras on both sides of one projector, six sets of 2-camera / 1-projector described in the second embodiment can be obtained. If the color of the pattern projected by each projector is set so that adjacent colors are not the same as in RGBRGB, two types of patterns are projected on one camera from the two projectors on both sides. Therefore, one of the two color combinations of RG, GB, and BR is distinguished by the method described above.
- information between the camera and projector is added in the first embodiment
- information between the camera and camera is added in the second embodiment
- information between the projector and projector is added in the third embodiment.
- the matching cost is calculated as the SSD between the camera image and the projector image (pattern image). Since a simple SSD for a rectangular area is not suitable for a data term, a patch that approximates the surrounding area by a tangent plane of the lattice point is used at a lattice point in the target object. In the fourth embodiment, the line detection results are compared rather than comparing the images themselves.
- FIG. 14 another embodiment of the intersection comparison method will be described.
- a local line detection error (referred to as a line feature) around the intersection is used.
- the solid line in FIG. 14 is the result of line detection, and the broken line is the projector pattern.
- the error is small, there is a high possibility that the lattice point corresponds. According to this method, the amount of calculation can be reduced as compared with the SSD calculation described in the first embodiment.
- the corresponding point 2602 of the projector image corresponding to the lattice point 2603 of a certain camera image exists on the epipolar line (straight line) 2601. Intersections riding on the same epipolar line may be erroneous corresponding point candidates (incorrect correspondence) (for example, intersections 2602 and 2604 in FIG. 15B). Therefore, the line feature described in the SSD or the fourth embodiment is compared with the intersection on the same epipolar line. The parameters may be selected so that the difference is as large as possible. In order to make a comparison including information on adjacent intersections, the energy represented by Equation 2 is repeatedly calculated by BP.
- an evaluation value of the intersection corresponding to the smallest energy calculated by BP is set as the evaluation value of the intersection, and an evaluation value is calculated in consideration of all the intersections.
- the parameter having the smallest overall evaluation value is determined as the optimum parameter.
- the optimum parameters only the projector image is used, and the intersections on the epipolar line of the projector image are compared with each other. Assuming that the camera and the projector have been calibrated, the epipolar line is invariant even if the grid pattern parameters are changed, and the intersection point on the same epipolar line changes. The parameter with the smallest evaluation value calculated using the intersection point on the same epipolar line is selected.
- the parameters may include wavy line thickness, color (wavelength), and the like.
- FIG. 17 to 18 show the simulation results of the first embodiment.
- bunny data from the shape data collection of Stanford University was used as the target shape.
- FIG. 16A is an image to be observed without a texture
- FIG. 17A is an image to which a lattice pattern is mapped. Images generated by the ray tracing software described in Non-Patent Document 22 for these input images are shown in FIGS. 16B and 17B, respectively.
- FIG. 16C and FIG. 17C are grid detection results of the first embodiment in the head. There are places where the connectivity of the grid is erroneously detected at the boundary of the head, ears, and torso, but these are cut correctly in the stereo matching process.
- FIG. 18A is an input image of a technique using two colors described in Non-Patent Document 8, and is an image to be observed without a texture.
- FIG. 19A is an image to be observed with a texture. In each case, eight periods of local ID information are encoded using three two-color lines. This technique succeeds as shown in FIG. 18B when the observation target has no texture. However, as shown in FIG. 19B, when there is a texture, the color information of the pattern is disturbed, and decoding of ID information and three-dimensional restoration have failed.
- FIGS. 16B, 17B, and 18A were calculated. Since the coordinates of the projector image corresponding to each pixel of the camera image are known, the error between the corresponding point estimated in the restoration result and the correct corresponding point was calculated by the distance on the image plane. 20A to 20C are images obtained by imaging the errors of FIGS. 16B, 17B, and 18A, respectively, in order. Bright pixels indicate a large error.
- RMSE square root of the square error
- RMSE1 calculated for all restored corresponding points and RMSE2 calculated for corresponding points excluding outliers exceeding one pixel. From this table, when there is no texture, RMSE1 for all pixels is better than the method of Non-Patent Document 8 for the method of the first embodiment, whereas RMSE2 excluding outliers is Non-Patent Document 8 It can be seen that this method is better than the method of the first embodiment.
- 21A and 21B show the polygon mesh restored according to the first embodiment.
- 21A corresponds to the input image of FIG. 16B
- FIG. 21B corresponds to the input image of FIG. 17B.
- the shapes in FIGS. 21A and 21B are the result of dense restoration by interpolation.
- the baseline between the camera and the projector is large, for example, there is a parallax of about 100 pixels, but in stereo restoration, a correct correspondence can be obtained without specifying the search range.
- dense corresponding points can be obtained by interpolation and optimization.
- FIGS. 22A to 22D show the results reconstructed by a wavy grid pattern.
- FIG. 3B is a three-dimensional reconstruction result restored with the wavy pattern shown in FIG. 3B.
- FIG. FIG. 22A shows an input image
- FIG. 22B shows a restoration result by the projector / camera system stereo method. Grid lines at the discontinuity of the object (the boundary between the mannequin head and neck) are cut correctly during the stereo process.
- FIG. 22C shows the result of the three-dimensional restoration according to the present embodiment.
- the number of grid points was 943, and the average number of corresponding point candidates at each grid point was 41.
- the time required for the stereo processing was 0.22 seconds.
- the search range is set for the entire image, the calculation cost is small because the grid pattern is sparse compared to the number of pixels.
- FIG. 22D shows a dense shape generated by the interpolation method described above.
- the number of vertices of the three-dimensional model was 25,938.
- the number of iterations in optimization was 5, and the time taken for the interpolation process was 0.59 seconds.
- the total calculation time including line detection was 4.87 seconds.
- 23A to 23C show the results of evaluating the accuracy of the first embodiment.
- FIG. 23A shows an input image
- FIG. 23B shows a shape generated by the above-described interpolation method
- FIG. 23C shows an error imaged. Evaluate by measuring cube shape.
- the size of the cube is 0.2 m on a side and the distance from the camera is 1.0 m.
- FIGS. 24A to 24C are diagrams showing the result of reconfiguration under the influence of ambient light.
- An important advantage of a monochromatic static pattern is that it allows a wider choice of device to illuminate the pattern. Therefore, a restoration experiment using a laser projector that projects light of a single wavelength was performed. Since the energy of the projected light is concentrated in a narrow band, the projection pattern can be observed even under the influence of ambient light by using an appropriate bandpass filter.
- FIG. 24A shows an experimental environment, and it can be seen that the target is strongly illuminated from an external light source. However, as shown in FIG. 24B, the irradiated pattern is clarified by the bandpass filter, and three-dimensional reconstruction can be correctly performed as shown in FIG. 24C.
- FIGS. 25 and 26 show the measurement results of hand opening and closing movements. Movements with the palm closed were measured in the order of FIGS. 25 (a)-(d). FIGS. 26A to 26D show the respective measurement results. According to the first embodiment, since one-shot restoration is performed, three-dimensional restoration can be performed independently for each frame even when the target object moves at high speed.
- FIGS. 27A to 27D show the measurement results of the movement of the human body during the punching operation.
- the movement in which the right hand was extended in the order of FIGS. 27A to 27D was measured.
- 28A to 28D show the respective measurement results.
- three-dimensional restoration can be performed independently for each frame even when the target object moves at high speed.
- the 3D shape restoration (one-shot restoration) method from a single image by irradiating a monochromatic and static pattern. Corresponding information is expressed implicitly by the difference in the pattern of each intersection in the wavy grid pattern. Therefore, by disturbing the regularity of the pattern, the pattern specificity increases and a stable solution can be obtained.
- the method of extending the shape restoration by the stereo method to the projector / camera system in consideration of the connectivity of the grid was described.
- depth estimation in pixel units is performed by interpolating restoration by the grid. Compared with the conventional method, more stable results can be obtained and the measurement of moving objects has been shown to be effective.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Theoretical Computer Science (AREA)
- Optics & Photonics (AREA)
- Length Measuring Devices By Optical Means (AREA)
- Image Processing (AREA)
Abstract
Description
図1に、本発明の第1の実施形態にかかる画像処理システムを示す。1台のカメラ101(撮影装置)と1台のプロジェクタ102(投影装置)を用いる。プロジェクタ102は、観測対象103に対して、波線によるグリッドパターンを投影する。投影パターンは静的であるため、撮影に同期は必要ない。そのため、非常に高いFPS(Flames Per Second)での計測が可能となる。カメラ101とプロジェクタ102とは、パーソナルコンピュータからなる画像処理装置104に接続されている。
Nx=lcm(Sx,Wx)/Sx
Ny=lcm(Sy,Wy)/Sy
を用いて、同一パターンが、縦・横軸に沿ってそれぞれNx,Ny本の波線ごとに起こる。ここでlcm(a,b)はaとbの最小公倍数であり、添字x,yはそれぞれ縦・横軸に沿った値を表すものとする。しかしながら、局所パターンは、各サイクルにおいて識別可能なパターンである。
Sx=10,Sy=11,Wx=Wy=14,Ax=Ay=1
からなるパターンの例である(単位は画素)。この例では、1周期は縦線7本、横線14本となる。よって、98(=7×14)種類の交点が1周期で構成される矩形の中に存在する。
ax+by+cz+1=0
と表される。ただし、a,b,cは平面パラメータである。このパラメータを画像のひずみを考慮してSSD最小化により求める。
(1)カメラ画像501における格子点p周辺の四角パッチR(p)511を、3次元接平面に投影し、プロジェクタ画像502に再投影する。
(2)再投影された四角パッチ512とプロジェクタ画像502との輝度値SSDを計算する。
(3)a,b,cを変数としてSSDを最小化する。
(4)上記のステップを数回繰り返す。
a,b,cの初期値は、接平面が、視差によって計算される格子点の三次元位置を含み、カメラの画像面に対して平行になるように設定され、SSDは以下の式で表される。
図7に、本発明の第2の実施形態にかかる画像処理システムを示す。2台のカメラ1101,1102(撮影装置)と1台のプロジェクタ1103(投影装置)を用いる。プロジェクタ1103は、観測対象1104に対して、波線によるグリッドパターンを投影する。投影パターンは静的であるため、撮影に同期は必要ない。そのため、非常に高いFPS(Flames Per Second)での計測が可能となる。カメラ1101,1102とプロジェクタ1103とは、パーソナルコンピュータからなる画像処理装置1105に接続されている。
D(p1,P1(tP0))<θ and tp0∈Tp1
ここで、D(a,b)は二点ab間の距離であり、θはP1(tP0)に近い格子点の検索範囲の半径であり、Tp1は対応点候補tp1の集合である。
図10に、本発明の第3の実施形態にかかる画像処理システムを示す。6台のカメラ2101-2106(撮影装置)と6台のプロジェクタ2201-2206(投影装置)を用いる。プロジェクタ2201-2206は、観測対象2301に対して、波線によるグリッドパターンを投影する。投影パターンは静的であるため、撮影に同期は必要ない。そのため、非常に高いFPS(Flames Per Second)での計測が可能となる。カメラ2101-2106とプロジェクタ2201-2206とは、パーソナルコンピュータからなる画像処理装置2401に接続されている。
(h,s,v)=RGB2HSV(r,g,b)
(r’,g’,b’)=HSV2RGB(h,1,v) (11)
ただし、RGB2HSVとHSV2RGBとは、それぞれ色空間の間の変換を表し、色は[0,1]の範囲で表される。飽和色に変換することによって、図11Eに示したように、緑色のパターンの影響を抑制することができる。
D(pik,pil)<φ (12)
ここで、D(a,b)は二点ab間の距離であり、φはpikの検索範囲の半径である。
第1~3の実施形態において、マッチングコストを、カメラ画像とプロジェクタ画像(パターン画像)とのSSDとして計算していた。四角形の領域に対する単純なSSDは、データ項には適さないので、対象物体における格子点において、その周辺の領域を格子点の接平面で近似したパッチを用いた。第4の実施形態では、このような画像そのものの比較ではなく、線検出した結果を比較する。
図3A,3Bを参照して説明したように、投影するパターンのパラメータは、経験的に決定していた。そこで、形状計測結果が最も良くなるパターンを、事前に予測して、パラメータを決定する。
図23A-Cに、第1の実施形態の精度を評価した結果を示す。図23Aに入力画像を、図23Bに、上述の補間手法により生成された形状を示し、図23Cに、誤差を画像化したものを示す。立方体形状の計測により評価する。立方体の大きさは一辺0.2mであり、カメラからの距離は1.0mである。復元された立方体の各面に平面を当てはめ、各平面からの誤差をRMSEで算出した。2個の面のRMSEの平均値は0.36mmであり、面の間の角度は88.8度(正解は90.0度)であった。この誤差は、実用的に十分小さいと考えられる。
Claims (11)
- 投影装置により観測対象上に投影された投影パターンを、撮影装置により撮影し、前記投影装置および前記撮影装置に接続された画像処理装置において、該投影パターンを含む入力画像から形状復元を行う画像処理システムにおいて、前記画像処理装置が、
前記撮影装置により撮影された入力画像を取り込み、前記投影装置により投影された投影パターンの線検出を行う手段であって、該投影パターンは、波線によるグリッドパターンであることと、
前記線検出により抽出された縦横線の交点を、前記投影パターンと対応付けることにより形状復元を行う手段と
を備えたことを特徴とする画像処理システム。 - 前記波線は、一定の周期性を持つ波状の曲線であり、前記波線によるグリッドパターンは、一定の間隔に配置された複数の波線からなり、互いに交差する2方向の波線の集合であって、一方の波線の間隔が他方の波線の波長の整数倍でないことを特徴とする請求項1に記載の画像処理システム。
- 前記形状復元を行う手段は、
前記撮影装置により撮影された入力画像における各々の格子点周辺の領域を接平面で近似したパッチを、前記投影装置における出力画像に再投影する手段と、
前記再投影されたパッチと前記投影装置により投影された投影パターンの格子点の対応候補とのステレオマッチングのエネルギーを、格子点に割り当てるデータ項と隣接格子点との間で得られる正規化項との和により計算する手段と、
前記対応候補のステレオマッチングのエネルギーが、最小の対応候補である格子点を、前記投影パターンと対応付けることにより形状復元を行う手段と
を含むことを特徴とする請求項1に記載の画像処理システム。 - 前記形状復元を行う手段は、
3つの標本画素からなる三角メッシュを作成し、準画素単位の深さを計算する手段と、
前記三角メッシュを前記投影装置における出力画像に再投影したときの誤差を、全ての標本画素について計算し、前記計算された誤差を最小化し、前記標本画素以外の画素における深さを線形補間する手段と
を含むことを特徴とする請求項1に記載の画像処理システム。 - 前記撮影装置は、第1および第2の撮影装置からなり、
前記形状復元を行う手段は、前記格子点についての前記第1および第2の撮影装置の間で得られる正規化項を、前記対応候補のステレオマッチングのエネルギーに加算して、前記対応候補を取捨選択する手段を含むことを特徴とする請求項3に記載の画像処理システム。 - 前記形状復元を行う手段は、前記格子点について、前記第1および第2の撮影装置で得られた画素単位の深さを、平均により統合する手段を含むことを特徴とする請求項5に記載の画像処理システム。
- 前記投影装置は、第1および第2の投影装置からなり、
前記形状復元を行う手段は、前記第1の投影装置から投影された第1の投影パターンと、前記第2の投影装置から投影された第2の投影パターンとで一致する格子点について、画素単位の深さを最適化する手段を含むことを特徴とする請求項1に記載の画像処理システム。 - 前記形状復元を行う手段は、
前記格子点についての前記投影パターンと、前記線検出により得られた結果との差を、前記格子点の周囲の複数の位置で計算し、該計算された結果を前記対応候補のマッチングコストとして、最小の対応候補である格子点を、前記投影パターンと対応付けることにより形状復元を行う手段を含むことを特徴とする請求項1に記載の画像処理システム。 - 前記投影パターンを前記観測対象に投影したときに、同じエピポーラ線上の任意の2つの交点についての類似度を比較し、最も類似度が低くなるように、前記投影パターンのパラメータが選択されていることを特徴とする請求項1に記載の画像処理システム。
- 投影装置により観測対象上に投影された投影パターンを、撮影装置により撮影し、前記投影装置および前記撮影装置に接続された画像処理装置において、該投影パターンを含む入力画像から形状復元を行う画像処理方法であって、
前記画像処理装置が、前記撮影装置により撮影された入力画像を取り込み、前記投影装置により投影された投影パターンの線検出を行うステップであって、該投影パターンは、波線によるグリッドパターンであることと、
前記画像処理装置が、前記線検出により抽出された縦横線の交点を、前記投影パターンと対応付けることにより形状復元を行い、復元された画像を表示するステップと
を備えたことを特徴とする画像処理方法。 - 投影装置により観測対象上に投影された投影パターンを、撮影装置により撮影し、前記投影装置および前記撮影装置に接続された画像処理装置に、該投影パターンを含む入力画像から形状復元を行わせるコンピュータ実行可能命令からなるコンピュータプログラムにおいて、
前記撮影装置により撮影された入力画像を取り込み、前記投影装置により投影された投影パターンの線検出を行うステップであって、該投影パターンは、波線によるグリッドパターンであることと、
前記線検出により抽出された縦横線の交点を、前記投影パターンと対応付けることにより形状復元を行い、復元された画像を表示するステップと
を実行させることを特徴とするコンピュータプログラム。
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/418,663 US9633439B2 (en) | 2012-07-30 | 2013-06-28 | Image processing system, and image processing method |
JP2014527960A JP6270157B2 (ja) | 2012-07-30 | 2013-06-28 | 画像処理システムおよび画像処理方法 |
EP13825095.6A EP2881702B1 (en) | 2012-07-30 | 2013-06-28 | Image processing system, and image processing method |
ES13825095T ES2696198T3 (es) | 2012-07-30 | 2013-06-28 | Sistema de procesamiento de imágenes y método de procesamiento de imágenes |
CN201380040691.1A CN104541127B (zh) | 2012-07-30 | 2013-06-28 | 图像处理系统以及图像处理方法 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2012-168412 | 2012-07-30 | ||
JP2012168412 | 2012-07-30 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014020823A1 true WO2014020823A1 (ja) | 2014-02-06 |
Family
ID=50027538
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2013/004059 WO2014020823A1 (ja) | 2012-07-30 | 2013-06-28 | 画像処理システムおよび画像処理方法 |
Country Status (6)
Country | Link |
---|---|
US (1) | US9633439B2 (ja) |
EP (1) | EP2881702B1 (ja) |
JP (1) | JP6270157B2 (ja) |
CN (1) | CN104541127B (ja) |
ES (1) | ES2696198T3 (ja) |
WO (1) | WO2014020823A1 (ja) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2950268A1 (de) * | 2014-05-28 | 2015-12-02 | Wincor Nixdorf International GmbH | Verfahren und Vorrichtung zur Erfassung der dreidimensionalen Form eines Objekts |
JP2017032335A (ja) * | 2015-07-30 | 2017-02-09 | キヤノン株式会社 | 情報処理装置、情報処理方法、プログラム |
JP2017217215A (ja) * | 2016-06-07 | 2017-12-14 | 公立大学法人広島市立大学 | 3次元形状計測装置及び3次元形状計測方法 |
WO2018139237A1 (ja) | 2017-01-25 | 2018-08-02 | 国立研究開発法人産業技術総合研究所 | 画像処理方法 |
CN112399028A (zh) * | 2016-03-01 | 2021-02-23 | 奇跃公司 | 深度感测系统和方法 |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9791267B2 (en) * | 2014-05-27 | 2017-10-17 | Bourbaki 13, Inc. | Determining three-dimensional information from projections or placement of two-dimensional patterns |
US10473916B2 (en) * | 2014-09-30 | 2019-11-12 | Washington University | Multiple-view compressed-sensing ultrafast photography (MV-CUP) |
JP2016114963A (ja) * | 2014-12-11 | 2016-06-23 | 株式会社リコー | 入力操作検出装置、プロジェクタ装置、電子黒板装置、デジタルサイネージ装置、及びプロジェクタシステム |
US10068338B2 (en) * | 2015-03-12 | 2018-09-04 | Qualcomm Incorporated | Active sensing spatial resolution improvement through multiple receivers and code reuse |
US9846943B2 (en) | 2015-08-31 | 2017-12-19 | Qualcomm Incorporated | Code domain power control for structured light |
US10145670B2 (en) * | 2016-01-12 | 2018-12-04 | The Boeing Company | Systems and methods for projected grid-based location tracking |
DE102016002398B4 (de) * | 2016-02-26 | 2019-04-25 | Gerd Häusler | Optischer 3D-Sensor zur schnellen und dichten Formerfassung |
US10225535B2 (en) * | 2016-08-11 | 2019-03-05 | Rabin Esrail | Self-adjusting portable modular 360-degree projection and recording computer system |
US10560679B2 (en) * | 2016-08-30 | 2020-02-11 | Microsoft Technology Licensing, Llc | Deformation detection and automatic calibration for a depth imaging system |
CA3049681A1 (en) * | 2017-01-23 | 2018-07-26 | The Board Of Trustees Of The University Of Illinois | Adaptive cyber-physical system for efficient monitoring of unstructured environments |
US20230107110A1 (en) * | 2017-04-10 | 2023-04-06 | Eys3D Microelectronics, Co. | Depth processing system and operational method thereof |
US10460512B2 (en) * | 2017-11-07 | 2019-10-29 | Microsoft Technology Licensing, Llc | 3D skeletonization using truncated epipolar lines |
US11758100B2 (en) | 2019-09-11 | 2023-09-12 | The Johns Hopkins University | Portable projection mapping device and projection mapping system |
CN110702034A (zh) * | 2019-10-25 | 2020-01-17 | 湖北工业大学 | 高反光表面三维面形测量方法、服务器及系统 |
US20220230335A1 (en) * | 2021-01-20 | 2022-07-21 | Nicolae Paul Teodorescu | One-shot high-accuracy geometric modeling of three-dimensional scenes |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090005924A1 (en) | 2006-01-27 | 2009-01-01 | Toyota Jidosha Kabushiki Kaisha | Vehicle and Control Method Thereof |
JP2009300277A (ja) * | 2008-06-13 | 2009-12-24 | Saitama Univ | 画像処理装置、画像処理方法およびプログラム |
JP2011242183A (ja) * | 2010-05-17 | 2011-12-01 | Hiroshima City Univ | 画像処理装置、画像処理方法およびプログラム |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8224064B1 (en) * | 2003-05-21 | 2012-07-17 | University Of Kentucky Research Foundation, Inc. | System and method for 3D imaging using structured light illumination |
JP2007114071A (ja) | 2005-10-20 | 2007-05-10 | Omron Corp | 三次元形状計測装置、プログラム、コンピュータ読み取り可能な記録媒体、及び三次元形状計測方法 |
JP2009031150A (ja) | 2007-07-27 | 2009-02-12 | Omron Corp | 三次元形状計測装置、三次元形状計測方法、三次元形状計測プログラム、および記録媒体 |
US7768656B2 (en) | 2007-08-28 | 2010-08-03 | Artec Group, Inc. | System and method for three-dimensional measurement of the shape of material objects |
WO2010036403A2 (en) * | 2008-05-06 | 2010-04-01 | Flashscan3D, Llc | System and method for structured light illumination with frame subwindows |
JP5631025B2 (ja) | 2010-03-10 | 2014-11-26 | キヤノン株式会社 | 情報処理装置、その処理方法及びプログラム |
JP5630208B2 (ja) | 2010-10-25 | 2014-11-26 | 株式会社安川電機 | 形状計測装置、ロボットシステムおよび形状計測方法 |
US8633969B2 (en) * | 2011-02-09 | 2014-01-21 | Omnivision Technologies, Inc. | Apparatus and method for three-dimensional image capture with extended depth of field |
JP6097903B2 (ja) * | 2011-07-15 | 2017-03-22 | 有限会社テクノドリーム二十一 | 3次元形状の取得装置、処理方法およびプログラム |
-
2013
- 2013-06-28 US US14/418,663 patent/US9633439B2/en active Active
- 2013-06-28 CN CN201380040691.1A patent/CN104541127B/zh not_active Expired - Fee Related
- 2013-06-28 ES ES13825095T patent/ES2696198T3/es active Active
- 2013-06-28 JP JP2014527960A patent/JP6270157B2/ja active Active
- 2013-06-28 EP EP13825095.6A patent/EP2881702B1/en not_active Not-in-force
- 2013-06-28 WO PCT/JP2013/004059 patent/WO2014020823A1/ja active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090005924A1 (en) | 2006-01-27 | 2009-01-01 | Toyota Jidosha Kabushiki Kaisha | Vehicle and Control Method Thereof |
JP2009300277A (ja) * | 2008-06-13 | 2009-12-24 | Saitama Univ | 画像処理装置、画像処理方法およびプログラム |
JP2011242183A (ja) * | 2010-05-17 | 2011-12-01 | Hiroshima City Univ | 画像処理装置、画像処理方法およびプログラム |
Non-Patent Citations (21)
Title |
---|
"POV-Ray", PERSISTENCE OF VISION PTY. LTD., 2004 |
"Swiss Ranger SR-4000", MESA IMAGING AG., 2011 |
A. 0. ULUSOY; F. CALAKLI; G. TAUBIN: "One-shot scanning using de bruij n spaced grids", THE 7TH IEEE CONF. 3DIM, 2009 |
C. JE; S. W. LEE; R. H. PARK: "High-contrast color stripe pattern for rapid structured-light range imaging", ECCV, vol. 1, 2004, pages 95 - 107, XP019005813 |
CANESTA, INC: "Canesta Vision EP Development Kit", - 2010, Retrieved from the Internet <URL:http://www.canesta.com/devkit.htm> |
H. KAWASAKI; R. FURUKAWA; R. SAGAWA; Y. YAGI: "Dynamic scene shape reconstruction using a single structured light pattern", CVPR, 23 June 2008 (2008-06-23), pages 1 - 8, XP031297260 |
J. SALVI; J. BATLLE; E. M. MOUADDIB: "A robust coded pattern projection for dynamic 3D scene measurement", PATTERN RECOGNITION, vol. 19, no. 11, 1998, pages 1055 - 1065, XP004142894, DOI: doi:10.1016/S0167-8655(98)00085-3 |
J. TAJIMA; M. IWAKAWA: "3-D data acquisition by rainbow range finder", ICPR, 1990, pages 309 - 313, XP010020284, DOI: doi:10.1109/ICPR.1990.118121 |
L. ZHANG; B. CURLESS; S. M. SEITZ: "Space time stereo: Shape recovery for dynamic scenes", IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, June 2003 (2003-06-01), pages 367 - 374 |
L. ZHANG; B. CURLESS; S. SEITZ: "Rapid shape acquisition using color structured light and multi-pass dynamic programming", 3DPVT, 2002, pages 24 - 36, XP010596631 |
M. MARUYAMA; S. ABE: "Range sensing by projecting multiple slits with random cuts", SPIE OPTICS, ILLUMINATION, AND IMAGE SENSING FOR MACHINE VISION IV, vol. 1194, 1989, pages 216 - 224 |
MICROSOFT: "Xbox 360 Kinect", 2010, Retrieved from the Internet <URL:http://www.xbox.com/en-US/Kinect.> |
NOZOMU KASUYA: "Projector-Camera Active Stereo using Single-colored Wave Grid Pattern", ITE TECHNICAL REPORT, vol. 36, no. 43, 30 October 2012 (2012-10-30), pages 13 - 16, XP008176055 * |
P. FELZENSZWALB; D. HUTTENLOCHER: "Efficient belief propagation for early vision", IJCV, vol. 70, 2006, pages 41 - 54, XP019410150, DOI: doi:10.1007/s11263-006-7899-4 |
P. VUYLSTEKE; A. OOSTERLINCK: "Range image acquisition with a single binary-encoded light pattern", IEEE TRANS. ON PAMI, vol. 12, no. 2, 1990, pages 148 - 164, XP000103229, DOI: doi:10.1109/34.44402 |
R. SAGAWA; Y. OTA; Y. YAGI; R. FURUKAWA; N. ASADA; H. KAWASAKI: "Dense 3d reconstruction method using a single pattern for fast moving object", ICCV, 2009 |
S. G. NARASIMHAN; S. J. KOPPAL; S. YAMAZAKI: "Temporal dithering of illumination for fast active vision", PROC. EUROPEAN CONFERENCE ON COMPUTER VISION, October 2008 (2008-10-01), pages 830 - 844, XP019109367 |
S. RUSINKIEWICZ; O. HALL-HOLT; M. LEVOY: "Realtime 3D model acquisition", PROC. SIGGRAPH, 2002, pages 438 - 446 |
S. ZHANG; P. HUANG: "High-resolution, real-time 3D shape acquisition", PROC. CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOP, 2004, pages 28, XP010761973 |
T. WEISE; B. LEIBE; L. V. GOOL: "Fast 3D scanning with automatic motion compensation", CVPR, 2007 |
THE STANFORD 3D SCANNING REPOSITORY, 2012, Retrieved from the Internet <URL:http://www.graphics.stanford.edu/data/3Dscanrep> |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2950268A1 (de) * | 2014-05-28 | 2015-12-02 | Wincor Nixdorf International GmbH | Verfahren und Vorrichtung zur Erfassung der dreidimensionalen Form eines Objekts |
JP2017032335A (ja) * | 2015-07-30 | 2017-02-09 | キヤノン株式会社 | 情報処理装置、情報処理方法、プログラム |
CN112399028A (zh) * | 2016-03-01 | 2021-02-23 | 奇跃公司 | 深度感测系统和方法 |
US10964039B2 (en) * | 2016-03-01 | 2021-03-30 | Magic Leap, Inc. | Depth sensing systems and methods |
US20210183087A1 (en) * | 2016-03-01 | 2021-06-17 | Magic Leap, Inc. | Depth sensing systems and methods |
US11475583B2 (en) | 2016-03-01 | 2022-10-18 | Magic Leap, Inc. | Depth sensing systems and methods |
JP2017217215A (ja) * | 2016-06-07 | 2017-12-14 | 公立大学法人広島市立大学 | 3次元形状計測装置及び3次元形状計測方法 |
WO2018139237A1 (ja) | 2017-01-25 | 2018-08-02 | 国立研究開発法人産業技術総合研究所 | 画像処理方法 |
US11076093B2 (en) | 2017-01-25 | 2021-07-27 | National Institute Of Advanced Industrial Science And Technology | Image processing method |
Also Published As
Publication number | Publication date |
---|---|
JP6270157B2 (ja) | 2018-01-31 |
JPWO2014020823A1 (ja) | 2016-07-21 |
EP2881702A1 (en) | 2015-06-10 |
US9633439B2 (en) | 2017-04-25 |
US20150221093A1 (en) | 2015-08-06 |
ES2696198T3 (es) | 2019-01-14 |
EP2881702A4 (en) | 2016-03-23 |
CN104541127B (zh) | 2017-07-04 |
EP2881702B1 (en) | 2018-09-12 |
CN104541127A (zh) | 2015-04-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6270157B2 (ja) | 画像処理システムおよび画像処理方法 | |
Zhang | Absolute phase retrieval methods for digital fringe projection profilometry: A review | |
Fechteler et al. | Adaptive colour classification for structured light systems | |
Salvi et al. | A state of the art in structured light patterns for surface profilometry | |
JP5317169B2 (ja) | 画像処理装置、画像処理方法およびプログラム | |
CN106796661B (zh) | 投影光图案的系统、方法和计算机程序产品 | |
US20120176478A1 (en) | Forming range maps using periodic illumination patterns | |
JP5643645B2 (ja) | 有形物の形状の3次元測定のためのシステム及び方法 | |
Sagawa et al. | Grid-based active stereo with single-colored wave pattern for dense one-shot 3D scan | |
US20100328308A1 (en) | Three Dimensional Mesh Modeling | |
US20130038696A1 (en) | Ray Image Modeling for Fast Catadioptric Light Field Rendering | |
Takimoto et al. | 3D reconstruction and multiple point cloud registration using a low precision RGB-D sensor | |
JP5633058B1 (ja) | 3次元計測装置及び3次元計測方法 | |
JP6097903B2 (ja) | 3次元形状の取得装置、処理方法およびプログラム | |
US11398085B2 (en) | Systems, methods, and media for directly recovering planar surfaces in a scene using structured light | |
US9147279B1 (en) | Systems and methods for merging textures | |
JP5761750B2 (ja) | 画像処理方法および装置 | |
CN107517346A (zh) | 基于结构光的拍照方法、装置及移动设备 | |
Ruchay et al. | Accuracy analysis of 3D object reconstruction using RGB-D sensor | |
Rodrigues et al. | Structured light techniques for 3D surface reconstruction in robotic tasks | |
Furukawa et al. | One-shot entire shape acquisition method using multiple projectors and cameras | |
Cheng et al. | Structured light-based shape measurement system | |
KR101765257B1 (ko) | 삼차원 영상 정보 획득 방법 및 이를 구현한 컴퓨팅 장치 | |
Sagawa et al. | Parallel processing of grid-based one-shot structured-light system for online 3D reconstruction of moving objects | |
Ouji et al. | A space-time depth super-resolution scheme for 3D face scanning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13825095 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2014527960 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14418663 Country of ref document: US Ref document number: 2013825095 Country of ref document: EP |