US20190116326A1 - Apparatus and method for capturing still images and video using coded lens imaging techniques - Google Patents
Apparatus and method for capturing still images and video using coded lens imaging techniques Download PDFInfo
- Publication number
- US20190116326A1 US20190116326A1 US16/207,941 US201816207941A US2019116326A1 US 20190116326 A1 US20190116326 A1 US 20190116326A1 US 201816207941 A US201816207941 A US 201816207941A US 2019116326 A1 US2019116326 A1 US 2019116326A1
- Authority
- US
- United States
- Prior art keywords
- sensor
- image
- lens
- coded
- aperture
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims description 57
- 238000003384 imaging method Methods 0.000 title description 57
- 230000033001 locomotion Effects 0.000 claims description 19
- 230000005855 radiation Effects 0.000 claims description 19
- 230000005670 electromagnetic radiation Effects 0.000 claims 22
- 239000004065 semiconductor Substances 0.000 abstract description 11
- 239000000463 material Substances 0.000 abstract description 6
- 230000000903 blocking effect Effects 0.000 abstract description 2
- 230000003287 optical effect Effects 0.000 description 20
- 230000000737 periodic effect Effects 0.000 description 19
- 239000003550 marker Substances 0.000 description 18
- 230000000694 effects Effects 0.000 description 14
- 230000008569 process Effects 0.000 description 12
- 230000005540 biological transmission Effects 0.000 description 11
- 238000003491 array Methods 0.000 description 10
- 238000005314 correlation function Methods 0.000 description 10
- 238000001914 filtration Methods 0.000 description 10
- 239000007787 solid Substances 0.000 description 10
- 238000012546 transfer Methods 0.000 description 9
- 230000008901 benefit Effects 0.000 description 8
- 230000003321 amplification Effects 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 238000003199 nucleic acid amplification method Methods 0.000 description 7
- 238000012545 processing Methods 0.000 description 7
- 230000004075 alteration Effects 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 238000004422 calculation algorithm Methods 0.000 description 5
- 230000007423 decrease Effects 0.000 description 5
- 230000002829 reductive effect Effects 0.000 description 5
- 239000011521 glass Substances 0.000 description 4
- 230000000670 limiting effect Effects 0.000 description 4
- 230000036961 partial effect Effects 0.000 description 4
- 238000013139 quantization Methods 0.000 description 4
- 230000002238 attenuated effect Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000006073 displacement reaction Methods 0.000 description 3
- 230000009021 linear effect Effects 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 238000012805 post-processing Methods 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000003139 buffering effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000000354 decomposition reaction Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000005251 gamma ray Effects 0.000 description 2
- 238000009499 grossing Methods 0.000 description 2
- 230000003116 impacting effect Effects 0.000 description 2
- 239000003973 paint Substances 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 240000005020 Acaciella glauca Species 0.000 description 1
- 238000012935 Averaging Methods 0.000 description 1
- 206010010071 Coma Diseases 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- QBWCMBCROVPCKQ-UHFFFAOYSA-N chlorous acid Chemical compound OCl=O QBWCMBCROVPCKQ-UHFFFAOYSA-N 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 238000005286 illumination Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 210000003127 knee Anatomy 0.000 description 1
- CNQCVBJFEGMYDW-UHFFFAOYSA-N lawrencium atom Chemical compound [Lr] CNQCVBJFEGMYDW-UHFFFAOYSA-N 0.000 description 1
- 238000012886 linear function Methods 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- ORQBXQOJMQIAOY-UHFFFAOYSA-N nobelium Chemical compound [No] ORQBXQOJMQIAOY-UHFFFAOYSA-N 0.000 description 1
- 230000036963 noncompetitive effect Effects 0.000 description 1
- 230000009022 nonlinear effect Effects 0.000 description 1
- 238000009206 nuclear medicine Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 210000001747 pupil Anatomy 0.000 description 1
- 235000003499 redwood Nutrition 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
Images
Classifications
-
- H04N5/357—
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N25/00—Circuitry of solid-state image sensors [SSIS]; Control thereof
- H04N25/60—Noise processing, e.g. detecting, correcting, reducing or removing noise
- H04N25/61—Noise processing, e.g. detecting, correcting, reducing or removing noise the noise originating only from the lens unit, e.g. flare, shading, vignetting or "cos4"
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N25/00—Circuitry of solid-state image sensors [SSIS]; Control thereof
- H04N25/60—Noise processing, e.g. detecting, correcting, reducing or removing noise
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N25/00—Circuitry of solid-state image sensors [SSIS]; Control thereof
- H04N25/10—Circuitry of solid-state image sensors [SSIS]; Control thereof for transforming different wavelengths into image signals
- H04N25/11—Arrangement of colour filter arrays [CFA]; Filter mosaics
- H04N25/13—Arrangement of colour filter arrays [CFA]; Filter mosaics characterised by the spectral characteristics of the filter elements
- H04N25/134—Arrangement of colour filter arrays [CFA]; Filter mosaics characterised by the spectral characteristics of the filter elements based on three different wavelength filter elements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N25/00—Circuitry of solid-state image sensors [SSIS]; Control thereof
- H04N25/60—Noise processing, e.g. detecting, correcting, reducing or removing noise
- H04N25/61—Noise processing, e.g. detecting, correcting, reducing or removing noise the noise originating only from the lens unit, e.g. flare, shading, vignetting or "cos4"
- H04N25/611—Correction of chromatic aberration
-
- H04N5/3572—
Definitions
- This invention relates generally to the field of image capture and image processing. More particularly, the invention relates to an apparatus and method for capturing still images and video using coded lens techniques.
- Photographic imaging is commonly done by focusing the light coming from a scene using a single glass lens which is placed in front of a light sensitive detector such as a photographic film or a semiconductor sensor including CCD and CMOS sensors.
- a light sensitive detector such as a photographic film or a semiconductor sensor including CCD and CMOS sensors.
- coded aperture imaging For imaging high-energy radiation such as x-ray or gamma rays, other techniques must be used because such radiation cannot be diffracted using glass lenses.
- a number of techniques have been proposed including single pinhole cameras and multi-hole collimator systems.
- a particularly beneficial technique is “coded aperture imaging” wherein a structured aperture, consisting of a suitably-chosen pattern of transparent and opaque elements, is placed in front of a detector sensitive to the radiation to be imaged. When the aperture pattern is suitably chosen, the imaged scene can be digitally reconstructed from the detector signal.
- Coded aperture imaging has the advantage of combining high spatial resolution with high light efficiency. Coded aperture imaging of x-ray and gamma ray radiation using structured arrays of rectangular or hexagonal elements is known from R. H.
- a particularly useful class of coded imaging systems is known from E. E. F ENIMORE AND T. M. C ANNON : C ODED A PERTURE I MAGING W ITH U NIFORMLY R EDUNDANT A RRAYS. A PPL . O PT., 17:337-347, 1978 (hereinafter “Fenimore”).
- a basic aperture pattern is cyclically repeated such that the aperture pattern is a 2 ⁇ 2 mosaic of the basic pattern.
- the detector has at least the same size as the basic aperture pattern.
- the “fully coded FOV” (“FOV” shall be used herein to refer to “field-of-view”) is defined as the area within the FOV, within which a point source would cast a complete shadow of a cyclically shifted version of the basic aperture pattern onto the aperture.
- the “partially coded FOV” is defined as the area within the FOV, within which a point source would only cast a partial shadow of the basic aperture pattern onto the aperture.
- a collimator is placed in front of the detector which limits the FOV to the fully coded FOV, thus allowing an unambiguous reconstruction of the scene from the detector signal.
- a collimator has the undesired property of only transmitting light without attenuation which is exactly parallel to the optical axis. Any off-axis light passing through the collimator is attenuated, the attenuation increasing towards the limits of the FOV. At the limits of the FOV, the attenuation is 100%, i.e., no light can pass through the collimator at such angles. This effect will be denoted as “collimator attenuation” within this document. Both in the x-direction and in the y-direction, collimator attenuation is proportional to the tangent of the angle between the light and the optical axis.
- the effect of collimator attenuation may have to be reversed in order to obtain a photometrically correct image.
- the attenuation, especially the collimator attenuation is very high, i.e. this factor approaches zero.
- Inverting the collimator attenuation in this case involves amplifying the pixel values with a very large factor, approaching infinity at the limits of the FOV. Since any noise in the reconstruction will also be amplified by this factor, pixels close to the limits of the FOV may be very noisy or even unusable.
- the basic aperture pattern can be characterized by means of an “aperture array” of zeros and ones wherein a one stands for a transparent and a zero stands for an opaque aperture element.
- the scene within the FOV can be characterized as a two-dimensional array wherein each array element contains the light intensity emitted from a single pixel within the FOV.
- the sensor signal can be characterized as the two-dimensional, periodic cross-correlation function between the FOV array and the aperture array. It should be noted that the sensor signal as such has no resemblance with the scene being imaged.
- a “reconstruction filter” can be designed by computing the two-dimensional periodic inverse filter pertaining to the aperture array.
- the two-dimensional periodic inverse filter is a two-dimensional array which is constructed in such a way that all sidelobes of the two-dimensional, periodic cross-correlation function of the aperture array and the inverse filter are zero.
- URAs Uniformly Redundant Arrays
- URAs have a two-dimensional, periodic cross-correlation function whose sidelobe values are all identical.
- URAs have an inverse filter which has the same structure as the URA itself, except for a constant offset and constant scaling factor.
- Such reconstruction filters are optimal in the sense that any noise in the sensor signal will be subject to the lowest possible amplification during the reconstruction filtering.
- URAs can be algebraically constructed only for very few sizes.
- MURAs have the additional advantage that, with the exception of a single row and a single column, they can be represented as the product of two one-dimensional sequences, one being a function only of the column index and the other being a function only of the row index to the array.
- their inverse filter can also be represented as the product of two one-dimensional sequences. This property permits to replace the two-dimensional in-verse filtering by a sequence of two one-dimensional filtering operations, making the reconstruction process much more efficient to compute.
- near-field effects occur.
- the “near field” is defined as those ranges which are less than 10 times the sensor size, aperture size or distance between aperture and sensor, whichever of these quantities is the largest. If an object is in the near field, the sensor image can no longer be described as the two-dimensional cross-correlation between the scene and the aperture array. This causes artifacts when attempting to reconstructing the scene using inverse filtering.
- methods for reducing such near-field artifacts are disclosed. These methods involve imaging the scene using two separate coded apertures where the second aperture array is the inverse of the first aperture array (i.e. transparent elements are replaced by opaque elements and vice versa). The reconstruction is then computed from two sensor signals acquired with the two different apertures in such a manner that near-field artifacts are reduced in the process of combining the two sensor images.
- Coded aperture imaging to date has been limited to industrial, medical, and scientific applications, primarily with x-ray or gamma-ray radiation, and systems that have been developed to date are each designed to work within a specific, constrained environment.
- existing coded aperture imaging systems are each designed with a specific view depth (e.g. effectively at infinity for astronomy, or a specific distance range for nuclear or x-ray imaging).
- coded aperture imaging has been used with either controlled radiation sources (e.g. in nuclear, x-ray, or industrial imaging), or astronomical radiation sources that are relatively stable and effectively at infinity.
- existing coded aperture systems have had the benefit of operating within constrained environments, quite unlike, for example, a typical photographic camera using a lens.
- a typical photographic camera using a single lens i.e. a single lens per sensor or film frame; stereoscopic cameras have 2 lenses, but utilize a separate sensor or film frame per lens
- a single lens i.e. a single lens per sensor or film frame; stereoscopic cameras have 2 lenses, but utilize a separate sensor or film frame per lens
- No coded aperture system has ever been designed that can handle these types of unconstrained imaging environments that billions of photographic cameras with single lenses handle everyday.
- Photographic imaging in the optical spectrum using a single lens has a number of disadvantages and limitations.
- the main limitation of single lens photography is its finite depth-of-field (DOF), particularly at large aperture settings. Only scenes at a limited DOF can be in focus in a single lens image while any objects closer or farther away from the camera than the DOF will appear blurred in the image.
- DOF depth-of-field
- a single lens camera must be manually or automatically focused before an image can be taken. This is a disadvantage when imaging objects which are moving fast or unexpectedly such as in sports photography or photography of children or animals, particularly at large apertures with a short DOF. In such situations, the images may be out of focus because there was not enough time to focus or because the object moved unexpectedly when acquiring the image. Single lens photography does not allow a photographer to retrospectively change the focus once an image has been acquired.
- focusing a single lens camera involves adjusting the distance between one or more lenses and the sensor. This makes it necessary for a single lens camera to contain mechanically moving parts which makes it prone to mechanical failure.
- Various alternatives to glass lenses such as liquid lenses (see, e.g., B. H ENDRIKS & S TEIN K UIPER : T HROUGH A L ENS S HARPLY . IEEE S PECTRUM , D ECEMBER, 2004), have been proposed in an effort to mitigate the mechanical limitations of a glass lens, but despite the added design complexity and potential limitations (e.g., operating temperature range and aperture size) of such alternatives, they still suffer from the limitation of a limited focus range.
- single lens cameras have a limited dynamic range as a result of their sensors (film or semiconductor sensors) having a limited dynamic range.
- specialized semiconductor image sensors e.g. the D1000 by Pixim, Inc. of Mountain View, Calif.
- image sensors are much more expensive than conventional CCD or CMOS image sensors, and as such are not cost-competitive for many applications, including mass-market general photography.
- single lenses can provide a rough estimate of the distance between the lens and a subject object. But since most photographic applications require lenses designed to have as long a range of concurrent focus as possible, using focus for a distance estimate is extremely imprecise. Since a single lens can only be focused to a single distance range at a time, at best, a lens will provide an estimate of the distance to a single object range at a given time.
- CAI Coded Aperture Imaging
- CAI Application addresses many of the limitations of a single lens camera. Relative to a single lens camera, CAI makes it possible to make a thinner camera, a lighter camera, a camera with greater dynamic range, and also a camera which can reconstruct an image which is in focus throughout a large range of depth in the scene.
- FIG. 1 A visible light coded aperture camera according to one embodiment described in the CAI Application is illustrated in FIG. 1 .
- the illustrated embodiment includes a coded aperture 101 placed in front of a light sensitive grayscale or color semiconductor sensor 104 .
- the coded aperture 1012 is a pattern of circular, square, hexagonal, rectangular or other tiled elements, some of which are transparent to visible light (e.g. element 102 ) and some of which are opaque (e.g. element 103 ).
- coded aperture 101 has very few transparent elements.
- a typical coded aperture may have significantly more transparent elements (e.g., 50%).
- Visible light a from 2-dimensional or 3-dimensional scene 100 (which may be illuminated by ambient or artificial lighting) is projected through the coded aperture 101 onto image sensor 104 .
- the camera is capable of limiting the FOV to the fully coded FOV projected onto the sensor. In one embodiment, this is implemented by the use of a self-collimating coded aperture 101 (utilizing baffles for collimation, as explained below).
- the space between the coded aperture and the sensor is shielded by a light-opaque housing 105 (only the outline of which is shown in FIG. 1 ), preventing any light from reaching the sensor other than by passing through an open element of the coded aperture.
- the camera further includes an image sensor readout subsystem 110 with an interface 109 to the image sensor 104 (which may be similar to those used in prior coded aperture systems).
- the readout subsystem clocks out the analog image signal from the image sensor 104 and applies analog buffering, amplification and/or filtering as required by the particular image sensor.
- An example of such a readout subsystem 110 that also incorporates ND 120 is the NDX-1260 CleanCapture Image Processor by NuCore Technology, Inc. of Sunnyvale, Calif.
- the ability to adjust the zero offset 112 and gain 111 to analog pixel values read by the readout subsystem 110 will increase the dynamic range of the captured image, but is not essential if the image sensor has a sufficient dynamic range for the desired image quality without a zero-offset and gain adjustment.
- op amp operational amplifier
- the output of the readout subsystem 110 is coupled by interface 113 to at least one analog-to-digital converter (A/D) 120 which digitizes the analog output.
- A/D analog-to-digital converter
- the output of the A/D is coupled via interface 121 to an image reconstruction processor 130 , which in one embodiment incorporates a Digital Signal Processor (DSP) 132 and Random Access Memory (RAM) 131 .
- DSP Digital Signal Processor
- RAM Random Access Memory
- the digitized image from the interface 121 is stored in RAM 131 , and the DSP 132 post-processes the image so as to reconstruct the original scene 101 into a grayscale or color image.
- the image reconstruction processor 130 incorporates a general purpose CPU such as an Intel Corporation Pentium 4®, or similar general purpose processor.
- the image reconstruction processor 130 incorporates an Application-Specific Integrated Circuit (“ASIC”) which implements part or all of the reconstruction processing in dedicated digital structures.
- ASIC Application-Specific Integrated Circuit
- CAI CAI-reconstructed image
- the resolution of a CAI camera is limited by the larger of two primary factors: (a) the order of the aperture array, and (b) distortion in the projected image caused by diffraction. This is explained further in the following paragraphs.
- FIG. 2 shows several representative coded aperture array patterns of MURAs of “order” 101 , 61 and 31 (described in more detail in the CAI application).
- FIG. 2 also shows coded aperture array patterns of PBAs of order 8 and 24 .
- the PBAs 8 and 24 are shown enlarged relative to the MURAs to better show their patterns.
- the coded aperture array patterns are formed from a square array (with horizontal and vertical dimensions of the specified order) that is repeated twice in the horizontal and twice in the vertical dimension. So, for example, the MURA 101 pattern has a total size of 202 ⁇ 202. Note also that each of the aperture elements in the arrays is of the same size.
- a CAI camera can not resolve an image that is higher resolution than the order of its coded aperture array.
- a MURA 101 CAI camera can not resolve an image of higher resolution than 101 ⁇ 101 pixels.
- FIG. 3 shows one embodiment of the visible light coded aperture camera shown in FIG. 1 .
- the embodiment shown in FIG. 3 is not useful for many applications because the resolution of the reconstructed image is only 3 ⁇ 3 pixels, but it is illustrative of how a camera such as that shown in FIG. 1 works.
- a MURA order 3 (“MURA 3 ”) aperture array 301 contains 16 open apertures, such as open aperture 302 , and 20 closed apertures, such as closed aperture 303 .
- Color or grayscale sensor 304 is the same size as one quadrant (i.e. one 3 ⁇ 3 block of apertures) of the MURA 3 aperture array 301 and in this embodiment it is positioned centered relative to the MURA 3 aperture array 301 .
- Orthographic View 320 of FIG. 3 reveals more of the structure of the camera.
- Baffles (referred to as “collimators” in the CAI Application) 315 serve to collimate the light passing through open apertures, such as open aperture 302 . This restricts the FOV of each aperture projection onto color or grayscale sensor 304 . Closed apertures such as closed aperture 303 are covered with an opaque cover so they do not allow light to pass through.
- Sensor 304 is separated from MURA 3 aperture array 301 and baffles 317 to allow space for the overlapping projections from each of the open apertures. The entire unit is contained within a light-tight camera body 316 , which is shown to be transparent for the purposes of illustration. Note that in this particular example, even if sensor 304 is a very high-resolution sensor, only a 3 ⁇ 3 pixel image can be reconstructed.
- FIG. 4 illustrates how light is projected through the MURA 3 aperture array.
- Illustration 400 shows the MURA 3 aperture array 401 delineated by a solid black outline, with exemplary open aperture 402 and closed aperture 403 .
- the position of color or grayscale sensor 404 is delineated by a dotted outline.
- Open aperture 405 is delineated by a dashed line.
- the light that passes through aperture 405 projects onto a square area on the sensor plane shown as a gray square 406 .
- projection 406 is a square approximately 9 times larger than aperture 405 and centered on aperture 405 . Depending on how close or far sensor 404 is to the aperture array, this projection may correspond to a wider or narrower FOV. Baffles around aperture 405 (not shown in this illustration, but visible as baffles 317 in FIG. 3 ) are used in this embodiment to limit the extent of projection 406 to approximately 9 times larger than the size of aperture 405 .
- Illustration 410 shows the overlaying of the 4 projections from the upper right quadrant of aperture array 401 .
- the 4 open apertures 415 in the upper right quadrant are delineated with dashed outlines.
- the 4 projections 416 from these 4 apertures are shown as overlapping gray areas.
- Each projection like the projection 406 shown in illustration 400 , is a square approximately 9 times the size of its aperture and is centered on its aperture, and is delineated by a solid gray line.
- varying levels of gray scale are used to fill each area. The lightest gray indicates 1 projection, the next darker indicates 2 projections overlapping, the next darker indicates 3 projections overlapping, and finally the darkest indicates 4 projections overlapping.
- Illustration 420 shows the overlaying of all 16 projections from the entire aperture array 401 .
- the 16 open apertures 425 are delineated by dashed outlines.
- Each projection like the projection 406 shown in illustration 400 , is a square approximately 9 times the size of its aperture and centered on its aperture, and is delineated by a solid gray line.
- varying levels of gray scale are used as described in the previous paragraph. Note that in this embodiment each area of sensor 404 is shown covered by 4 overlapping projections.
- f/2.8 is good light transmission performance for a photographic lens, so the description of the MURA 3 coded aperture camera in the last few paragraphs characterizes a camera with potentially desirable light transmission characteristics. Unfortunately, only a 3 x 3 pixel image can be reconstructed by the system described.
- Each element in a CAI camera acts geometrically like a pinhole in a pinhole camera. Light passing through each aperture makes a projection onto the sensor, just as it would in a pinhole camera. And like a pinhole camera, a CAI camera is subject to the diffraction effects of light passing through a pinhole. In a pinhole, these diffraction effects create a point source projected pattern commonly known as the “Airy disk”.
- the primary lobe of the Airy disk roughly defines the smallest resolvable spot size from a given pinhole camera projection. At a given distance from the pinhole to the sensor, the Airy disk increases in size as the pinhole decreases in size. From a geometric point of view, the resolution (i.e.
- the optimal pinhole size of a 1′′ focal length (i.e. 1′′ thick) pinhole camera is about 0.007′′.
- the optimal pinhole size of a 10′′ focal length (i.e. 10′′ thick) pinhole camera is about 0.023′′.
- visible light CAI cameras are also subject to diffraction effects which may result in resolution/size trade-offs.
- the diffraction patterns are more complex than pinhole diffraction patterns because of the complexity of the aperture patterns, and consequently, determining the impact on image resolution and/or camera size requirements is more complex.
- the pixel resolution of the CAI image can be no higher than the order of the aperture array, to achieve a high-resolution image it is necessary to utilize high order aperture arrays which can potentially exhibit worse diffraction effects than lower order aperture arrays or, alternatively, require longer focal lengths (and, as a result, larger camera sizes) to mitigate those diffraction effects.
- plenoptic camera Another approach to improving the performance of a lens system in a digital camera is a plenoptic camera.
- the basic concept of a plenoptic camera is described in U.S. Pat. No. 5,076,687.
- the word “plenoptic” is not used in the patent, the device referenced in the patent is called a “plenoptic camera” by its inventor in a web page describing the camera at: http://www-bcs.mit.edu/people/jyawang/demos/plenoptic/plenoptic.html.
- the apparatus comprises: a coded lens array including a plurality of lenses arranged in a coded pattern with opaque material blocking array elements not containing lenses; and a light-sensitive semiconductor sensor coupled to the coded lens array and positioned at a specified distance behind the coded lens array, the light-sensitive sensor configured to sense light transmitted through the lenses in the coded lens array.
- FIG. 1 illustrates a visible light coded aperture camera according to one embodiment of the invention.
- FIG. 2 illustrates three exemplary MURA patterns and two exemplary PBA patterns employed in accordance with the underlying principles of the invention.
- FIG. 3 illustrates the configuration of a MURA order 3 coded aperture array, baffles, sensor, and a camera body in accordance with one embodiment of the invention.
- FIG. 4 illustrates the projection of light from transparent apertures in a MURA 3 coded aperture array in accordance with one embodiment of the invention.
- FIG. 5 illustrates a coded lens camera according to one embodiment of the invention.
- FIG. 6 illustrates the configuration of a MURA order 3 coded lens array, baffles, sensor, and a camera body in accordance with one embodiment of the invention.
- FIG. 7 illustrates the projection of light from transparent apertures in a MURA 3 coded lens array in accordance with one embodiment of the invention.
- FIG. 8 illustrates a side view of a MURA order 3 coded lens camera in accordance with one embodiment of the invention.
- FIG. 9 illustrates an exemplary RGB Bayer Pattern employed in one embodiment with the invention.
- FIG. 10 illustrates image sensors implemented as a multi-layer structure and used in one embodiment of the invention.
- FIG. 11 a illustrates one embodiment of the invention in which an output signal is digitized by an analog-to-digital converter (A/D) in order to allow digital image reconstruction and post-processing.
- A/D analog-to-digital converter
- FIG. 11 b illustrates a process for selecting zero offset and gain in accordance with one embodiment of the invention.
- FIG. 12 illustrates a coded lens imaging characteristic and a typical lens imaging characteristic.
- FIG. 13 illustrates a graph showing typical CMOS and CCD image sensor transfer characteristics.
- FIG. 14 illustrates a side view of a MURA order 3 coded lens camera with multi-element lens in accordance with one embodiment of the invention.
- FIG. 15 illustrates a gearing arrangement for simultaneously focusing all of the lenses in a coded lens array in accordance with one embodiment of the invention.
- FIG. 16 illustrates a side view of a multi-element coded lens system with a gearing system for simultaneously focusing all the lenses in a coded lens array in accordance with one embodiment of the invention.
- FIG. 17 a illustrates three examples of a projection and reconstruction of three flat scenes at a known range using a MURA 3 coded lens array in accordance with one embodiment of the invention.
- FIG. 17 b illustrates three examples of a projection and reconstruction of three flat scenes at a known range using a PBA 24 coded lens array in accordance with one embodiment of the invention.
- FIG. 18 illustrates a reconstruction of an image at different ranges to identify the correct range in accordance with one embodiment of the invention.
- FIG. 19 illustrates an image in which a person is standing close to a camera, while mountains are far behind the person.
- FIG. 20 illustrates how the person from FIG. 19 can readily be placed in a scene with a different background.
- FIG. 21 illustrates a photograph of an exemplary motion capture session.
- FIG. 5 A visible light coded lens array camera. for either single shot images or sequential (e.g. video) images, including readout electronics and display, according to one embodiment of the invention, is illustrated in FIG. 5 .
- the illustrated embodiment includes a coded lens array 501 placed in front of a light sensitive grayscale or color semiconductor sensor 504 .
- the coded lens array 501 is a pattern of circular, square, hexagonal or rectangular (or any pattern that can be tiled on a plane) apertures, some of which are transparent (i.e. “open”) to visible light (e.g. element 502 ) and some of which are opaque (i.e. “closed) to visible light (e.g. element 503 ).
- Each open aperture, such as 502 is covered by (or contains) a lens such as 508 , so that virtually all of the light passing through the open aperture passes through the lens.
- a typical coded lens array has approximately 50% transparent apertures, each with a lens.
- the coded lens array pattern shown is a MURA order 3 with a 4/5 ratio of transparent to opaque apertures.
- Visible light a from 2-dimensional or 3-dimensional scene 500 (which may be illuminated by ambient or artificial lighting) is projected through the lenses and open apertures of coded aperture array 501 onto image sensor 504 .
- the camera is capable of limiting the FOV to the fully coded FOV projected onto the sensor.
- the light contributions of overlapping projections in this fully coded FOV is shown in illustration 620 of FIG.
- this is implemented by the use of a self-collimating coded lens array 501 (self-collimation is accomplished through baffles 517 behind the coded lens array 501 , which are explained below).
- the space between the coded lens array and the sensor is shielded by a light-opaque housing 516 (only the outline of which is shown in FIG. 5 ), preventing any light from reaching the sensor other than by passing through a lens and open aperture of the coded lens array 501 .
- the camera further includes an image sensor readout subsystem 510 with an interface 509 to the image sensor 504 .
- the readout subsystem clocks out the analog image signal from the image sensor 504 and applies analog buffering, amplification and/or filtering as required by the particular image sensor.
- An example of such a readout subsystem 510 that also incorporates A/D 520 is the NDX-1260 CleanCapture Image Processor by NuCore Technology, Inc. of Sunnyvale, Calif.
- the ability to adjust the zero offset 512 and gain 511 to analog pixel values read by the readout subsystem 510 will increase the dynamic range of the captured image, but is not essential if the image sensor has a sufficient dynamic range for the desired image quality without a zero-offset and gain adjustment.
- op amp operational amplifier
- the output of the readout subsystem 510 is coupled by interface 513 to at least one analog-to-digital converter (A/D) 520 which digitizes the analog output.
- A/D analog-to-digital converter
- the output of the A/D is coupled via interface 521 to an image reconstruction processor 530 , which in one embodiment incorporates a Digital Signal Processor (DSP) 532 and Random Access Memory (RAM) 531 .
- DSP Digital Signal Processor
- RAM Random Access Memory
- the digitized image from the interface 521 is stored in RAM 531 , and the DSP 532 post-processes the image so as to reconstruct the original scene 500 into a grayscale or color image.
- the image reconstruction processor 530 incorporates a general purpose CPU such as an Intel Corporation Pentium 4®, or similar general purpose processor.
- the image reconstruction processor 530 incorporates an Application-Specific Integrated Circuit (“ASIC”) which implements part or all of the reconstruction processing in dedicated digital structures.
- ASIC Application-Specific Integrated Circuit
- FIG. 6 shows one embodiment of the visible light coded lens array camera shown in FIG. 5 .
- a MURA order 3 (“MURA 3 ”) lens array 601 contains 16 open apertures, such as open aperture 602 , and 20 closed apertures, such as closed aperture 603 .
- Each open aperture, such as 602 contains one lens.
- the lenses are round, but in alternative embodiments the lens may be other shapes (e.g. squares or hexagons) that may more completely fill the open aperture 602 area. But, regardless of the shape of lens 608 in the present embodiment, any remaining area of the open aperture 602 not filled bylens 608 must be opaque or nearly opaque.
- Color or grayscale sensor 604 is the same size as one quadrant (i.e.
- illustration 610 shows sensor 604 's placement location behind MURA 3 lens array 601 by showing it through the circles that illustrate the shape of the lenses. This is done simply for the sake of illustration, and this may not what would be seen upon visual inspection of an actual system due to the refraction effects of the lenses if an observer would look through them.
- Orthographic View 620 of FIG. 6 reveals more of the structure of the camera.
- Baffles (referred to as “collimators” in the CAI Application) 617 serve to collimate the light passing through the lens and open apertures, such as open aperture 602 and lens 608 . This restricts the FOV of each aperture projection onto color or grayscale sensor 604 . Closed apertures such as closed aperture 603 are covered with an opaque cover so they do not allow light to pass through.
- Sensor 604 is separated from MURA 3 aperture array 611 and baffles 617 to allow space for the overlapping projections from each of the open apertures.
- the entire unit is contained within a light-tight camera body 616 , which is shown to be transparent for the purposes of illustration.
- FIG. 7 illustrates how light is projected through the MURA 3 coded lens array 701 .
- Illustration 700 shows the MURA 3 coded lens array 701 , with exemplary open aperture and lens 702 and closed aperture 703 .
- the position of color or grayscale sensor 704 that would be located behind coded lens array 701 is delineated by a dotted outline.
- Lens 705 is delineated by a dashed line.
- the light that passes through lens 705 projects onto a square area on the sensor plane shown as a gray square 706 .
- aperture array 701 is shown in illustration 700 as overlaying the projection, much of projection 706 is obstructed by closed apertures. Nonetheless, the perimeter of projection 706 can be seen delineated by a solid gray outline.
- projection 706 is a square approximately 9 times larger than open aperture square around lens 705 and centered on lens 705 . Depending on how close or far sensor 704 is to the aperture array, this projection may correspond to a wider or narrower FOV. Baffles around open aperture 705 (not shown in this illustration, but visible as baffles 617 in FIG. 6 are used in this embodiment to limit the extent of projection 706 to approximately 9 times larger than the size of lens 705 .
- Illustration 710 shows the overlaying of the 4 projections from the upper right quadrant of aperture array 701 .
- the 4 lenses of open apertures 715 in the upper right quadrant are delineated with dashed outlines.
- the 4 projections 716 from these 4 lenses are shown as overlapping gray areas.
- Each projection like the projection 706 shown in illustration 700 , is a square approximately 9 times the size of the open aperture square surrounding its lens and is centered on its lens, and is delineated by a solid gray line.
- each area is filled with varying levels of gray scale. The lightest gray indicates 1 projection, the next darker indicates 2 projections overlapping, the next darker indicates 3 projections overlapping, and finally the darkest indicates 4 projections overlapping.
- Illustration 720 shows the overlaying of all 16 projections from the entire aperture array 701 .
- the 16 lenses of all open apertures 725 are delineated by dashed outlines.
- Each projection like the projection 706 shown in illustration 700 , is a square approximately 9 times the size of the open aperture square surrounding its lens and centered on its lens, and is delineated by a solid gray line.
- varying levels of gray scale are used as described in the previous paragraph. Note that in this embodiment each area of sensor 704 is shown covered by 4 overlapping projections.
- the description of the MURA 3 coded lens array camera in the last few paragraphs characterizes a camera with potentially desirable characteristics.
- the MURA 3 coded lens array camera illustrated in FIGS. 5, 6 and 7 is capable of reconstructing an image at least up to the approximate diffraction limits of each of the lenses in the MURA 3 coded lens array. For example, in the case of lenses 12 mm lenses with a 36 mm focal length and a 53 degree FOV, more 2000 ⁇ 2000 resolution (4 megapixels) is achievable within the diffraction limits.
- the size of the sensor as being approximately equal to the size of one quadrant (i.e. one-half size in each dimension) as the size of the coded lens array.
- the sensor dimensions are independent from the coded lens array dimensions, but the system is configured in such a way that the coded lens array projects a pattern onto the sensor that is equivalent to the pattern that would have been projected had the sensor been equal to the size of one quadrant of a coded lens array and with appropriate spacing and focal length such as the coded lens camera configurations described herein.
- the reconstruction of the image using the techniques described herein are reliant on the configuration of overlapping pattern of images of the scene projected onto the sensor, not on the particular configuration of the coded lens array relative to the sensor. If a different coded lens array configuration than one described herein can achieve a similar overlapping pattern on the sensor, then the image reconstruction will be the same. For example, if telephoto lenses in a MURA 3 pattern are positioned far from the sensor, but the optical path of each is angled in such a way that the projected pattern on the sensor is the same as the pattern shown in FIG. 7 , then the image can still be reconstructed correctly.
- the resulting output 533 from the reconstruction processor is a 2-dimensional array of grayscale or color pixels representing the scene within the FOV of the camera.
- the pixel data is transmitted through a digital interface to a computer (or other image processing device).
- the digital interface for transferring the reconstructed image data may be any digital interface capable of handling the bandwidth from the camera for its required application such as for example, a IEEE 1394 (“FireWire”) interface or a USB 2.0 interface (which would be suitable for current still and video camera applications).
- the underlying principles of the invention are not limited to any particular digital interface.
- the camera includes a display 540 (e.g., an LCD or OLED display), for presenting the reconstructed images to the photographer, but in this embodiment, display device 540 and interface 533 are optional.
- the camera does not include reconstruction processor 530 .
- the digitized image data from the A/D converter 520 is coupled through interface 521 to an output buffer where the image data is packetized and formatted to be output through a digital interface.
- the digital interface would typically be coupled to an external computing means such as a personal computer, either to be processed and reconstructed immediately, or stored on a mass storage medium (e.g., magnetic or optical disc, semiconductor memory, etc.) for processing and reconstruction at a later time.
- the external computing device has a display for presenting the reconstructed images to the photographer.
- the digital interface is coupled directly to a mass storage medium (e.g., magnetic or optical disc, semiconductor memory, etc.).
- the digital interface for transferring the reconstructed image data could be any digital interface capable of handling the bandwidth from the camera for its required application (e.g., IEEE 1394 (“FireWire”) interface or a USB 2.0 interface).
- the coded lens array 501 is a Modified Uniformly Redundant Array (“MURA”) pattern.
- the coded lens array 501 is a Perfect Binary Array (“PBA”) pattern.
- the coded lens array 501 is a Uniformly Redundant Array (“URA”) pattern.
- the coded lens array 501 is a random pattern (although the performance of the system typically will not be as optimal with a random pattern as it will with a MURA, PBA, or URA).
- the basic aperture pattern would be the same size as the sensor, and the overall coded lens array would be a 2 ⁇ 2 mosaic of this basic aperture pattern.
- Each transparent aperture in the array contains a lens.
- Three exemplary MURA patterns and one PBA pattern are illustrated in FIG. 2 .
- MURA 101 is a 101 ⁇ 101 element pattern
- MURA 61 is a 61 ⁇ 61 element pattern
- MURA 31 is a 31 ⁇ 31 element pattern.
- PBA 8 is a 8 ⁇ 8 element pattern
- PBA 24 is a 24 ⁇ 24 element pattern.
- the PBA patterns are illustrated as enlarged relative to the MURA patterns. In each pattern, each black area is opaque and each white area is transparent (open) and would contain a lens.
- the coded aperture consists of a microlens array such as those manufactured by Suss Micro-optics of Neuchatel, Switzerland.
- a microlens array is an array of typically plano-convex lenses fabricated in a typically a rectilinear or hexagonal grid.
- a microlens array would be used for the coded lens array with a lens at each location on the grid, but those lenses occurring at “closed” aperture location would be painted over with an opaque paint or an opaque material would be lithographically coated at the “closed” aperture locations..
- a microlens array would be fabricated with only lenses at locations of an “open” aperture in the coded lens array. “Closed” aperture locations in the coded lens array would be either painted with an opaque paint, or a opaque material would be lithographically coated at the “closed” aperture locations.
- the distance between the coded lens array and the sensor plane is chosen in such a way that each of the projections of the individual lenses is in focus.
- the sensor plane is therefore placed at the focal plane of the lenses.
- the sensor plane might be placed slightly behind the focal plane of lenses in order to focus at the desired distance.
- the distance between the coded lens array and the sensor plane may therefore not be chosen arbitrarily, but a constraint between focal length, image plane to sensor plane distance, and distance of the object to be image must be observed.
- One embodiment of the camera employs techniques to limit the FOV (FOV) to the fully coded FOV (FCFOV).
- the techniques of limiting the FOV may be dimensioned in such a way that the FOV is slightly larger than the FCFOV, i.e., in such a way that the FOV is composed of the FCFOV plus a small part of the partially coded FOV (PCFOV).
- PCFOV partially coded FOV
- FOV limitation is achieved by placing baffles either in front of or behind the lenses in order to limit the maximum angles at which rays can pass through the coded lens array and reach the sensor.
- the length of the baffles determines the size of the FOV: The longer the baffles, the narrower the FOV of the coded lens camera.
- FIG. 8 illustrates a side view of the projected FOVs of each of the lenses in a MURA 3 coded lens camera.
- the baffles 801 are placed behind the lenses 802 , i.e. on the side of the lens facing the sensor 804 . It should be noted, however, that the baffles may also be placed in front of the lenses, i.e. on the side of the lens facing the scene.
- placing the baffles behind the lenses has the advantage that the exit pupil 803 of the lens system is moved closer towards the sensor plane. This way the size of the diffraction patterns caused by each lens is reduced and hence the achievable resolution of the overall imaging system is increased.
- FIG. 8 further shows how the FOV of each lens is determined by the marginal rays 805 , passing through the edges of the lens and passing just by the edge of the baffles on the opposite side.
- the right hand illustration 810 of FIG. 8 shows how the projections caused by the individual lenses overlap in the sensor plane.
- Each lens has the same angular field of view. However, due to the displacement of the lenses towards each other, there is a parallax for objects at a finite distance. Therefore, the field of view of the overall imaging system is approximately the same as the field of view of an individual lens, but may be slightly larger for objects at a finite distance due to this parallax effect.
- FIG. 8 shows a complete row of lenses. However, in a coded lens imaging system, some of the positions in each row will not contain any lens but be blocked. The figure only shows the complete row of lenses for illustrative purposes. Different rows of lenses in a coded lens array will contain lenses in different positions. Since typically each position will contain a lens in at least one row, the overall field of view can be derived as depicted in FIG. 8 .
- baffle attenuation is compensated for by multiplying each pixel of the reconstructed image with the inverse of the baffle attenuation the pixel has been subjected to.
- the baffle attenuation is known from the geometry of the lenses and baffles. This way, in the absence of any noise, a constant-intensity surface is reconstructed to a constant-intensity image.
- the signal-to-noise ratio (SNR) of the reconstructed image is highest in the center of the image and decreases towards the edges of the image, reaching the value zero at the edges of the FOV.
- this problem is alleviated by using only a central region of the reconstructed image while discarding the periphery of the reconstructed image.
- the problem is further alleviated by applying a noise-reducing smoothing filter to image data at the periphery of the reconstructed image.
- Wiener filters are known to be optimum noise-reducing smoothing filters, given that the signal-to-noise ratio of the input signal to the Wiener filter is known.
- the signal-to-noise ratio varies across the image.
- the SNR is known for each pixel or each region of the reconstructed image.
- noise-reduction is achieved by applying a local Wiener filtering operation with the filter characteristic varying for each pixel or each region of the reconstructed image according to the known SNR variations.
- a coded lens camera is subject to the focus limitations of the lenses in its coded lens array.
- DOF Depth of Field
- the DOF is typically increased by narrowing the aperture of the lens, which reduces the light from the scene that reaches the sensor.
- a principal advantage of the coded lens camera over a conventional single lens camera is that as the effective lens aperture is narrowed to increase the DOF, the amount of light from the scene reaching the sensor is not substantially reduced.
- a coded lens array typically has about 50% transparent apertures with lenses and 50% opaque apertures, so typically 50% of the light from the scene passes through the coded lens array.
- 12.5% represents the average light transmission of square apertures with 50% open apertures and is a reasonable approximation for a coded lens system.
- 12.5% light transmission is approximately equivalent to a f/2.8 aperture on a single lens (which has 12.7% light transmission).
- f/2.8 aperture is a very wide aperture setting.
- f/2.8 corresponds to a 17.9 mm aperture.
- focus limits are subjective and will vary from photographer to photographer, but the same criteria are utilized for the different conditions considered in this section, so the results can be considered relative to one another. These calculations were made using a Depth of Field online calculator at http://www.dofmaster.com/dofjs.html). Any object in the scene closer than the near focus or farther than the far focus will be subject to a reduction in sharpness.
- 8.82′ is a short DOF
- the f/2.8 setting passes about 12.7% of the light from the scene.
- a 50 mm square PBA 8 coded lens array is utilized, again focused on an object 25′ in the distance.
- the PBA 8 pattern shown in FIG. 2 would be utilized, with a lens placed in each transparent (i.e. illustrated as white) aperture of the PBA 8 .
- each lens would be about 3.1 mm in diameter, which is about the same diameter as a conventional single 50 mm lens stopped down to f/16.
- the DOF of the PBA 8 coded lens array would be roughly the same as the DOF of a conventional 50 mm lens stopped down to f/16.
- this embodiment of a coded lens array has a DOF comparable to an f/16 conventional lens with the light transmission characteristics of an f/2.8 conventional lens.
- the same coded lens array described in the previous paragraph is used with a Nikon D100 camera, but the coded lens array is focused on an object 26′ in the distance instead of 25′ away.
- the near focus limit is 12.9′ and the far focus limit is infinity. Since everything is in focus from a certain distance through infinity, the coded lens array is functioning as a “hyperfocal lens”, with its focus distance set to the “hyperfocal distance”.
- This configuration is useful for certain applications where all of the objects in the scene are at least 12.9′ away, and then the lenses in the coded lens array can be set to a fixed focus and do not need to be adjusted. Note that if an object in the scene is slightly closer than 12.9′, it still may be usefully imaged.
- the coded lens arrays shown in most of the figures have only a single lens element in each transparent aperture. Although this may be sufficient for some applications, in other applications, it is desirable to use multiple lens elements to correct for image aberrations, such as geometric distortion, coma, and chromatic aberrations.
- image aberrations such as geometric distortion, coma, and chromatic aberrations.
- an entire lens industry has been devoted to designing multi-element lenses to address lens aberration issues, and this vast corpus of prior art work will not be repeated here. Suffice it to say that typically, 3 elements or more are needed for photographic-quality imaging, and further, that typically, one or more of these elements needs to translate back-and-forth on the optical axis for focusing, unless the camera has a fixed focus. Frequently, such back-and-forth motion is accomplished by a rotating mechanism that turns a collar around part or all of the lens, which in turn engages a thread which moves one or more of the lens elements along the optical axis.
- FIG. 14 illustrates a side view of a coded lens array with three-element lenses.
- the lens shapes shown are simply for illustrative purposes, and the actual lens shapes would vary depending on the optical characteristics desired, using any of a vast number of prior art photographic lens designs.
- Each aperture would have 3 such lenses in a stack within one or more concentric cylinders. Baffles would extend behind the last lens toward the sensor so as to limit the FOV of the projection. Note that each aperture position is shown containing a stack of lenses in this illustration. In practice, opaque apertures would not contain lenses, or they would be covered so as not to permit light to pass through them.
- FIG. 15 illustrates an arrangement of gears with hollow centers within a coded lens array, each gear rotating around either a lens (if the location is a transparent aperture) or rotating over an opaque aperture without a lens. (For the sake of illustration, the teeth of adjacent gears are not touching each other, but in practice they would typically fit together snugly.)
- Gear 1501 is coupled to the shaft of an electric motor, which is either manually controlled or is controlled by an auto-focus mechanism. As the electric motor turns, it turns gear 1501 , which in turn transfers the rotational motion to all the gears in the coded lens array.
- gear 1501 turns clockwise, it turns gear 1502 counterclockwise, which then turns gears 1503 and 1504 both clockwise, and then gears 1503 and 1504 both turn gear 1505 counter-clockwise.
- gear 1501 turns all of the gears in the coded lens array, with each successive gear in the horizontal or vertical direction turning the opposite way.
- FIG. 16 shows a side view of a three-element coded lens array utilizing the gearing system shown in FIG. 15 .
- all lens array positions are shown with lenses. In practice, opaque lens array positions would not have lenses and would have their apertures closed so they block light.
- each lens array position has two fixed lenses 1601 and 1602 , and one lens 1603 that translates back-and-forth along the optical axis.
- Electric motor 1620 is powered by either a manual or auto-focus means, and it turns gear 1621 , which in turn drives the other gears in the coded lens array, as previously described in FIG. 15 , including FIG. 16 's gear 1604 .
- Gear 1604 turns hollow cylinder 1605 , which in turn drives hollow cylinder 1606 , which holds lens 1603 .
- Hollow cylinder 1606 is coupled to hollow cylinder 1605 in such a way that it is able to translate back-and-forth along the optical axis (left-to-right as shown in FIG. 16 ).
- Hollow cylinder 1606 has screw thread 1607 on its outside surface, which notches pins such as pin 1608 that are secured to structure 1609 . As hollow cylinder 1606 rotates, screw thread 1607 causes it to translate back-and-forth along the optical axis.
- each subsequent gear in the coded lens array rotates in the opposite direction.
- each subsequent hollow cylinder holding a lens is threaded with the opposite pitch, such as screw thread 1610 has opposite pitch of screw thread 1607 .
- the middle lenses of the lens array all move in the same direction when the electric motor 1620 actuates gear 1621 , despite the fact each other gear position is rotating in an opposite direction.
- the same structure 1609 that holds the lens array mechanism continues behind the lenses to form the baffles.
- Such structure 1609 may be made of a metal such as aluminum, plastic, or any other sufficiently sturdy, but light-opaque material.
- FIG. 16 shows a side view, but in practice the baffle form a box around the perimeter of each transparent aperture, and function to limit the FOV of the projection from each lens stack that projects onto sensor 1630 .
- the sensor pixel size is chosen such as to be in the same order of magnitude as the resolution of the coded lens array. It should be noted that this resolution is determined by the diffraction patterns of the individual lenses. If the sensor pixel size is chosen significantly larger than the size of the diffraction patterns, resolution of the imaging system is wasted. If, on the other hand, the sensor pixel size is chosen significantly smaller than the size of the diffraction patterns, no additional usable information is gained.
- the choice of the lens size it should be noted that there is a tradeoff between the size of the diffraction patterns and the achievable DOF. The smaller a lens is chosen, the larger its diffraction pattern and the better its DOF. It is important to note, however, that there is a degree of freedom in the choice of the lens size in order to achieve the best compromise between resolution and DOF of a specific application. In coded aperture imaging, however, this degree of freedom does not exist. Rather, in coded aperture imaging the sensor pixel size and aperture element size are constrained to be more or less identical.
- the sensor 504 of FIG. 5 is a CCD sensor. More specifically, a color CCD sensor using a color filter array (“CFA”), also know as a Bayer pattern, is used for color imaging.
- CFA color filter array
- a CFA is a mosaic pattern of red, green and blue color filters placed in front of each sensor pixel, allowing it to read out three color planes (at reduced spatial resolution compared to a monochrome CCD sensor).
- FIG. 9 illustrates an exemplary RGB Bayer Pattern.
- Each pixel cluster 900 consists of 4 pixels 901 - 904 , with color filters over each pixel in the color of (G)reen, (R)ed, or (B)lue.
- each pixel cluster in a Bayer pattern has 2 Green pixels ( 901 and 904 ), 1 Red ( 902 ) and 1 Blue ( 903 ). Pixel Clusters are typically packed together in an array 905 that makes up the entire CFA. It should be noted, however, that the underlying principles of the invention are not limited to a Bayer pattern.
- a multi-layer color image sensor is used.
- Color sensors can be implemented without color filters by exploiting the fact that subsequent layers in the semiconductor material of the image sensor absorb light at different frequencies while transmitting light at other frequencies.
- Foveon, Inc. of Santa Clara, Calif. offers “Foveon X3” image sensors with this multi-layer structure. This is illustrated in FIG. 10 in which semiconductor layer 1001 is an array of blue-sensitive pixels, layer 1002 is an array of green-sensitive pixels, and layer 1003 is an array of red-sensitive pixels. Signals can be read out from these layers individually, thereby capturing different color planes.
- This method has the advantage of not having any spatial displacement between the color planes. For example, pixels 1011 - 1013 are directly on top of one another and the red, green and blue values have no spatial displacement between them horizontally or vertically.
- each of the 3 RGB color planes are read out from a color imaging sensor (CFA or multi-layer) and are reconstructed individually.
- the reconstruction algorithms detailed below are applied individually to each of the 3 color planes, yielding 3 separate color planes of the reconstructed image. These can then be combined into a single RGB color image.
- the analog output signal of imaging sensor 1101 is digitized by an analog-to-digital converter (A/D) 1104 in order to allow digital image reconstruction and post-processing.
- A/D analog-to-digital converter
- the sensor output is first amplified by an op amp 1100 before feeding it into the A/D.
- the op amp 1100 applies a constant zero offset z ( 1102 ) and a gain g ( 1103 ) to the image sensor 1101 output signal.
- offset 1102 and gain 1103 are chosen in such a way that the full dynamic range of the A/D 1104 is exploited, i.e., that the lowest possible sensor signal value s min corresponds to zero and the highest possible sensor signal value s max corresponds to the maximum allowed input signal of the A/D 1104 without the A/D 1104 going into saturation.
- FIG. 12 depicts the characteristic of the resulting system. Note that as described above, the dynamic range of the scene is compressed by coded lens imaging; therefore, zero offset and gain may be higher than in conventional imaging with a single lens. In one embodiment, zero offset and gain are automatically chosen in an optimal fashion by the coded lens camera according to the following set of operations, illustrated in the flowchart in FIG. 11 b:
- an initial zero offset is selected as the maximum possible zero offset and a relatively large initial step size is selected for the zero offset.
- an initial gain is selected as the maximum possible gain and a relatively large initial step size is selected for the gain.
- an image is acquired using the current settings and a determination is made at 1113 as to whether there are any pixels in the A/D output with a zero value. If there are pixels with a zero value, then the current zero offset step size is subtracted from the current zero offset at 1114 and the process returns to 1112 .
- an image is acquired using the current settings.
- a determination is made as to whether there are any pixels in the A/D output with the maximum output value (e.g. 255 for an 8-bit A/D). If there are pixels with the maximum value, then the current gain step size is subtracted from the current gain at 1119 and the process returns to 1117 .
- the effects of zero offset and gain have to be reversed.
- each sensor pixel is exposed to light emitted by different pixels of the scene, reaching the sensor pixel through different lenses within the coded lens array.
- the reconstruction algorithms used in coded lens imaging assume that sensor image is the linear sum of all sensor images which each individual lens would have projected onto the sensor. Therefore, in one embodiment, the sensor output signal s is an exactly linear function of the number p of photons hitting each sensor pixel during the exposure time.
- the function describing the dependency of the sensor output signal from the actual photon count of each sensor pixel is called the “transfer characteristic” of the sensor.
- CCD imaging sensors have a linear transfer characteristic over a large range of intensities while CMOS imaging sensors have a logarithmic transfer characteristic.
- FIG. 13 A graph showing typical CMOS and CCD image sensor transfer characteristics is shown in FIG. 13 .
- the dynamic range of the sensor signal may be different from the dynamic range of the imaged scene. Since each sensor pixel is exposed to multiple scene pixels across the entire FOV, the coded lens array has an averaging effect on the range of intensities. Even scenes with a high dynamic range (e.g. dark foreground objects and bright background objects) produce sensor signals with a lower dynamic range.
- the dynamic range of the original scene is reconstructed independently of the dynamic range of the imaging sensor. Rather, the limited dynamic range of the imaging sensor (finite number of bits for quantization) leads to quantization errors which can be modeled as noise in the sensor image. This quantization noise also causes noise in the reconstruction.
- a MURA lens array is constructed in the following way. First consider a Legendre sequence of length p where p is an odd prime.
- a 1 represents a lens and a 0 represents an opaque element in the coded lens array.
- the periodic inverse filter g (i, j) pertaining to this MURA is given by:
- g (i, j) (2 a (i, j) ⁇ 1)/K if i>0 or j>0.
- the periodic inverse filter pertaining to a MURA therefore has the same structure as the MURA itself, except for a constant offset and constant scaling factor, and for the exception of a single element which is inverted with respect to the original MURA.
- FIG. 2 shows various sizes of MURA lens array patterns.
- a PBA according to Busboom can be used as a lens array. Its periodic inverse filter has exactly the same structure as the PBA itself, except for a constant offset and constant scaling factor.
- the formulas and algorithms for generating PBAs can be found in A. B usBooM: A RRAYS UND R EKONSTRUKTIONSALGORITHMEN FUER BILDGEBENDE S YSTEME MIT CODIERTER A PERTUR . VDI V ERLAG , D UESSELDORF , 1999, ISBN 3-18-357210-9 , PAGES 52-56.
- PBAs of order 8 and 24 are illustrated in FIG. 2 . They are enlarged relative to the MURA patterns.
- the sensor image is given by the periodic cross-correlation function of the object function with the coded lens array, magnified by a geometric magnification factor f as described above.
- the periodic cross-correlation function of the measured sensor image with an appropriately magnified version of the periodic inverse filter is computed. In the absence of noise and other inaccuracies of the measured sensor image, the result equals the original object function.
- reconstruction of the scene from the sensor signal is performed in a digital signal processor (“DSP”) (e.g., DSP 132 ) integrated into the camera or in a computing device external to the camera.
- DSP digital signal processor
- scene reconstruction consists of the following sequence of operations:
- the inverse filtering of operation (2) can be decomposed into a sequence of two one-dimensional filter operations, one of which is applied per image row and the other of which is applied per image column. This decomposition may reduce the computational complexity of (2) in the case of large array orders.
- FIG. 17 a illustrates three examples of the projection and reconstruction of three flat scenes at a known range using the procedure described in the preceding paragraph.
- a 3 ⁇ 3 MURA pattern was used for the lens array ( 1700 ).
- the distance (pitch) between two adjacent lenses in the array was 3 mm.
- Each lens had a focal length of 5 mm which was also the distance between the lens array and the sensor.
- the sensor was a 10 ⁇ 10 mm sensor with 30 ⁇ 30 um square pixels.
- Scene 1701 is a flat (2-dimensional) test pattern of 307 ⁇ 307 pixels. It is projected through the 3 ⁇ 3 element MURA lens array 1700 onto the image sensor, resulting in the sensor image 1711 .
- Sensor image 1711 is adjusted and reconstructed per the process described above resulting in reconstruction 1721 .
- flat 307 ⁇ 307 pixel image 1702 is projected through the lens array 1700 resulting in sensor image 1712 and is processed to result in reconstruction 1722 .
- flat 307 ⁇ 307 pixel image 1703 is projected through the lens array 1700 resulting in sensor image 1713 and is processed to result in reconstruction 1723 .
- FIG. 17 b illustrates three similar examples as FIG. 17 a .
- a 24 ⁇ 24 PBA pattern was used as the lens array pattern ( 1750 ).
- the lenses had a pitch of 0.39 mm such that the total size of the lens array was similar to that of FIG. 17 a (18.72 ⁇ 18.72 mm in FIGS. 17 b and 18 ⁇ 18 mm in FIG. 17 a ).
- the same sensor as in the example of FIG. 17 a was used.
- the lenses had again a focal length of 5 mm.
- Scene 1701 is projected through the 24 ⁇ 24 element PBA lens array 1750 onto the image sensor, resulting in the sensor image 1731 .
- Sensor image 1731 is adjusted and reconstructed per the process described above resulting in reconstruction 1741 .
- flat 307 ⁇ 307 pixel image 1702 is projected through the lens array 1750 resulting in sensor image 1732 and is processed to result in reconstruction 1742 .
- flat 307 ⁇ 307 pixel image 1703 is projected through the lens array 1750 resulting in sensor image 1733 and is processed to result in reconstruction 1743 . It can be observed from the sensor images ( 1711 - 1713 and 1731 - 1733 ) in the two examples that increasing the order of the lens array flattens the contrast in the sensor image. In the sensor images 1731 - 1733 of FIG. 17 b , no more details of the original scene are recognizable. However, as can be seen from the reconstructions 1741 - 1743 , the sensor images still contain all the information necessary for reconstructing the original scene.
- sensor images 1711 - 1713 and 1731 - 1733 may be quantized at a given number of bits per pixel (e.g. 8), but may yield in the reconstructed images 1721 - 1723 and 1741 - 1743 an image with a useful dynamic range comparable to a higher number of bits per pixel (e.g. 10).
- operation (2) of the sequence of operations described above in section “Reconstruction of a Scene with One Object at a Known Range” are repeated for different expected object ranges o, when the true object range is uncertain or unknown.
- a set of multiple reconstructions is obtained from the same sensor signal.
- the one where the expected object range is identical with or closest to the true object range will be the most accurate reconstruction of the real scene, while those reconstructions with a mismatch between expected and true range will contain artifacts.
- artifacts will be visible in the reconstruction as high-frequency artifacts, such as patterns of horizontal or vertical lines or ringing artifacts in the neighborhood of edges within the reconstruction.
- the one with the least artifacts is manually or automatically selected.
- This allows a change in the range of reconstruction without the need to pre-focus the camera and, in particular, without the need to mechanically move parts of the camera, as would be required with a conventional single lens camera, or to pre-select an expected object range. Further, this allows the user to decide about the desired range of reconstruction after the image acquisition (i.e. retrospectively).
- the range of reconstruction is automatically selected from the set of reconstructions by identifying the reconstruction with the least amount of high-frequency artifacts and the smoothest intensity profile.
- a simple, but highly effective criterion for “focusing” a coded lens camera i.e., for determining the correct range from a set of reconstructions, is to compute the mean m and the standard deviation ⁇ of all gray level values of each reconstruction. Further, the ratio m/ ⁇ is computed for each reconstruction. The reconstruction for which this ratio takes on its maximum is chosen as the optimal reconstruction, i.e., as the reconstruction which is “in focus.” This technique produces the best results if the objects in the scene are in focus in each of the individual projections.
- FIG. 18 illustrates how a scene is reconstructed at a set of different ranges.
- a similar system configuration as in FIG. 17 b was used for producing FIG. 18 , i.e. a 24 ⁇ 24 PBA pattern was used for projection.
- the original scene was the test image 1701 from FIG. 17 b which was imaged at a range of 1,000 mm.
- Reconstructions were computed from the resulting sensor image at assumed ranges of 500 mm ( 1801 ), 800 mm ( 1802 ), 1,000 mm ( 1803 ) and 5,000 mm ( 1804 ).
- the reconstruction in the lower left-hand corner at the correct range of 1,000 mm looks “clean” while the reconstructions at different ranges contain strong high-frequency artifacts.
- FIG. 18 illustrates how a scene is reconstructed at a set of different ranges.
- FIG. 18 also shows the standard deviation (“stddev”) of the gray values in each of the four reconstructions.
- FIG. 18 further shows the quotients (m/s) of the gray value mean, divided by the gray value standard deviation, for each of the four reconstructions. This value starts at 0.0977 at an assumed range of 500 mm, then continuously increases to a maximum of 2.0 at the correct range of 1,000 mm, then continuously decreases, reaching a value of 0.1075 at an assumed range of 5,000 mm.
- the example shows how the true range of the scene can be easily computed from a set of reconstructions by choosing the reconstruction at which the quotient m/s takes on its maximum.
- a partial reconstruction of parts of the image is computed using different expected object ranges o.
- a partial reconstruction is computed by only evaluating the periodic cross-correlation function in operation (2) above in section “Reconstruction of a Scene with One Object at a Known Range” for a subset of all pixels of the reconstructed image, thus reducing the computational complexity of the reconstruction.
- This subset of pixels may be a sub-sampled version of the image, a contiguous region of the image, or other suitable subsets of pixels. Then, the two one-dimensional periodic filtering operations only need to be evaluated for a subset of rows and/or columns of the reconstructed image.
- the one with the least amount of high-frequency artifacts and the smoothest intensity profile is identified in order to determine the true object range o.
- a full reconstruction is then performed. This way, the computational complexity of reconstructing the scene while automatically determining the true object range o can be reduced.
- a set of full image reconstructions at different object ranges o is computed. Since objects in different parts of the scene may be at different ranges, the reconstructions are decomposed into several regions. For each region, the object range o which yields the least amount of high-frequency artifacts and the smoothest intensity profile is identified. The final reconstruction is then assembled region by region whereas for each region the reconstruction with the optimum object range o is selected. This way, images with infinite depth of FOV (from close-up to infinity) can be reconstructed from a single sensor signal.
- the combined reconstruction is of lower quality than a flat reconstruction of a flat scene, i.e., of a scene with only a single object at a single range.
- the presence of other regions in the scene which are “out of focus” do not only cause the out-of-focus regions to be of inferior quality in the reconstruction, but also cause the in-focus region to contain artifacts in the reconstruction.
- an iterative reconstruction procedure is employed which eliminates this crosstalk among different regions in the scene at different ranges.
- the iterative reconstruction procedure according to one embodiment of the invention consists of the following set of operations.
- the output signal of the coded lens camera (in addition to the two-dimensional image information) also contains range information for each image pixel or for several image regions, as determined from finding the object range o for each region with the least amount of high-frequency artifacts and the smoothest intensity profile.
- the reconstruction assigns a z value indicating the distance from the camera to the object at that pixel position in the image. This way, three-dimensional image data can be obtained from a single, two-dimensional sensor signal.
- the range data allows the camera, an external imaging manipulation system, or the user, utilizing an image manipulation application or system to easily segment the two-dimensional image into different regions pertaining to different parts of the scene, such as separating objects in the foreground of a scene from the background of a scene.
- Chroma-keying is a technique commonly used in video and photographic production to separate a foreground image from a solid background color.
- a “blue screen” or “green screen” is used, which is a very carefully colored and illuminated screen that is placed behind a performer or object while the scene is photographed or captured on video or film.
- a hardware or software system separates the presumably distinctively colored foreground image from the fairly uniformly colored background image, so that the foreground image can be com posited into a different scene.
- the weatherperson on a TV news show is chroma-keyed against a blue or green screen, then com posited on top of a weather map.
- Such blue or green screens are quite inconvenient for production. They are large and bulky, they require careful illumination and must be kept very clean, and they must be placed far enough behind the foreground object so as not to create “backwash” of blue or green light onto the edges of the foreground object.
- an image can be captured without a blue or green screen, and the z value provided with each pixel will provide a compositing system with enough information to separate a foreground object from its background (i.e., by identifying which pixels in the scene contain the image of closer objects and should be preserved in the final image, and which pixels in the scene contain the image of further away objects and should be removed from the final image). This would be of substantial benefit in many applications, including photographic, video, and motion picture production, as well as consumer applications (e.g. separating family members in various pictures from the background of each picture so they may be composited into a group picture with several family members).
- FIG. 20 shows how a person 1901 from FIG. 19 can readily be placed in a scene with a different background, such as the castle 2002 with the background mountains 2002 removed from the picture. This is simply accomplished by replacing every pixel in the image reconstructed from FIG. 19 that has a z value greater than that of person 1901 with a pixel from the image of the castle 2002 .
- the processing of z values may be implemented using virtually any type of image processor including, for example, a DSP, ASIC or a general purpose processor.
- the per-pixel distance ranging capability of one embodiment also has applications in optical performance motion capture (“mocap”).
- Mocap is currently used to capture the motion of humans, animals and props for computer-generated animation, including video games (e.g. NBA Live 2005 from Electronic Arts of Redwood City, Calif.), and motion pictures (e.g. “The Polar Express”, released by the Castle Rock Entertainment, a division of Time Warner, Inc, New York, N.Y.).
- Such mocap systems e.g. those manufactured by Vicon Motion Systems, Ltd. of Oxford, United Kingdom
- Retroreflective markers or other distinctive markings
- the video cameras simultaneously capture images of the markers, each capturing the markers within its FOV that is not obstructed.
- software analyzes all of the video frames and by triangulation, tries to identify the position of each marker in 3D space.
- FIG. 21 is a photograph of an exemplary motion capture session.
- the three bright rings of light are rings of LEDs around the single lenses of the video cameras 2101 - 2103 .
- the performers are wearing tight-fitting black suits.
- the gray dots on the suits are retroreflective markers that reflect the red LED light back to the camera lenses causing the markers to stand out brightly relative to the surrounding environment.
- Four such retroreflective markers on the knees of the left performer are identified as 2111 - 2114 .
- a frame of a given video camera shows a marker centered at a given (x, y) pixel position
- the image is really showing two markers lined up one behind the other, leaving one completely obscured.
- the performer's motion may separate the markers to different (x, y) positions, but it can be difficult to determine which marker was the one in front and which was the one in back in the previous frame (e.g. the marker further away may appear slightly smaller, but the size difference may be less than the resolution of the camera can resolve).
- a performer may roll on the floor, obscuring all of the markers on one side.
- single lens video cameras are replaced by video cameras utilizing coded lens techniques described herein.
- the coded lens cameras not only capture images of the markers, but they also capture the approximate depth of each marker. This improves the ability of the mocap system to identify markers in successive frames of capture. While a single lens camera only provides useful (x, y) position information of a marker, a coded lens camera provides (x, y, z) position information of a marker (as described above). For example, if one marker is initially in front of the other, and then in a subsequent frame the markers are separated, it is easy for the coded lens camera to identify which marker is closer and which is further away (i.e., using the z value). This information can then be correlated with the position of the markers in a previous frame before one was obscured behind the other, which identifies which marker is which, when both markers come into view.
- one marker is only visible by one mocap camera, and it is obscured from all other mocap cameras (e.g. by the body of the performer).
- a single lens mocap camera it is not possible to triangulate with only one camera, and as such the markers (x, y, z) position can not be calculated.
- the distance to the marker is known, and as a result, its (x, y, z) position can be easily calculated.
- coded lens cameras are used in robot vision systems.
- a conventional lens camera can not provide distance information for a robotic armature to determine the (x, y, z) position of a part that it needs to pick up and insert in an assembly, but a coded lens camera can.
- coded lens cameras are employed within security systems. Because they have the ability to use low dynamic range sensors to capture high dynamic range scenes, they can provide usable imagery in situations where there is backlighting that would normally wash out the image in a conventional single lens camera. For example, if an intruder is entering a doorway, if there is bright daylight outside the doorway, a conventional single lens camera may not be able to resolve a useful image both outside the doorway and inside the doorway, whereas a coded lens camera can.
- Embodiments of the invention may include various steps as set forth above.
- the steps may be embodied in machine-executable instructions which cause a general-purpose or special-purpose processor to perform certain steps.
- the various operations described above may be software executed by a personal computer or embedded on a PCI card within a personal computer.
- the operations may be implemented by a DSP or ASIC.
- various components which are not relevant to the underlying principles of the invention such as computer memory, hard drive, input devices, etc, have been left out of the figures and description to avoid obscuring the pertinent aspects of the invention.
- Elements of the present invention may also be provided as a machine-readable medium for storing the machine-executable instructions.
- the machine-readable medium may include, but is not limited to, flash memory, optical disks, CD-ROMs, DVD ROMs, RAMs, EPROMs, EEPROMs, magnetic or optical cards, propagation media or other type of machine-readable media suitable for storing electronic instructions.
- the present invention may be downloaded as a computer program which may be transferred from a remote computer (e.g., a server) to a requesting computer (e.g., a client) by way of data signals embodied in a carrier wave or other propagation medium via a communication link (e.g., a modem or network connection).
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Studio Devices (AREA)
Abstract
An apparatus for capturing images. In one embodiment, the apparatus comprises: a coded lens array including a plurality of lenses arranged in a coded pattern and with opaque material blocking array elements that do not contain lenses; and a light-sensitive semiconductor sensor coupled to the coded lens array and positioned at a specified distance behind the coded lens array, the light-sensitive sensor configured to sense light transmitted through the lenses in the coded lens array.
Description
- This application is a continuation of U.S. patent application Ser. No. 13/652,259, filed on Oct. 15, 2012, which is a continuation of U.S. patent application Ser. No. 13/226,461, filed on Sep. 6, 2011, now U.S. Pat. No. 8,288,704, Issued on Oct. 16, 2012, which is a continuation of U.S. patent application Ser. No. 12/691,500, filed Jan. 21, 2010, now U.S. Pat. No. 8,013,285, Issued on Sep. 6, 2011, which is a continuation of U.S. patent application Ser. No. 11/210,098 entitled “Apparatus And Method For Capturing Still Images And Video Using Coded Lens Imaging Technique” filed on Aug. 22, 2005, now U.S. Pat. No. 7,671,321, Issued on Mar. 2, 2010, which is a continuation-in-part of U.S. patent application Ser. No. 11/039,029, entitled, “Apparatus And Method For Capturing Still Images And Video Using Coded Aperture Techniques” filed on Jan. 18, 2005, now U.S. Pat. No. 7,767,949, Issued on Aug. 3, 2010, and claims the benefit of U.S. Provisional Application No. 60/701,435 entitled, “Apparatus And Method For Capturing Still Images And Video Using Coded Lens Imaging Techniques”, filed on Jul. 20, 2005. These applications are incorporated by reference in their entirety.
- This invention relates generally to the field of image capture and image processing. More particularly, the invention relates to an apparatus and method for capturing still images and video using coded lens techniques.
- Photographic imaging is commonly done by focusing the light coming from a scene using a single glass lens which is placed in front of a light sensitive detector such as a photographic film or a semiconductor sensor including CCD and CMOS sensors.
- For imaging high-energy radiation such as x-ray or gamma rays, other techniques must be used because such radiation cannot be diffracted using glass lenses. A number of techniques have been proposed including single pinhole cameras and multi-hole collimator systems. A particularly beneficial technique is “coded aperture imaging” wherein a structured aperture, consisting of a suitably-chosen pattern of transparent and opaque elements, is placed in front of a detector sensitive to the radiation to be imaged. When the aperture pattern is suitably chosen, the imaged scene can be digitally reconstructed from the detector signal. Coded aperture imaging has the advantage of combining high spatial resolution with high light efficiency. Coded aperture imaging of x-ray and gamma ray radiation using structured arrays of rectangular or hexagonal elements is known from R. H. D
ICKE : SCATTER -HOLE CAMERA FOR X-RAYS AND GAMMA RAYS. ASTROHYS . J., 153:L101-L106, 1968 (hereinafter “Dicke”), and has been extensively applied in astronomical imaging and nuclear medicine. - A particularly useful class of coded imaging systems is known from E. E. F
ENIMORE AND T. M. CANNON : CODED APERTURE IMAGING WITH UNIFORMLY REDUNDANT ARRAYS. APPL . OPT., 17:337-347, 1978 (hereinafter “Fenimore”). In this class of systems, a basic aperture pattern is cyclically repeated such that the aperture pattern is a 2×2 mosaic of the basic pattern. The detector has at least the same size as the basic aperture pattern. In such a system, the “fully coded FOV” (“FOV” shall be used herein to refer to “field-of-view”) is defined as the area within the FOV, within which a point source would cast a complete shadow of a cyclically shifted version of the basic aperture pattern onto the aperture. Likewise, the “partially coded FOV” is defined as the area within the FOV, within which a point source would only cast a partial shadow of the basic aperture pattern onto the aperture. According to Dicke, a collimator is placed in front of the detector which limits the FOV to the fully coded FOV, thus allowing an unambiguous reconstruction of the scene from the detector signal. - From J. G
UNSON AND B. POLYCHRONOPULOS : OPTIMUM DESIGN OF A CODED MASK X-RAY TELESCOPE FOR ROCKET APPLICATIONS . MON. NOT. R. ASTRON . SOC., 177:485-497, 1976 (hereinafter “Gunson”) it is further known to give the opaque elements of the aperture a finite thickness such that the aperture itself acts as a collimator and limits the FOV to the fully coded FOV. Such a “self-collimating aperture” allows the omission of a separate collimator in front of the detector. - It should be noted that besides limiting the FOV, a collimator has the undesired property of only transmitting light without attenuation which is exactly parallel to the optical axis. Any off-axis light passing through the collimator is attenuated, the attenuation increasing towards the limits of the FOV. At the limits of the FOV, the attenuation is 100%, i.e., no light can pass through the collimator at such angles. This effect will be denoted as “collimator attenuation” within this document. Both in the x-direction and in the y-direction, collimator attenuation is proportional to the tangent of the angle between the light and the optical axis.
- After reconstructing an image from a sensor signal in a coded aperture imaging system, the effect of collimator attenuation may have to be reversed in order to obtain a photometrically correct image. This involves multiplying each individual pixel value with the inverse of the factor by which light coming from the direction which the pixel pertains to, has been attenuated. It should be noted that close to the limits of the FOV, the attenuation, especially the collimator attenuation, is very high, i.e. this factor approaches zero. Inverting the collimator attenuation in this case involves amplifying the pixel values with a very large factor, approaching infinity at the limits of the FOV. Since any noise in the reconstruction will also be amplified by this factor, pixels close to the limits of the FOV may be very noisy or even unusable.
- In a coded aperture system according to Fenimore or Gunson, the basic aperture pattern can be characterized by means of an “aperture array” of zeros and ones wherein a one stands for a transparent and a zero stands for an opaque aperture element. Further, the scene within the FOV can be characterized as a two-dimensional array wherein each array element contains the light intensity emitted from a single pixel within the FOV. When the scene is at infinite distance from the aperture, it is known that the sensor signal can be characterized as the two-dimensional, periodic cross-correlation function between the FOV array and the aperture array. It should be noted that the sensor signal as such has no resemblance with the scene being imaged. However, a “reconstruction filter” can be designed by computing the two-dimensional periodic inverse filter pertaining to the aperture array. The two-dimensional periodic inverse filter is a two-dimensional array which is constructed in such a way that all sidelobes of the two-dimensional, periodic cross-correlation function of the aperture array and the inverse filter are zero. By computing the two-dimensional, periodic cross-correlation function of the sensor signal and the reconstruction filter, an image of the original scene can be reconstructed from the sensor signal.
- It is known from Fenimore to use a so-called “Uniformly Redundant Arrays” (URAs) as aperture arrays. URAs have a two-dimensional, periodic cross-correlation function whose sidelobe values are all identical. URAs have an inverse filter which has the same structure as the URA itself, except for a constant offset and constant scaling factor. Such reconstruction filters are optimal in the sense that any noise in the sensor signal will be subject to the lowest possible amplification during the reconstruction filtering. However, URAs can be algebraically constructed only for very few sizes.
- It is further known from S. R. G
OTTESMAN AND E. E. FENIMORE : NEW FAMILY OF BINARY ARRAYS FOR CODED APERTURE IMAGING. APPL . OPT., 28:4344-4352, 1989 (hereinafter “Gottesman”) to use a modified class of aperture arrays called “Modified Uniformly Redundant Arrays” (MURAs) which exist for all sizes p×p where p is an odd prime number. Hence, MURAs exist for many more sizes than URAs. Their correlation properties and noise amplification properties are near-optimal and almost as good as the properties of URAs. MURAs have the additional advantage that, with the exception of a single row and a single column, they can be represented as the product of two one-dimensional sequences, one being a function only of the column index and the other being a function only of the row index to the array. Likewise, with the exception of a single row and a single column, their inverse filter can also be represented as the product of two one-dimensional sequences. This property permits to replace the two-dimensional in-verse filtering by a sequence of two one-dimensional filtering operations, making the reconstruction process much more efficient to compute. - It is further known from A. B
USBOOM: ARRAYS UND REKONSTRUKTIONSALGORITHMEN FUER BILDGEBENDE SYSTEME MIT CODIERTER APERTUR . VDI VERLAG , DUESSELDORF, 1999, ISBN 3-18-357210-9 (hereinafter “Busboom”) to use so-called “Perfect Binary Arrays” (PBAs) which exist for allsizes 3s 2r×3s 2r and allsizes 3s 2r−1×3s 2r+1 where s=0, 1, 2. . . and r=1, 2, 3. . . Hence, PBAs also exist for many sizes, especially for many square sizes with an even number of columns and rows. Their correlation properties and noise amplification properties are as good as those of URAs. - If the scene is at a finite distance from the aperture, a geometric magnification of the sensor image occurs. It should be noted that a point source in the scene would cast a shadow of the aperture pattern onto the sensor which is magnified by a factor of f =(o+a)/o compared to the actual aperture size where o is the distance between the scene and the aperture and a is the distance between the aperture and the sensor. Therefore, if the scene is at a finite distance, the sensor image needs to be filtered with an accordingly magnified version of the reconstruction filter.
- If the scene is very close to the aperture, so-called near-field effects occur. The “near field” is defined as those ranges which are less than 10 times the sensor size, aperture size or distance between aperture and sensor, whichever of these quantities is the largest. If an object is in the near field, the sensor image can no longer be described as the two-dimensional cross-correlation between the scene and the aperture array. This causes artifacts when attempting to reconstructing the scene using inverse filtering. In Lanza, et al., U.S. patent application Ser. No. 6,737,652, methods for reducing such near-field artifacts are disclosed. These methods involve imaging the scene using two separate coded apertures where the second aperture array is the inverse of the first aperture array (i.e. transparent elements are replaced by opaque elements and vice versa). The reconstruction is then computed from two sensor signals acquired with the two different apertures in such a manner that near-field artifacts are reduced in the process of combining the two sensor images.
- Coded aperture imaging to date has been limited to industrial, medical, and scientific applications, primarily with x-ray or gamma-ray radiation, and systems that have been developed to date are each designed to work within a specific, constrained environment. For one, existing coded aperture imaging systems are each designed with a specific view depth (e.g. effectively at infinity for astronomy, or a specific distance range for nuclear or x-ray imaging). Secondly, to date, coded aperture imaging has been used with either controlled radiation sources (e.g. in nuclear, x-ray, or industrial imaging), or astronomical radiation sources that are relatively stable and effectively at infinity. As a result, existing coded aperture systems have had the benefit of operating within constrained environments, quite unlike, for example, a typical photographic camera using a lens. A typical photographic camera using a single lens (i.e. a single lens per sensor or film frame; stereoscopic cameras have 2 lenses, but utilize a separate sensor or film frame per lens) is designed to simultaneously handle imaging of scenes containing 3-dimensional objects with varying distances from close distances to effective infinite distance; and is designed to image objects reflecting, diffusing, absorbing, refracting, or retro-reflecting multiple ambient radiation sources of unknown origin, angle, and vastly varying intensities. No coded aperture system has ever been designed that can handle these types of unconstrained imaging environments that billions of photographic cameras with single lenses handle everyday.
- Photographic imaging in the optical spectrum using a single lens has a number of disadvantages and limitations. The main limitation of single lens photography is its finite depth-of-field (DOF), particularly at large aperture settings. Only scenes at a limited DOF can be in focus in a single lens image while any objects closer or farther away from the camera than the DOF will appear blurred in the image.
- Further, a single lens camera must be manually or automatically focused before an image can be taken. This is a disadvantage when imaging objects which are moving fast or unexpectedly such as in sports photography or photography of children or animals, particularly at large apertures with a short DOF. In such situations, the images may be out of focus because there was not enough time to focus or because the object moved unexpectedly when acquiring the image. Single lens photography does not allow a photographer to retrospectively change the focus once an image has been acquired.
- Still further, focusing a single lens camera involves adjusting the distance between one or more lenses and the sensor. This makes it necessary for a single lens camera to contain mechanically moving parts which makes it prone to mechanical failure. Various alternatives to glass lenses, such as liquid lenses (see, e.g., B. H
ENDRIKS & STEIN KUIPER : THROUGH A LENS SHARPLY . IEEE SPECTRUM , DECEMBER, 2004), have been proposed in an effort to mitigate the mechanical limitations of a glass lens, but despite the added design complexity and potential limitations (e.g., operating temperature range and aperture size) of such alternatives, they still suffer from the limitation of a limited focus range. - Still further, single lens cameras have a limited dynamic range as a result of their sensors (film or semiconductor sensors) having a limited dynamic range. This is a severe limitation when imaging scenes which contain both very bright areas and very dark areas. Typically, either the bright areas will appear overexposed while the dark areas have sufficient contrast, or the dark areas will appear underexposed while the bright areas have sufficient contrast. To address this issue, specialized semiconductor image sensors (e.g. the D1000 by Pixim, Inc. of Mountain View, Calif.) have been developed that allow each pixel of an image sensor to sampled each with a unique gain so as to accommodate different brightness regions in the image. But such image sensors are much more expensive than conventional CCD or CMOS image sensors, and as such are not cost-competitive for many applications, including mass-market general photography.
- Because of the requirement to focus, single lenses can provide a rough estimate of the distance between the lens and a subject object. But since most photographic applications require lenses designed to have as long a range of concurrent focus as possible, using focus for a distance estimate is extremely imprecise. Since a single lens can only be focused to a single distance range at a time, at best, a lens will provide an estimate of the distance to a single object range at a given time.
- Coded Aperture Imaging (CAI) (as disclosed in co-pending application entitled “Apparatus And Method For Capturing Still Images And Video Using Coded Aperture Techniques,” Ser. No. 11/039,029, filed Jan. 18, 2005; hereinafter “CAI Application”) addresses many of the limitations of a single lens camera. Relative to a single lens camera, CAI makes it possible to make a thinner camera, a lighter camera, a camera with greater dynamic range, and also a camera which can reconstruct an image which is in focus throughout a large range of depth in the scene.
- A visible light coded aperture camera according to one embodiment described in the CAI Application is illustrated in
FIG. 1 . The illustrated embodiment includes a codedaperture 101 placed in front of a light sensitive grayscale orcolor semiconductor sensor 104. The codedaperture 1012 is a pattern of circular, square, hexagonal, rectangular or other tiled elements, some of which are transparent to visible light (e.g. element 102) and some of which are opaque (e.g. element 103). Note that for illustration clarity purposes, codedaperture 101 has very few transparent elements. A typical coded aperture may have significantly more transparent elements (e.g., 50%). Visible light a from 2-dimensional or 3-dimensional scene 100 (which may be illuminated by ambient or artificial lighting) is projected through the codedaperture 101 ontoimage sensor 104. The camera is capable of limiting the FOV to the fully coded FOV projected onto the sensor. In one embodiment, this is implemented by the use of a self-collimating coded aperture 101 (utilizing baffles for collimation, as explained below). The space between the coded aperture and the sensor is shielded by a light-opaque housing 105 (only the outline of which is shown inFIG. 1 ), preventing any light from reaching the sensor other than by passing through an open element of the coded aperture. - The camera further includes an image
sensor readout subsystem 110 with aninterface 109 to the image sensor 104 (which may be similar to those used in prior coded aperture systems). The readout subsystem clocks out the analog image signal from theimage sensor 104 and applies analog buffering, amplification and/or filtering as required by the particular image sensor. An example of such areadout subsystem 110 that also incorporatesND 120 is the NDX-1260 CleanCapture Image Processor by NuCore Technology, Inc. of Sunnyvale, Calif. The ability to adjust the zero offset 112 and gain 111 to analog pixel values read by the readout subsystem 110 (e.g., using at least one operational amplifier (op amp)) will increase the dynamic range of the captured image, but is not essential if the image sensor has a sufficient dynamic range for the desired image quality without a zero-offset and gain adjustment. - In one embodiment, the output of the
readout subsystem 110 is coupled byinterface 113 to at least one analog-to-digital converter (A/D) 120 which digitizes the analog output. The output of the A/D is coupled viainterface 121 to animage reconstruction processor 130, which in one embodiment incorporates a Digital Signal Processor (DSP) 132 and Random Access Memory (RAM) 131. The digitized image from theinterface 121 is stored inRAM 131, and theDSP 132 post-processes the image so as to reconstruct theoriginal scene 101 into a grayscale or color image. In accordance with another embodiment, theimage reconstruction processor 130 incorporates a general purpose CPU such as anIntel Corporation Pentium 4®, or similar general purpose processor. In yet another embodiment, theimage reconstruction processor 130 incorporates an Application-Specific Integrated Circuit (“ASIC”) which implements part or all of the reconstruction processing in dedicated digital structures. This grayscale or color image reconstructed byreconstruction processor 130 is output throughinterface 133 to be displayed on adisplay device 140. - However, one limitation of CAI is the resolution of the reconstructed image. The resolution of a CAI camera is limited by the larger of two primary factors: (a) the order of the aperture array, and (b) distortion in the projected image caused by diffraction. This is explained further in the following paragraphs.
-
FIG. 2 shows several representative coded aperture array patterns of MURAs of “order” 101, 61 and 31 (described in more detail in the CAI application).FIG. 2 also shows coded aperture array patterns of PBAs oforder 8 and 24. (ThePBAs 8 and 24 are shown enlarged relative to the MURAs to better show their patterns.), Note that the coded aperture array patterns are formed from a square array (with horizontal and vertical dimensions of the specified order) that is repeated twice in the horizontal and twice in the vertical dimension. So, for example, theMURA 101 pattern has a total size of 202×202. Note also that each of the aperture elements in the arrays is of the same size. Although it appears that some of the apertures are larger than others, this is simply because adjacent apertures combine to create what appears to be a larger aperture. A CAI camera can not resolve an image that is higher resolution than the order of its coded aperture array. For example, aMURA 101 CAI camera can not resolve an image of higher resolution than 101×101 pixels. - For purposes of illustration,
FIG. 3 shows one embodiment of the visible light coded aperture camera shown inFIG. 1 . The embodiment shown inFIG. 3 is not useful for many applications because the resolution of the reconstructed image is only 3×3 pixels, but it is illustrative of how a camera such as that shown inFIG. 1 works. A MURA order 3 (“MURA 3 ”)aperture array 301 contains 16 open apertures, such asopen aperture 302, and 20 closed apertures, such asclosed aperture 303. Color orgrayscale sensor 304 is the same size as one quadrant (i.e. one 3×3 block of apertures) of theMURA 3aperture array 301 and in this embodiment it is positioned centered relative to theMURA 3aperture array 301. -
Orthographic View 320 ofFIG. 3 reveals more of the structure of the camera. Baffles (referred to as “collimators” in the CAI Application) 315 serve to collimate the light passing through open apertures, such asopen aperture 302. This restricts the FOV of each aperture projection onto color orgrayscale sensor 304. Closed apertures such asclosed aperture 303 are covered with an opaque cover so they do not allow light to pass through.Sensor 304 is separated fromMURA 3aperture array 301 and baffles 317 to allow space for the overlapping projections from each of the open apertures. The entire unit is contained within a light-tight camera body 316, which is shown to be transparent for the purposes of illustration. Note that in this particular example, even ifsensor 304 is a very high-resolution sensor, only a 3×3 pixel image can be reconstructed. -
FIG. 4 illustrates how light is projected through theMURA 3 aperture array.Illustration 400 shows theMURA 3aperture array 401 delineated by a solid black outline, with exemplaryopen aperture 402 andclosed aperture 403. The position of color orgrayscale sensor 404 is delineated by a dotted outline.Open aperture 405 is delineated by a dashed line. The light that passes throughaperture 405 projects onto a square area on the sensor plane shown as agray square 406. Note that becauseaperture array 401 is shown overlaying the projection inillustration 400, much ofprojection 406 is obstructed by closed apertures. Nonetheless, the perimeter ofprojection 406 can be seen delineated by a solid gray outline. - In this embodiment,
projection 406 is a square approximately 9 times larger thanaperture 405 and centered onaperture 405. Depending on how close orfar sensor 404 is to the aperture array, this projection may correspond to a wider or narrower FOV. Baffles around aperture 405 (not shown in this illustration, but visible asbaffles 317 inFIG. 3 ) are used in this embodiment to limit the extent ofprojection 406 to approximately 9 times larger than the size ofaperture 405. - Note that in this embodiment only a small percentage of the area of
projection 406overlaps sensor 404. Part of this overlap is visible through anopen aperture 409 and part of it is obscured byclosed aperture 408. -
Illustration 410 shows the overlaying of the 4 projections from the upper right quadrant ofaperture array 401. (For clarity, inillustrations MURA 3aperture array 401 is shown.) The 4open apertures 415 in the upper right quadrant are delineated with dashed outlines. The 4projections 416 from these 4 apertures are shown as overlapping gray areas. Each projection, like theprojection 406 shown inillustration 400, is a square approximately 9 times the size of its aperture and is centered on its aperture, and is delineated by a solid gray line. To indicate the number of overlapping projections in each area of the sensor plane, varying levels of gray scale are used to fill each area. The lightest gray indicates 1 projection, the next darker indicates 2 projections overlapping, the next darker indicates 3 projections overlapping, and finally the darkest indicates 4 projections overlapping. -
Illustration 420 shows the overlaying of all 16 projections from theentire aperture array 401. The 16open apertures 425 are delineated by dashed outlines. Each projection, like theprojection 406 shown inillustration 400, is a square approximately 9 times the size of its aperture and centered on its aperture, and is delineated by a solid gray line. To indicate the number of overlapping projections in each area of the sensor plane, varying levels of gray scale are used as described in the previous paragraph. Note that in this embodiment each area ofsensor 404 is shown covered by 4 overlapping projections. In practice, it is correct that there will be 4 overlapping projections over the vast majority of the sensor area, but because of tolerance variations, diffraction effects, and varying distances to objects in the observed scene, there may be fewer or more overlapping projections near the borders of projections, which are shown as solid gray lines in illustration 411. - Note also that most of the light hitting the
MURA 3aperture array 401 is projected beyond the edges ofsensor 404, and as a result this light is not used for the reconstruction. If the area of the rightmost column of theMURA 3aperture array 401 is disregarded (since all apertures in that column are closed, it does not contribute any light to the camera and can be removed from the system without impacting the image reconstruction), approximately 13% of the light hitting the remaining area of theMURA 3aperture array 401 is actually projected onto thesensor 404. A conventional single f/2.8 lens transmits approximately 12.7% of the light hitting the lens, so the 13% light transmission performance of thisMURA 3 coded aperture array camera can be seen as comparable to a conventional f/2.8 lens. - Generally speaking, f/2.8 is good light transmission performance for a photographic lens, so the description of the
MURA 3 coded aperture camera in the last few paragraphs characterizes a camera with potentially desirable light transmission characteristics. Unfortunately, only a 3 x 3 pixel image can be reconstructed by the system described. - Each element in a CAI camera acts geometrically like a pinhole in a pinhole camera. Light passing through each aperture makes a projection onto the sensor, just as it would in a pinhole camera. And like a pinhole camera, a CAI camera is subject to the diffraction effects of light passing through a pinhole. In a pinhole, these diffraction effects create a point source projected pattern commonly known as the “Airy disk”. The primary lobe of the Airy disk roughly defines the smallest resolvable spot size from a given pinhole camera projection. At a given distance from the pinhole to the sensor, the Airy disk increases in size as the pinhole decreases in size. From a geometric point of view, the resolution (i.e. minimum point source projection spot size) of images from a pinhole camera also increases as the pinhole gets smaller. So, for any given distance of pinhole to sensor, there is an optimum pinhole size where the point source projection spot size equals the size of the primary lobe of the Airy disk. If the pinhole is made smaller than this optimum size, resolution decreases because the Airy disk increases in size. If the pinhole is made larger than this optimum size, resolution decreases because a point source projection spot size increases. Since the characterization of resolution of a pinhole camera is subjective, different formulae have been proposed for calculating the optimal pinhole diameter. One such formula is A=SQRT(55 F), where A is the pinhole diameter in thousandths of an inch, F is the camera focal length in inches, and SQRT( )is the square root function.
- Note that achievable resolution in a pinhole camera increases as the focal length of the camera increases. Unfortunately, the physical size of the camera typically increases in proportion to the focal length, and as a result, a very large camera is needed for high resolution pinhole images. For example (using the formula A=SQRT(55 F)), the optimal pinhole size of a 1″ focal length (i.e. 1″ thick) pinhole camera is about 0.007″. For a “normal” viewing angle of about 53°, this results in about a 134.8 pixel diagonal dimension, or about a 95×95 pixel resolution image. The optimal pinhole size of a 10″ focal length (i.e. 10″ thick) pinhole camera is about 0.023″. With a 53° viewing angle, this results in about a 426.4 diagonal resolution, or about a 301×301 resolution image. (Note that different photographers will use different subjective criteria in assessing the resolvable resolution of a pinhole camera. The resolution calculated here is based on one interpretation of resolvable resolution. Other interpretations may lead higher or lower resolution assessments, but will normally be within a 2× range higher or lower than the numbers presented here.)
- Like pinhole cameras, visible light CAI cameras are also subject to diffraction effects which may result in resolution/size trade-offs. The diffraction patterns are more complex than pinhole diffraction patterns because of the complexity of the aperture patterns, and consequently, determining the impact on image resolution and/or camera size requirements is more complex. But because the pixel resolution of the CAI image can be no higher than the order of the aperture array, to achieve a high-resolution image it is necessary to utilize high order aperture arrays which can potentially exhibit worse diffraction effects than lower order aperture arrays or, alternatively, require longer focal lengths (and, as a result, larger camera sizes) to mitigate those diffraction effects.
- Another approach to improving the performance of a lens system in a digital camera is a plenoptic camera. The basic concept of a plenoptic camera is described in U.S. Pat. No. 5,076,687. Although the word “plenoptic” is not used in the patent, the device referenced in the patent is called a “plenoptic camera” by its inventor in a web page describing the camera at: http://www-bcs.mit.edu/people/jyawang/demos/plenoptic/plenoptic.html. In 2005, Stanford University researchers published a paper (Stanford Tech Report CTSR 2005-02) describing an application of a plenoptic camera implementation that achieves the DOF of a conventional f/22 lens while capturing the equivalent light from the scene that would be gathered by an f/4 lens. Unfortunately, this increase in light gathering ability comes at a theoretically linear cost of image resolution. The prototype constructed by the team resulted in about 2× beyond the theoretical resolution losses, so with a 4000×4000 pixel sensor they were able to reconstruct only a 296×296 image which exhibited the f/22 DOF with f/4 light capture (i.e. a 16 megapixel sensor yielded a 90 kilopixel image). While such a system might be useful for certain specialized applications, the enormous losses of sensor resolution would likely make such a system non-competitive for general photographic applications.
- An apparatus and method are described for capturing images. In one embodiment, the apparatus comprises: a coded lens array including a plurality of lenses arranged in a coded pattern with opaque material blocking array elements not containing lenses; and a light-sensitive semiconductor sensor coupled to the coded lens array and positioned at a specified distance behind the coded lens array, the light-sensitive sensor configured to sense light transmitted through the lenses in the coded lens array.
- A better understanding of the present invention can be obtained from the following detailed description in conjunction with the drawings, in which:
-
FIG. 1 illustrates a visible light coded aperture camera according to one embodiment of the invention. -
FIG. 2 illustrates three exemplary MURA patterns and two exemplary PBA patterns employed in accordance with the underlying principles of the invention. -
FIG. 3 illustrates the configuration of aMURA order 3 coded aperture array, baffles, sensor, and a camera body in accordance with one embodiment of the invention. -
FIG. 4 illustrates the projection of light from transparent apertures in aMURA 3 coded aperture array in accordance with one embodiment of the invention. -
FIG. 5 illustrates a coded lens camera according to one embodiment of the invention. -
FIG. 6 illustrates the configuration of aMURA order 3 coded lens array, baffles, sensor, and a camera body in accordance with one embodiment of the invention. -
FIG. 7 illustrates the projection of light from transparent apertures in aMURA 3 coded lens array in accordance with one embodiment of the invention. -
FIG. 8 illustrates a side view of aMURA order 3 coded lens camera in accordance with one embodiment of the invention. -
FIG. 9 illustrates an exemplary RGB Bayer Pattern employed in one embodiment with the invention. -
FIG. 10 illustrates image sensors implemented as a multi-layer structure and used in one embodiment of the invention. -
FIG. 11a illustrates one embodiment of the invention in which an output signal is digitized by an analog-to-digital converter (A/D) in order to allow digital image reconstruction and post-processing. -
FIG. 11b illustrates a process for selecting zero offset and gain in accordance with one embodiment of the invention. -
FIG. 12 illustrates a coded lens imaging characteristic and a typical lens imaging characteristic. -
FIG. 13 illustrates a graph showing typical CMOS and CCD image sensor transfer characteristics. -
FIG. 14 illustrates a side view of aMURA order 3 coded lens camera with multi-element lens in accordance with one embodiment of the invention. -
FIG. 15 illustrates a gearing arrangement for simultaneously focusing all of the lenses in a coded lens array in accordance with one embodiment of the invention. -
FIG. 16 illustrates a side view of a multi-element coded lens system with a gearing system for simultaneously focusing all the lenses in a coded lens array in accordance with one embodiment of the invention. -
FIG. 17a illustrates three examples of a projection and reconstruction of three flat scenes at a known range using aMURA 3 coded lens array in accordance with one embodiment of the invention. -
FIG. 17b illustrates three examples of a projection and reconstruction of three flat scenes at a known range using aPBA 24 coded lens array in accordance with one embodiment of the invention. -
FIG. 18 illustrates a reconstruction of an image at different ranges to identify the correct range in accordance with one embodiment of the invention. -
FIG. 19 illustrates an image in which a person is standing close to a camera, while mountains are far behind the person. -
FIG. 20 illustrates how the person fromFIG. 19 can readily be placed in a scene with a different background. -
FIG. 21 illustrates a photograph of an exemplary motion capture session. - A system and method for capturing still images and video using coded lens imaging techniques is described below. In the description, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced without some of these specific details. In other instances, well-known structures and devices are shown in block diagram form to avoid obscuring the underlying principles of the invention.
- A visible light coded lens array camera. for either single shot images or sequential (e.g. video) images, including readout electronics and display, according to one embodiment of the invention, is illustrated in
FIG. 5 . The illustrated embodiment includes a codedlens array 501 placed in front of a light sensitive grayscale orcolor semiconductor sensor 504. The codedlens array 501 is a pattern of circular, square, hexagonal or rectangular (or any pattern that can be tiled on a plane) apertures, some of which are transparent (i.e. “open”) to visible light (e.g. element 502) and some of which are opaque (i.e. “closed) to visible light (e.g. element 503). Each open aperture, such as 502, is covered by (or contains) a lens such as 508, so that virtually all of the light passing through the open aperture passes through the lens. A typical coded lens array has approximately 50% transparent apertures, each with a lens. The coded lens array pattern shown is aMURA order 3 with a 4/5 ratio of transparent to opaque apertures. Visible light a from 2-dimensional or 3-dimensional scene 500 (which may be illuminated by ambient or artificial lighting) is projected through the lenses and open apertures of codedaperture array 501 ontoimage sensor 504. (The camera is capable of limiting the FOV to the fully coded FOV projected onto the sensor. The light contributions of overlapping projections in this fully coded FOV is shown in illustration 620 ofFIG. 6 .) In one embodiment, this is implemented by the use of a self-collimating coded lens array 501 (self-collimation is accomplished throughbaffles 517 behind the codedlens array 501, which are explained below). The space between the coded lens array and the sensor is shielded by a light-opaque housing 516 (only the outline of which is shown inFIG. 5 ), preventing any light from reaching the sensor other than by passing through a lens and open aperture of the codedlens array 501. - The camera further includes an image
sensor readout subsystem 510 with aninterface 509 to theimage sensor 504. The readout subsystem clocks out the analog image signal from theimage sensor 504 and applies analog buffering, amplification and/or filtering as required by the particular image sensor. An example of such areadout subsystem 510 that also incorporates A/D 520 is the NDX-1260 CleanCapture Image Processor by NuCore Technology, Inc. of Sunnyvale, Calif. The ability to adjust the zero offset 512 and gain 511 to analog pixel values read by the readout subsystem 510 (e.g., using at least one operational amplifier (op amp)) will increase the dynamic range of the captured image, but is not essential if the image sensor has a sufficient dynamic range for the desired image quality without a zero-offset and gain adjustment. - In one embodiment, the output of the
readout subsystem 510 is coupled byinterface 513 to at least one analog-to-digital converter (A/D) 520 which digitizes the analog output. The output of the A/D is coupled viainterface 521 to animage reconstruction processor 530, which in one embodiment incorporates a Digital Signal Processor (DSP) 532 and Random Access Memory (RAM) 531. The digitized image from theinterface 521 is stored inRAM 531, and theDSP 532 post-processes the image so as to reconstruct theoriginal scene 500 into a grayscale or color image. In accordance with another embodiment, theimage reconstruction processor 530 incorporates a general purpose CPU such as anIntel Corporation Pentium 4®, or similar general purpose processor. In yet another embodiment, theimage reconstruction processor 530 incorporates an Application-Specific Integrated Circuit (“ASIC”) which implements part or all of the reconstruction processing in dedicated digital structures. This grayscale or color image reconstructed byreconstruction processor 530 is output throughinterface 533 to be displayed on adisplay device 540. -
FIG. 6 shows one embodiment of the visible light coded lens array camera shown inFIG. 5 . A MURA order 3 (“MURA 3 ”)lens array 601 contains 16 open apertures, such asopen aperture 602, and 20 closed apertures, such asclosed aperture 603. Each open aperture, such as 602, contains one lens. In the illustrated embodiment, the lenses are round, but in alternative embodiments the lens may be other shapes (e.g. squares or hexagons) that may more completely fill theopen aperture 602 area. But, regardless of the shape oflens 608 in the present embodiment, any remaining area of theopen aperture 602 not filledbylens 608 must be opaque or nearly opaque. Color orgrayscale sensor 604 is the same size as one quadrant (i.e. one 3 x 3 block of apertures) of theMURA 3aperture array 601 and in this embodiment it is positioned centered relative to theMURA 3aperture array 601, as shown inillustration 610. (Illustration 610 showssensor 604's placement location behindMURA 3lens array 601 by showing it through the circles that illustrate the shape of the lenses. This is done simply for the sake of illustration, and this may not what would be seen upon visual inspection of an actual system due to the refraction effects of the lenses if an observer would look through them.) - Orthographic View 620 of
FIG. 6 reveals more of the structure of the camera. Baffles (referred to as “collimators” in the CAI Application) 617 serve to collimate the light passing through the lens and open apertures, such asopen aperture 602 andlens 608. This restricts the FOV of each aperture projection onto color orgrayscale sensor 604. Closed apertures such asclosed aperture 603 are covered with an opaque cover so they do not allow light to pass through.Sensor 604 is separated fromMURA 3 aperture array 611 and baffles 617 to allow space for the overlapping projections from each of the open apertures. The entire unit is contained within a light-tight camera body 616, which is shown to be transparent for the purposes of illustration. -
FIG. 7 illustrates how light is projected through theMURA 3 codedlens array 701.Illustration 700 shows theMURA 3 codedlens array 701, with exemplary open aperture andlens 702 andclosed aperture 703. The position of color orgrayscale sensor 704 that would be located behind codedlens array 701 is delineated by a dotted outline.Lens 705 is delineated by a dashed line. The light that passes throughlens 705 projects onto a square area on the sensor plane shown as agray square 706. Note that becauseaperture array 701 is shown inillustration 700 as overlaying the projection, much ofprojection 706 is obstructed by closed apertures. Nonetheless, the perimeter ofprojection 706 can be seen delineated by a solid gray outline. - In this embodiment,
projection 706 is a square approximately 9 times larger than open aperture square aroundlens 705 and centered onlens 705. Depending on how close orfar sensor 704 is to the aperture array, this projection may correspond to a wider or narrower FOV. Baffles around open aperture 705 (not shown in this illustration, but visible asbaffles 617 inFIG. 6 are used in this embodiment to limit the extent ofprojection 706 to approximately 9 times larger than the size oflens 705. - Note that in this embodiment only a small percentage of the area of
projection 706overlaps sensor 704. Part of this overlap is visible (illustratively, although not necessarily physically) through the lens ofopen aperture 709 and part of it is obscured (illustratively) byclosed aperture 708 and the area around the lens inopen aperture 709. -
Illustration 710 shows the overlaying of the 4 projections from the upper right quadrant ofaperture array 701. (For clarity inillustration MURA 3 codedlens array 701 is shown.) The 4 lenses ofopen apertures 715 in the upper right quadrant are delineated with dashed outlines. The 4projections 716 from these 4 lenses are shown as overlapping gray areas. Each projection, like theprojection 706 shown inillustration 700, is a square approximately 9 times the size of the open aperture square surrounding its lens and is centered on its lens, and is delineated by a solid gray line. To indicate the number of overlapping projections in each area of the sensor plane, each area is filled with varying levels of gray scale. The lightest gray indicates 1 projection, the next darker indicates 2 projections overlapping, the next darker indicates 3 projections overlapping, and finally the darkest indicates 4 projections overlapping. -
Illustration 720 shows the overlaying of all 16 projections from theentire aperture array 701. The 16 lenses of allopen apertures 725 are delineated by dashed outlines. Each projection, like theprojection 706 shown inillustration 700, is a square approximately 9 times the size of the open aperture square surrounding its lens and centered on its lens, and is delineated by a solid gray line. To indicate the number of overlapping projections in each area of the sensor plane, varying levels of gray scale are used as described in the previous paragraph. Note that in this embodiment each area ofsensor 704 is shown covered by 4 overlapping projections. In practice, it is correct that there will be 4 overlapping projections over the vast majority of the sensor area, but because of tolerance variations, diffraction effects, lens aberrations and varying distances to objects in the observed scene, there may be fewer or more overlapping projections near the borders of projections, which are shown as solid gray lines inillustration 720. - Note also that most of the light hitting the
MURA 3 codedlens array 701 is projected beyond the edges ofsensor 704, and as a result this light is not used for the reconstruction. If the area of the rightmost column of theMURA 3 codedlens array 701 is disregarded (since all apertures in that column are closed, it does not contribute any light to the camera and can be removed from the system without impacting the image reconstruction), approximately 10.2% (because round lenses are used in this embodiment, if square lenses were used in an alternate embodiment, the number would be approximately 13%) of the light hitting the remaining area of theMURA 3aperture array 701 is actually projected onto thesensor 704. A conventional single f/3.1 lens transmits approximately 10.2% of the light hitting the lens, so the 10.2% light transmission performance of thisMURA 3 coded aperture array camera can be seen as comparable to a conventional f/3.1 lens. - Generally speaking, f/3.1 is good light transmission performance for a photographic lens, so the description of the
MURA 3 coded lens array camera in the last few paragraphs characterizes a camera with potentially desirable characteristics. And unlike aMURA 3 coded aperture array camera, such as that illustrated inFIGS. 3 and 4 , which is limited to a 3×3 pixel resolution in the reconstruction, theMURA 3 coded lens array camera illustrated inFIGS. 5, 6 and 7 is capable of reconstructing an image at least up to the approximate diffraction limits of each of the lenses in theMURA 3 coded lens array. For example, in the case of lenses 12 mm lenses with a 36 mm focal length and a 53 degree FOV, more 2000×2000 resolution (4 megapixels) is achievable within the diffraction limits. - The preceding illustrated examples show the size of the sensor as being approximately equal to the size of one quadrant (i.e. one-half size in each dimension) as the size of the coded lens array. Although this is a typical configuration, in one embodiment the sensor dimensions are independent from the coded lens array dimensions, but the system is configured in such a way that the coded lens array projects a pattern onto the sensor that is equivalent to the pattern that would have been projected had the sensor been equal to the size of one quadrant of a coded lens array and with appropriate spacing and focal length such as the coded lens camera configurations described herein. In other words, the reconstruction of the image using the techniques described herein are reliant on the configuration of overlapping pattern of images of the scene projected onto the sensor, not on the particular configuration of the coded lens array relative to the sensor. If a different coded lens array configuration than one described herein can achieve a similar overlapping pattern on the sensor, then the image reconstruction will be the same. For example, if telephoto lenses in a
MURA 3 pattern are positioned far from the sensor, but the optical path of each is angled in such a way that the projected pattern on the sensor is the same as the pattern shown inFIG. 7 , then the image can still be reconstructed correctly. - According to one embodiment of the system illustrated in
FIG. 5 , the resultingoutput 533 from the reconstruction processor is a 2-dimensional array of grayscale or color pixels representing the scene within the FOV of the camera. In one embodiment, the pixel data is transmitted through a digital interface to a computer (or other image processing device). Thus, the output of the coded aperture camera will appear to any attached device as if it is the output of a conventional digital camera. The digital interface for transferring the reconstructed image data may be any digital interface capable of handling the bandwidth from the camera for its required application such as for example, a IEEE 1394 (“FireWire”) interface or a USB 2.0 interface (which would be suitable for current still and video camera applications). Of course, the underlying principles of the invention are not limited to any particular digital interface. Preferably, the camera includes a display 540 (e.g., an LCD or OLED display), for presenting the reconstructed images to the photographer, but in this embodiment,display device 540 andinterface 533 are optional. - According to one embodiment, the camera does not include
reconstruction processor 530. Instead, the digitized image data from the A/D converter 520 is coupled throughinterface 521 to an output buffer where the image data is packetized and formatted to be output through a digital interface. The digital interface would typically be coupled to an external computing means such as a personal computer, either to be processed and reconstructed immediately, or stored on a mass storage medium (e.g., magnetic or optical disc, semiconductor memory, etc.) for processing and reconstruction at a later time. Preferably, the external computing device has a display for presenting the reconstructed images to the photographer. Alternatively, or in addition, the digital interface is coupled directly to a mass storage medium (e.g., magnetic or optical disc, semiconductor memory, etc.). The digital interface for transferring the reconstructed image data could be any digital interface capable of handling the bandwidth from the camera for its required application (e.g., IEEE 1394 (“FireWire”) interface or a USB 2.0 interface). - According to one embodiment of the invention, the coded
lens array 501 is a Modified Uniformly Redundant Array (“MURA”) pattern. According to another embodiment of the invention, the codedlens array 501 is a Perfect Binary Array (“PBA”) pattern. According to another embodiment of the invention, the codedlens array 501 is a Uniformly Redundant Array (“URA”) pattern. And according to yet another embodiment of the invention, the codedlens array 501 is a random pattern (although the performance of the system typically will not be as optimal with a random pattern as it will with a MURA, PBA, or URA). Typically, the basic aperture pattern would be the same size as the sensor, and the overall coded lens array would be a 2×2 mosaic of this basic aperture pattern. Each transparent aperture in the array contains a lens. Three exemplary MURA patterns and one PBA pattern are illustrated inFIG. 2 .MURA 101 is a 101×101 element pattern,MURA 61 is a 61×61 element pattern, andMURA 31 is a 31×31 element pattern. PBA 8 is a 8×8 element pattern, andPBA 24 is a 24×24 element pattern. The PBA patterns are illustrated as enlarged relative to the MURA patterns. In each pattern, each black area is opaque and each white area is transparent (open) and would contain a lens. - In one embodiment, the coded aperture consists of a microlens array such as those manufactured by Suss Micro-optics of Neuchatel, Switzerland. A microlens array is an array of typically plano-convex lenses fabricated in a typically a rectilinear or hexagonal grid. In one embodiment, a microlens array would be used for the coded lens array with a lens at each location on the grid, but those lenses occurring at “closed” aperture location would be painted over with an opaque paint or an opaque material would be lithographically coated at the “closed” aperture locations..
- In another embodiment a microlens array would be fabricated with only lenses at locations of an “open” aperture in the coded lens array. “Closed” aperture locations in the coded lens array would be either painted with an opaque paint, or a opaque material would be lithographically coated at the “closed” aperture locations.
- According to the present invention the distance between the coded lens array and the sensor plane is chosen in such a way that each of the projections of the individual lenses is in focus. For imaging an object at infinity, the sensor plane is therefore placed at the focal plane of the lenses. For imaging an object at a finite distance, the sensor plane might be placed slightly behind the focal plane of lenses in order to focus at the desired distance. Unlike in coded aperture imaging, the distance between the coded lens array and the sensor plane may therefore not be chosen arbitrarily, but a constraint between focal length, image plane to sensor plane distance, and distance of the object to be image must be observed.
- One embodiment of the camera employs techniques to limit the FOV (FOV) to the fully coded FOV (FCFOV). Alternatively, the techniques of limiting the FOV may be dimensioned in such a way that the FOV is slightly larger than the FCFOV, i.e., in such a way that the FOV is composed of the FCFOV plus a small part of the partially coded FOV (PCFOV). This way, the FOV of a coded lens camera can be increased at the expense of only a very minor degradation in image quality.
- According to one embodiment, FOV limitation is achieved by placing baffles either in front of or behind the lenses in order to limit the maximum angles at which rays can pass through the coded lens array and reach the sensor.
- Note that the length of the baffles determines the size of the FOV: The longer the baffles, the narrower the FOV of the coded lens camera.
-
FIG. 8 illustrates a side view of the projected FOVs of each of the lenses in aMURA 3 coded lens camera. In this example, thebaffles 801 are placed behind thelenses 802, i.e. on the side of the lens facing thesensor 804. It should be noted, however, that the baffles may also be placed in front of the lenses, i.e. on the side of the lens facing the scene. - However, placing the baffles behind the lenses has the advantage that the
exit pupil 803 of the lens system is moved closer towards the sensor plane. This way the size of the diffraction patterns caused by each lens is reduced and hence the achievable resolution of the overall imaging system is increased. -
FIG. 8 further shows how the FOV of each lens is determined by themarginal rays 805, passing through the edges of the lens and passing just by the edge of the baffles on the opposite side. Let l denote the length of the baffles (l=18 mm inFIG. 8 ) and let further d denote the diameter of a single lens. Then, as can be seen fromFIG. 8 , the angular field of view a is given by - tan a/2=d/l
- or
- a=2 atan (d/l).
- In the example shown in
FIG. 8 where d=12 mm and/=18 mm, an angular field of view of a=67.38° results. - The
right hand illustration 810 ofFIG. 8 shows how the projections caused by the individual lenses overlap in the sensor plane. Each lens has the same angular field of view. However, due to the displacement of the lenses towards each other, there is a parallax for objects at a finite distance. Therefore, the field of view of the overall imaging system is approximately the same as the field of view of an individual lens, but may be slightly larger for objects at a finite distance due to this parallax effect. - It should be noted that
FIG. 8 shows a complete row of lenses. However, in a coded lens imaging system, some of the positions in each row will not contain any lens but be blocked. The figure only shows the complete row of lenses for illustrative purposes. Different rows of lenses in a coded lens array will contain lenses in different positions. Since typically each position will contain a lens in at least one row, the overall field of view can be derived as depicted inFIG. 8 . - When using baffles, light passing through the coded lens array parallel to the optical axis will not be attenuated. However, light passing through the coded lens array at an angle with respect to the optical axis will be partially blocked by the baffles.
- As a result, after imaging and reconstructing a scene in a coded lens camera, the sensitivity of the camera is higher in the center of the FOV (light parallel to the optical axis) than it is towards the edges of the FOV (larger angles with respect to the optical axis), due to the baffle attenuation. Thus, when imaging a constant-intensity surface, the reconstruction will be bright in the center and darker and darker towards the edges of the image. Therefore, in one embodiment of the invention, baffle attenuation is compensated for by multiplying each pixel of the reconstructed image with the inverse of the baffle attenuation the pixel has been subjected to. The baffle attenuation is known from the geometry of the lenses and baffles. This way, in the absence of any noise, a constant-intensity surface is reconstructed to a constant-intensity image.
- It should be noted, however, that inverting the baffle attenuation also causes any noise in the reconstruction to be amplified with the same factor as the signal. Therefore, the signal-to-noise ratio (SNR) of the reconstructed image is highest in the center of the image and decreases towards the edges of the image, reaching the value zero at the edges of the FOV.
- According to one embodiment of the invention, this problem is alleviated by using only a central region of the reconstructed image while discarding the periphery of the reconstructed image. According to another embodiment, the problem is further alleviated by applying a noise-reducing smoothing filter to image data at the periphery of the reconstructed image.
- From the literature, Wiener filters are known to be optimum noise-reducing smoothing filters, given that the signal-to-noise ratio of the input signal to the Wiener filter is known. In the reconstructed image of a coded lens camera, the signal-to-noise ratio varies across the image. The SNR is known for each pixel or each region of the reconstructed image. According to one embodiment, noise-reduction is achieved by applying a local Wiener filtering operation with the filter characteristic varying for each pixel or each region of the reconstructed image according to the known SNR variations.
- Unlike a coded aperture camera, which projects an image in focus at all scene object distances, a coded lens camera is subject to the focus limitations of the lenses in its coded lens array. Typically, in a conventional single lens camera, the Depth of Field (DOF) (i.e. the range from near focus to far focus) of the camera is inversely proportional to the camera's light gathering capability. This is because the DOF is typically increased by narrowing the aperture of the lens, which reduces the light from the scene that reaches the sensor.
- Although a coded lens camera does have focus limitations, a principal advantage of the coded lens camera over a conventional single lens camera is that as the effective lens aperture is narrowed to increase the DOF, the amount of light from the scene reaching the sensor is not substantially reduced.
- Consider the following: A coded lens array typically has about 50% transparent apertures with lenses and 50% opaque apertures, so typically 50% of the light from the scene passes through the coded lens array. The overlapping projections of the coded lens array typically projects onto an
area 4 times the area of the sensor, so approximately 25% of the projected light hits the sensor. So, in total, typically 25%*50%=12.5% of the light from the scene that is incident upon the coded lens array reaches the sensor. (Of course, less light may be transmitted due to attenuation from using round lenses instead of square lenses, the baffles, lens imperfections, and aberrations, and also, more light may be transmitted because a given aperture pattern may have more open than closed apertures, but geometrically, 12.5% represents the average light transmission of square apertures with 50% open apertures and is a reasonable approximation for a coded lens system.) 12.5% light transmission is approximately equivalent to a f/2.8 aperture on a single lens (which has 12.7% light transmission). - With a typical single lens system an f/2.8 aperture is a very wide aperture setting. On a 50 mm lens, f/2.8 corresponds to a 17.9 mm aperture. Consider a Nikon D100 6 megapixel camera with a 50 mm lens. If the lens is focused on a subject at a 25′ (25 foot) distance, the near focus limit is approximately 21.3′ and the far focus limit is approximately 30.2′ (30.2′−21.3′=8.82′ of total DOF). (Note: focus limits are subjective and will vary from photographer to photographer, but the same criteria are utilized for the different conditions considered in this section, so the results can be considered relative to one another. These calculations were made using a Depth of Field online calculator at http://www.dofmaster.com/dofjs.html). Any object in the scene closer than the near focus or farther than the far focus will be subject to a reduction in sharpness. Although 8.82′ is a short DOF, the f/2.8 setting passes about 12.7% of the light from the scene.
- Consider now an f/16 setting for the same Nikon D100 with a 50 mm lens. Now the aperture diameter is only 3.1 mm and only 0.4% of the light from the scene reaches the sensor. If the lens is focused on a subject at a 25′ distance, the near focus limit is approximately 13′ and the far focus limit is 805′. So, everything in the scene from 13′ to 805′ is in focus, for a 792′ DOF. Clearly, this is a dramatic improvement in DOF over the 8.82′ DOF at f/2.8. But it comes at a dramatic cost in light transmission. f/16 only transmits 0.4%/12.7%=3% of the light transmitted by f/2.8, so it can only be used with very well-illuminated scenes.
- Consider the same Nikon D100, but instead of using a single conventional 50 mm lens, a 50 mm square PBA 8 coded lens array is utilized, again focused on an object 25′ in the distance. The PBA 8 pattern shown in
FIG. 2 would be utilized, with a lens placed in each transparent (i.e. illustrated as white) aperture of the PBA 8. Since a PBA 8 is a 16×16 aperture array and in this embodiment it is 50 mm in length on each side, each lens would be about 3.1 mm in diameter, which is about the same diameter as a conventional single 50 mm lens stopped down to f/16. And as a result, the DOF of the PBA 8 coded lens array would be roughly the same as the DOF of a conventional 50 mm lens stopped down to f/16. But, because the coded lens array transmits approximately 12.5% of the light from the scene, its light transmission is similar to f/2.8. So, this embodiment of a coded lens array has a DOF comparable to an f/16 conventional lens with the light transmission characteristics of an f/2.8 conventional lens. - In another embodiment, the same coded lens array described in the previous paragraph is used with a Nikon D100 camera, but the coded lens array is focused on an object 26′ in the distance instead of 25′ away. In this case the near focus limit is 12.9′ and the far focus limit is infinity. Since everything is in focus from a certain distance through infinity, the coded lens array is functioning as a “hyperfocal lens”, with its focus distance set to the “hyperfocal distance”. This configuration is useful for certain applications where all of the objects in the scene are at least 12.9′ away, and then the lenses in the coded lens array can be set to a fixed focus and do not need to be adjusted. Note that if an object in the scene is slightly closer than 12.9′, it still may be usefully imaged. It simply will not be captured at the highest resolution, but as objects continue to get closer than 12.9′, they will get increasingly fuzzier (i.e. lower resolution). So, for applications that require high resolution for objects closer than 12.9′, a focusing means for the lenses in the coded lens array will be required.
- For clarity of illustration, the coded lens arrays shown in most of the figures have only a single lens element in each transparent aperture. Although this may be sufficient for some applications, in other applications, it is desirable to use multiple lens elements to correct for image aberrations, such as geometric distortion, coma, and chromatic aberrations. For over a century, an entire lens industry has been devoted to designing multi-element lenses to address lens aberration issues, and this vast corpus of prior art work will not be repeated here. Suffice it to say that typically, 3 elements or more are needed for photographic-quality imaging, and further, that typically, one or more of these elements needs to translate back-and-forth on the optical axis for focusing, unless the camera has a fixed focus. Frequently, such back-and-forth motion is accomplished by a rotating mechanism that turns a collar around part or all of the lens, which in turn engages a thread which moves one or more of the lens elements along the optical axis.
-
FIG. 14 illustrates a side view of a coded lens array with three-element lenses. The lens shapes shown are simply for illustrative purposes, and the actual lens shapes would vary depending on the optical characteristics desired, using any of a vast number of prior art photographic lens designs. Each aperture would have 3 such lenses in a stack within one or more concentric cylinders. Baffles would extend behind the last lens toward the sensor so as to limit the FOV of the projection. Note that each aperture position is shown containing a stack of lenses in this illustration. In practice, opaque apertures would not contain lenses, or they would be covered so as not to permit light to pass through them. -
FIG. 15 illustrates an arrangement of gears with hollow centers within a coded lens array, each gear rotating around either a lens (if the location is a transparent aperture) or rotating over an opaque aperture without a lens. (For the sake of illustration, the teeth of adjacent gears are not touching each other, but in practice they would typically fit together snugly.)Gear 1501 is coupled to the shaft of an electric motor, which is either manually controlled or is controlled by an auto-focus mechanism. As the electric motor turns, it turnsgear 1501, which in turn transfers the rotational motion to all the gears in the coded lens array. By way of example, ifgear 1501 turns clockwise, it turnsgear 1502 counterclockwise, which then turnsgears turn gear 1505 counter-clockwise. Extending this example, it can be seen that the motion ofgear 1501 turns all of the gears in the coded lens array, with each successive gear in the horizontal or vertical direction turning the opposite way. -
FIG. 16 shows a side view of a three-element coded lens array utilizing the gearing system shown inFIG. 15 . For the purposes of illustration, all lens array positions are shown with lenses. In practice, opaque lens array positions would not have lenses and would have their apertures closed so they block light. In this embodiment, each lens array position has two fixedlenses lens 1603 that translates back-and-forth along the optical axis. -
Electric motor 1620 is powered by either a manual or auto-focus means, and it turnsgear 1621, which in turn drives the other gears in the coded lens array, as previously described inFIG. 15 , includingFIG. 16 's gear 1604.Gear 1604 turnshollow cylinder 1605, which in turn driveshollow cylinder 1606, which holdslens 1603.Hollow cylinder 1606 is coupled tohollow cylinder 1605 in such a way that it is able to translate back-and-forth along the optical axis (left-to-right as shown inFIG. 16 ).Hollow cylinder 1606 hasscrew thread 1607 on its outside surface, which notches pins such aspin 1608 that are secured tostructure 1609. Ashollow cylinder 1606 rotates,screw thread 1607 causes it to translate back-and-forth along the optical axis. - As can be seen in
FIG. 15 , each subsequent gear in the coded lens array rotates in the opposite direction. As a result each subsequent hollow cylinder holding a lens is threaded with the opposite pitch, such asscrew thread 1610 has opposite pitch ofscrew thread 1607. In this way, the middle lenses of the lens array all move in the same direction when theelectric motor 1620 actuatesgear 1621, despite the fact each other gear position is rotating in an opposite direction. - In this embodiment, the
same structure 1609 that holds the lens array mechanism continues behind the lenses to form the baffles.Such structure 1609 may be made of a metal such as aluminum, plastic, or any other sufficiently sturdy, but light-opaque material. Note thatFIG. 16 shows a side view, but in practice the baffle form a box around the perimeter of each transparent aperture, and function to limit the FOV of the projection from each lens stack that projects ontosensor 1630. - Unlike in coded aperture imaging where sensor pixel size and aperture element size are typically chosen such as to be in the same order of magnitude, in coded lens imaging the individual lenses may be much larger than the sensor pixel size.
- In one embodiment, the sensor pixel size is chosen such as to be in the same order of magnitude as the resolution of the coded lens array. It should be noted that this resolution is determined by the diffraction patterns of the individual lenses. If the sensor pixel size is chosen significantly larger than the size of the diffraction patterns, resolution of the imaging system is wasted. If, on the other hand, the sensor pixel size is chosen significantly smaller than the size of the diffraction patterns, no additional usable information is gained.
- Regarding the choice of the lens size it should be noted that there is a tradeoff between the size of the diffraction patterns and the achievable DOF. The smaller a lens is chosen, the larger its diffraction pattern and the better its DOF. It is important to note, however, that there is a degree of freedom in the choice of the lens size in order to achieve the best compromise between resolution and DOF of a specific application. In coded aperture imaging, however, this degree of freedom does not exist. Rather, in coded aperture imaging the sensor pixel size and aperture element size are constrained to be more or less identical.
- According to one embodiment, the
sensor 504 ofFIG. 5 is a CCD sensor. More specifically, a color CCD sensor using a color filter array (“CFA”), also know as a Bayer pattern, is used for color imaging. A CFA is a mosaic pattern of red, green and blue color filters placed in front of each sensor pixel, allowing it to read out three color planes (at reduced spatial resolution compared to a monochrome CCD sensor).FIG. 9 illustrates an exemplary RGB Bayer Pattern. Eachpixel cluster 900 consists of 4 pixels 901-904, with color filters over each pixel in the color of (G)reen, (R)ed, or (B)lue. Note that each pixel cluster in a Bayer pattern has 2 Green pixels (901 and 904), 1 Red (902) and 1 Blue (903). Pixel Clusters are typically packed together in anarray 905 that makes up the entire CFA. It should be noted, however, that the underlying principles of the invention are not limited to a Bayer pattern. - In an alternative embodiment, a multi-layer color image sensor is used. Color sensors can be implemented without color filters by exploiting the fact that subsequent layers in the semiconductor material of the image sensor absorb light at different frequencies while transmitting light at other frequencies. For example, Foveon, Inc. of Santa Clara, Calif. offers “Foveon X3” image sensors with this multi-layer structure. This is illustrated in
FIG. 10 in whichsemiconductor layer 1001 is an array of blue-sensitive pixels,layer 1002 is an array of green-sensitive pixels, andlayer 1003 is an array of red-sensitive pixels. Signals can be read out from these layers individually, thereby capturing different color planes. This method has the advantage of not having any spatial displacement between the color planes. For example, pixels 1011-1013 are directly on top of one another and the red, green and blue values have no spatial displacement between them horizontally or vertically. - According to one embodiment of the present invention, each of the 3 RGB color planes are read out from a color imaging sensor (CFA or multi-layer) and are reconstructed individually. In one embodiment, the reconstruction algorithms detailed below are applied individually to each of the 3 color planes, yielding 3 separate color planes of the reconstructed image. These can then be combined into a single RGB color image.
- As illustrated in
FIG. 11a , the analog output signal ofimaging sensor 1101 is digitized by an analog-to-digital converter (A/D) 1104 in order to allow digital image reconstruction and post-processing. In order to exploit the full dynamic range of the A/D 1104, the sensor output is first amplified by anop amp 1100 before feeding it into the A/D. Theop amp 1100 applies a constant zero offset z (1102) and a gain g (1103) to theimage sensor 1101 output signal. The input signal to the A/D 1104 is s′=g(s−z) where s is theimage sensor 1101 output signal. In one embodiment, offset 1102 and gain 1103 are chosen in such a way that the full dynamic range of the A/D 1104 is exploited, i.e., that the lowest possible sensor signal value smin corresponds to zero and the highest possible sensor signal value smax corresponds to the maximum allowed input signal of the A/D 1104 without the A/D 1104 going into saturation. -
FIG. 12 depicts the characteristic of the resulting system. Note that as described above, the dynamic range of the scene is compressed by coded lens imaging; therefore, zero offset and gain may be higher than in conventional imaging with a single lens. In one embodiment, zero offset and gain are automatically chosen in an optimal fashion by the coded lens camera according to the following set of operations, illustrated in the flowchart inFIG. 11 b: - At 1110, an initial zero offset is selected as the maximum possible zero offset and a relatively large initial step size is selected for the zero offset. At 1111 an initial gain is selected as the maximum possible gain and a relatively large initial step size is selected for the gain.
- At 1112, an image is acquired using the current settings and a determination is made at 1113 as to whether there are any pixels in the A/D output with a zero value. If there are pixels with a zero value, then the current zero offset step size is subtracted from the current zero offset at 1114 and the process returns to 1112.
- Otherwise, if there are no pixels with a zero value, a check is made at 1115 as to whether the current zero offset step size is the minimum possible step size. If this is not the case, then at 1116 a, the current zero offset step size is added to the current zero offset, making sure that the maximum possible zero offset is not exceeded. The current zero offset step size is then decreased at 1116 b (e.g., by dividing it by 10) and the process returns to 1112.
- Otherwise, at step 1117, an image is acquired using the current settings. At 1118, a determination is made as to whether there are any pixels in the A/D output with the maximum output value (e.g. 255 for an 8-bit A/D). If there are pixels with the maximum value, then the current gain step size is subtracted from the current gain at 1119 and the process returns to 1117.
- Otherwise, at 1120, a determination is made as to whether the current gain step size is the minimum possible step size. If this is not the case, then at 1121 a, the current gain step size is added to the current gain, making sure the maximum possible gain is not exceeded. The current gain step size is then decreased at 1121 b (e.g., by dividing it by 10) and the process returns to 1117. Otherwise, the process ends with the current zero offset and gain settings.
- Before applying the reconstruction algorithm, the effects of zero offset and gain have to be reversed. In one embodiment, this is done by digitally computing the corrected sensor signal s* from the A/D output signal s″ whereas s″ is the output of the A/D pertaining to the A/D input signal s′ and s*=s′/g+z. Note that in the absence of noise in the
op amp 1100 and in the absence of quantization errors, s* would equal the original analog sensor output signal s. - In coded lens imaging, each sensor pixel is exposed to light emitted by different pixels of the scene, reaching the sensor pixel through different lenses within the coded lens array. The reconstruction algorithms used in coded lens imaging assume that sensor image is the linear sum of all sensor images which each individual lens would have projected onto the sensor. Therefore, in one embodiment, the sensor output signal s is an exactly linear function of the number p of photons hitting each sensor pixel during the exposure time. The function describing the dependency of the sensor output signal from the actual photon count of each sensor pixel is called the “transfer characteristic” of the sensor. CCD imaging sensors have a linear transfer characteristic over a large range of intensities while CMOS imaging sensors have a logarithmic transfer characteristic. A graph showing typical CMOS and CCD image sensor transfer characteristics is shown in
FIG. 13 . When the transfer characteristic s=f (p) of the sensor is known, it can be compensated for by means of a lookup table. That is, instead of using the value e for the reconstruction, the value LUT (s*)=LUT (s″ l g+z) is used where LUT is a lookup table compensating for any non-linear effects in the sensor transfer characteristic. Once the operations above have been completed, the adjusted sensor image is stored in the memory of the DSP, ASIC or other type ofimage reconstruction processor 530 of the camera in preparation for image reconstruction. - It should be noted that in coded lens photography, the dynamic range of the sensor signal may be different from the dynamic range of the imaged scene. Since each sensor pixel is exposed to multiple scene pixels across the entire FOV, the coded lens array has an averaging effect on the range of intensities. Even scenes with a high dynamic range (e.g. dark foreground objects and bright background objects) produce sensor signals with a lower dynamic range. In the process of image reconstruction, the dynamic range of the original scene is reconstructed independently of the dynamic range of the imaging sensor. Rather, the limited dynamic range of the imaging sensor (finite number of bits for quantization) leads to quantization errors which can be modeled as noise in the sensor image. This quantization noise also causes noise in the reconstruction. The noise is more prominent close to the edges of the reconstructed image as described above, since in these areas a high multiplier must be applied for compensating for baffle attenuation. As a result, imaging a scene with high dynamic intensity range with an imaging sensor with low dynamic range causes the reconstructed image to be more noisy, but not to have lower dynamic range. This is in contrast to conventional single lens photography where the dynamic range of the imaging sensor directly limits the maximum dynamic range of the scene which can be imaged.
- The following set of operations are used in one embodiment of the invention to reconstruct scenes from sensor images that are captured and adjusted as described above. According to Gottesman, a MURA lens array is constructed in the following way. First consider a Legendre sequence of length p where p is an odd prime. The Legendre sequence l (i) where i=0, 1, . . . , p−1 is defined as:
- l (0)=0,
- l (i)=+1if for any k=1, 2, . . . , p−1 the relation k2 mod p=l is satisfied
- l (i)=−1 otherwise.
- Then the MURA a (i, j) of size p×p is given by:
- a (0, j)=0 for j=0, 1, . . . , p−1,
- a (i, 0)=1 for i=1, 2, . . . p−1,
- a (i, j)=(l (i)*l (j)+1)/2 for i=1, 2, . . . , p−1 and j=1, 2, . . . , p−1.
- In this MURA array, a 1 represents a lens and a 0 represents an opaque element in the coded lens array. The number of lenses in a single period of this MURA is K=(p2−1)/2. The periodic inverse filter g (i, j) pertaining to this MURA is given by:
- g (0, 0)=+1/K,
- g (i, j)=(2 a (i, j)−1)/K if i>0 or j>0.
- It can be shown that the periodic cross-correlation function phi (n, m) between a (i, j) and g (i, j) is 1 for n=0 and m=0, and 0 otherwise. The periodic inverse filter pertaining to a MURA therefore has the same structure as the MURA itself, except for a constant offset and constant scaling factor, and for the exception of a single element which is inverted with respect to the original MURA.
FIG. 2 shows various sizes of MURA lens array patterns. - In a similar manner, a PBA according to Busboom can be used as a lens array. Its periodic inverse filter has exactly the same structure as the PBA itself, except for a constant offset and constant scaling factor. The formulas and algorithms for generating PBAs can be found in A. B
usBooM: ARRAYS UND REKONSTRUKTIONSALGORITHMEN FUER BILDGEBENDE SYSTEME MIT CODIERTER APERTUR . VDI VERLAG , DUESSELDORF , 1999, ISBN 3-18-357210-9, PAGES 52-56. PBAs oforder 8 and 24 are illustrated inFIG. 2 . They are enlarged relative to the MURA patterns. - When an object at a constant distance is imaged with a coded lens array, the sensor image is given by the periodic cross-correlation function of the object function with the coded lens array, magnified by a geometric magnification factor f as described above. For reconstructing the original object, the periodic cross-correlation function of the measured sensor image with an appropriately magnified version of the periodic inverse filter is computed. In the absence of noise and other inaccuracies of the measured sensor image, the result equals the original object function.
- Performing the inverse filtering then consists of the following set of operations:
- 1. Compute the periodic inverse filter pertaining to the coded lens array pattern.
- 2. Compute a geometrically magnified version of this inverse filter in such a way that the distance between two adjacent elements of the inverse filter equals the separation of two adjacent lens projections of the scene in the sensor plane. The magnified version of the inverse filter is resampled according to the sensor resolution in such a way that all values between two filter elements are padded with zeros and the filter elements are represented as non-zeros peaks, each having the size of a single pixel. According to one embodiment of the invention, if the distance between two adjacent lens projections is not an integer multiple of the pixel size, standard interpolation techniques known from signal processing are used in order to compute the magnified version of the inverse filter. In this case, each filter element may spread across more than one pixel. It should be noted that the separation between two adjacent lens projections varies with the distance of the object from the coded lens camera. Therefore, different inverse filters may be used in order to reconstruct objects at different distances.
- 3. Compute the two-dimensional, periodic cross-correlation function between the sensor image and the inverse filter, resampled to the sensor resolution according to step (2).
- 4. Divide each pixel of the result of 3. by K, the number of lenses in a single period of the MURA or PBA or other lens array pattern.
Reconstruction of a Scene with One Object at a Known Range - As mentioned above, in one embodiment, reconstruction of the scene from the sensor signal is performed in a digital signal processor (“DSP”) (e.g., DSP 132) integrated into the camera or in a computing device external to the camera. In one embodiment, scene reconstruction consists of the following sequence of operations:
- 1. Linearize the transfer characteristic of the output signal of the sensor such that the linearized output signal of each sensor pixel is proportional to the number of photons counted by the sensor pixel.
- 2. Periodically cross-correlate the sensor signal with the appropriately magnified periodic inverse filter pertaining to the coded lens array.
- 3. Clip the result to non-negative pixel values.
- 4. Compensate for baffle attenuation by multiplying each pixel with an appropriate amplification factor.
- 5. Optionally smooth the off-axis parts of the result which are more subject to noise amplification during (4) than the center part of the result.
- It should be noted that if the aperture array is a MURA, the inverse filtering of operation (2) can be decomposed into a sequence of two one-dimensional filter operations, one of which is applied per image row and the other of which is applied per image column. This decomposition may reduce the computational complexity of (2) in the case of large array orders.
-
FIG. 17a illustrates three examples of the projection and reconstruction of three flat scenes at a known range using the procedure described in the preceding paragraph. In the example, a 3×3 MURA pattern was used for the lens array (1700). The distance (pitch) between two adjacent lenses in the array was 3 mm. Each lens had a focal length of 5 mm which was also the distance between the lens array and the sensor. The sensor was a 10×10 mm sensor with 30×30 um square pixels.Scene 1701 is a flat (2-dimensional) test pattern of 307×307 pixels. It is projected through the 3×3 elementMURA lens array 1700 onto the image sensor, resulting in thesensor image 1711.Sensor image 1711 is adjusted and reconstructed per the process described above resulting inreconstruction 1721. Note that theextreme corners 1730 ofreconstruction 1721 are not accurately reconstructed. This is due to the attenuation of light during the projection through the baffles at the extreme edges of the image . In the same manner, flat 307×307pixel image 1702 is projected through thelens array 1700 resulting insensor image 1712 and is processed to result inreconstruction 1722. In the same manner, flat 307×307pixel image 1703 is projected through thelens array 1700 resulting insensor image 1713 and is processed to result inreconstruction 1723. -
FIG. 17b illustrates three similar examples asFIG. 17a . However, inFIG. 17b a 24×24 PBA pattern was used as the lens array pattern (1750). The lenses had a pitch of 0.39 mm such that the total size of the lens array was similar to that ofFIG. 17a (18.72×18.72 mm inFIGS. 17b and 18×18 mm inFIG. 17a ). The same sensor as in the example ofFIG. 17a was used. The lenses had again a focal length of 5 mm.Scene 1701 is projected through the 24×24 elementPBA lens array 1750 onto the image sensor, resulting in thesensor image 1731.Sensor image 1731 is adjusted and reconstructed per the process described above resulting inreconstruction 1741. In the same manner, flat 307×307pixel image 1702 is projected through thelens array 1750 resulting insensor image 1732 and is processed to result inreconstruction 1742. In the same manner, flat 307×307pixel image 1703 is projected through thelens array 1750 resulting insensor image 1733 and is processed to result inreconstruction 1743. It can be observed from the sensor images (1711-1713 and 1731-1733) in the two examples that increasing the order of the lens array flattens the contrast in the sensor image. In the sensor images 1731-1733 ofFIG. 17b , no more details of the original scene are recognizable. However, as can be seen from the reconstructions 1741-1743, the sensor images still contain all the information necessary for reconstructing the original scene. - It is noted that, as described above, sensor images 1711-1713 and 1731-1733 may be quantized at a given number of bits per pixel (e.g. 8), but may yield in the reconstructed images 1721-1723 and 1741-1743 an image with a useful dynamic range comparable to a higher number of bits per pixel (e.g. 10).
- Reconstruction of a Scene with One Object at an Unknown Range
- In one embodiment, operation (2) of the sequence of operations described above in section “Reconstruction of a Scene with One Object at a Known Range” are repeated for different expected object ranges o, when the true object range is uncertain or unknown. By this technique a set of multiple reconstructions is obtained from the same sensor signal. Within this set of reconstructions, the one where the expected object range is identical with or closest to the true object range will be the most accurate reconstruction of the real scene, while those reconstructions with a mismatch between expected and true range will contain artifacts. These artifacts will be visible in the reconstruction as high-frequency artifacts, such as patterns of horizontal or vertical lines or ringing artifacts in the neighborhood of edges within the reconstruction.
- According to one embodiment of the present invention, among this set of reconstructions, the one with the least artifacts is manually or automatically selected. This allows a change in the range of reconstruction without the need to pre-focus the camera and, in particular, without the need to mechanically move parts of the camera, as would be required with a conventional single lens camera, or to pre-select an expected object range. Further, this allows the user to decide about the desired range of reconstruction after the image acquisition (i.e. retrospectively). Preferably, the range of reconstruction is automatically selected from the set of reconstructions by identifying the reconstruction with the least amount of high-frequency artifacts and the smoothest intensity profile.
- A simple, but highly effective criterion for “focusing” a coded lens camera, i.e., for determining the correct range from a set of reconstructions, is to compute the mean m and the standard deviation σ of all gray level values of each reconstruction. Further, the ratio m/σ is computed for each reconstruction. The reconstruction for which this ratio takes on its maximum is chosen as the optimal reconstruction, i.e., as the reconstruction which is “in focus.” This technique produces the best results if the objects in the scene are in focus in each of the individual projections.
-
FIG. 18 illustrates how a scene is reconstructed at a set of different ranges. A similar system configuration as inFIG. 17b was used for producingFIG. 18 , i.e. a 24×24 PBA pattern was used for projection. The original scene was thetest image 1701 fromFIG. 17b which was imaged at a range of 1,000 mm. Reconstructions were computed from the resulting sensor image at assumed ranges of 500 mm (1801), 800 mm (1802), 1,000 mm (1803) and 5,000 mm (1804). In the figure, it can clearly be seen that the reconstruction in the lower left-hand corner at the correct range of 1,000 mm looks “clean” while the reconstructions at different ranges contain strong high-frequency artifacts.FIG. 18 also shows the standard deviation (“stddev”) of the gray values in each of the four reconstructions.FIG. 18 further shows the quotients (m/s) of the gray value mean, divided by the gray value standard deviation, for each of the four reconstructions. This value starts at 0.0977 at an assumed range of 500 mm, then continuously increases to a maximum of 2.0 at the correct range of 1,000 mm, then continuously decreases, reaching a value of 0.1075 at an assumed range of 5,000 mm. The example shows how the true range of the scene can be easily computed from a set of reconstructions by choosing the reconstruction at which the quotient m/s takes on its maximum. - According to one embodiment, only a partial reconstruction of parts of the image is computed using different expected object ranges o. A partial reconstruction is computed by only evaluating the periodic cross-correlation function in operation (2) above in section “Reconstruction of a Scene with One Object at a Known Range” for a subset of all pixels of the reconstructed image, thus reducing the computational complexity of the reconstruction. This subset of pixels may be a sub-sampled version of the image, a contiguous region of the image, or other suitable subsets of pixels. Then, the two one-dimensional periodic filtering operations only need to be evaluated for a subset of rows and/or columns of the reconstructed image. From the set of partial reconstructions, the one with the least amount of high-frequency artifacts and the smoothest intensity profile is identified in order to determine the true object range o. For the identified true object range o, a full reconstruction is then performed. This way, the computational complexity of reconstructing the scene while automatically determining the true object range o can be reduced.
- Reconstruction of a Scene with Multiple Objects at Unknown Ranges
- According to one embodiment, a set of full image reconstructions at different object ranges o is computed. Since objects in different parts of the scene may be at different ranges, the reconstructions are decomposed into several regions. For each region, the object range o which yields the least amount of high-frequency artifacts and the smoothest intensity profile is identified. The final reconstruction is then assembled region by region whereas for each region the reconstruction with the optimum object range o is selected. This way, images with infinite depth of FOV (from close-up to infinity) can be reconstructed from a single sensor signal.
- The combined reconstruction is of lower quality than a flat reconstruction of a flat scene, i.e., of a scene with only a single object at a single range. The presence of other regions in the scene which are “out of focus” do not only cause the out-of-focus regions to be of inferior quality in the reconstruction, but also cause the in-focus region to contain artifacts in the reconstruction. In other words, there is a “crosstalk” between the out-of-focus and the in-focus regions. This crosstalk and techniques for suppressing it are addressed in the following.
- As explained before, the “flat” reconstruction of a region r1 at range o1 would only be accurate if the entire scene were at a constant range o1. If, however, other regions are at different ranges, there will be “crosstalk” affecting the reconstruction of region r1. Therefore, according to one embodiment, an iterative reconstruction procedure is employed which eliminates this crosstalk among different regions in the scene at different ranges. The iterative reconstruction procedure according to one embodiment of the invention consists of the following set of operations.
- 1. Computing a “flat” reconstruction, i.e., a reconstruction assuming a homogeneous range across the entire scene, at a set of ranges 0 1, 0 2, . . . , on.
- 2. Using the flat reconstructions obtained this way to decompose the scene into a number of contiguous regions r1, r2, . . . , rm and corresponding ranges o1, o2, . . . , om. The decomposition is done in such a way that for each region its reconstruction r1 at range o1 is “better”, i.e., contains less high-frequency artifacts and has a smoother intensity profile, than all reconstructions of the same region at other ranges.
- 3. For each of the reconstructed regions ri (i=1, 2, . . . , m) computing its contribution si to the sensor image. This is done by computing the two-dimensional, periodic cross-correlation function of ri with the lens array pattern. Note that if the reconstructions of all the regions were perfect, then the sum of all sensor image contributions would equal the measured sensor image s.
- 4. For each of the reconstructed regions ri (i=1, 2, . . . , m) subtracting the sensor image contributions of all other regions from the measured sensor image, i.e.,
-
- Note that each Δsi (i=1, 2, . . . , m) now contains a sensor image pertaining only to region ri, the contributions of all other regions ri, j≠i, being mostly suppressed. Due to the fact that the reconstruction of the other regions will not be perfect but contain reconstruction errors, there will be some remaining crosstalk, i.e. the Δsi will contain some residual contributions from the other regions. However, this crosstalk is much lower than the crosstalk without computation of a difference sensor image.
- 5. Utilizing the Δsi (i=1, 2, . . . , m) to compute a refined reconstruction r′i for each region at range oi. Optionally, this step can be repeated with a number of different ranges around the initial range oj in order to also refine the range estimate oi. In this case, for each region the reconstruction and range with the least high-frequency artifacts and the smoothest intensity profile are selected.
- 6. Optionally, going back to operation (3) for an additional refinement of each region.
- According to one embodiment, the output signal of the coded lens camera (in addition to the two-dimensional image information) also contains range information for each image pixel or for several image regions, as determined from finding the object range o for each region with the least amount of high-frequency artifacts and the smoothest intensity profile. Thus, for every pixel reconstructed in the image, in addition to the reconstruction deriving a single intensity value (for grayscale visible light, infrared, ultraviolet or other single frequency radiation) or three intensity values for visible red, green, blue color light, the reconstruction assigns a z value indicating the distance from the camera to the object at that pixel position in the image. This way, three-dimensional image data can be obtained from a single, two-dimensional sensor signal. Further, the range data allows the camera, an external imaging manipulation system, or the user, utilizing an image manipulation application or system to easily segment the two-dimensional image into different regions pertaining to different parts of the scene, such as separating objects in the foreground of a scene from the background of a scene.
- Chroma-keying is a technique commonly used in video and photographic production to separate a foreground image from a solid background color. Typically, a “blue screen” or “green screen” is used, which is a very carefully colored and illuminated screen that is placed behind a performer or object while the scene is photographed or captured on video or film. Either in real-time or through post-processing, a hardware or software system separates the presumably distinctively colored foreground image from the fairly uniformly colored background image, so that the foreground image can be com posited into a different scene. For example, typically the weatherperson on a TV news show is chroma-keyed against a blue or green screen, then com posited on top of a weather map.
- Such blue or green screens are quite inconvenient for production. They are large and bulky, they require careful illumination and must be kept very clean, and they must be placed far enough behind the foreground object so as not to create “backwash” of blue or green light onto the edges of the foreground object. Utilizing the principles of the embodiment of the previous paragraph, an image can be captured without a blue or green screen, and the z value provided with each pixel will provide a compositing system with enough information to separate a foreground object from its background (i.e., by identifying which pixels in the scene contain the image of closer objects and should be preserved in the final image, and which pixels in the scene contain the image of further away objects and should be removed from the final image). This would be of substantial benefit in many applications, including photographic, video, and motion picture production, as well as consumer applications (e.g. separating family members in various pictures from the background of each picture so they may be composited into a group picture with several family members).
-
FIG. 20 shows how aperson 1901 fromFIG. 19 can readily be placed in a scene with a different background, such as thecastle 2002 with thebackground mountains 2002 removed from the picture. This is simply accomplished by replacing every pixel in the image reconstructed fromFIG. 19 that has a z value greater than that ofperson 1901 with a pixel from the image of thecastle 2002. Once again, the processing of z values may be implemented using virtually any type of image processor including, for example, a DSP, ASIC or a general purpose processor. - The per-pixel distance ranging capability of one embodiment also has applications in optical performance motion capture (“mocap”). Mocap is currently used to capture the motion of humans, animals and props for computer-generated animation, including video games (e.g. NBA Live 2005 from Electronic Arts of Redwood City, Calif.), and motion pictures (e.g. “The Polar Express”, released by the Castle Rock Entertainment, a division of Time Warner, Inc, New York, N.Y.). Such mocap systems (e.g. those manufactured by Vicon Motion Systems, Ltd. of Oxford, United Kingdom) typically utilize a number of single lens video cameras surrounding a performance stage. Retroreflective markers (or other distinctive markings) are placed all over the bodies of performers and upon props. The video cameras simultaneously capture images of the markers, each capturing the markers within its FOV that is not obstructed. Finally, software analyzes all of the video frames and by triangulation, tries to identify the position of each marker in 3D space.
-
FIG. 21 is a photograph of an exemplary motion capture session. The three bright rings of light are rings of LEDs around the single lenses of the video cameras 2101-2103. The performers are wearing tight-fitting black suits. The gray dots on the suits are retroreflective markers that reflect the red LED light back to the camera lenses causing the markers to stand out brightly relative to the surrounding environment. Four such retroreflective markers on the knees of the left performer are identified as 2111-2114. - Because all of the markers look the same in a camera image, one of the challenges faced by mocap systems is determining which marker image corresponds to which marker (or markers) in the scene, and then tracking them frame-to-frame as the performers or props move. Typically, the performer stands roughly in a known position, with the markers placed in roughly known positions on the performer's body (or on a prop). The cameras all capture an initial frame, and the software is able to identify each marker because of the approximately known position of the performer and the markers on the performer. As the performer moves, the markers move in and out of the fields of view of the cameras, and often become obscured from the one, several or even all cameras as the performer moves around. This creates ambiguities in the mocap system's ability to continue to identify and track the markers.
- For example, if a frame of a given video camera shows a marker centered at a given (x, y) pixel position, it is quite possible that the image is really showing two markers lined up one behind the other, leaving one completely obscured. In the next frame, the performer's motion may separate the markers to different (x, y) positions, but it can be difficult to determine which marker was the one in front and which was the one in back in the previous frame (e.g. the marker further away may appear slightly smaller, but the size difference may be less than the resolution of the camera can resolve). As another example, a performer may roll on the floor, obscuring all of the markers on one side. When the performer stands up, many markers suddenly appear in a camera's image and it may be difficult to identify which marker is which. A number of algorithms have been developed to improve this marker identification process, but it is still the case that in a typical motion capture session, human operators must “clean up” the captured data by manually correcting erroneous marker identification, frame-by-frame. Such work is tedious, time-consuming and adds to the cost of mocap production.
- In one embodiment of the invention, single lens video cameras are replaced by video cameras utilizing coded lens techniques described herein. The coded lens cameras not only capture images of the markers, but they also capture the approximate depth of each marker. This improves the ability of the mocap system to identify markers in successive frames of capture. While a single lens camera only provides useful (x, y) position information of a marker, a coded lens camera provides (x, y, z) position information of a marker (as described above). For example, if one marker is initially in front of the other, and then in a subsequent frame the markers are separated, it is easy for the coded lens camera to identify which marker is closer and which is further away (i.e., using the z value). This information can then be correlated with the position of the markers in a previous frame before one was obscured behind the other, which identifies which marker is which, when both markers come into view.
- Additionally, it is sometimes the case that one marker is only visible by one mocap camera, and it is obscured from all other mocap cameras (e.g. by the body of the performer). With a single lens mocap camera, it is not possible to triangulate with only one camera, and as such the markers (x, y, z) position can not be calculated. With a coded lens camera, however, the distance to the marker is known, and as a result, its (x, y, z) position can be easily calculated.
- In another embodiment, coded lens cameras are used in robot vision systems. For example, in manufacturing applications a conventional lens camera can not provide distance information for a robotic armature to determine the (x, y, z) position of a part that it needs to pick up and insert in an assembly, but a coded lens camera can.
- In one embodiment, coded lens cameras are employed within security systems. Because they have the ability to use low dynamic range sensors to capture high dynamic range scenes, they can provide usable imagery in situations where there is backlighting that would normally wash out the image in a conventional single lens camera. For example, if an intruder is entering a doorway, if there is bright daylight outside the doorway, a conventional single lens camera may not be able to resolve a useful image both outside the doorway and inside the doorway, whereas a coded lens camera can.
- Embodiments of the invention may include various steps as set forth above. The steps may be embodied in machine-executable instructions which cause a general-purpose or special-purpose processor to perform certain steps. For example, the various operations described above may be software executed by a personal computer or embedded on a PCI card within a personal computer. Alternatively, or in addition, the operations may be implemented by a DSP or ASIC. Moreover, various components which are not relevant to the underlying principles of the invention such as computer memory, hard drive, input devices, etc, have been left out of the figures and description to avoid obscuring the pertinent aspects of the invention.
- Elements of the present invention may also be provided as a machine-readable medium for storing the machine-executable instructions. The machine-readable medium may include, but is not limited to, flash memory, optical disks, CD-ROMs, DVD ROMs, RAMs, EPROMs, EEPROMs, magnetic or optical cards, propagation media or other type of machine-readable media suitable for storing electronic instructions. For example, the present invention may be downloaded as a computer program which may be transferred from a remote computer (e.g., a server) to a requesting computer (e.g., a client) by way of data signals embodied in a carrier wave or other propagation medium via a communication link (e.g., a modem or network connection).
- Throughout the foregoing description, for the purposes of explanation, numerous specific details were set forth in order to provide a thorough understanding of the present system and method. It will be apparent, however, to one skilled in the art that the system and method may be practiced without some of these specific details. For example, while the embodiments of the invention are described above in the context of a “camera,” the underlying principles of the invention may be implemented within virtually any type of device including, but not limited to, PDA's, cellular telephones, and notebook computers. Accordingly, the scope and spirit of the present invention should be judged in terms of the claims which follow.
Claims (24)
1. A system for three-dimensional motion capture comprising:
a plurality of objects in motion that emit or reflect electromagnetic radiation (EMR);
a receiver that receives the emitted or reflected EMR from the plurality of objects; and
the system resolves ambiguities in identifying two or more of the plurality of objects by using the received EMR to determine the distance between the receiver and at least one object.
2. The system as in claim 1 wherein the plurality of objects are markers or distinctive markings placed on performers.
3. The system as in claim 1 wherein the receiver is a camera.
4. The system as in claim 1 wherein the system utilizes the distance between the receiver and at least one object to resolve ambiguities in determining the position of at least one object among the plurality of objects.
5. The system as in claim 1 where the system correlates the positions of the plurality of markers at a first time interval with the positions of the plurality of markers in a second time interval to track the motions of the plurality of markers between the time intervals.
6. The system as in claim 1 further wherein the EMR is a wavelength that is visible to the human eye.
7. The system as in claim 1 further wherein the EMR is a wavelength that is not visible to the human eye.
8. The system as in claim 1 wherein the EMR is infrared (IR) radiation.
9. The system as in claim 1 wherein the EMR is ultraviolet (UV) radiation.
10. The system as in claim 1 wherein the EMR is X-ray radiation.
11. The system as in claim 1 wherein the EMR is radiation at a single frequency.
12. The system as in claim 1 wherein the EMR is radiation at multiple frequencies.
13. A method for three-dimensional motion capture comprising:
a plurality of objects in motion emitting or reflecting electromagnetic radiation (EMR);
a receiver receiving the emitted or reflected EMR from the plurality of objects; and
the receiver resolving ambiguities in identifying two or more of the plurality of objects by using the received EMR to determine the distance between the receiver and at least one object.
14. The method as in claim 13 wherein the plurality of objects are markers or distinctive markings placed on performers.
15. The method as in claim 13 wherein the receiver is a camera.
16. The method as in claim 13 wherein the receiver utilizes the distance between the receiver and at least one object to resolve ambiguities in determining the position of at least one object among the plurality of objects.
17. The method as in claim 13 where the receiver correlates the positions of the plurality of markers at a first time interval with the positions of the plurality of markers in a second time interval to track the motions of the plurality of markers between the time intervals.
18. The method as in claim 13 further wherein the EMR is a wavelength that is visible to the human eye.
19. The method as in claim 13 further wherein the EMR is a wavelength that is not visible to the human eye.
20. The method as in claim 13 wherein the EMR is infrared (IR) radiation.
21. The method as in claim 13 wherein the EMR is ultraviolet (UV) radiation.
22. The method as in claim 13 wherein the EMR is X-ray radiation.
23. The method as in claim 13 wherein the EMR is radiation at a single frequency.
24. The method as in claim 13 wherein the EMR is radiation at multiple frequencies.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/207,941 US20190116326A1 (en) | 2005-01-18 | 2018-12-03 | Apparatus and method for capturing still images and video using coded lens imaging techniques |
Applications Claiming Priority (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/039,029 US7767949B2 (en) | 2005-01-18 | 2005-01-18 | Apparatus and method for capturing still images and video using coded aperture techniques |
US70143505P | 2005-07-20 | 2005-07-20 | |
US11/210,098 US7671321B2 (en) | 2005-01-18 | 2005-08-22 | Apparatus and method for capturing still images and video using coded lens imaging techniques |
US12/691,500 US8013285B2 (en) | 2005-01-18 | 2010-01-21 | Apparatus and method for capturing still images and video using coded lens imaging techniques |
US13/226,461 US8288704B2 (en) | 2005-01-18 | 2011-09-06 | Apparatus and method for capturing still images and video using coded lens imaging techniques |
US13/652,259 US10148897B2 (en) | 2005-07-20 | 2012-10-15 | Apparatus and method for capturing still images and video using coded lens imaging techniques |
US16/207,941 US20190116326A1 (en) | 2005-01-18 | 2018-12-03 | Apparatus and method for capturing still images and video using coded lens imaging techniques |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/652,259 Continuation US10148897B2 (en) | 2005-01-18 | 2012-10-15 | Apparatus and method for capturing still images and video using coded lens imaging techniques |
Publications (1)
Publication Number | Publication Date |
---|---|
US20190116326A1 true US20190116326A1 (en) | 2019-04-18 |
Family
ID=47677317
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/652,259 Active US10148897B2 (en) | 2005-01-18 | 2012-10-15 | Apparatus and method for capturing still images and video using coded lens imaging techniques |
US16/207,941 Abandoned US20190116326A1 (en) | 2005-01-18 | 2018-12-03 | Apparatus and method for capturing still images and video using coded lens imaging techniques |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/652,259 Active US10148897B2 (en) | 2005-01-18 | 2012-10-15 | Apparatus and method for capturing still images and video using coded lens imaging techniques |
Country Status (1)
Country | Link |
---|---|
US (2) | US10148897B2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11595575B2 (en) | 2020-05-11 | 2023-02-28 | Samsung Electronics Co., Ltd. | Image sensor |
Families Citing this family (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11394436B2 (en) | 2004-04-02 | 2022-07-19 | Rearden, Llc | System and method for distributed antenna wireless communications |
US10886979B2 (en) | 2004-04-02 | 2021-01-05 | Rearden, Llc | System and method for link adaptation in DIDO multicarrier systems |
US11309943B2 (en) | 2004-04-02 | 2022-04-19 | Rearden, Llc | System and methods for planned evolution and obsolescence of multiuser spectrum |
US10425134B2 (en) | 2004-04-02 | 2019-09-24 | Rearden, Llc | System and methods for planned evolution and obsolescence of multiuser spectrum |
US10749582B2 (en) | 2004-04-02 | 2020-08-18 | Rearden, Llc | Systems and methods to coordinate transmissions in distributed wireless systems via user clustering |
US8654815B1 (en) | 2004-04-02 | 2014-02-18 | Rearden, Llc | System and method for distributed antenna wireless communications |
US9819403B2 (en) | 2004-04-02 | 2017-11-14 | Rearden, Llc | System and method for managing handoff of a client between different distributed-input-distributed-output (DIDO) networks based on detected velocity of the client |
US9826537B2 (en) | 2004-04-02 | 2017-11-21 | Rearden, Llc | System and method for managing inter-cluster handoff of clients which traverse multiple DIDO clusters |
US10985811B2 (en) | 2004-04-02 | 2021-04-20 | Rearden, Llc | System and method for distributed antenna wireless communications |
US10277290B2 (en) | 2004-04-02 | 2019-04-30 | Rearden, Llc | Systems and methods to exploit areas of coherence in wireless systems |
US11451275B2 (en) | 2004-04-02 | 2022-09-20 | Rearden, Llc | System and method for distributed antenna wireless communications |
US9685997B2 (en) | 2007-08-20 | 2017-06-20 | Rearden, Llc | Systems and methods to enhance spatial diversity in distributed-input distributed-output wireless systems |
US9001231B2 (en) * | 2011-06-03 | 2015-04-07 | Rambus Inc. | Image acquisition using oversampled one-bit poisson statistics |
RU2616175C2 (en) * | 2011-09-28 | 2017-04-12 | Конинклейке Филипс Н.В. | Object distance determination by image |
JP2014082541A (en) * | 2012-10-12 | 2014-05-08 | National Institute Of Information & Communication Technology | Method, program and apparatus for reducing data size of multiple images including information similar to each other |
US11050468B2 (en) | 2014-04-16 | 2021-06-29 | Rearden, Llc | Systems and methods for mitigating interference within actively used spectrum |
US11189917B2 (en) | 2014-04-16 | 2021-11-30 | Rearden, Llc | Systems and methods for distributing radioheads |
US11190947B2 (en) | 2014-04-16 | 2021-11-30 | Rearden, Llc | Systems and methods for concurrent spectrum usage within actively used spectrum |
US10194346B2 (en) | 2012-11-26 | 2019-01-29 | Rearden, Llc | Systems and methods for exploiting inter-cell multiplexing gain in wireless cellular systems via distributed input distributed output technology |
US9923657B2 (en) | 2013-03-12 | 2018-03-20 | Rearden, Llc | Systems and methods for exploiting inter-cell multiplexing gain in wireless cellular systems via distributed input distributed output technology |
US9973246B2 (en) | 2013-03-12 | 2018-05-15 | Rearden, Llc | Systems and methods for exploiting inter-cell multiplexing gain in wireless cellular systems via distributed input distributed output technology |
US10164698B2 (en) | 2013-03-12 | 2018-12-25 | Rearden, Llc | Systems and methods for exploiting inter-cell multiplexing gain in wireless cellular systems via distributed input distributed output technology |
US10488535B2 (en) | 2013-03-12 | 2019-11-26 | Rearden, Llc | Apparatus and method for capturing still images and video using diffraction coded imaging techniques |
US10547358B2 (en) | 2013-03-15 | 2020-01-28 | Rearden, Llc | Systems and methods for radio frequency calibration exploiting channel reciprocity in distributed input distributed output wireless communications |
US11290162B2 (en) | 2014-04-16 | 2022-03-29 | Rearden, Llc | Systems and methods for mitigating interference within actively used spectrum |
US10070072B2 (en) * | 2014-12-18 | 2018-09-04 | Savannah River Nuclear Solutions, Llc | System and method for detecting high-energy photons |
US9955140B2 (en) * | 2015-03-11 | 2018-04-24 | Microsoft Technology Licensing, Llc | Distinguishing foreground and background with inframed imaging |
GB2539387B (en) | 2015-06-09 | 2021-04-14 | Oxford Metrics Plc | Motion capture system |
JP7259757B2 (en) * | 2017-10-19 | 2023-04-18 | ソニーグループ株式会社 | IMAGING DEVICE, AND IMAGE PROCESSING DEVICE AND METHOD |
CN109903719A (en) * | 2017-12-08 | 2019-06-18 | 宁波盈芯信息科技有限公司 | A kind of the structure light coding method for generating pattern and device of space-time code |
US10297697B1 (en) * | 2018-11-01 | 2019-05-21 | H3D, Inc. | Coded aperture system of imaging with a plurality of detectors in a spaced-apart configuration |
JP2022516038A (en) * | 2018-12-21 | 2022-02-24 | スコピオ ラブズ リミテッド | Compressed acquisition of microscopic images |
US11501474B2 (en) | 2019-02-18 | 2022-11-15 | Argospect Technologies Inc. | Collimators for medical imaging systems and image reconstruction methods thereof |
US11165969B1 (en) * | 2020-08-03 | 2021-11-02 | Sky Castle Toys LLC | System and method for adding auxiliary lights to a camera to create fluorescence in selected features of a captured image |
US11838221B2 (en) * | 2022-01-13 | 2023-12-05 | Verizon Patent And Licensing Inc. | Systems and methods for multi-cloud virtualized instance deployment and execution |
CN115375600B (en) * | 2022-10-20 | 2023-04-07 | 福建亿榕信息技术有限公司 | Reconstructed image quality weighing method and system based on self-encoder |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5225876A (en) * | 1989-12-23 | 1993-07-06 | Dornier Luftfahrt Gmbh | Range finding camera |
US6157040A (en) * | 1997-05-20 | 2000-12-05 | Sick Ag | Optoelectronic sensor |
US6324296B1 (en) * | 1997-12-04 | 2001-11-27 | Phasespace, Inc. | Distributed-processing motion tracking system for tracking individually modulated light points |
WO2005022373A2 (en) * | 2003-08-29 | 2005-03-10 | Canon Kabushiki Kaisha | Object information sensing apparatus, pointing device, and interface system |
US20050105772A1 (en) * | 1998-08-10 | 2005-05-19 | Nestor Voronka | Optical body tracker |
Family Cites Families (43)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IT959979B (en) * | 1972-06-28 | 1973-11-10 | Honeywell Inf Systems | OPTICAL ASSOCIATIVE MEMORY |
US4209780A (en) | 1978-05-02 | 1980-06-24 | The United States Of America As Represented By The United States Department Of Energy | Coded aperture imaging with uniformly redundant arrays |
US4360797A (en) | 1978-05-02 | 1982-11-23 | The United States Of America As Represented By The United States Department Of Energy | Coded aperture imaging with uniformly redundant arrays |
US4417791A (en) | 1982-08-19 | 1983-11-29 | Jonathan Erland | Process for composite photography |
US4855061A (en) | 1988-04-26 | 1989-08-08 | Cpc Engineering Corporation | Method and apparatus for controlling the coagulant dosage for water treatment |
US5699798A (en) | 1990-08-10 | 1997-12-23 | University Of Washington | Method for optically imaging solid tumor tissue |
US5076687A (en) | 1990-08-28 | 1991-12-31 | Massachusetts Institute Of Technology | Optical ranging apparatus |
JP2774738B2 (en) | 1992-05-27 | 1998-07-09 | シャープ株式会社 | Image coding restoration system |
US5903388A (en) | 1992-06-11 | 1999-05-11 | Sedlmayr Steven R | High efficiency electromagnetic beam projector and systems and method for implementation thereof |
US5606165A (en) | 1993-11-19 | 1997-02-25 | Ail Systems Inc. | Square anti-symmetric uniformly redundant array coded aperture imaging system |
US5479026A (en) | 1994-05-16 | 1995-12-26 | United Technologies Corporation | System having optically encoded information |
US5424533A (en) | 1994-06-21 | 1995-06-13 | United Technologies Corporation | Self illuminating touch activated optical switch |
US6710797B1 (en) | 1995-09-20 | 2004-03-23 | Videotronic Systems | Adaptable teleconferencing eye contact terminal |
US5756026A (en) | 1996-01-05 | 1998-05-26 | Fiberco, Inc. | Method for control of post molding fabric curl and distortion |
US5809422A (en) | 1996-03-08 | 1998-09-15 | Watkins Johnson Company | Distributed microcellular communications system |
AU3295097A (en) | 1996-05-31 | 1998-01-05 | Massachusetts Institute Of Technology | Coded aperture imaging |
US5757005A (en) | 1996-10-04 | 1998-05-26 | California Institute Of Technology | Advanced x-ray imaging spectrometer |
US6141104A (en) | 1997-09-09 | 2000-10-31 | Image Guided Technologies, Inc. | System for determination of a location in three dimensional space |
US6271900B1 (en) * | 1998-03-31 | 2001-08-07 | Intel Corporation | Integrated microlens and color filter structure |
US6533674B1 (en) | 1998-09-18 | 2003-03-18 | Acushnet Company | Multishutter camera system |
JP2001007007A (en) * | 1999-06-23 | 2001-01-12 | Ushio Sogo Gijutsu Kenkyusho:Kk | Wavelength monitoring device for excimer laser light for exposing semiconductor |
JP3821614B2 (en) * | 1999-08-20 | 2006-09-13 | 独立行政法人科学技術振興機構 | Image input device |
TW510131B (en) | 2000-05-24 | 2002-11-11 | Chi Mei Electronic Corp | Image input/output device |
US6643386B1 (en) | 2000-08-10 | 2003-11-04 | Omnivision Technologies, Inc. | Method and apparatus for adding watermarks to images and/or video data streams |
US6737652B2 (en) | 2000-09-29 | 2004-05-18 | Massachusetts Institute Of Technology | Coded aperture imaging |
DE60134950D1 (en) * | 2001-02-08 | 2008-09-04 | Sgs Thomson Microelectronics | Reference data encoding for a solid-state imaging device |
US7339521B2 (en) | 2002-02-20 | 2008-03-04 | Univ Washington | Analytical instruments using a pseudorandom array of sources, such as a micro-machined mass spectrometer or monochromator |
US7196728B2 (en) | 2002-03-27 | 2007-03-27 | Ericsson, Inc. | Method and apparatus for displaying images in combination with taking images |
ATE325354T1 (en) | 2003-07-02 | 2006-06-15 | Berner Fachhochschule Hochschu | METHOD AND DEVICE FOR IMAGING WITH CODED APERTURE |
US7152984B1 (en) * | 2003-08-13 | 2006-12-26 | Microfab Technologies Inc. | Cat's eye retro-reflector array coding device and method of fabrication |
JP2007506143A (en) | 2003-09-17 | 2007-03-15 | セーガン インダストリーズ インコーポレーティッド | Flash imaging apparatus, manufacturing method and usage thereof |
TWI236546B (en) * | 2004-04-15 | 2005-07-21 | Pixart Imaging Inc | Image sensing device of improving image quality and reducing color shift effect |
WO2006086085A2 (en) | 2004-12-28 | 2006-08-17 | Hypermed, Inc. | Hyperspectral/multispectral imaging in determination, assessment and monitoring of systemic physiology and shock |
US7767949B2 (en) | 2005-01-18 | 2010-08-03 | Rearden, Llc | Apparatus and method for capturing still images and video using coded aperture techniques |
US7671321B2 (en) | 2005-01-18 | 2010-03-02 | Rearden, Llc | Apparatus and method for capturing still images and video using coded lens imaging techniques |
GB0602380D0 (en) | 2006-02-06 | 2006-03-15 | Qinetiq Ltd | Imaging system |
GB2434937A (en) | 2006-02-06 | 2007-08-08 | Qinetiq Ltd | Coded aperture imaging apparatus performing image enhancement |
GB2434935A (en) | 2006-02-06 | 2007-08-08 | Qinetiq Ltd | Coded aperture imager using reference object to form decoding pattern |
US7792423B2 (en) | 2007-02-06 | 2010-09-07 | Mitsubishi Electric Research Laboratories, Inc. | 4D light field cameras |
US8243353B1 (en) | 2008-04-07 | 2012-08-14 | Applied Science Innovations, Inc. | Holography-based device, system and method for coded aperture imaging |
KR101483714B1 (en) | 2008-06-18 | 2015-01-16 | 삼성전자 주식회사 | Apparatus and method for capturing digital image |
GB0822281D0 (en) | 2008-12-06 | 2009-01-14 | Qinetiq Ltd | Optically diverse coded aperture imaging |
GB201104873D0 (en) | 2011-03-23 | 2011-05-04 | Mbda Uk Ltd | Encoded image processing apparatus and method |
-
2012
- 2012-10-15 US US13/652,259 patent/US10148897B2/en active Active
-
2018
- 2018-12-03 US US16/207,941 patent/US20190116326A1/en not_active Abandoned
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5225876A (en) * | 1989-12-23 | 1993-07-06 | Dornier Luftfahrt Gmbh | Range finding camera |
US6157040A (en) * | 1997-05-20 | 2000-12-05 | Sick Ag | Optoelectronic sensor |
US6324296B1 (en) * | 1997-12-04 | 2001-11-27 | Phasespace, Inc. | Distributed-processing motion tracking system for tracking individually modulated light points |
US20050105772A1 (en) * | 1998-08-10 | 2005-05-19 | Nestor Voronka | Optical body tracker |
WO2005022373A2 (en) * | 2003-08-29 | 2005-03-10 | Canon Kabushiki Kaisha | Object information sensing apparatus, pointing device, and interface system |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11595575B2 (en) | 2020-05-11 | 2023-02-28 | Samsung Electronics Co., Ltd. | Image sensor |
Also Published As
Publication number | Publication date |
---|---|
US10148897B2 (en) | 2018-12-04 |
US20130038766A1 (en) | 2013-02-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20190116326A1 (en) | Apparatus and method for capturing still images and video using coded lens imaging techniques | |
US8013285B2 (en) | Apparatus and method for capturing still images and video using coded lens imaging techniques | |
US7767949B2 (en) | Apparatus and method for capturing still images and video using coded aperture techniques | |
US11681061B2 (en) | Apparatus and method for capturing still images and video using diffraction coded imaging techniques | |
Venkataraman et al. | Picam: An ultra-thin high performance monolithic camera array | |
US8290358B1 (en) | Methods and apparatus for light-field imaging | |
Talvala et al. | Veiling glare in high dynamic range imaging | |
WO2010048618A1 (en) | Systems and methods for high resolution imaging | |
Galstian | Smart mini-cameras | |
KR20030028553A (en) | Method and apparatus for image mosaicing | |
JPWO2012120584A1 (en) | Imaging device and distance measuring device | |
US20100329566A1 (en) | Device and method for processing digital images captured by a binary image sensor | |
WO2019078336A1 (en) | Imaging device and signal processing device | |
US12147001B2 (en) | Apparatus and method for capturing still images and video using diffraction coded imaging techniques | |
US20200396380A1 (en) | Signal processing device and imaging device | |
Schöberl et al. | Building a high dynamic range video sensor with spatially nonregular optical filtering | |
RU2589750C2 (en) | Mobile device with optical elements | |
Moore | Integrating 3-D Modelling with Unmanned Aerial Vehicles in Subterranean Environments to aid Archaeological Stratigraphy | |
Solomatin | INFORMATION CAPACITY OF FACET OPTOELECTRONIC SYSTEMS | |
KR20210137886A (en) | Image sensor | |
Konnik et al. | Using spatially varying pixels exposure technique for increasing accuracy of the optical-digital pattern recognition correlator | |
EP2985992A1 (en) | Apparatus and method for providing an image | |
Bauer | Efficient Pixel Binning of Photographs |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |