[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

EP3718051A1 - Stereo-korrespondenzsuche - Google Patents

Stereo-korrespondenzsuche

Info

Publication number
EP3718051A1
EP3718051A1 EP18884507.7A EP18884507A EP3718051A1 EP 3718051 A1 EP3718051 A1 EP 3718051A1 EP 18884507 A EP18884507 A EP 18884507A EP 3718051 A1 EP3718051 A1 EP 3718051A1
Authority
EP
European Patent Office
Prior art keywords
image
search
digital data
comparing
feature
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP18884507.7A
Other languages
English (en)
French (fr)
Other versions
EP3718051A4 (de
Inventor
James A. MCCOMBE
Christoph BIRKHOLD
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mine One GmbH
Original Assignee
Mine One GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mine One GmbH filed Critical Mine One GmbH
Publication of EP3718051A1 publication Critical patent/EP3718051A1/de
Publication of EP3718051A4 publication Critical patent/EP3718051A4/de
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/30Determination of transform parameters for the alignment of images, i.e. image registration
    • G06T7/33Determination of transform parameters for the alignment of images, i.e. image registration using feature-based methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • G06T7/55Depth or shape recovery from multiple images
    • G06T7/593Depth or shape recovery from multiple images from stereo images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10004Still image; Photographic image
    • G06T2207/10012Stereo images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20016Hierarchical, coarse-to-fine, multiscale or multiresolution image processing; Pyramid transform
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20076Probabilistic image processing

Definitions

  • the present invention provides methods, systems, devices and computer software/program code produc ts that enable the foregoing aspects and others.
  • V3D Virtual 3-D
  • the present invention provides methods, systems, devices, and computer software/program code products suitable for a wide range of applications, including, but not l imited to: facilitating video communications and presentation of image and video content in telecommunications applications; and facilitating video communications and presentation of image and video content for virtual reali ty (VR), augmented reality (AR) and head-mounted display (HMD) systems.
  • VR virtual reali ty
  • AR augmented reality
  • HMD head-mounted display
  • Methods, systems, devices, and computer software/program code products in accordance with the invention are suitable tor implementation or execution in, or in conjunction with, commercially available computer graphics processor configurations and systems including one or more display screens for displaying images, cameras for capturing images, and graphics processors for rendering images for storage or for display, such as on a display screen, and for processing data values for pixels in an image representation.
  • the cameras, graphics processors and display screens can be of a form provided in commercially available smartphones, tablets and other mobile telecommunications devices, as well as in commercially available laptop and desktop computers, which may communicate using commercially available network architectures including client/server and chent/network/cloud architectures.
  • digital processors which can include graphic processor units, including general purpose graphical processor units (GPGFUs) such as those commercially available on cellphones, smartphones, tablets and other commercially available telecommunications and computing devices, as well as in digital display devices and digital cameras.
  • GPGFUs general purpose graphical processor units
  • Those skilled in the art to which this invention pertains will understand the structure and operation of digital processors, GPGPUs and similar digital graphics processor units.
  • One aspect of the invention relates to methods, systems, devices and computer software/prograrn code products operable to search for correspondence between a first selected feature in a first linage and an element within a selected search domain in a second image, wherein the searching comprises: in a digital processing resource comprising a digital processor:
  • the computation of a minimum difference value and a maximum difference value comprises computation of an estimated minimum difference value and an estimated maximum difference value.
  • Another aspect comprises iteratively executing the comparing (B), the comparing (C), the identifying (D), and the rejecting (E), «sing higher resolution digital data representations of the first selected feature and higher resolution digital data representations in the multi-level aggregate structure.
  • the first and second images are images captured by at least one camera from different viewpoints of substantially the same scene.
  • the selected search domain is selected based on the relative positions of the respective viewpoints of the first and second images.
  • the first and second images are rectified to a common projection such that selected search domain within the second image is oriented along a selected axis of the second image.
  • Another aspect of the invention relates to methods, systems, devices and computer
  • the selected search method comprises:
  • the selected search method further comprises: iteratively executing the comparing (i), the comparing (H), the identifying (Hi), and the rejecting (iv), using higher resolution digital data representations of tire first selected feature and higher resolution digital data representations in the multi-level aggregate structure;
  • the correspondence information is a transform.
  • the correspondence information is an offset representing a position within the search domain
  • the correspondence information is 2 -dimensional value representing a position within an image
  • the correspondence information is computed based on a known difference in viewpoints between first and second cameras that captured the first and the second image, respectively.
  • the correspondence information is a disparity value representing the distance between the location of the first selected feature in the first image and the location of the corresponding element in the second image
  • the feature is a pixel
  • the feature is a matrix of pixels
  • the matrix of pixels is a structure of adjacent pixels
  • the multi-level aggregate data structure contains dements; each lower resolution element comprising the minimum and maximum values for one or more higher resolution elements
  • a related, further aspect of the invention comprises methods, systems, devices and computer soft ware/program code products operable to; in a digital processing resource comprising a digital processor;
  • C) utilize the variance value to determine whether to execute another comparing operation in a higher resolution digital data representation in the multi-level aggregate digital data structure, wherein, if the determined variance value is less than a selected threshold value, then not executing another comparing operation for the first selected subset in a higher resolution digital data representation in the multi-level aggregate digital data structure.
  • the multi-level aggregate digital data structure has a configuration substantially similar to the configuration of a MTP map.
  • the comparing utilizes a selected error function
  • the comparing utilizes a selected comparison function.
  • the comparison function comprises a kernel .
  • applying the error function executing the comparing produces a scalar error value for a given transform.
  • the error function comprises a comparison function and selected logic applicable to weight results of individual comparisons.
  • the second image is pre-processed to generate a multi-level aggregate digital data structure, in which each element in a given resolution level subsumes the minimum and maximum values from the resolution level above the given resolution level.
  • the multi-level aggregate digital data structure is characterized by a selected reduction ratio.
  • only selected resolution levels of the multi-level aggregate digital data structure are created in an initial structure generating operation.
  • only selected lowest resolution levels of the multi-level aggregate digital data structure are generated in an initial structure generating operation.
  • all, or substantially all, levels of the multi-level aggregate digital data structure projected to be required in executing the comparing are generated in an initial structure generating operation.
  • Another aspect of the invention relates to methods, systems, devices and computer
  • sofiwarevprograra code products operable to search for correspondence between a first selected feature in an input signal and an element within a selected search domain in a search signal, wherein the searching comprises: in a digital processing resource comprising a digital processor:
  • a related aspect comprises; iteratively executing the comparing (B), the comparing ( €), the identifying (D), and the rejecting (E), using higher resolution digital data representations of the first selected feature and higher resolution digital data representations in the multi-level aggregate structure, and rejecting comparison results having difference values above the threshold difference value.
  • the input signal and the search signal are 1 -dimensional signals.
  • the input signal and the search signal are of different formats.
  • either the input signal or the search signal comprises any of: image data, pixel depth data, 3-dimensional -point data, 3-dimensional vertex data, L1DA data, neural network weights, computer readable instructions, abstract signal descriptions, compressed representations, audio data, market data, price signal data, or analytics.
  • either the input signal or the search signal comprises a combination of any of: image data, pixel depth data, 3-dimensional point data, 3 -dimensional vertex data, LlDAR data, neural network weights, computer readable instructions, abstract signal descriptions, compressed
  • one of the input signal or the searc h signa l comprises an abstract representation describing features of the given signal, and the other signal comprises image data.
  • Another aspect of the invention relates to methods, systems, devices and computer
  • determining a variance value comprises: utilizing a deviation calculation. in another aspect, determining a variance value comprises: utilizing a squared deviation calculation.
  • determining a variance value comprises: calculating the variance based on a running standard deviation.
  • determining a variance value comprises: utilizing a heuristic.
  • the comparing is executed for a plurality of subsets of the selected search domain, and the determining comprises utilizing the results of the comparing.
  • Another aspect of the invention relates to methods, systems, devices and computer
  • the variance is the absolute value of the difference between the minimum and maximum values contained in a lower resolution element of the multi-level aggregate digital data structure.
  • Another aspect comprises selecting a threshold for variance below which corresponding image detail is treated as irrelevant.
  • the variance threshold is a selected fixed value.
  • the variance threshold is determined, in the processing structure, based on parameters of the images or of the devices or systems from which the images originate.
  • the variance threshold is determined based on physical properties of cameras and camera processing dements associated with a stereo image disparity search.
  • the variance threshold is determined based on a selected model.
  • the model alters the variance threshold based on selected criteria.
  • the selected criteria comprise any of: absolute signal values contained within a currently selected element of a currently selected multi-level aggregate digital data structure; overall image-representative values across larger sections of an image being searched; available processing power; human perceptual factors; display technology considerations; relative and absolute image positions; inputs from additional connected components; or user-supplied inputs or hints.
  • Another aspect comprises adjusting the variance threshold based on availability of computational or processing power or resources.
  • Another aspect comprises adjusting the variance threshold to selectively prioritize execution of computational operations.
  • multi-level aggregate digital data structure elements with highest variance are designated as candidates for further refinement based on higher resolution versions of a selected image.
  • Another aspect comprises iteratively repeating the designating, as variances associated with all transforms across a given image being processed become smaller, such that correlations between images are refined as available computational or processing power permits.
  • Another aspect of the invention relates to methods, systems, devices and computer
  • features are selected from a multi-level aggregate digital data structure representat ive of the fi ts! image ; and if the currently executed comparison was not at the highest possible resolution, then accessing a next higher resolution digital data representation in the multi-level aggregate digital data structure representative of the first image to execute a subsequent comparison.
  • aggregate down-sampled versions of the first image represented by a currently selected multi-level aggregate digital data structure, represent the bounds of the higher resolution first image elements.
  • the aggregate down-sampled versions of the first image represent the bounds of the higher resolution first image elements in a manner analogous to the respective multi-level aggregate digital data structure representing the bounds of the second image.
  • the executed comparison function considers two ranges, wherein the first range ts the range of subsumed values from each element in the first image and the second range is the range of subsumed values from each element in the second image.
  • the minimum value for the error is based on the distance between the nearest points hi each range.
  • the minimum error can he zero, and the maximum error can be based on the distance between the two points in each range that are furthest from each other.
  • Another aspect of the invention relates to methods, systems, devices and computer
  • softwarc/program code products operable to search for a correspondence between a first selected feature in a first image and an element within a selected search domain in a second image, the searching comprising; in a digital processing resource comprising a digital processor;
  • (D) estimating minimum and maximum possible errors for correspondence for the selecte feature in the first image at a full resolution, by executing the comparing and determining at reduced resolution on multi-level aggregate digital data structures representative of the first image and the selected subset of the selected search domain in the second image, respectively.
  • such operations are executed to determine a transform applicable to the selected feature in the first image such that the selected feature in the first image is matchable to the selected subset of the selected search domain of the second image with a level of error lower than a selected error threshold, the determined transform corresponding to the processing parameters resulting in die smallest error value.
  • Another aspect of the invention relates to methods, systems, devices and computer
  • software/progra code products operable to search for a correspondence between a first selected feature in a first image and an element within a selected search domain in a second image, the searching comprising; in a digital processing resource comprising a digital processor;
  • such operations are executed to determine a transform applicable to the selected feature in the first image such that the selected feature in the first image is matchable to the selected subset of the selected search domain of the second image with a level of error lower than a selected error threshold; wherein the determined transform corresponding to the processing parameters resulting in the smallest error value.
  • each resolution level contains at least one element, and m which a gi ven element at a given resolution level can reference both minimum and maximum values of other elements subsumed within the given element at higher resolution levels of the multi-level aggrega te digital da ta structure.
  • Another aspect farther comprises: in a digital processing structure: applying correspondence searching operations to multi -dimensional data.
  • the image data has n dimensions, and is greater than or equal to 2.
  • applying correspondence searching operations to multi-dimensional image data comprises: in a digital processing structure: searching a multi-dimensional image data field using a selected comparison kernel
  • using a selected comparison kernel comprises: evaluating- the selected comparison kernel; then, based on the evaluation, associating with a representative element of the kernel, a designation of eligibility; and then utilizing the designation of eligibility to determine whether the representative element, and any containe higher resolution elements, should be refected from further consideration.
  • Another aspect further comprises: if a selected relatively lower resolution element is designated ineligible, rejecting from further consideration the selected lower resolution element and rejecting any corresponding subsumed higher resolution elements
  • Another aspect further comprises: if a selected relatively lower resolution element is designated eligible, executing comparison operations on contained higher resolution elements.
  • Another aspect further comprises: hi a digital processing structure; expressing an «-dimensional image data field in a form comprising fewer than n dimensions, by applying a dimension collapsing operation.
  • applying a dimension collapsing operation comprises executing a search characterized by a selected search pattern.
  • the search pattern comprises a scan line search.
  • Another aspect further comprises executing a search characterized by a selected search patern, wherein the search pattern comprises a scan line search
  • Another aspect further comprises: executing a search characterized by a selected search pattern, wherein the search pattern comprises a scan line search.
  • the search patern comprises a Hilbert curve.
  • the search pattern comprises a Morton order.
  • successive search operations are executed in multiple passes through data to be searched.
  • evaluation of a comparison kernel finds a minimum and maximum error for the kernel
  • a subsequent pass comprises determining a minimum value of the maximum error for the kernel to be used as a threshold when determining which elements are eligible for further consideration.
  • a further subsequent pass comprises comparing all eligible elements at a next resolution level
  • Another aspect further comprises: optimizing execution of processing operations to substantially fully utilize all available digital processing resources.
  • Another aspect further comprises:
  • Another aspect further comprises: optimizing execution of processing operations to minimize computational work and memory traffic.
  • multiple features of the input image are tested in parallel against the selected search domain.
  • processing operations are executed in accordance with digital instructions executing on a computing device.
  • the computing device comprises a GPU (General Processing Unit).
  • the computing device comprises a CPU (Central Processing Unit).
  • CPU Central Processing Unit
  • the computing device comprises a TPU (Tensor Processing Unit).
  • TPU Transistsor Processing Unit
  • processing operations are executed by dedicated hardware circuitry.
  • data generated for use in a given processing operation are retained for future use.
  • data for use in a given processing operation are generated on an as-needed basis.
  • Another aspect further comprises; in a digital processing resource:
  • a search operation for the additional feature is executed after execution of the search operation for the selected first feature.
  • the search operation for the additional feature is executed in parallel with the search operation for the selected first feature.
  • the result of a given search operation is a transform indicating a best correspondence between a feature from the input image and a best match for the input image feature in the search image.
  • the transform is invertible to generate a corresponding transform between a corresponding feature in the search image and a corresponding feature in the input image.
  • Another aspect further comprises selecting a first portion of a gi ven image to be a search image, and selecting a second different portion of the given image to be an input image.
  • Another aspect further comprises; applying comparison and search processes across a plurality of images simultaneously, such that one input image is simultaneously compared for correspondence with more than one search image.
  • Another aspect further comprises; applying comparison and search processes across a plurality of images simultaneously, such that multiple input images are simultaneously compared for correspondence with a given search image,
  • the input image differs in format from the search image.
  • Another aspect further comprises; executing a transform to enable comparison between an input image and a search image of different formats,
  • the applie comparison function adaptively interprets different respective sets of data associated with each image, to enable comparison between an input image an a search image of different formats.
  • the present invention enables the features described herein to be provided at reasonable computational cost, and in a manner easily accommodated within the digital processing capabilities and form factors of modern mobile devices such as tablets and smartphones as well as the form factors of laptops, PCs, computer-driven televisions, computer-driven projector devices, and the like, does not dramatically alter the economies of building such devices, and is viable within current or near-current communications neiwork/connectivity architectures.
  • FIG. ! is a schematic block diagram depicting a MLADDS structure in accordance with an exemplar ⁇ ' practice of the in vention.
  • PIG. 2 is a schematic flowchart depicting a lossless search process in accordance with an exemplary practice of the invention.
  • FIG. 3 depicts computational functions, in accordance with the invention, that represent one possible method for calculating minimum and maximum error values when comparing two elements, each expressing minimum and maximum signal bounds values.
  • FIG. 4 comprised of sub-figures 4.1 , 4.2, 4.3 and 4.4, shows exemplar ⁇ ' plots of the minimum and maximum errors for a feature from an input signal compare across a search domain at four separate resolution levels.
  • FIG. 5 is a schematic diagram depicting operations performed, in accordance with an exemplars' practice of the invention, to search a two-dimensional (2-D) image or image signal
  • FIGS 6 - 22 are flowcharts depicting exemplary practices of the invention.
  • FIG. 23 is a schematic block diagram depicting exemplar ⁇ ' devices or systems in which aspects of the present invention may be practiced or embodied.
  • the present invention relates to methods, systems, devices and computer software/prqgram products that enable efficiently finding stereo correspondence between a feature or set of features in a first signal and a search domain hi a second signal.
  • the process of finding an acceptable correspondence results in the determination of the transformation (ofte a position or translation) for the feature from the first signal such that it matches a part of the second signal with an acceptable error. In accordance with exemplary practices and embodiments of the present invention, this is accomplished by estimating the minimum and maximum possible errors for correspondence between features at a full resolution, by performing correspondence testing at reduced resolution on aggregations of the signal data.
  • difference values A comparison between a feature at a given resolution and an element in a multi-level aggregate digital data structure (MLADDS) may result in a difference value indicating the difference between the feature and die element being compared.
  • difference values A comparison between a feature at a given resolution and an element in a multi-level aggregate digital data structure (MLADDS) may result in a difference value indicating the difference between the feature and die element being compared.
  • comparisons with Sow difference values can be said to have low error values, because there is a more precise match between the feature and the element.
  • a comparison function When a comparison function is applied across an entire search domain at a given resolution, it can be called an error function.
  • a 2 -dimensional plot: of the values of the error function can be used to identi fy the elements within the search domain of the lowest error or best correspondence between the feature and the dement,
  • One known technique for determining stereo correspondence between an input signal and a search signal is to -perform a linear search, in such a search, a portion of the input signal is transformed and compared for errors against the corresponding part of the search signal using a comparison function or an error function. Changes to the transform parameters are iteratively applied in a linear fashion across the entire possible domain being searched, and the transform parameters resulting in the smallest error are chosen for that portion of the input signal.
  • a portion of the input signal or a processed version of the signa l for which the algorithm is searching for a corresponding match can be referred to, various, as a’’feature" or as a "snippet.”
  • the comparison function is often a kernel.
  • An error function is any function that produces a scalar error value for a given transform, and can include one or more comparison functions along with logic to interpret or weight the results of the individual comparisons.
  • the linear search can be applied to multipie resolutions of the input signal and the search signal, as taught by the Applicants' commonly owned patent applications listed above and incorporated herein by reference. Further, an error function that combines the results from comparisons of the sigt l al at more than one resolution can increase the accuracy of the overall results.
  • a tower resolution signal is produced from a high resolution signal can vary, but can utilize averaging, down-sampling, or applying a low-pass filter.
  • the lower resolution version of a signal can be referred to as an aggregate signal or a downsampled signal.
  • the technique of parallel searching across multiple resolutions produces good results, but is computationally very costly.
  • An aspect of the present invention is that, unl ike executing the search operation in each resolution as an independent process, in exemplary practices of the present invention each resolution is searched serially, where the search at a higher resolution i predicated on the results of the search at the lower resolution.
  • a known optimization that can be employed is a refinement search to reduce the computational cost of the search operation.
  • the search space is sampled first at a coarse resolution, and then higher resolution samples are taken only when the error is below a threshold. This process is repeated tor successively higher resolutions until a final result is determined from samples in a high resolution version of the signal.
  • this type of error might result in the search ignoring a small object because it does not make a distinct contribution to a low resolution representation of the image.
  • signals which can be processed by the invention, can comprise, hut are not limited to, any of the following: image data, pixel depth data, 3-dimensional point data, 3- dtmensioaal vertex data, L1DAR data, neural network weights, computer readable instructions, abstract signal descriptions compressed representations, audio data, market data, price signal data or analytics.
  • exemplary practices of the present invention employ a multi-level aggregate digital data structure (MLADDS), of which a bounds structure is one form.
  • a MLADDS is a multi-resolution representation of the signal in which each element at a given le vel can reference both the minimum and maximum values of the elements contained within it at all higher resol utions.
  • Certain aspec ts of a bounds structure or MLAD DS are analogous to aspects of a class of structures known as a "MIP map.” (The word MIP is an acronym of the Latin phrase "multum in parvo".
  • bounds structure or MLADDS can be represented as one or more MIP maps, but does not necessarily imply all of the specific limitations from the traditional use or construction of MIP maps. More particularly, as will be described herein, the bounds structure or MLADDS of exemplary practices of the present invention, and the manner in which it is employed in connection with the present invention, are distinct from and contrast with previously known uses and constructions of M ⁇ R maps. While the term "bounds structure * ’ is also used herein, referring to one form of MLADDS, it will be understood that the invention can be embodied using various forms of MLADDS.
  • an image (or a signal that ma be representative of an image) is preprocessed to create an MLADDS 100 like that schematically depicted in FIG. 1.
  • Each element in a gi ven MIP level encompasses the minimum and maximum values from the MIP level abo ve it. This interval, expressed in an element of the MLADDS, therefore expresses the overall range of all of the contained elements from the original signal.
  • each MIP Level contains a Min value and a Max value representative of minimum and maximum values from the MIP level above it.
  • FIG. 1 illustrates a 2 to I MIP reduction, but other MIP reductions are also valid, an may be useful, depending on tradeoffs associated with the hardware in volve in the practice of the invention.
  • MIP level nomenclature A brief discussion about MIP level nomenclature is appropriate here.
  • MIP level 3 is a lower level than MIP level 2. Because the higher resolution levels of a bounds structure in accordance with the present invention, or indeed any type multi-level aggregate structure, may be unneeded to search certain portions of the signal, it may be advantageous to generate only the lowest levels.
  • One aspect of the present invention involves performing a lossless stereo search using an MLADDS.
  • This technique avoids wasted computation by avoiding the application of the comparison function to data for which the combined error function cannot produce the best result across the search domain.
  • the lossless search is capable of producing the same result as a multi-resolution linear search applied with tire same input signal across the same search domain
  • FIG. 2 is a flowchart depicting a lossless search process 200 in accordance with an exemplary practice of the invention in particular, FIG. 2 descri bes an exemplars 1 process of lossless! ⁇ ' searching a search signal for correspondence with features from an input signal.
  • the left-hand side of FIG, 2 indicates the search signal 210 and input signal 212
  • the search signal 210 is used as an input to operation 214: create or retrieve MLADDS structure for search domain within search signal.
  • the input signal 212 is used as an input signal to operation 216: generate or retrieve lowest resolution aggregate signal for input signal.
  • the result or output of operation 216 is used as an input to operation 218: select feature from current resolution aggregate signal to locate within the corresponding resolution search signal.
  • operation 218 executes comparison function to compu te minimum and maximum error for each eligible element of the corresponding resolution in the MLADDS structure.
  • operation 220 executes comparison function to compu te minimum and maximum error for each eligible element of the corresponding resolution in the MLADDS structure.
  • the result or output of box 220 is used as an input to box 222, a determination of whether the currently executed comparison was at the highest resolution. If "Yes", then the process proceeds to operation 224: determine the lowest value for the error. As noted in box 224, in the exemplary practice depicted in FIG. 2, this error determination may incorporate error values from multiple resolutions of the aggregate map.
  • the result or output of operation 224 yields box 226, which is the output of the lossless search process example depicted in FIG. 2: foe result of the feature search is the transform that produces the lowest error.
  • the result or output of determination 228 leads to operation 230; indicate that each dement for which the minimum error is greater than the previously determined lowest maximum error is ineligible.
  • operation 232 the result or output of operation 230 leads to operation 232; generate or retrieve the next higher resolution aggregate signal for the input signal.
  • the lossless search process thus loops back to bo 218; select a feature from the current resolution aggregate signal to locate within the corresponding resolution search signal; and the process iterates,
  • the aggregate downsampled versions of the input signal must also represent the bounds of the higher resolution input signal elements, in a functionally similar way that the MLADDS structure represents the bounds of the search signal.
  • the comparison function must consider two ranges; the range of contained values from each deme t in the input signal and die range of contained values from each element in the search signal if those ranges are disjoint, the minimum value for the error is based on the distance between the nearest points in each range. If those ranges overlap, the the minimum error can be zero. Similarly, the maximum error can be based on the distance between the two points in each range that are furthest from each other.
  • FIG. 3 is a depiction of computational functions 302, 304, in accordance with the invention, that represent one possible method for calculating minimum and maximum error values when comparing two elements, each expressing minimum and maximum : signal bounds values.
  • These exemplary functions 302, 304, respectively, are;
  • FIG 4 comprised of sub-figures 4.1 , 4 2, 4.3 and 4.4, shows exemplary plots of the minimum and maximum errors for a feature from an input signal compared across a search domain at four separate resolution levels.
  • FIG. 4 highlights areas of the signal that are rejected from further consideration in a lossless search.
  • the lowest value of foe for comparisons at MIP level 3, the lowest value of foe
  • the resulting transform is able to express a correspondingly higher degree of precision. This is due to the increased resolution in the signal allowing finer details to be compared, producing a more accurate result. In many instances, however, this increased precision is not necessary, la some cases, the high precision portion of the transform may even be invalid, due to spurious noise in one or both of the signals. Contrastingly, a high variance in a lower resolution element indicates foe potential tor a high frequency transition or some other meaningful detail that can he correlated between the signals with high potential accuracy. (It should be noted that the term "variance” is used herein in a manner substantially consistent with its broad English definition. In particular, foe term “variance”, as used herein, is NOT limited to meaning only the square of the deviation.)
  • the variance is the absolute value of the difference between the minimum and maximum signal values contained within a lower resolution element of a ML ADDS structure.
  • an element represents a singleton sample and therefore the variance is zero, because there is no minimum and maximum signal values that are different from foe singleton value itself in embodiments where it is possible to infinitely subdivide signal elements or manufacture new samples from an analog scarce, individual samples would not be singletons
  • the elemen t can be determined not to contain significant detail, and therefore a further examination of higher resolution levels of the signal will likely not produce valuable information.
  • a low resolution element in ie MLADDS structure containing many pixels that all display a featureless smooth background will have a very low associated variance.
  • ike lower precision transforms obtained from the lower resolution versions of the MLADDS structure are likely to be sufficient to adequately represent the transform.
  • the threshold for variance below which the detail is considered irrelevant may depend on many factors.
  • the threshold may be a fixed value, or a value calculated from parameters of the systems on which the signals originate.
  • the threshold may be based on the physical properties of the cameras and camera processing pipeline for a stereo image disparity search.
  • the threshold can also be based on a model. That model may alter the allowable variance threshold depending on numerous other factors and consideration including, but not limited to. one or more of the following: absolute signal values contained with the element, overall signal values across larger sections of a signal, available processing power, perceptual factors, display technology
  • the threshold can be adjusted to accommodate the availability of computational resource.
  • the variance at each element could function as a type of prioritization. Elements with the highest variance would then be candidates for further refinement based on higher resolution versions of the signal This would continue as the variances associated with all transforms across the signal became smaller, in this way, correlations between the signals could be refined as the available processing power allows.
  • FIG. 5 is a diagram depicting operations performed, in accordance with an exemplary practice of the invention, to search a two-dimensional (2-D) image or image signal
  • FIG. 5 illustrates how a multi-dimensional signal can be searched, by execution, for example in a digital processing resource, of operations 500.
  • FIG. 5 shows a 12 x 12 table 502 schematically representing Search Signal Date: the central portion of FIG. 5 shows a 6 x 6 table 504 representing Bounds MIP Level 1; and the right-hand side of FIG. 5 shows a 3 x 3 table 506 representing Bounds MIP Level 2.
  • a 2-D input signal feature is compared with a 2-D search signal using a 3x3 comparison kernel 50S, designated by the dark outlines in, e.g. the Search Signal Data table of FIG, 5.
  • a comparison kernel 508 is evaluated, an indication of eligibility is associated with a
  • the center element of the kernel is used. This indication of eligibility is then used to determine whether the element, and any contained or subsumed higher resolution elements, should be rejected from further consideration.
  • the example signal of FIG. 5 uses a 2:1 MIP reduction in each dimension, four comparisons at the higher resolution mip level result from every comparison that is not rejected. If a low resolution element is inel igible, it may be rejected and all contained higher resolution elements are also rejected and therefore not compared. If the low resolution element is eligible, multiple comparisons on the higher resolution signal may be performed. In the noted example, four comparisons are performed for each eligible dement
  • This approach can be extended into higher dimensions, including images or signals with three or more dimensions.
  • resolution element will depend on the dimensionality of the signal and the mip reduction
  • a signal comprising multiple dimensions may be expressed as a signal comprising fewer dimensions.
  • a given image may he a two dimensional signal, but searching a single scan line involves a simitar behavior to searching a one dimensional signal. Successively searching multiple scan lines still preserves the one dimensional behavior for the entire image.
  • a wide range of search patterns have the same dimension collapsing property, such as. for example, following a Hilbert curve or Morton order.
  • successive search operations would occur in passes.
  • a comparison kernel would find the minimum and maximum error for the kernel. This would be followed by a subsequent operation to determine the minimum value of the maximum error, to be used as a threshold when considering which elements are eligible for further consideration.
  • the next pass would then be responsible for comparing all of the eligible elements at the next raip level and so on.
  • an entire image representing an input signal is processed to create a bound structure.
  • a second image representing a search signal is also processed to create a second search MLADDS structure.
  • Features from the input image and associated MLADDS structure are searched within the search image and associated MLADDS structure by evaluating the comparison kernel in parallel against as much of the image as possible.
  • the algorithm is optimized to saturate all available computation elements.
  • each feature from the input signal is tested. If the error is determined to be within acceptable bounds for further processing, then processing can continue to the next level of the MLADDS structure.
  • processing can continue to the next level of the MLADDS structure.
  • the algorithm is optimized to minimize unnecessary computational work and memory traffic. In spite of the more serial nature of this embodiment, multiple features from the input signal can be searched in parallel, so the algorithm can still be amenable to large vector processors.
  • the present invention can be embodied in software executing on a GPU or similar computational device, processor or resource. It could also be embodied in software executed by a CPU, software executed by a TPU (Tensor Processing Unit), or by dedicated hardware circuitry.
  • a CPU Central Processing Unit
  • TPU Transistor Processing Unit
  • Data created for use in one operation may be retained for future use, or intermediate data can be created on an as-needed basis.
  • This is implementation-specific, and does not alter the invention.
  • one implementation may construct MLADDS structures for an entire input signal and an entire search signal . Then the search operation may select a feature from the input signal to locate within the search domain in the search signal. An additional feature can be selected from the already constructed MLADDS structures and another search operation may occur subsequently or in parallel.
  • the result of each search operation is a transformation indicating the best correspondence between the feature, subset or snippet from the input signal and its best match in the search signal.
  • a transformation may be inverted to produce a corresponding transformation between the feature in the search signal and the corresponding feature in the input signal. For this reason, which signal is considered the search signal and which signal is considered the input signal may be arbitrary in some
  • some embodiments may choose parts of a signal to behave as a search signal, while parts of the same signal may behave as an input signal.
  • one input signal could be compared for correspondence with multiple search signals simultaneously.
  • multiple input signals could be compared against the same search signal. This may be desirable in a system with more than two input devices producing signals requiring correlation, for example a device with three or more cameras.
  • the input signal may differ substantially from the search signal in format.
  • an input signal may comprise an abstract: representation to describe the features of the input signal while the search signal may comprise image data from a camera.
  • the input signal or the search signal or both may comprise one or more of: image data, neural network weights, computer readable instructions, abstrac t signal descriptions, compressed representations, or combinations of the aforementioned.
  • the comparison function would appropriately interpret the data provided from the signal.
  • the telecommunications devices can include known forms of cellphones, smartphones, and other known forms of mobile devices, tablet computers, desktop and laptop computers, and known forms of digital network components and se 'er/cloud network/client
  • Computer software can encompass any set of computer-readable programs instructions encoded on a non- transitory computer readable medi um.
  • a computer readable medium can encompass any form of computer readable element, including, but not limited to, a computer bard disk, computer floppy disk, computer-readable flash drive, computer-readable RA or ROM element or any other known means of encoding, storing or providing digital information, whether local to or remote from the cellphone, smartphone, tablet computer, PC, laptop, computer-driven television, or other digital processing device or system.
  • Various forms of computer readable elements and media are well known hi the computing arts, and their selection is left to the imp!ementer.
  • modules can be implemented using computer program modules and digital processing hardware elements, including memory units and other data storage units, and including commercially available processing units, memory units, computers, servers, smartphones and other computing and telecommunications devices.
  • modules include computer program instructions, objects, components, data structures, and the like that can be executed to perform selected tasks or achieve selected outcomes.
  • data storage module can refer to any appropriate memory element usable for storing program instructions, machine readable files, databases, and other data structures.
  • the various digital processing, memory and storage elements described herein can be implemented to operate on a single computing device or system, such as a server or collection of servers, or they can be implemented and inter-operated on various devices across a network, whether in a server-client arrangement, server-ctoud-elient arrangement, or other configuration in which client devices can communicate with allocated resources, functions or applications programs, or with a server, via a communications network,
  • One implementation comprises a complete device, including four cameras, capable of encoding content and receiving (full-duplex communication).
  • Another is an Apple iPhone-based implementation that can receive and present immersive content (receive-oniy).
  • the Applicants used the following hardware and software structures and tools, among others, to create the two noted implementations, collectively:
  • An Intel Core i7-f>770HQ processor which includes on-chip the following:
  • OpenCL API using Intel Media SDK running on Linux operating system to implement, among other aspects: image Rectification, Fast Dense Disparity Bstmiate(s) (FDDE) and Multi-level Disparity Histogram aspects.
  • FDDE Fast Dense Disparity Bstmiate(s)
  • Multi-level Disparity Histogram aspects 5.
  • OpcnGL API running on Linux operating system to implement Multiple Native Disparity Map Voting and image reconstruction.
  • the Apple iOS SDK was used to access accelerometer, gyroscope and compass fo device orientation and to access video decode hardware; and the OpenGL BS API to implement multiple native disparity map voting and image reconstruction to enable an iPhone-based prototype of a receiving device.
  • FIGS. 6 - 22 are flowcharts illustrating method aspects and exemplary practices of the invention.
  • the methods and elements depicted in these flowcharts are examples only ; the organization, groupings, order and number of operations in the exemplary practices can be varied; and the exemplary practices and methods can be grouped, arranged or ordered differently, and include different or additional functions, whether singly or hi combination, while still being within the spirit and scope of the present invention.
  • FIG . 6 shows a method 6(H) for searching for correspondence between a first selected feature in a first image and an element within a selected search domain in a second image, according to an exemplary practice of the invention, comprising the following operations; fe KX.Sgarc ls fo : ⁇ BlPBtMBl idence between 3 ⁇ 4 First selected feature in a first image and an element within a selected search domain in a second image:
  • multi-level aggregate digital data structure representative of second image
  • die multi-level aggregate digital data structure comprising digital data representations at different resolutions
  • FIG. 7A shows a method 700 for determining disparity between a first image and a second image, according to an exemplar practice of the invention, comprising the following operations:
  • 700 Determining disparity between a first image and a second image: 70 . Generate a multi-level aggregate digital data structure representative of the second image, the multilevel aggregate digital data structure comprising digital data representations at different resolutions;
  • FIG. 7B shows a selected search method, useful, for example, in element 703 of FIG 7A (which m turn shows a method for determining disparity between a first image and a second image), according to an exemplary practice of the invention, comprising the following operations:
  • FIG. 8 shows a method 800 for searching for correspondence between a first selected feature in an input signal and an element within a selected search domain in a search signal, according to an exemplary practice of the invention, comprising foe following operations: Searching for correspondence between a first selected feature in an input signal and an element
  • the multilevel aggregate digital data structure comprising digital data representations at different resolutions
  • FIG. 9 shows a method 900 for variance-aware searching for correspondence between a first selected feature in a first image and an element within a selected search domain in a second image, according to an exemplary practice of the invention, comprising the following operations:
  • a multi-level aggregate digital data structure representative of the second image the multilevel aggregate digital data structure comprising digital data representations at different resolutions;
  • FIG. 10 shows a method 1000 for variance-aware searching for correspondence between a first selected feature in an input: signal and an element within a selected search domain in a search signal, according to an exemplary practice of the invention, comprising the following operations:
  • the multilevel aggregate digital data structure comprising digital data representations at different resolutions
  • 1003 determine, based on the comparing, a variance value for the first selected subset
  • FIG. 1 1 shows a method 1100 for lossless search for correspondence betwee a first selected feature in a first image and an element within a selected search domain in a second image, according to an exemplary practice of the invention, comprising the following operations:
  • FIG. 12 shows a method 1200 for searching for correspondence between a first selected feature in a first image and an element within a selected search domain in a second image, according to an exemplary practice of the in vention, comprising the following operations:
  • FIG. 13 shows a method 1300 for searching for a correspondence between a first selected feature in a first image and an element within a selected search domain in a second image, according to an exemplary practice of the invention, comprising the fol lowing operations:
  • FIG. 14 depicts additional aspects relating to die MLADDS, searching, and other aspects of the invention, as follows:
  • the comparing can utilize a selected error .function.
  • the comparing can utilize a selected comparison function.
  • the comparison function can comprise a kernel
  • Error function comprises a comparison function and selected logic applicable to weight results of individual comparisons.
  • MLADDS is characterized by a selected reduction ratio.
  • FIG. .15 depicts a further aspect 1500 of processing in which:
  • searching can further comprise:
  • FIG. 16 shows additional aspects relating to lossless search and other aspects, as follows:
  • Executed comparison function considers two ranges, wherein the first range is the range of subsumed values from each element in the first image and the second range is the range of subsumed values from each element in the second image)
  • the minimum value for the error is based on the distance between the nearest points in each range
  • the minimum error can be zero, and the maximum error can be based on the distance between the two points in each range that are furthest from each other
  • Variance is defined as the absolute value of the difference between the minimum and maximum values contained in a lower resolution element of the bounds struc ture
  • Selected criteria comprise any of absolute signal values contained within a currently selected element of a currently selected bounds structure; overall image-representative values across larger sections of an image being searched; a vailable processing power; human perceptual factors; display technology considerations; relative and absolute image positions; inputs from additional connected components; or user-supplied inputs or hints),
  • P1.G. 17 shows features relating to variance threshold and other aspects, as follows:
  • FIG 18 shows features relating to multi -dimensional signal or image data and other aspects, as follows;
  • Image data has n dimensions, and n is equal to or greater than 3)
  • Applying correspondence searching operations to multi-dimensional image data comprises; in a digital processing structure, search a multi-dimensional image data field using a selected comparison kernel )
  • Using a selected comparison kernel comprises; evaluate the selected comparison kernel; then, base on the evaluation, associating, with a representative element of the kernel, designation of eligibility; and then utilize the designation of eligi bility to determine whether the representative element, and any contained higher resolution elements, should be rejected from further consideration)
  • Applying a dimension collapsing operation comprises executing a search characterized by a selected search pattern) (1809.
  • Search pattern comprises a scan Hoe search.
  • Subsequent pass comprises determinin the minimum value of the maximu error for the kernel, to be used as a threshold when determining which elements are eligible for further consideration.
  • FIG. 19 shows features relating to execution of processing operation and other aspects, as follows:
  • First image representing an input image
  • second image representing a search image
  • features from the input image and associated first bounds structure are searched within the search image and associated second bounds structure by evaluating a selected comparison kernel in parai!el against as much of the search image as possible
  • Processing operations are executed in accordance with digital instructions executing on
  • Computing device comprises a GPU; a CPU; or a TFU).
  • FIG. 20 shows additional aspects, as follows.
  • FIG. 2 f shows features relating to input and search imagcs/signals, as follows: : Search image/signa
  • Comparison function enables comparison of an input signal and a search signal.
  • Either the input signal or the search signal comprises any of: image data, pixel depth data, 3- dirnensionai point data, 3 -dimensional vertex data, LIDAR data, neural network weights, computer readable instructions, abstract signal descriptions, compressed representations, audio data, market data, price signal data, or analytics.
  • Either the inpu signal or the search signal comprises a combination of any of: image data, pixel depth data, 3-dimensional point data, 3-dimensional vertex data, LIDAR data, neural network weights, computer readable instructions, abstract signal descriptions, compressed representations, audio data, market: data, price signal data, or analytics.
  • FIG. 22 shows additional aspects of exemplary practices of the invention, as follows:
  • the computation of a minimum difference value and a maximum difference value can comprise computation of an estimated minimum difference value and an estimated maximum difference value.
  • the first and second images are images captured by at least one camera from different vie points of substantially the same scene.
  • the selected search domain is selec ted based on the relative positions of the respective viewpoints of the first and second images.
  • the correspondence information can be a transform
  • the correspondence information can be an offset representing a position within the search domain.
  • the correspondence information can be a 2-dimensional value representing a position within an image.
  • the correspondence information can he computed based on a known difference in viewpoints between first and second cameras that captured the first and the second image, respectively.
  • the correspondence information can be a disparity value representing the distance between the location of the first selected feature in the first image and the location of the corresponding element in the second image.
  • the selected feature can be a pixel
  • the selected feature can be a matrix of pixels.
  • the matrix of pixels can be a structure of adjacent pixels.
  • the multi-level aggregate data structure contains elements; each lower resolution element comprising the minimum and maximum values for one or more higher resolution elements) (2215.
  • Process can further include comparing a digital data representation of the selected feature from the first image, at a given resolution, against a digital data representation of a first selected subset of a selected search domain in the second image, at a given resolution, in the multi-level aggregate digital data structure; determining, based on the comparing, a variance value for the first selected subset; and utilizing the variance value to determine whether to execute another comparing operation in a higher resolution digital data representation in the multi-level aggregate digital data structure, wherein, if the determined variance value is less than a selected threshold value, then not executing another comparing operation for the first selected subset in a higher resolution digital data representation in the multi-level aggregate digital data structure.) (2216, Process can further include iteratively execute the comparing, identifying and rejecting functions, using higher resolution digital data representations of the first selected feature and higher resolution digital data representations in the multi-level aggregate
  • FIG. 23 is a schematic block diagram depicting exemplary devices or systems in which aspects of the present invention may be practiced or embodied.
  • FIG. 23 is a schematic block diagram showing a device or system 2300 in which the invention may be practiced.
  • the device or system 2300 may be implemented «sing known forms of di gital processing hardware, such as known forms of smartphones, tablets and/or other forms of digital processing and imaging devices, supplemented a necessary in accordance with the teachings of the present invention. Arrows in FIG. 23 indicate flow of digital data and/or signals between elements.
  • device of system 2300 can comprise at least a first digital camera or camera pipeline 2302, a second digital camera or camera pipeline 2304, and a digital processing resource 2306 comprising one or more digital processors) 2308, memory elements) 2310, bidler(s) 2312, other storage 2314, and interfaces 2316 to external devices and/or networks.
  • the digital processing resource 2306 is operable to receive digital data from the cameras or camera pipelines 2302, 2304, process the data in accordance with the invention, and provide outputs, based on such processing, to internal (t.e., within device or system 2300 ⁇ display element 2330 or storage 2320: and/or to external display, storage or network elements (collectively 2350).
  • the externa! display, storage or network elements 2350 may comprise the Internet devices, processors or other networks connected to or via the Internet, or other network-connected elements or destinations.
  • the digital processing resource may receive or consume digital information from such a network or networks 2350, such as for processing by the digital processing resource 2306.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)
EP18884507.7A 2017-12-03 2018-12-03 Stereo-korrespondenzsuche Pending EP3718051A4 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201762594006P 2017-12-03 2017-12-03
PCT/US2018/063699 WO2019109106A1 (en) 2017-12-03 2018-12-03 Stereo correspondence search

Publications (2)

Publication Number Publication Date
EP3718051A1 true EP3718051A1 (de) 2020-10-07
EP3718051A4 EP3718051A4 (de) 2021-09-01

Family

ID=66665303

Family Applications (1)

Application Number Title Priority Date Filing Date
EP18884507.7A Pending EP3718051A4 (de) 2017-12-03 2018-12-03 Stereo-korrespondenzsuche

Country Status (2)

Country Link
EP (1) EP3718051A4 (de)
WO (1) WO2019109106A1 (de)

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100319151B1 (ko) * 1999-02-01 2002-01-05 구자홍 다중레벨 이미지 데이터 생성 방법과 다중레벨 이미지 데이터를 이용한 이미지 검색방법
US8401276B1 (en) * 2008-05-20 2013-03-19 University Of Southern California 3-D reconstruction and registration
US8817071B2 (en) * 2009-11-17 2014-08-26 Seiko Epson Corporation Context constrained novel view interpolation
US9762794B2 (en) * 2011-05-17 2017-09-12 Apple Inc. Positional sensor-assisted perspective correction for panoramic photography
US9378542B2 (en) * 2011-09-28 2016-06-28 The United States Of America As Represented By The Secretary Of The Army System and processor implemented method for improved image quality and generating an image of a target illuminated by quantum particles
US9135700B2 (en) * 2012-05-11 2015-09-15 Dako Denmark A/S Method and apparatus for image scoring and analysis

Also Published As

Publication number Publication date
WO2019109106A1 (en) 2019-06-06
EP3718051A4 (de) 2021-09-01

Similar Documents

Publication Publication Date Title
US11481869B2 (en) Cross-domain image translation
US11810326B2 (en) Determining camera parameters from a single digital image
WO2018166438A1 (zh) 图像处理方法、装置及电子设备
WO2021073493A1 (zh) 图像处理方法及装置、神经网络的训练方法、合并神经网络模型的图像处理方法、合并神经网络模型的构建方法、神经网络处理器及存储介质
US20190355103A1 (en) Guided hallucination for missing image content using a neural network
CN117597703B (zh) 用于图像分析的多尺度变换器
CN111524106A (zh) 颅骨骨折检测和模型训练方法、装置、设备和存储介质
CN110910437B (zh) 一种复杂室内场景的深度预测方法
CN113674146A (zh) 图像超分辨率
CN111831844A (zh) 图像检索方法、图像检索装置、图像检索设备及介质
EP4088226A1 (de) Erzeugung radioaktiver daten
US11934958B2 (en) Compressing generative adversarial neural networks
CN108921801B (zh) 用于生成图像的方法和装置
WO2023159757A1 (zh) 视差图生成方法和装置、电子设备及存储介质
JP2024508867A (ja) 画像クラスタリング方法、装置、コンピュータ機器及びコンピュータプログラム
WO2022125237A1 (en) Parsing an ink document using object-level and stroke-level processing
WO2022081226A1 (en) Dual-stage system for computational photography, and technique for training same
US11550387B2 (en) Stereo correspondence search
CN113822871A (zh) 基于动态检测头的目标检测方法、装置、存储介质及设备
EP3718051A1 (de) Stereo-korrespondenzsuche
CN112070853A (zh) 图像生成方法及装置
CN116030290A (zh) 在设备上检测数字对象并且生成对象掩膜
CN113610856A (zh) 训练图像分割模型和图像分割的方法和装置
CN113837948B (zh) 图像生成方法、设备及存储介质
US12079957B2 (en) Modeling continuous kernels to generate an enhanced digital image from a burst of digital images

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20200702

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20210803

RIC1 Information provided on ipc code assigned before grant

Ipc: G06K 9/64 20060101AFI20210728BHEP

Ipc: G06K 9/46 20060101ALI20210728BHEP

Ipc: G06T 7/11 20170101ALI20210728BHEP

Ipc: G06T 7/33 20170101ALI20210728BHEP

Ipc: G06T 7/593 20170101ALI20210728BHEP

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20230801