WO2005029835A2 - Registration of separations - Google Patents
Registration of separations Download PDFInfo
- Publication number
- WO2005029835A2 WO2005029835A2 PCT/US2004/028782 US2004028782W WO2005029835A2 WO 2005029835 A2 WO2005029835 A2 WO 2005029835A2 US 2004028782 W US2004028782 W US 2004028782W WO 2005029835 A2 WO2005029835 A2 WO 2005029835A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- transformation
- candidate
- frame
- digital image
- image
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N1/00—Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
- H04N1/46—Colour picture communication systems
- H04N1/56—Processing of colour picture signals
- H04N1/58—Edge or detail enhancement; Noise or error suppression, e.g. colour misregistration correction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/30—Determination of transform parameters for the alignment of images, i.e. image registration
- G06T7/33—Determination of transform parameters for the alignment of images, i.e. image registration using feature-based methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/30—Determination of transform parameters for the alignment of images, i.e. image registration
- G06T7/38—Registration of image sequences
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/24—Aligning, centring, orientation detection or correction of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/48—Matching video sequences
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
Definitions
- This invention relates to image processing, and more particularly to the registration of separations.
- color separations In addition to the creation of color separations through the original Technicolor process, color separations also have been produced and used for the archival of color film because black and white film stock generally has a much greater shelf-life than color film, hi this process, the color film stock is used to expose one reel of black and white film with sequential records of red, green, and blue so that each frame is printed three times on the resultant reel to form a sequential separation.
- Attorney Docket: 16406-002WO1 Film studios may recombine the three color separations onto a single reel of color film using a photographic process that is performed in a film laboratory.
- an optical film printer is employed to resize and reposition each source reel, one at a time, hi particular, three passes are made.
- the magenta source reel is projected through an appropriate color filter onto the destination reel.
- the destination reel is rewound, the next source reel is loaded and resized, and the color filter is changed.
- a human operator determines a global alignment (and scaling if necessary) for the entire set of frames within the reel or, alternatively, within selected scenes on a scene-by-scene basis, with each scene including several, if not hundreds, ' of frames.
- IP interpositive
- automatically registering digital images includes accessing first and second digital images, and a first transformation.
- the first image includes first content for a feature in a first frame
- the second image includes second content for the feature.
- the first transformation has been determined between at least an aspect of a third digital image and a corresponding aspect of a fourth digital image, with the third and fourth digital images including content for the feature in a second frame.
- the first frame represents the feature at a first point in time
- the second frame represents the feature at a time that either precedes or succeeds the first point in time.
- a second transformation is automatically determined based on the first transformation, and the second transformation reduces a misregistration between at least an aspect of the first digital image and a corresponding aspect of the second digital image.
- the second transformation is automatically applied to the first digital image to reduce the misregistration between at least the aspect of the first digital image and the corresponding aspect of the second digital image.
- Attorney Docket: 16406-002 WO 1 Determining the second transformation also may be based on the first content and the second content. Determining the second transformation may include determining a candidate transformation based only on content from the first frame, and biasing toward either the first transformation or toward the candidate transformation.
- the first transformation may include a first translation.
- Determining the second transformation may include accessing a target image that is based on one or more of the first and second digital images.
- a first transformation may be applied to the target image to obtain a first transformed target image.
- a first distortion may be determined based on the first transformed target image.
- the candidate transformation may be applied to the target image to obtain a second transformed target image.
- a candidate distortion may be determined based on the second transformed target image.
- Biasing may include computing a product of the candidate distortion and a threshold, and comparing the product to the first distortion.
- the second transformation may be set equal to the first transformation based on results of the comparison of the product to the first distortion.
- the second transformation may include a translation, and the translation may include a multi-dimensional vector.
- the second frame may immediately precede the first frame in a film, or the second frame may immediately follow the first frame in a film. Multiple transformations, including the first transformation, may be accessed.
- the multiple transformations each may relate to at least one of multiple frames that are distinct from the first frame. Determining the second transformation may be further based on each of the multiple transformations. An initial transformation may be accessed, in which the initial transformation reduces the misregistration between the first digital image and the second digital image. Determining the second transformation may be further based on the initial transformation. Determining the second transformation may include determining a most common element, an average, Attorney Docket: 16406-002WO1 or a median of a set that includes the multiple transformations and the initial transformation. The multiple transformations may include (i) a previous-frame transformation that relates to a previous frame that precedes the first frame, and (ii) a successive- frame transformation that relates to a successive frame that succeeds the first frame.
- Determining the second transformation may include (i) determining a previous sign change between the previous-frame transformation and the initial transformation, (ii) determining a successive sign change between the successive-frame transformation and the initial transformation, and (iii) smoothing the initial transformation based on the previous sign change and the successive sign change.
- the first transformation may be determined.
- the second transformation may equal the first transformation.
- the aspect of the first digital image may include an edge, subportion, or feature of the first digital image, and the corresponding aspect of the second digital image may include a corresponding edge, subportion, or feature of the second digital image.
- the aspect of the first digital image may include the first content for the feature, and the corresponding aspect of the second digital image may include the second content for the feature.
- the first content may include color information for the feature.
- an apparatus has stored thereon a result of applying the second transformation to the first digital image as recited in the above general aspect.
- automatically registering digital images includes accessing first and second digital images.
- the first image includes first content for a feature in a first frame, and the second image includes second content for the feature.
- a first candidate transformation is determined, the first candidate transformation reducing a misregistration between the first and second digital images.
- a second candidate transformation is accessed.
- a distance is determined between the first candidate transformation and the second candidate transformation. The distance is compared to a threshold. The first candidate transformation is discounted based on results of the comparison of the distance to the threshold.
- Whether to use the first Attorney Docket: 16406-002WO 1 candidate transformation as a final transformation is determined based on results of the discounting of the first candidate transformation.
- a misregistration distortion resulting from application of the first candidate transformation to the first digital image may be determined. Determining whether to use the first candidate transformation as a final transformation may be further based on the misregistration distortion.
- the first candidate transformation may include a first translation
- the second candidate transformation may include a second translation.
- the second candidate transformation may be determined.
- the determination of the second candidate transformation may be based only on content from the first frame.
- a distortion value may be determined that indicates a level of misregistration between the first and second digital images after applying one of the multiple candidate transformations.
- the distortion values for each of the multiple candidate transformations may be compared.
- One of the multiple candidate transformations may be selected as the second candidate transformation based on results of the comparison of the distortion values for each of the multiple candidate transformations.
- the threshold may be a constant.
- a third candidate transformation may be accessed.
- a first distance may be determined between the first candidate transformation and the third candidate transformation.
- a second distance may be determined between the second candidate transformation and the third candidate transformation.
- the first distance may be compared to the second distance.
- the first candidate transformation may be discounted based on results of the comparison of the first distance to the second distance.
- the first and second digital images may be associated with a first block in the first frame.
- the third candidate transformation may be a transformation for reducing Attorney Docket: 16406-002WO 1 misregistration between a third digital image and a fourth digital image that are associated with a second block in the first frame.
- a series of additional first candidate transformations may be determined after determining the first candidate transformation.
- the first of the additional first candidate transformations may be, for example, no closer to the third candidate transformation than is the first candidate transformation.
- Each of the remaining additional first candidate transformations may be, for example, no closer to the third candidate transformation than is the previous additional first candidate transformation in the series.
- the first and second digital images may be associated with a first block in the first frame.
- Determining the second candidate transformation may include selecting as the second candidate transformation a candidate transformation for reducing misregistration between a third digital image and a fourth digital image that are associated with a second block in the first frame.
- Accessing the first digital image may include accessing a portion of a first edge map, the first edge map corresponding to a first color separation of the first frame.
- Accessing the second digital image may include accessing a portion of a second edge map, the second edge map corresponding to a second color separation of the first frame.
- the first content may include edge information in the first edge map, and the second content may include edge information in the second edge map.
- a third candidate transformation may be. determined.
- the threshold may be a function of distance between the second candidate transformation and the third candidate transformation.
- the second candidate transformation may be determined, based only on content from a second frame, to reduce a misregistration between a third digital image and a fourth digital image.
- Each of the third and fourth digital images may include content for the feature in the second frame.
- the first frame may represent the feature at a first point in time
- the second frame may represent the feature at a time that either precedes or succeeds the first point in time.
- the threshold may be a distance between the second candidate transformation and a third candidate transformation.
- the third candidate transformation may be a candidate transformation to reduce misregistration between the first and second digital Attorney Docket: 16406-002WO 1 images. Discounting may include adding an additional distortion term to the misregistration distortion.
- automatically registering digital images includes accessing a composite color image and automatically separating from the composite color image first and second component digital images.
- the first component digital image includes first content for a feature in a first frame.
- the second component digital image includes second content for the feature in the first frame.
- a transformation is automatically determined for the first component digital image to reduce a misregistration between at least an aspect of the first and second digital component images.
- the transformation is automatically applied to the first component digital image to reduce the misregistration and to produce a transformed first component digital image.
- a new composite color image is formed using the transformed first component digital image and the second component digital image.
- the composite color image may be configured according to a video format, and separating the first and second component digital images may include converting the composite color image into an RGB format with the first and second component digital images each corresponding to separate color components of the RGB format. Forming the new composite color image may include converting the transformed first component digital image and the second component digital image from the RGB format into the video format.
- an apparatus has stored thereon the new composite color image of the above general aspect.
- registering frames includes accessing a first frame and a second frame that are successive frames. A resultant transformation is automatically determined for registering at least part of the second frame based on content in the first frame and on content in the second frame.
- Determining the resultant transformation may include determining a first transformation for at least part of the first frame based on content of the first frame.
- An initial transformation may be determined for at least a corresponding part of the second frame based on content of the second frame.
- the resultant transformation may be determined based on the first transformation and the initial transformation.
- the Attorney Docket: 16406-002WO 1 resultant transformation may be applied to at least part of the second frame to produce a transformed part.
- an apparatus has stored thereon the above transformed part.
- an apparatus stores information relating to an image in a sequence of images.
- the information is configured to allow generation of a first component image of the image and a second component image of the image, and the first and second component images include first and second content, respectively, that relates to a specific portion of the image.
- the second component image has been transformed to reduce misregistration between the first and second component images as compared to misregistration between the first component image and a prior version of the second component image. The transformation is based on registration information from another image in the sequence of images.
- the second component image may include analog information or digital information or both.
- the first and second component images may each include a color separation, displays of which are configured to collectively form at least part of a single resultant image. One or more of the color separations may be a digital color separation.
- the second component image may be part of a frame in a film.
- the apparatus may include a reel, a video, an optical disc, or a computer readable medium.
- the second component image may have been indirectly transformed by being based on one or more images that have been directly transformed.
- the second component image maybe one of a set of corresponding images stored in a YUV format.
- the set of corresponding images, including the second component image may be based on another set of corresponding images having an RGB format.
- One or more of the other set of corresponding images may have been directly transformed to reduce misregistration between two or more images of the other set of corresponding images, whereby the second component image is indirectly transformed by being based on the one or more images that have been directly fransformed.
- the information may include encoded information configured to be decoded to generate the first and second component images.
- the first and second component images may be interleaved with each other.
- Attorney Docket: 16406-002WO1 A computer program may be stored on a computer-readable medium and may include instructions for causing a computer to perform one or more of the above general aspects and, if desired, one or more of the additional features.
- An apparatus may include one or more processors programmed to perform one or more of the above general aspects and, if desired, one or more of the additional features.
- An apparatus may include a result of a process applying one or more of the above general aspects or variations of these aspects.
- Various general aspects address methods for the registration of separations or images that may correspond to separations. The separation may relate to film or other fields.
- Such methods may include one or more of a variety of features, such as, for example: (1) accessing a composite color image or a component image (also referred to as a separation) relating, for example, to one or more colors or luminance; (2) accessing component images that are based on a variety of separations, including, for example, digitized film separations for which each of the component images includes a set of gray-level pixels; (3) correcting one or more film distortions; (4) automatically determining a transformation to reduce a film distortion; (5) enhancing spatial continuity within a frame; (6) enhancing temporal continuity across consecutive frames; (7) applying multiple criteria of merit to a set of features to determine a set of features to use in determining a transformation; (8) determining transformations for areas in an image or a separation in a radial order; (9) comparing areas in images or separations by weighting feature pixels differently than non-feature pixels; (10) determining distortion values for transformations by applying a partial distortion measure and/or using a spiral search configuration; (11) determining
- the described implementations may achieve, one or more of the following features. For example, they may provide an automatic and efficient digital image registration process for color film separations.
- the process may operate in the digital domain to enable the use of a number of digital image processing techniques, and may require minimal human intervention.
- the process may be computationally efficient, Attorney Docket: 16406-002WO 1 and may be capable of determining alignments on a frame-by-frame basis.
- the process also may address the local nature of the misregistration within an image that results from such causes as film shrinkage due to aging.
- the process may compensate, correct, or avoid one or more of the described distortions.
- FIG. 1 is a picture illustrating misregistration.
- FIG. 2 is a block diagram of an implementation of a registration method.
- FIG. 3 is a diagram of a partitioning of an image into blocks.
- FIG. 4 is a diagram of one implementation of a processing order for the partition of FIG. 3.
- FIG. 5 is a diagram highlighting areas in which one implementation applies feathering.
- FIG. 6 is a picture illustrating a result of applying one implementation of a registration technique to the picture in FIG. 1.
- FIG. 7 illustrates three sequential composite frames, their respective color components, and an alignment vector corresponding to the red component of each frame.
- FIG. 8 is a graph illustrating a distance distortion metric.
- FIG. 9 is a diagram of a frame for illustrating a distortion calculation.
- FIG. 10 is a graph illustrating another distance distortion metric.
- FIG. 11 illustrates a process for registering composite color images.
- DETAILED DESCRIPTON The film processes described earlier, as well as other processes, may be subject to one or more of a variety of well-known film distortions. These include Attorney Docket: 16406-002WO 1 static misregistration, dynamic misregistration, differential resolution, and loss of resolution. Although referred to as film distortions, these distortions also may be present in other applications and environments. For example, registration of separations may be required, and one or more film distortions may be present, in photography, astronomy, and medical applications.
- Static misregistration may be experienced due to one or more of a variety of reasons, six examples of which follow.
- the original cameras typically were adjusted mechanically by a technician with a micrometer.
- the alignment results therefore often varied, usually from camera to camera, within a single movie title.
- Third, differences between the camera and the printer may have caused color shifting.
- Fourth, the photographic compositing process discussed above typically operates on either a reel-by-reel basis or a scene-by-scene basis, rendering it difficult to correct misalignments that may occur on a frame-by-frame basis.
- the photographic compositing process provides a global alignment for a particular image, the process does not necessarily address the local misalignments that may occur within an image.
- First, the film separations in a camera are subject to intermittent motion, stopping and starting, for example, twenty four times every second. All three separations must stop in precise alignment in order to obtain proper registration. However, such precise timing is difficult to achieve and maintain.
- the film may move in the camera or subsequent film printer leading to color fringing that moves in like manner.
- film may be spliced together as part of a normal editing process, resulting in splices that are physically thicker than the film.
- a small vertical bump in one or more color film separations may occur.
- the photographic compositing Attorney Docket: 16406-002WO1 process may not operate on a frame-by-frame basis, the process may not capture these types of misalignments. Differential resolution also may arise due to one or more of a variety of reasons.
- the nature of the light path and lens coatings in the Technicolor cameras typically caused the three film separations to have drastically different resolution or sharpness.
- the cyan separation typically was located behind the yellow separation in what was known as a bipack arrangement. Light that passed through the yellow separation was filtered and unfortunately diffused before striking the cyan separation.
- the yellow (inverted blue) separation typically had a greater resolution compared to the cyan (inverted red) separation
- the magenta (inverted green) separation typically had a resolution that was similar to that of the yellow (inverted blue) separation.
- This difference in resolution may result in red fringing that encircles many objects. Loss of resolution may arise, for example, from the use of an optical printer.
- Digital image registration can be used to address one or more of these film distortions.
- One aspect of digital image registration includes the process of aligning two or more digital images by applying a particular mapping between the images.
- Each digital image consists of an array of pixels having a dimensionality that may be quantified by multiplying the image width by the image height.
- the gray-level value I(x, y) represents how much of the particular color (for example, red, green, or blue) is present at the corresponding pixel location (x, y). If II represents a first image, 12 represents a second image, II (x, y) represents a pixel value at location (x, y) within image II, and I2(x, y) represent a pixel value at location (x, y) within image 12, the mapping between the two images can be expressed as: I2(x, y) corresponds to gll(f(x, y)), where f is a two dimensional spatial coordinate transformation that can be characterized by a pixel alignment vector, and g is an intensity transformation that, for example, can be characterized by an interpolation function.
- a registration algorithm may be used to find a spatial transformation or alignment vector, and in some cases, to find an intensity transformation, to match the images.
- Fig. 1 illustrates one visual manifestation of misregistration that can occur due to one or more sources of distortion.
- Fig. 2 illustrates a system 200 including a digitization unit 210 that receives three separation images and outputs three digital, and possibly transformed, color component images.
- a feature selection unit 220 receives these digital images and, after processing, outputs them to an alignment vector determination unit 230.
- the alignment vector determination unit 230 determines transformations for two of the images against the third, with the third being used as a reference. In other implementations that employ other than three images, the alignment vector determination unit 230 would produce transformations for N- 1 images, where N is the total number of images.
- An alignment vector application unit 240 receives the two transformations from the alignment vector determination unit 230 and the two non-reference digital images from the digitization unit 210. The alignment vector application unit 240 modifies these two non-reference images using the transformations.
- a composite phase unit 250 combines the two modified images and the reference image into a composite image.
- Digitization Unit Multiple color separations are input into the digitization unit 210.
- the digitization unit 210 accepts multiple photographic negative images (for example, yellow, cyan, and magenta) and outputs multiple photographic positive images (for example, blue, red, and green) as digital data in the form of a set of gray-level pixels.
- Other implementations may perform one or more of a variety of other transformations, such as, for example, positive-to-negative, in lieu of or in addition to the negative-to-positive transformation; perform no transformation at all; or accept digitized data and thereby obviate the need for digitization.
- Attorney Docket: 16406-002WO1 Feature Selection Unit Each of the digital color component images is input into the feature selection unit 220.
- the feature selection unit 220 selects a feature or feature set, such as, for example, one or more edges, objects, landmarks, locations, pixel intensities, or contours.
- the feature selection unit 220 identifies a set of edges, optionally or selectively refines this set, and outputs an edge map (labeled E R , EQ, or E B ) that may be in the form of an image consisting of edge and non-edge type pixels for each color component image.
- An edge detection filter for example, a Canny filter, may be incorporated in or accessed by the feature selection unit 220 and may be applied to a digital color component image in order to obtain a set of edges.
- the edge information may be combined or separated into, for example, orthogonal sets.
- One implementation obtains separate horizontal and vertical edge information.
- the edge maps can be further refined to attempt to identify a set of useful edges.
- the set of edges may be pruned to a smaller set so as to reduce the inclusion of edge pixels having properties that could cause misleading misregistration results.
- the feature selection unit 220 performs this pruning by applying one or more criteria of merit to each edge in order to determine whether that particular edge should be included or rejected. Thereafter, one or more second criteria of merit may be applied to the collection of included edges in order to determine whether the entire set should be retained or if the entire set should be rejected.
- the alignment vector can be determined in some other manner, such as, for example, by applying the techniques discussed below with respect to the alignment vector determination unit 230.
- Several techniques may be used to refine a set of edges by enforcing a minimum edge requirement and/or emphasizing high intensity areas as described below. Examples of these techniques include the use of horizontal/vertical information and the use of high intensity selection, both of which are discussed below.
- Attorney Docket: 16406-002WO 1 Horizontal/Vertical information When searching for horizontal and vertical translational shifts, or more generally, alignments vectors, one implementation determines whether there is enough useful vertical and horizontal edge information within the area under consideration to make a useful alignment determination.
- each edge is first compared to a criterion of merit that determines the vertical and horizontal extent of the edge in both absolute and relative (with respect to the other direction) terms. Thereafter, the set of edges that has been determined to have sufficient vertical or horizontal extent is compared to another criterion of merit in order to determine whether this new set should be retained or rejected in its entirety. For instance, in one implementation, determining the sufficiency of vertical/horizontal edge information may include identifying a connected edge.
- a connected edge may be identified by identifying a set of adjacent pixels that each have characteristics of an edge and that each have at least one neighbor pixel with characteristics of an edge. For each connected edge in the area under consideration, a determination may be made as to whether there is sufficient vertical and horizontal information.
- T_yl, and T_y2 represent preset or configurable thresholds.
- Total_x a value for the total number of horizontal edge candidate pixels, may be computed by adding N_x to Total_x for each edge for which N_x is greater than T_xl and x_info is greater than T_x2. That is, an edge is included as a horizontal edge candidate and the total number of horizontal edge candidate pixels is incremented by N_x if N_x and x_info are greater than the thresholds T_xl and T_x2, respectively. Otherwise, none of the pixels for the connected edge are used to determine vertical shifts.
- Total_y a value for the total number of vertical edge candidate pixels, may be computed by adding N_y to Total_y for each edge for which N_y is greater than T_yl and y_info is greater than T_y2. That is, an edge is included as a vertical edge candidate and the total number of vertical edge candidate pixels is incremented by N_y if N_y and y_info are greater than the thresholds T_yl and T_y2, respectively. Otherwise, none of the pixels for the connected edge are used to determine horizontal shifts.
- the total number of candidate edges for each direction, Total_x and Total_y are compared to the preset threshold,
- T otal If Total_x is greater than Tjotal, all of the pixels associated with the identified horizontal edge candidates are considered horizontal edge pixels. Otherwise, if Total_x is less than or equal to Tjotal, the number of horizontal edge candidates is deemed insufficient, and, as such, none of the edges within the area are used for the vertical shift determination. If Total_y is greater than Tjotal, all the pixels associated with the identified vertical edge candidates are considered vertical edge pixels. Otherwise, if Total_y is less than or equal to Tjotal, the number of vertical edge candidates is deemed insufficient, and, as such, none of the edges within the area are used for the horizontal shift determination.
- an alternate method of obtaining the alignment values for that area in one or more directions may be used.
- Several alternative methods are discussed below with respect to the alignment vector determination unit 230.
- Attorney Docket: 16406-002WO1 High Intensity Selection In general, a misregistration at bright areas of an image is more observable and objectionable than a misregistration at darker areas of an image. For example, the eye would more readily observe a red area extending beyond a white area than a red area extending beyond a brown area. As such, it may be desirable to target or exclusively select edges that exist within high intensity areas.
- Such targeting/selection may be achieved through the construction of an edge map using a process that compares the gray-level pixel intensities associated with each color component image to a threshold.
- high intensity selection may be applied to a refined edge map generated using the previously-described or some other refinement technique, individually or in combination.
- RE_x indicates a particular pixel in the new refined edge map for the x color component image, where x can be either red, green, or blue
- E_x indicates a corresponding pixel in the original edge map for the x color component image (where E_x contains either edge or non-edge valued pixels)
- P_r indicates the original gray-level intensity value for the corresponding pixel for the red component image
- P_g indicates the original gray-level intensity value for the corresponding pixel for the green component image
- P_b indicates the original gray- level intensity value for the corresponding pixel for the blue component image
- T_h indicates a preset pixel intensity threshold.
- RE_x is an edge pixel if E_x is an edge pixel, P_r > T_h, P_g > T_h, and P_b > T_h. Otherwise, RE_x is not an edge pixel.
- the definition of a high intensity edge may be relaxed or expanded to be more inclusive. For instance, in one implementation, edge pixels within a window (of relatively small horizontal and/or vertical extent) relative to a high intensity edge also may be categorized as high intensity edge pixels.
- the refinement procedure for assessing horizontal and vertical edge information can be Attorney Docket: 16406-002WO1 applied to generate a more useful set of high intensity edges.
- the initial edge maps that is, the edge map obtained before the high intensity edge refinement process was applied
- the edge refinement technique for assessing horizontal and vertical edge information then can be applied to this edge map to obtain a useful set of edges within this area. If there is not a sufficient number of edges in this case, an alternate method of obtaining the horizontal and/or vertical alignment for that area may be used, as discussed below.
- a new corresponding image consisting of edge and non-edge valued pixels is created and transferred from the feature selection unit 220 to the alignment vector determination unit 230.
- the alignment vector determination unit 230 may operate on different types of feature maps. Nonetheless, consistent with the examples set forth previously, operation of the alignment vector determination unit 230 will be described, in detail, primarily for edges. After the edges are obtained for each color component image, they are compared between pairs of color component images in order to determine the alignment vector that will lead that pair of images to be aligned, typically in an optimal manner. Other implementations may, for example, accept an alignment vector that satisfies a particular performance threshold. In one implementation, each pair consists of one color component image that serves as a reference image, and a second color component image that serves as a non-reference image.
- one color component image is maintained as a reference image that does not undergo any alignment throughout the film sequence, thus ensuring a constant temporal reference throughout the film sequence to be registered.
- the green reference image typically is chosen as the reference image due to its relatively high contrast and resolution.
- a red, a blue, or some other color component image may be selected as a reference, or the reference may be varied with time.
- Other implementations may select a reference, if any, as warranted by a particular application.
- Attorney Docket: 16406-002 WO 1 There are various possible spatial transformations that can be used to align the color component images of a film frame.
- the fransformation is represented as one or more translational alignments in the horizontal and/or vertical directions.
- the fransformation in that implementation can be described using I(x, y) to denote a pixel intensity at location (x, y) for a particular color component image, and let F(x, y) to denote a pixel intensity at location (x, y) after the translational alignment has been imposed on the color component image.
- a translational transformation can be performed, for example, either globally for the entire image or locally within different areas of the image.
- the misalignment experienced at the outer areas of the image may differ from the misalignment experienced at the center portion of the image.
- different alignment vectors are applied to different areas of the image.
- localized alignment vectors are determined for various areas of the image, as described below.
- the color component image is divided into areas arranged in a manner such that the center of at least one area and the center of at least one other area are in different proximity to the center of the image.
- areas can have overlapping pixels, that is, some pixels can belong to more than one area within the non-reference image. Further, all areas of an image need not necessarily be processed.
- the different areas of the image will be referred to as blocks. Fig.
- a distortion value (or alternatively, a similarity value) is computed between a defined set of pixels associated with that block and the corresponding set of pixels in the reference image for a given translational alignment vector (deltax, deltay) using a registration metric such as that defined below.
- a pixel at location (x+deltax, y+deltay) in the reference image is defined to be the corresponding pixel to a pixel at location (x, y) within a block in the non-reference image for a translational alignment vector of (deltax, deltay).
- the set of pixels used to compute the registration metric value associated with the block can be a subset of the total pixels associated with the block.
- One or more of various registration metrics can be used.
- One general class of measures includes feature-based measures that weight comparisons involving feature pixels in a base image (reference or non- reference) differently, for example, more heavily, than comparisons involving non- feature pixels, where a feature pixel is a pixel determined to possess particular characteristics.
- the measure may be characterized as accumulating distortion for each pixel in the non-reference image satisfying the conditions that the pixel is identified as part of a feature, but the corresponding pixel in the reference image is not identified as part of a feature.
- the term "part,” as well as any other similar term, is used in this application broadly to refer to either "all” or "less than all.”
- the above pixels may, in general, contain all of the feature or less than all of the feature.
- the maximum potential distortion for each tested (deltax, deltay) vector would be equal to the number of feature (e.g., edge) pixels within the non-reference image.
- a total distortion value is computed for a number of candidate (deltax, deltay) alignment vectors, within a particular "window," W, of size (2Wx+l)*(2Wy+l), where Wx, Wy are integers greater than or equal to zero, the absolute value of Wx is greater than or equal to deltax, and the absolute value of Wy is greater than or equal to deltay.
- the (deltax, deltay) vector that provides the lowest distortion value among the set of distortion values associated with the candidate alignment vectors is then selected as the alignment vector, (deltax_selected, deltay_selected) .
- the alignment vectors in the associated implementation can be determined by determining an initial alignment, defined as (deltax_i, deltay_i), for the image.
- the center of the image is used to establish the initial alignment vector upon which other blocks of the image base their alignment vector.
- the center can comprise the inner 25% of the image, which may overlap, partially or completely, an arbitrary number of blocks.
- the (deltax, deltay) pair that is chosen is the pair that provides the lowest distortion using the one-sided mismatch accumulator distortion measure among a number of candidate (deltax, deltay) vectors.
- the alignment vectors for the individual blocks of the image are determined by processing the blocks in a radial manner.
- the order in which the areas are processed and in which their alignments are determined is based, in one implementation, upon a radial path that begins near the center of the image and then progresses outward.
- a radial ordering can be attained, for example, if the blocks are grouped into four different rings.
- a radial ordering refers to processing blocks based on some measure of their distance from a chosen reference point, and, further, processing the blocks in either a generally increasing or a generally decreasing distance from the chosen reference point, such as for example, the center of an image.
- a radial ordering also may process blocks randomly within a ring, where the rings are processed according to either a generally increasing or generally decreasing distance using some measure of distance.
- An inner ring is a ring that is positioned a smaller distance, using some measure, from the chosen reference point than a ring under consideration.
- an outer ring is positioned a larger distance from a chosen reference point than a ring under consideration.
- An innermost ring has no ring that is closer to the chosen reference point. Fig.
- FIG. 4 illustrates four different rings. These rings are concentric.
- the determination of the selected alignment vectors of the blocks proceeds, in this implementation, by first processing the blocks in the first ring (ring 0) consisting of blocks 27, 28, 35, and 36.
- blocks within the second ring (ring 1) are processed, that is, blocks 18-21, 29, 37, 45-42, 34, and 26.
- blocks within the third ring (ring 2) are processed, that is, blocks 9-14, 22, 30, 38, 46, 54-49, 41, 33, 25, and 17.
- blocks within the fourth ring (ring 3) are processed, that is, blocks 0-7, 15, 23, 31, 39, 47, 55, 63-56, 48, 40, 32, 24, 16, and 8.
- each ring is processed in a clockwise manner.
- a translation alignment vector is determined by establishing an initial translation alignment vector for the block.
- these initial translation alignment vectors may be determined based on the alignment vectors of their neighboring block(s), where these neighbors belong to the set of blocks that have already been processed and that share a common border or pixel(s) with the block under consideration.
- the blocks may not share a common border or pixel or the initial vector may be set by default or chosen at random.
- the initial alignment vector for the block under consideration may be equal to a function of the neighbors of the block under consideration that have already been processed. If a clockwise progression is used, the set of neighbors for block 21 that have already been processed consists of blocks 20 and 28. Similarly, the set of neighbors for block 6 that have already been processed consists of blocks 5, 13, and 14.
- the function can be defined in a number of ways. For example, the function may be a weighting of the alignment vectors among each of the neighbors or the alignment vector of one or more neighbors that provide the minimum distortion for the block under consideration. In implementations that emphasize the radial configuration, the neighbor can be chosen to be the inward radial neighbor of the current block under consideration.
- An inward radial neighbor is any neighboring block having a distance, using some measure, that is no further from the chosen reference point than is the block under consideration. This implies, for example, that the initial translational alignment vector for blocks 9, 10, and 17 would all be equal to the selected translational alignment vector determined for block 18, and that the initial translational alignment vector for block 11 would be equal to the selected translational alignment vector determined for block 19.
- the initial estimate can be computed in a similar manner or any other suitable manner.
- the blocks within the inner ring can use the translational alignment vector determined for the center portion of the image as their initial estimate.
- the center portion of the image can use a preset initial alignment vector, which may be, for example, no initial displacement, or the displacement for the central block or blocks of a previous frame.
- the distortion associated with a number of candidate alignment vectors that represent different displacements can be calculated.
- Wx and Wy are integers greater or equal to 0, and the dependence of Wx and Wy on m and n indicates that the horizontal and vertical window areas can be different dimensions for different rings or even different blocks within a ring.
- the alignment vector that corresponds to the displacement that produces the minimum distortion among the candidate displacements chosen from this set then is selected to be the alignment vector, (deltax_selected, deltay_selected), for the block.
- Wx_in and Wy_in respectively, resulting in a large increase in efficiency.
- Wx(m, n) and Wy(m, n) are a number of strategies that may be employed to determine the selected candidate within a particular window of dimension (2*Wx+l)*(2*Wy+l).
- a straightforward approach is to check every displacement possibility within the window.
- Another implementation uses a spiral search with a partial distortion Attorney Docket: 16406-002WO1 measure to determine the selected displacement or alignment vector.
- the different displacements are considered in an order that begins at the location associated with the initial alignment vector and proceeds radially outward in a spiral scanning path.
- the one-sided mismatch accumulator is a cumulative distortion, it is possible to periodically compare the current minimum distortion to the distortion accumulated after only a partial number of the pixels within the block (that have been chosen to be used in the distortion calculation) have been processed. If the partial distortion sum is found to be greater than and/or equal to the current minimum distortion, then the candidate location cannot provide the minimum distortion and the other pixels in the block need not be processed.
- a spiral search with a partial distortion measure reduces the computational complexity associated with the search of all the candidate locations.
- the initial alignment vector is a function of the neighboring blocks' selected alignment vectors
- the block under consideration will have lower distortion with this alignment vector or with an alignment vector that corresponds to a displacement that is close in distance to this initial alignment vector rather than an alignment vector that corresponds to a displacement that is farther away in distance from the initial alignment vector.
- the distortion will exceed the current minimum value before a complete check of all of the pixels associated with the block that are chosen to be used in the distortion calculation.
- a method that does not search all displacement possibilities within the window can be used in order to reduce the computational complexity of the search.
- an iterative algorithm can be employed in which the selected alignment vector (corresponding to a particular candidate displacement) is first chosen for one direction (e.g., vertical), and this result then is used as the initial alignment vector in the search for the selected alignment in the orthogonal direction (e.g., horizontal), and this process then is iterated until a particular stopping condition is met.
- Such an implementation may use different features, for example, vertical and horizontal edges, to determine the alignment vectors for the horizontal and vertical directions, respectively.
- Attorney Docket: 16406-002WO1 For the case where separate horizontal and vertical edge information is retained, the following provides an example of a method that can be used to select one candidate alignment vector from a set of candidate alignment vectors for a given block. First, initial conditions are set (step 1).
- the distortion_y associated with the initial alignment vector (deltax_i, deltay_i) is determined using the horizontal edge information, and minimum_y, the minimum distortion in the vertical direction, is set equal to distortion_y, and the selected vertical displacement deltay_selected(0) is set equal to deltayj.
- step 2-1-2 The minimum distortion_y among the set of calculated distortion values then is found, and deltay_selected is set to be the sum of the deltay that produces this minimum distortion value and deltay_selected(i-l), and this distortion value is set to be the new minimum distortion value, minimum_y (step 2-1-2).
- a determination then is made as to whether the stopping condition for a particular direction has been met (step 2-1-3). If so, step (2-1) will not be repeated after step (2-2). Next, the selected horizontal shift is determined using the vertical edge information (step 2-2).
- the minimum distortion_x among the set of calculated distortion values then is found, deltax_selected is set to be the sum of the deltax that produces this minimum distortion value and deltax_selected(i-l), and this distortion value is set to be the new Attorney Docket: 16406-002WO1 minimum distortion value, minimum_x (step 2-2-2).
- a determination then is made as to whether the stopping condition for a particular direction has been met.
- step (2-2) will not be repeated after step (2-1).
- a similar implementation that reduces the number of candidate locations searched can be performed using edge information that is captured for both vertical and horizontal directions simultaneously (for example, the edge information is based on the magnitude of the edge strength), h such a case, the distortion value computation at (deltax_selected(i-l), deltay_selected(i-l)) for iteration i need not be calculated because it already has been calculated in iteration i-1, and the single edge- map information is used instead of the horizontal and vertical edge maps discussed above. If there is not a sufficient number of useful edges in a particular direction within a block to be registered within the non-reference image or within a corresponding block in the reference image, an alternative method may be performed to select an alignment vector for this block.
- the alignment for that direction can simply be taken to be the initial alignment for that direction.
- a larger area encompassing more blocks (or even the entire image) can be used to determine the selected alignment vector for this block.
- Other alternative or additional methods can also be used to select an alignment vector for this block.
- the alignment vector for the center is set to the selected alignment vector determined from the entire image.
- the alignment vector application unit 240 aligns each block using these vectors or a modification of them.
- the alignment is straightforward. If, however, the image has been segregated into multiple blocks, the image can be spatially aligned by a number of different methods.
- One technique that can be used for alignment involves applying a uniform alignment to each block by its corresponding alignment vector. However, different blocks within that color component image may have different alignment vectors. In such a case, discontinuities may exist at a boundary between blocks.
- the alignment vector application unit 240 attempts to reduce the perceptual effects of discontinuities by "feathering" at the boundaries between blocks with different alignment vectors, as described below.
- this technique is applied only to the non-reference images (recall that the non- reference images are shifted with respect to the reference image).
- the feathering process will be described hereinafter with reference to horizontal and vertical alignment values to maintain consistency with early examples, the feathering process also is applicable to other types of transformations. Feathering is performed, in this implementation, along the y-axis boundaries between two horizontally neighboring blocks, along the x-axis boundaries between two vertically neighboring blocks, and for the four-corner boundaries between four neighboring blocks.
- Fig. 5 provides an example 500 identifying several pixels that are affected by the feathering scheme described above when the image is divided into sixteen uniformly sized areas.
- feathering is performed across a particular-sized horizontal window.
- the window 510 may be used.
- feathering is performed across a particular-sized vertical window.
- the window 520 may be used.
- feathering is performed across a particular-sized vertical and a particular-sized horizontal window.
- the window 530 (arrows point to corners of window 530 in Fig. 5) may be used.
- the window size is determined as a function of the maximum (max) of the difference between the x alignment values of the neighboring blocks and the difference between the y alignment values of the neighboring blocks.
- maximum the maximum of the difference between the x alignment values of the neighboring blocks
- y alignment values the difference between the y alignment values of the neighboring blocks.
- many techniques may be used to determine the size and/or shape of the various windows. These windows need not be rectangular or continuous.
- new alignment values are obtained by linearly interpolating between the different alignment values of the neighboring blocks under consideration. Another implementation uses non-linear interpolation.
- the interpolated alignment values then are used to obtain the new intensity value of the pixel at a particular location.
- the pixel at the location corresponding to the selected alignment value is used as the value for the current location.
- the intensity values of the pixels that correspond to the two integer- valued displacements closest in distance to the selected displacement are appropriately weighted and combined to obtain the final new intensity value.
- the calculation of the new alignment vectors within the feathering window may be performed in the alignment vector determination unit 230 rather than the alignment vector application unit 240. The calculation of the new intensity values within the feathering window would still be performed in the alignment vector application unit 240.
- Other implementations may perform many of the various described operations in different orders or in different functional blocks.
- the horizontal alignment value for the point (x4, y4) is computed as (dxl+(x4-(x3-(fwsize/2)))*(dx2- dxlV(fwsize)) and the vertical alignment value for the point (x4, y4) is computed as (dyl+(x4-(x3-(fwsize/2)))*(dy2-dyl)/(fwsize)).
- the new value at a particular location is the value of the pixel at the calculated displacement location (or the weighted combination of the intensities at the nearest integer grid points). Note that special care may need to be applied to the boundaries of the image when the feathering approach is used. Another implementation adjusts for discontinuities by using warping.
- each block can be identified with a control point at its center.
- the horizontal and vertical alignment values that were obtained for each block can become the alignment values for the block's control point.
- the alignment values for the remaining pixels within the image may be obtained by interpolating between the alignment values of the nearest control points. These alignment values are then applied to the pixels within the non-reference image.
- Composite Phase Unit Once the non-reference images are aligned by the alignment vector application unit 240, the images can be recombined into a composite color frame by the composite phase unit 250.
- Fig. 6 illustrates a composite frame after the composite phase unit 250 has been applied to three color component images.
- a laser film printer is optionally used to avoid the loss of resolution incurred with an optical printer.
- Temporal Implementations The selection of a transformation for a particular frame may use temporal information, that is, information from one or more frames that precede, succeed, or both precede and succeed, in time, the frame under consideration.
- temporal information that is, information from one or more frames that precede, succeed, or both precede and succeed, in time, the frame under consideration.
- using information from Attorney Docket: 16406-002WO 1 neighboring frames may enable a temporally smoother visual experience.
- it may be beneficial to take advantage of the fact that consecutive frames often are shifted similarly during the photographic process.
- strategies that may be employed to use information from other frames to influence the selection of the transformation for a current frame, and a variety of implementations are illustrated below.
- Equation 1 shows that the new alignment vector may be a function of the alignment vectors and image data associated with the current frame, as well as past and future frames.
- the similarity (or lack thereof) between a current and neighboring (past or future) frame, either reference frames or non- reference frames influences the extent to which alignment vectors based on the neighboring frames are used in the determination of the new alignment vector for the current frame.
- Equation 1 may be generalized to include additional information in the determination of V_i_new.
- additional information may relate to the current frame or other frames, or may be unrelated to the frames.
- Attorney Docket: 16406-002WO 1 Referring to Fig. 7, three sequential composite frames 710, 720, and 730 are displayed, along with the red, green, and blue color component images for each frame 710, 720, and 730.
- a red component 712, a green component 714, and a blue component 716 may be combined to form composite frame 710.
- a red component 722, a green component 724, and a blue component 726 may be combined to form composite frame 720.
- a red component 732, a green component 734, and a blue component 736 may be combined to form composite frame 730.
- Fig. 7 illustrates that V_(i-1) is a function of color components 712 and 714, V_i is a function of color components 722 and 724, and V_(i+1) is a function of color components 732 and 734.
- the alignment vectors from past and future frames that are used to determine the new alignment vector (for a current frame) are generated using information from the respective past or future frame only.
- the new alignment vector (for a current frame) may be based on the new alignment vectors for one or more past or future frames.
- V_i_new f(V_(i-j)_new, ... , V_(i-l)_new, V_i, ... , V i+k), R_(i-j), R_i, ... , R_(i+k), G_(i-j), ... , G_i, ... , G_(i+k)).
- V_i_new could be a function of new vectors for future frames and could also be a function of all color component images for one or more frames.
- Such an implementation may determine an alignment vector for the first frame of the series using only spatial information for the first frame, and then may apply the alignment vector to the entire series of frames, thus determining the alignment vectors for the remaining frames in the series using only temporal information (and not using any spatial information per se from the remaining frames).
- Temporal information may also be used at various different points in a registration process. For example, a first pass may be used to determine alignment vectors (optimal or otherwise) for each frame without considering temporal information, then a second pass may be used to consider temporal information, or vice versa.
- a single pass may be used that considers both the information within a frame (non-temporal) and information (including alignment vectors) from one or more previous frames (temporal).
- Single pass implementations also may consider information from future frames, but would not have access to alignment vectors for future frames unless those alignment vectors were already determined.
- implementations may use temporal information in various different ways. For example, the temporal and non-temporal information may be considered separately and only combined at the end of the process of determining an alignment vector.
- Such an implementation may, for example, determine two proposed alignment vectors (with a first of the two being determined using only temporal information and a second of the two being determined using only spatial information), and then compare results achieved with the two proposed alignment vectors.
- One such implementation proceeds by (i) determining an alignment vector V_i for a current frame "i" based only on information in the current frame, (ii) determimng the distortion that results from the application of V_i to frame "i,” (iii) determining the distortion resulting from the application of V_(i-1) to frame "i,” and (iv) selecting as V_i_new either V_i or Attorney Docket: 16406-002WO 1
- V_(i-1) depending on which alignment vector resulted in the lower distortion for frame "i.”
- temporal and non-temporal information may be integrated such that each is used in the generation of a single proposed alignment vector.
- Such an implementation may proceed by, for example, using V_(i-1) as an initial alignment vector for frame "i,” and then iterating through various other alignment vectors in a window around V_(i-1) to find the vector that produces the lowest distortion for frame "i.”
- the temporal information (V_(i-1)) is used as a starting point and is modified based on non-temporal information (the distortion results for frame "i").
- temporal information may be considered, for example, before or after non-temporal information.
- Implementations also may, for example, consider temporal information both before and after, or at the same time as, non-temporal information.
- Using both temporal and non-temporal information may involve a tradeoff.
- using temporal information produces better temporal stability
- using non-temporal (spatial) information produces better spatial registration.
- the "best" alignment vector chosen using the temporal information may not be the same as the "best" alignment vector chosen using the spatial information.
- there is a tradeoff between the objectives of temporal stability and spatial registration and the implementation must determine which objective to favor or bias toward.
- Such a bias could be constant for a series of frames or may vary depending on factors such as scene content.
- alignment vectors for frames that make up a static scene may be biased toward temporal stability, and alignment vectors for frames that involve movement may be biased toward greater registration within each frame.
- Bias may be achieved in various ways. For example, consider an implementation in which a distortion is computed by applying the best alignment vector of the previous frame, V_(i-1), to the current frame "i.” If the distortion associated with using V_(i-1) is "low enough,” then V_(i-1) also may be used as the
- “Low enough” may be defined, for example, as a distortion value that is within a particular adaptive threshold of the Attorney Docket: 16406-002WO 1 minimum distortion that results from applying V_i (the alignment vector selected for the current frame when only information from the current frame is used).
- An adaptive threshold may be defined as, for example, 110% of the distortion associated with V , and such an implementation would be biased toward using V_(i-1) in that higher distortions would be tolerated with V_(i-1) as compared to V_i.
- Bias Factor may be, for example, set to 1.1 as discussed above.
- the bias factor also may be, for example, variable.
- a variable Bias Factor may be varied based on, for example, the content of the frame and/or the previous frame, and may be varied, for example, manually or automatically. Bias also may be applied when using the spatial distance between alignment vectors as a metric.
- a metric may be the distance between alignment vector V_i, which is based only on current frame "i," and alignment vector V_(i-1), which is based only on previous frame "i-l .” Note that such a distance metric does not consider the actual frame content per se, but only the distance between alignment vectors.
- V_(i-1) may be selected as the alignment vector for frame "i" (V_i_new) if the distance between V_(i-1) and V_i is less than a threshold.
- a larger threshold provides a greater bias toward V_(i-1), and toward temporal stability.
- Implementations may use a constant threshold, such as, for example, 1 or 2 pixels for a one-dimensional implementation. Implementations also may use an adaptive threshold, such as, for example, setting the allowable distance equal to a percentage of the length of one or more of the alignment vectors.
- the threshold may be based on, for example, the resolution of the frame (for example, 2048x1536) and/or the content of the frame (for example, the frame content may be part of a static scene or a moving scene).
- Attorney Docket: 16406-002WO 1 Referring to FIG. 8, a graph 800 shows a two-dimensional implementation of a spatial distance metric using a constant threshold. Alignment vector V_i is shown surrounded by a dashed circle 810 of fixed radius equal to the constant threshold. Alignment vector V_(i-1) is shown as lying within circle 810, revealing that the spatial distance between V_i and V_(i-1) is less than the threshold. Implementations also may apply the spatial distance metric, as with many other metrics, separately to each of multiple dimensions.
- various other distortion metrics also may be used, such as, for example, the one-sided mismatch accumulator discussed earlier. Additional distortion metrics are discussed further below, and a distortion metric also may be defined as a function of one or more other metrics.
- Several implementations are now described for using temporal information. These examples include a mode filter, a running average filter, a running median filter, and a sign change filter. These filters may be referred to as smoothing filters and may be implemented as a second pass that occurs after a first pass in which the set of alignment vectors is determined by considering only information in the current frame.
- Vjnode is determined that will be applied to frames 1 through N.
- V_N For example, for a set of one-dimensional alignment vectors ⁇ 12, 14, 15, 16,
- V_mode 16 16, 16, 16, 15, 12 ⁇ , V_mode is 16.
- V_i 1, ... , N
- fractional distances could be used by, for example, interpolating pixel values at locations corresponding to alignment values of 14 and 15.
- V_i (mean) uses a moving window of size k+1 that is forward-looking. Implementations may be, for example, forward-looking, backward- looking, or a combination of forward-looking and backward-looking.
- S_i[x] is the xth vector in set S_i.
- the "extent” is chosen such that if the intervals are larger than the "extent,” then the sign changes are infrequent enough in time and the changes may not be as perceivable to the viewer, so that correction may not be necessary. By contrast, if an interval is not larger than "extent,” then the sign changes are too frequent and the changes may be perceived by a viewer and, thus , may require correction.
- V_(origi+count-l)) V_(origi-l).
- the pseudo-code may be applied to the set of alignment vectors ⁇ 12, 12, 12, 10, 10, 10, 12 ⁇ .
- the sign change filter generally attempts to determine when frames (or, for example, separations or edge maps related to the frames), or portions thereof, in a sequence are being transformed in alternating directions. Accordingly, the sign change filter may be extended to other transformations by associating a sign change with an appropriate indicator of the direction of the transformation.
- the indicator may include, for example, a shift indicator, a rotation indicator, and a scaling indicator.
- other types of filters and combinations may be used.
- a median filter and average filter may be applied to fixed blocks
- a mode filter may be applied to a moving window
- a mode filter may include a threshold requiring, for example, that the percentage of frames in a block that have the "most common" value be at least a threshold percentage
- a spline filter may be applied
- filters may look forward, backward, or both.
- the new alignment vectors that result from a smoothing filter may be subject to additional constraints in an effort to maintain the misregistration within a particular frame (or the cumulative misregistration within a set of frames) at or below a negligible or unperceivable level, if possible.
- a distortion constraint may be imposed in conjunction with or subsequent to a smoothing operation.
- the sign change filter discussed above uses a threshold distortion constraint of distortion_max, and if the distortion resulting from a proposed smoothing operation exceeds distortion nax, then the smoothing operation is not performed.
- Distortion constraints may be, for example, fixed or variable and may be based on various distortion metrics, such as, for example, spatial distance and/or mismatched edge pixels.
- a frame may be divided into blocks, as discussed earlier, and the new alignment vector for a particular block may depend on the alignment vectors and new alignment vectors for neighboring blocks within the same frame.
- implementations may determine new alignment vectors for, for example, one or more color components of a frame.
- a transformation for a reference separation also may be determined and used to provide increased temporal stability.
- a composite frame 900 includes a first block 910 and a second block 920.
- Block 920 includes three parallel lines: a top green line Gl, a bottom green line G2, and a red line R that is equidistant between line Gl and line G2 at a distance "dl" from both lines Gl and G2. Lines Gl and G2 are isolated in the green separation and line R is isolated in the red separation. Considering block 920 alone, it is not clear whether line R corresponds to line Gl or to line G2.
- the optimal alignment vector for a neighboring block within the current frame, V_n is used as an initial candidate alignment vector for the current frame, and candidate alignment vectors, V_i(cand), having progressively larger distances from the initial candidate vector are considered.
- candidate vectors are considered, the implementation keeps track of which candidate vector, V_i(opt), has produced the lowest distortion, D_i(opt).
- Total distortion is the sum of the normal distortion, calculated according to the distortion metric being used (for example, a one-sided mismatch accumulator), and
- DextraJL The above distortion algorithm can be applied to frame 900. Assume that the registration process has been applied to block 910 to produce an optimal alignment vector for block 910 of dl, and that the registration process is now being applied to neighboring block 920.
- V_i(opt) is initially set to V_n, which is dl, and which produces a minimum distortion, D_i(opt). Another vector that presumably produces the minimum distortion is -dl .
- Dextra_i is considered as a candidate Attorney Docket: 16406-002WO1 alignment vector, Dextra_i then is computed for the candidate alignment vector of- dl. Assuming that D_i(opt) is not zero and that 2*dl > Cl, then Dextra_i will be positive.
- the candidate alignment vector of-dl will be discounted by the additional distortion of Dextra_i and will not be selected as V_i(opt).
- candidate alignment vectors are not discounted if those vectors are near the currently-optimal vector or near the neighboring block's optimal vector.
- this algorithm may provide advantages even if the minimum distortion is not achieved. Additionally, this algorithm may be adapted for use with systems that do not necessarily select the optimal vector or the mirtimum distortion. Referring to FIG.
- a graph 1000 shows a two-dimensional example illustrating the distances used in the above equation for Dextraj V_i(opt) initially is set to V_n, and graph 1000 shows the scenario after other candidate vectors have been considered and a new V_i(opt) has been determined.
- the new V_i(opt) and V_n are separated by a distance d2, and the distance d2 defines a circle 1010 around V_n. Any future candidate alignment vectors that are within circle 1010, and that are considered before another V_i(opt) is determined, will not be discounted.
- a circle 1020 is defined around V_i(opt) having a radius of Cl, and any future candidate alignment vectors that are within circle 1020, and that are considered before another V_i(opt) is determined, also will not be discounted. Both circles 1010 and 1020 may change if a new candidate alignment vector becomes V_i(opt), with circle 1010 possibly changing in radius and circle 1020 changing its center point. Note that an alternate solution is to discount the proposed alignment vector if the alignment vector is too far away from the best alignment vector determined for the corresponding block within the previous frame. Such a solution may be considered to be an extension of the temporal implementations described previously.
- the various distortion metrics discussed herein may be applied to transformations other than alignment vectors.
- distortion metrics relying on spatial distance may be generalized as relying on the magnitude of the transformation. Accordingly, distance metrics may be applied to other transformations by replacing the distance with the magnitude of some characteristic of Attorney Docket: 16406-002WO1 the transformation, such as, for example, the magnitude of a rotation or a scaling in an affine fransformation.
- a process 1100 shows that the implementations and techniques described herein also may be applied to composite color images, that is, images that have more than one color component.
- the color composite image may be accessed (1110) and processed (1120) to separate out various components, and one or more of the registration implementations and techniques described herein may be applied to the components (1130).
- the individual components that are separated out from the color composite image may relate, for example, to a single color, to multiple colors, or to luminance (brightness).
- Composite color images may have been formed by, for example, combining film separations to produce a composite RGB or YCM image.
- the composite RGB or YCM image may be processed to separate the color components and then a registration algorithm may be applied to the separated color components, and the registered components may be combined again to form a new pomposite color image.
- film separations may have been combined through some other mechanism (such as a photographic process currently performed in film labs by movie studios) and the combination may have been subsequently converted to some other format, such as, for example, video.
- the video may use, for example, a YUV format in which Y represents the luminance, and U and V represent the chrominance (color).
- video in YUV format is input, the YUV input is converted into separated R, G, and B color components using a mathematical conversion, and the separated RGB color components are registered using one of the registration algorithms described herein.
- the registered RGB components may be, for example, combined into a composite RGB image, or converted back to YUV so that the YUV components may be combined into a composite video image.
- the ability to perform registration on a composite color image may be beneficial, for example, when the original reels of film corresponding to a movie property are no longer available. For example, the movie property may only be available in another format, such as a production video master.
- the ability to perform Attorney Docket: 16406-002WO 1 such registration also may be beneficial when the cost associated with restoring the film from its original YCM separations is too high.
- the original YCM separations may have been extensively processed to produce a production video master, such as, for example, by performing color correction and other processes, including restorative processes.
- the improvements from the processing already performed may be retained.
- the disclosed registration algorithms are able to use input that has been preprocessed and/or converted from different formats, many types of media may be processed with the disclosed registration algorithms.
- the implementations and techniques described herein also may be applied to successive YCM, in which the Y, C, and M components are printed successively for each frame on the same reel of black and white film.
- Misregistration among these components may occur, for example, for one of the reasons discussed earlier (e.g., film shrinkage).
- the resulting component images, and a composite image based on the component images may be stored on a DVD or other storage device.
- the registered component image(s) and the composite image are each said to have been transformed or registered.
- a composite image also may be based on a transformation of the component images, as may happen, for example, if (i) a set of component images in an RGB format are registered, (ii) the registered RGB component images are converted into another format, such as YUV, and (iii) the YUV component images are formed into a composite image.
- the YUV composite image, and the YUV component images also are said to have been transformed or registered, albeit indirectly, by being based on the registered RGB component images.
- the component images may be separate from each other or, for example, interleaved with each other. Interleaving may be performed, for example, for each pixel, or for a block of pixels.
- the component images may be encoded using, for example, an MPEG-2 standard, and the encoded component images may be stored or transmitted.
- the encoded data may be decoded to generate the component images, or varied instances of the component images due to lossy Attorney Docket: 16406-002WO1 compression, and the component images may be combined to form a composite image.
- the generated component images even if varied due to losses in compression, are still considered to have been fransformed to reduce misregistration, even though the generated component images have had some additional compression-related distortion introduced
- the repeated algoritlim may be applied to the aligned/transformed image or to the original image in an effort, for example, to provide a better-registered resulting image.
- the high intensity selection procedure may be performed based on the aligned/transformed image to determine if new bright edge result from the alignment/transformation.
- Spectral separations are used, for example, in: (1) color film applications capturing, for example, different color frequencies, (2) astronomical applications capturing, for example, radio frequencies and/or optical frequencies, and (3) medical applications capturing, for example, different magnetic (MRI), X-ray, and sound (ultrasound) frequencies.
- spectral separations may be captured from various frequency Attorney Docket: 16406-002WO1 sources, including, for example, electromagnetic and sound waves.
- Non-spectral separations may be obtained from, for example, variations in pressure, temperature, energy, or power.
- implementations and features described may be implemented in a process, a device, a combination of devices employing a process, or in a computer readable medium or storage device (for example, a floppy disk, a hard disk, RAM, ROM, firmware, electromagnetic waves encoding or transmitting instructions, or some combination) embodying instructions for such a process.
- a computer including a programmable device (for example, a processor, programmable logic device, application specific integrated circuit, controller chip, ROM, or RAM) with appropriate programmed instructions and, if needed, a storage device (for example, an external or internal hard disk, a floppy disk, a CD, a DVD, a cassette, a tape, ROM, or RAM).
- the computer may include, for example, one or more general-purpose computers (for example, personal computers), one or more special-purpose computers (for example, devices specifically programmed to communicate with each other), or some combination.
- the implementations described, or variations of the implementations may produce a modified image or series of images, such as, for example, an entire movie that has been modified to reduce one or more distortions, such as, for example, misregistration.
- the modified image(s) may be stored, permanently or temporarily, on a storage device, computer readable medium, or programmable device.
- Storage devices may be, for example, analog devices, digital devices, or a combination of analog and digital.
- Storage devices may include, for example, a reel, a VHS tape, a video, an optical disc (including, for example, a DVD), and the other examples of storage devices listed earlier.
- the use of headers and sections on this document is intended for ease of referencing the disclosure contained in the document, and is not intended to limit the applicability or relevance of material discussed in any given section. To the contrary, the material discussed in each section may be relevant to discussions in other sections, Attorney Docket: 16406-002WO 1 and the use of headers and sections does not restrict the application of the material to the other sections. In particular, material from one section may be combined with the material from any and all of the other sections. Accordingly, other implementations are within the scope of the following claims.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Signal Processing (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Image Analysis (AREA)
- Image Processing (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP04809671.3A EP1665768B1 (en) | 2003-09-05 | 2004-09-03 | Registration of separations |
MXPA06002505A MXPA06002505A (en) | 2003-09-05 | 2004-09-03 | Registration of separations. |
AU2004305781A AU2004305781B2 (en) | 2003-09-05 | 2004-09-03 | Registration of separations |
CA2537533A CA2537533C (en) | 2003-09-05 | 2004-09-03 | Registration of separations |
HK06113439.6A HK1094827A1 (en) | 2003-09-05 | 2006-12-06 | Registration of separations |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/035,337 US7092584B2 (en) | 2002-01-04 | 2002-01-04 | Registration of separations |
US50037103P | 2003-09-05 | 2003-09-05 | |
US60/500,371 | 2003-09-05 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2005029835A2 true WO2005029835A2 (en) | 2005-03-31 |
WO2005029835A3 WO2005029835A3 (en) | 2006-01-12 |
Family
ID=34594140
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2004/028782 WO2005029835A2 (en) | 2002-01-04 | 2004-09-03 | Registration of separations |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2005029835A2 (en) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030128280A1 (en) | 2002-01-04 | 2003-07-10 | Perlmutter Keren O. | Registration of separations |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69331874T2 (en) * | 1992-12-28 | 2002-11-14 | Canon K.K., Tokio/Tokyo | One-chip integrated circuit for use in a controlling focusing means |
JPH10178611A (en) * | 1996-12-17 | 1998-06-30 | Canon Inc | Electronic camera and recording controller |
-
2004
- 2004-09-03 WO PCT/US2004/028782 patent/WO2005029835A2/en active Application Filing
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030128280A1 (en) | 2002-01-04 | 2003-07-10 | Perlmutter Keren O. | Registration of separations |
Non-Patent Citations (1)
Title |
---|
SCHALLAUER, AUTOMATIC RESTORATION ALGORITHMS FOR 35MM FILM |
Also Published As
Publication number | Publication date |
---|---|
WO2005029835A3 (en) | 2006-01-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7280707B2 (en) | Registration of separations | |
US9773314B2 (en) | Registration of separations | |
US9619897B2 (en) | Correction of blotches in component images | |
CA2537533C (en) | Registration of separations | |
WO2005029835A2 (en) | Registration of separations | |
JP4709649B2 (en) | Reduction of resolution difference of separated colors |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BW BY BZ CA CH CN CO CR CU CZ DK DM DZ EC EE EG ES FI GB GD GE GM HR HU ID IL IN IS JP KE KG KP KZ LC LK LR LS LT LU LV MA MD MK MN MW MX MZ NA NI NO NZ PG PH PL PT RO RU SC SD SE SG SK SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): BW GH GM KE LS MW MZ NA SD SZ TZ UG ZM ZW AM AZ BY KG MD RU TJ TM AT BE BG CH CY DE DK EE ES FI FR GB GR HU IE IT MC NL PL PT RO SE SI SK TR BF CF CG CI CM GA GN GQ GW ML MR SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2537533 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: PA/a/2006/002505 Country of ref document: MX |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004305781 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004809671 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2004305781 Country of ref document: AU Date of ref document: 20040903 Kind code of ref document: A |
|
WWP | Wipo information: published in national office |
Ref document number: 2004305781 Country of ref document: AU |
|
WWP | Wipo information: published in national office |
Ref document number: 2004809671 Country of ref document: EP |