US9734610B2 - Image processing device, image processing method, and image processing program - Google Patents
Image processing device, image processing method, and image processing program Download PDFInfo
- Publication number
- US9734610B2 US9734610B2 US14/766,166 US201314766166A US9734610B2 US 9734610 B2 US9734610 B2 US 9734610B2 US 201314766166 A US201314766166 A US 201314766166A US 9734610 B2 US9734610 B2 US 9734610B2
- Authority
- US
- United States
- Prior art keywords
- text
- color
- region
- translucent
- background
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
- 238000012545 processing Methods 0.000 title claims abstract description 92
- 238000003672 processing method Methods 0.000 title claims description 6
- 238000000034 method Methods 0.000 claims description 17
- 230000006870 function Effects 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims 1
- 239000003086 colorant Substances 0.000 description 11
- 238000013459 approach Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000012706 support-vector machine Methods 0.000 description 2
- 101001109518 Homo sapiens N-acetylneuraminate lyase Proteins 0.000 description 1
- 102100022686 N-acetylneuraminate lyase Human genes 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/60—Editing figures and text; Combining figures or text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/0007—Image acquisition
-
- G06T5/005—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/50—Image enhancement or restoration using two or more images, e.g. averaging or subtraction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/77—Retouching; Inpainting; Scratch removal
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20212—Image combination
- G06T2207/20221—Image fusion; Image merging
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20212—Image combination
- G06T2207/20224—Image subtraction
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09G—ARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
- G09G2340/00—Aspects of display data processing
- G09G2340/12—Overlay of images, i.e. displayed pixel being the result of switching between the corresponding input pixels
Definitions
- One aspect of the present invention relates to an image processing device, an image processing method, and an image processing program.
- Non Patent Literature 1 There is a technique that removes text from an image and complements that region by using the pixels of the surrounding background region (see Non Patent Literature 1 below). By using such a technique, it is possible to naturally restore the background hidden behind the text.
- Non Patent Literature 1 When the technique of the above-described Non Patent Literature 1 is applied to an image in which a text region has transparency (translucent), the restoration of the background can be incomplete or insufficient.
- information of the background covered by text is predicted using information of the surrounding region.
- a part of the background covered by translucent text is different from the surrounding region in design, that part is not restored to its original state, and a difference from the original background becomes significant.
- An image processing device for processing an image where translucent text is superimposed on an original image, including an acquisition unit configured to acquire an edge between a text region where the translucent text is superimposed and a background region where the translucent text is not superimposed, a pair acquisition unit configured to acquire a plurality of pairs of a representative color in the text region and a representative color in the background region adjacent to each other with the edge placed therebetween, a first specifying unit configured to specify a text color and transparency of the translucent text from the plurality of pairs, and a second specifying unit configured to specify a pixel color before the translucent text is superimposed for each pixel in the text region from a pixel color after the translucent text is superimposed, the text color and the transparency.
- An image processing method is an image processing method performed by an image processing device for processing an image where translucent text is superimposed on an original image, the method including an acquisition step of acquiring an edge between a text region where the translucent text is superimposed and a background region where the translucent text is not superimposed, a pair acquisition step of acquiring a plurality of pairs of a representative color in the text region and a representative color in the background region adjacent to each other with the edge placed therebetween, a first specifying step of specifying a text color and transparency of the translucent text from the plurality of pairs, and a second specifying step of specifying a pixel color before the translucent text is superimposed for each pixel in the text region from a pixel color after the translucent text is superimposed, the text color and the transparency.
- An image processing program is an image processing program causing a computer to function as an image processing device for processing an image where translucent text is superimposed on an original image, the program causing the computer to implement an acquisition unit configured to acquire an edge between a text region where the translucent text is superimposed and a background region where the translucent text is not superimposed, a pair acquisition unit configured to acquire a plurality of pairs of a representative color in the text region and a representative color in the background region adjacent to each other with the edge placed therebetween, a first specifying unit configured to specify a text color and transparency of the translucent text from the plurality of pairs, and a second specifying unit configured to specify a pixel color before the translucent text is superimposed for each pixel in the text region from a pixel color after the translucent text is superimposed, the text color and the transparency.
- a computer-readable recording medium is a computer-readable recording medium that stores an image processing program causing a computer to function as an image processing device for processing an image where translucent text is superimposed on an original image, the program causing the computer to implement an acquisition unit configured to acquire an edge between a text region where the translucent text is superimposed and a background region where the translucent text is not superimposed, a pair acquisition unit configured to acquire a plurality of pairs of a representative color in the text region and a representative color in the background region adjacent to each other with the edge placed therebetween, a first specifying unit configured to specify a text color and transparency of the translucent text from the plurality of pairs, and a second specifying unit configured to specify a pixel color before the translucent text is superimposed for each pixel in the text region from a pixel color after the translucent text is superimposed, the text color and the transparency.
- the text color and the transparency of translucent text itself are obtained from a plurality of pairs of the representative color of the text region and the representative color of the background region. Because the pixel color of the text region is the mixture of the pixel color before text is superimposed (original background color) and the color of the text itself, if the color of the translucent text itself is determined, the pixel color of the text region can be restored to the original background color. In this manner, by obtaining the color of the translucent text itself and then specifying the pixel color before superimposition for each pixel in the text region, it is possible to more reliably restore the original background on a pixel-by-pixel basis.
- a superimposition color of each pixel in the text region where the translucent text is superimposed may be determined by a relational expression representing a relationship between the superimposition color, a color of a corresponding pixel in the original image, and the text color and the transparency of the translucent text
- the pair acquisition unit may specify a first pair of a first superimposition color in the text region and a first background color in the background region adjacent to each other with the edge placed therebetween and a second pair of a second superimposition color in the text region and a second background color in the background region adjacent to each other with the edge placed therebetween, the second superimposition color being different from the first superimposition color and the second background color being different from the first background color
- the first specifying unit may specify the text color and the transparency of the translucent text based on a first equation where the first pair is substituted into the relational expression and a second equation where the second pair is substituted into the relational expression
- the second specifying unit may specify a pixel value in the
- the first specifying unit may acquire a pair of a cluster of the text region and a cluster of the background region when the cluster of the text region is placed between the clusters of the two background regions with the same representative color.
- the image processing device may further include an inverse transform unit configured to obtain a range of pixel values in a small region centering on an edge of the translucent text on which anti-aliasing has been performed, and binarizes each pixel value in the small region to one of a first reference value and a second reference value.
- an inverse transform unit configured to obtain a range of pixel values in a small region centering on an edge of the translucent text on which anti-aliasing has been performed, and binarizes each pixel value in the small region to one of a first reference value and a second reference value.
- the first reference value and the second reference value may be end points of the range of pixel values.
- the first specifying unit may acquire a rectangular region including the background region and the text region where the translucent text on which anti-aliasing has been performed is located, and acquires the representative color of each of the text region and the background region adjacent to each other with the edge in a direction along an outer edge of the rectangle region placed therebetween.
- the first specifying unit may acquire statistics of pixel values of each cluster as a representative color.
- the acquisition unit may determine that text is translucent when pixel values of a text part are dispersed.
- the acquisition unit may determine that text is translucent when there is an edge continuous from the background region to the text region.
- FIG. 1 is a view showing the concept of restoration processing according to an embodiment.
- FIG. 2 is a view showing the concept of restoration processing according to the embodiment.
- FIG. 3 is a view showing a hardware configuration of an image processing device according to the embodiment.
- FIG. 4 is a block diagram showing a functional configuration of the image processing device according to the embodiment.
- FIG. 5 is a flowchart showing an operation of the image processing device according to the embodiment.
- FIG. 6 is a view showing a configuration of an image processing program according to the embodiment.
- FIG. 7 is a view illustrating processing using Histogram of Oriented Gradient (HOG).
- FIG. 8 is a view showing an example of anti-aliasing.
- the image processing device 10 is a computer system that removes translucent text superimposed on an original image and restores the background hidden by the translucent text.
- text in this specification is the concept including a mark, a figure or the like of a certain shape.
- translucent text in this specification is text having transparency. The transparency is defined as 0 to 1. When the transparency is 0, the text is solid, and when the transparency is 1, the text is completely transparent. Accordingly, in this specification, the transparency of translucent text is more than 0 and less than 1. A specific value of transparency is not particularly limited as long as it is within the above range.
- the image processing device 10 receives input of a text image generated by superimposing translucent text on an original image.
- the image processing device 10 acquires a rectangular region circumscribing the text (circumscribing rectangle) from the text image and specifies the boundary (edge) between the background region and the text region in each circumscribing rectangle.
- the image processing device 10 specifies a region to be processed (which is referred to hereinafter as “target region”), where the background is to be restored.
- the target region includes a background region in which translucent text is not superimposed and a text region in which translucent text is superimposed, and it is composed of one or a plurality of circumscribing rectangles.
- the target region L shown in FIG. 1 is one circumscribing rectangle where the translucent text K is superimposed on the background J. In this embodiment, it is assumed that the color and transparency of translucent text itself are uniform.
- the image processing device 10 performs color clustering on both of the background region and the text region, and acquires a plurality of pairs of the representative color of the background region and the representative color (representative superimposition color) of the text region which are adjacent to each other with the edge placed therebetween in the target region L.
- the image processing device 10 acquires a first pair of the representative color O 1 of the background region and the representative color I 1 of the text region and a second pair of the representative color O 2 of the background region and the representative color I 2 of the text region.
- the image processing device 10 specifies the text color T and the transparency r of the translucent text itself from the two pairs.
- the image processing device 10 specifies and restores the original color of each pixel in the region covered by the translucent text from the color of the translucent text itself and the color of the text region.
- the background J is restored. Because the restoration is performed for each pixel, the circles Jp and Jq are also restored that have been completely hidden by the translucent text K.
- the variable O is the color of the background itself covered by the translucent text, and it is the color restored by the image processing device 10 .
- the variables T and r are the text color and the transparency of the translucent text itself as described above.
- the variable I is the color of a pixel in the text region after the translucent text is superimposed (the apparent color in the text image).
- the variable I is, in other words, the mixture of the color of the background itself and the color of the translucent text itself.
- the target region is one circumscribing rectangle in FIG. 1
- the target region may include a plurality of circumscribing rectangles as described above. Note that, however, in the case of acquiring a pair of representative colors from the two circumscribing rectangles, the assumption is required that the color settings of two text letters are the same.
- FIG. 2 shows an example where the target region L is composed of two circumscribing rectangles La and Lb.
- the circumscribing rectangle La is a region where the translucent text Ka is superimposed on the background Ja
- the circumscribing rectangle Lb is a region where the translucent text Kb is superimposed on the background Jb.
- the image processing device 10 acquires one pair from each of the two circumscribing rectangles obtained from adjacent two text letters.
- the image processing device 10 acquires a pair of the representative color O 1 of the background region and the representative color I 1 of the text region from the circumscribing rectangle La, and acquires a pair of the representative color O 2 of the background region and the representative color I 2 of the text region from the circumscribing rectangle Lb.
- the image processing device 10 specifies the text color T and the transparency r of the translucent text itself from the two pairs, and restores the original color of each pixel in the text region from the color of the translucent text itself and the color of the text region.
- this processing is based on the assumption that the basic color and the transparency of a plurality of text letters to be processed are the same.
- a target region is not limited to a rectangle, and a target region may be in any shape as long as it contains a text region.
- FIG. 3 shows a hardware configuration of the image processing device 10 .
- the image processing device 10 includes a CPU 101 that executes an operating system, an application program and the like, a main storage unit 102 such as ROM and RAM, an auxiliary storage unit 103 such as a hard disk or a flash memory, a communication control unit 104 such as a network card or a wireless communication module, an input device 105 such as a keyboard and a mouse, and an output device 106 such as a display.
- a CPU 101 that executes an operating system, an application program and the like
- main storage unit 102 such as ROM and RAM
- an auxiliary storage unit 103 such as a hard disk or a flash memory
- a communication control unit 104 such as a network card or a wireless communication module
- an input device 105 such as a keyboard and a mouse
- an output device 106 such as a display.
- the functional elements of the image processing device 10 which are described later, are implemented by loading given software onto the CPU 101 or the main storage unit 102 , making the communication control unit 104 , the input device 105 , the output device 106 and the like operate under control of the CPU 101 , and performing reading and writing of data in the main storage unit 102 or the auxiliary storage unit 103 .
- the data and databases required for processing are stored in the main storage unit 102 or the auxiliary storage unit 103 .
- the image processing device 10 may be composed of one computer or may be composed of a plurality of computers.
- the image processing device 10 includes, as functional elements, an acquisition unit 11 , a text color specifying unit (pair acquisition unit, first specifying unit) 12 , and a restoration unit (second specifying unit) 13 .
- the acquisition unit 11 is a functional element that acquires the edge between a background region and a text region.
- the acquisition unit 11 can acquire information of the edge of each target region by using an arbitrary method.
- the acquisition unit 11 may acquire image data to which information of a target region and an edge are added from a device (not shown) other than the image processing device 10 .
- the acquisition unit 11 may perform color processing to specify a target region in image data, and further detect an edge in each target region by using the existing technique such as the Canny method.
- color clustering is one type of region-based approach. This technique reduces the number of colors by using the k-means method for clustering and, after clustering, divides a region for each color cluster based on topology analysis.
- classifier-based approach such as support vector machine (SVM).
- SVM support vector machine
- a target region may be extracted by performing postprocessing by classifier-based approach after using region-based approach.
- the acquisition unit 11 outputs information of the edge of each target region to the text color specifying unit 12 .
- the text color specifying unit 12 is a functional element that specifies the text color and the transparency of translucent text.
- the “text color of translucent text” is not the apparent color but the color (basic color) that serves as a basis for the apparent color. Because the translucent text part in an image is mixed with the background, it is necessary to know the color of the translucent text itself.
- the text color specifying unit 12 specifies the color for each target region in the following way.
- the text color specifying unit 12 acquires two pairs of the representative color of the text region and the representative color of the background region which are adjacent to each other with the edge placed therebetween as shown in FIGS. 1 and 2 .
- the reason of acquiring the representative color of each region, not the color of each pixel, is to facilitate calculation.
- the text color specifying unit 12 divides each of the text region and the background region into a plurality of clusters by using color clustering and obtains the representative color of each cluster.
- the way of determining the representative color is not limited, and the text color specifying unit 12 may set the statistics (average, median etc.) of pixel colors in a cluster as the representative color, for example. Alternatively, the text color specifying unit 12 may use the representative color of each cluster which is input by a user.
- the text color specifying unit 12 specifies the cluster of the text region and the cluster of the background region which are adjacent to each other with the edge placed therebetween as a pair, and obtains the representative color of each cluster in the pair. At this time, the text color specifying unit 12 acquires two pairs.
- the text color specifying unit 12 acquires a pair of the representative color I 1 in the text region and the representative color O 1 in the background region and a pair of the representative color I 2 in the text region and the representative color O 2 in the background region.
- the text color specifying unit 12 solves the simultaneous equations (2) and thereby obtains the text color T and the transparency r of the translucent text itself.
- the values T and r are obtained by the following equations (3) and (4).
- r ( I 1 ⁇ I 2 )/( O 1 ⁇ O 2 ) (4)
- the text color specifying unit 12 After obtaining the values T and r for each target region, the text color specifying unit 12 outputs information of the target region to which the values T and r for the translucent text have been added to the restoration unit 13 .
- the restoration unit 13 is a functional element that restores the original color of the background covered by translucent text.
- the restoration unit 13 performs the following processing on each target region.
- the restoration unit 13 obtains the original color O org of each of the pixels p 1 to p N by using the above equation (1).
- the restoration unit 13 applies the color (i.e. the current pixel color) I c of the pixel p i after the translucent text is superimposed and the text color T and the transparency r of the translucent text input from the text color specifying unit 12 to the equation (1) and thereby obtains the color O org of the pixel p i before the translucent text is superimposed (the following equation (5)). Then, the restoration unit 13 changes the color of the pixel p i from I c to O org .
- the restoration unit 13 changes the pixel color of all of the pixels p 1 to p N by using the above equation (5) and thereby removes the translucent text and restores the original background color.
- the restoration unit 13 After performing the above processing for all target regions, the restoration unit 13 outputs data of the image in which the background is restored.
- the way of output is not limited, and the restoration unit 13 may store the restored image data into an arbitrary storage device (not shown) or display the restored image on a monitor or print it out.
- the acquisition unit 11 acquires one target region from a text image (Step S 11 , acquisition step), and acquires information of an edge in the target region (Step S 12 , acquisition step).
- the text color specifying unit 12 acquires two pairs of the representative color in the text region and the representative color in the background region which are adjacent to each other with the edge placed therebetween (Step S 13 , pair acquisition step).
- the text color specifying unit 12 substitutes information of the two pairs (two representative colors in the text region and two representative colors in the background region) to the above simultaneous equations (2) and thereby obtains the text color and the transparency of the translucent text (Step S 14 , first specifying step).
- the restoration unit 13 applies the text color and the transparency of the translucent text and the current pixel color to the above equation (1) for one pixel in the text region and thereby specifies the original pixel color (Step S 15 , second specifying step). After that, the restoration unit 13 replaces the current pixel color with the specified color and thereby restores the original color of the pixel (Step S 16 ). The restoration unit 13 performs this processing for all pixels in the text region (see Step S 17 ).
- Steps S 11 to S 17 the processing of Steps S 11 to S 17 is performed for each of the target regions and, in the end, image data in which the translucent text is removed and the background is restored is output.
- An image processing program P for implementing the image processing device 10 is described hereinafter with reference to FIG. 6 .
- the image processing program P includes a main module P 10 , an acquisition module P 11 , a text color specifying module P 12 , and a restoration module P 13 .
- the main module P 10 is a part that exercises control over the image processing function.
- the functions implemented by executing the acquisition module P 11 , the text color specifying module P 12 and the restoration module P 13 are equal to the functions of the acquisition unit 11 , the text color specifying unit 12 , and the restoration unit 13 described above, respectively.
- the image processing program P may be provided in the form of being recorded in a static manner on a tangible recording medium such as CD-ROM or DVD-ROM or semiconductor memory, for example. Further, the image processing program P may be provided as a data signal superimposed onto a carrier wave through a communication network.
- the text color and the transparency of translucent text itself are obtained from a plurality of pairs of the representative color of the text region and the representative color of the background region. Because the pixel color of the text region is the mixture of the pixel color before text is superimposed (original background color) and the color of the text itself, if the color of the translucent text itself is determined, the pixel color of the text region can be restored to the original background color. In this manner, by obtaining the color of the translucent text itself and then specifying the pixel color before superimposition for each pixel in the text region, it is possible to more reliably restore the original background on a pixel-by-pixel basis.
- the original background can be restored on a pixel-by-pixel basis, it is possible to restore a part that is completely covered by translucent text and not continuous with the exposed background region, such as the regions Jp and Jq in FIG. 1 , to the original color or the color close to that color.
- the text color specifying unit 12 may use Histogram of Oriented Gradient (HOG) to obtain the representative color, in consideration of the case where the boundary in the background and the edge of text happen to overlap.
- HOG is a technique that utilizes the characteristics that, for a pair of edge pixels that form a text width, the slope of the edge of the both is the same but the orientation of each edge is opposite to each other.
- the text color specifying unit 12 searches for a part where the cluster of a text region is placed between the clusters of two background regions with the same representative color by using HOG, and uses the representative colors of the clusters that meet this condition as a pair.
- the cluster of the text region with the representative color I 1 is placed between the clusters of the two background regions with the representative color O 1 .
- the text color specifying unit 12 generates a pair of the two representative colors I 1 and O 1 .
- the cluster of the text region with the representative color I 1 is not placed between the clusters of the two background regions with the representative color O 2 and not placed between the clusters of the two background regions with the representative color O 3 .
- the text color specifying unit 12 does not generate a pair including the representative color I 2 . In this case, the text color specifying unit 12 needs to obtain a pair of representative colors from another circumscribing rectangle.
- anti-aliasing When translucent text is inserted, anti-aliasing that smoothes the edge of text is performed in some cases. Pixels are arranged in the horizontal direction (x-direction) and the vertical direction (y-direction), and anti-aliasing is performed for the outline (edge) of text that runs obliquely to the lines of pixels. As a result of anti-aliasing, the pixels have a color between the background color and the text color.
- the above equation (1) cannot be applied to the pixels on which anti-aliasing has been performed. For example, in the example of FIG. 8 , this processing has been performed in the outline of substantially the right half of the text “R”, and if the equation (1) is applied to this part, the accuracy of restoration decreases.
- the image processing device 10 may exclude or remove the anti-alias by the following technique.
- the image processing device 10 may remove the anti-alias by changing the pixels on which anti-aliasing has been performed to either one of the color of a pixel on which it is not performed in the text region or the color of a pixel in the background region adjacent to the edge.
- the image processing device 10 acquires a small region of W ⁇ W pixels centering on the pixel determined as an edge (edge pixel).
- W a specific value of W is not limited, the value needs to be set to at least include the part where anti-aliasing has been applied.
- the image processing device 10 plots the pixels in this region on a three-dimensional coordinate system indicating the RGB color space and obtains the approximate straight line that fits those pixels.
- the image processing device 10 can obtain the approximate straight line by using the least squares method, RANSAC (RANdom SAmple Consensus), LMedS (Least MEDian Square) and the like.
- the image processing device 10 projects the points indicating pixels on the straight line and specifies two pixels corresponding to the two projected points (end points) located at both ends on the straight line as reference pixels.
- the image processing device 10 then changes the color of the remaining pixels other than the reference pixels to either one of the colors of the two reference pixels (the first reference value and the second reference value). Specifically, when one remaining pixel is p r the image processing device 10 selects the color of the reference pixel at the shorter distance from the pixel p r as the color of the pixel p r .
- the distance between the pixel p r and the reference pixel may be the actual distance between the points plotted on the three-dimensional coordinate system or the distance between the projected points on the approximate straight line.
- the image processing device removes the anti-alias by transforming the region around the edge to binary values.
- This removal processing may be performed at any time before the restoration unit 13 carries out the restoration processing.
- the removal processing may be performed before the text color specifying unit 12 selects a pair of representative colors.
- the removal processing may be performed by the text color specifying unit 12 or the restoration unit 13 .
- the text color specifying unit 12 or the restoration unit 13 functions also as an inverse transform unit.
- the text color specifying unit 12 may remove the cluster containing the pixel on which anti-aliasing has been performed when selecting the representative color of the cluster in the text region. Specifically, the text color specifying unit 12 may acquire the representative color of each of the text region and the background region adjacent to each other with the edge placed therebetween, which extends in the direction (the x-direction or y-direction) along the outer edge of the circumscribing rectangle, and generate a pair of those representative colors.
- the text color specifying unit 12 when the text color specifying unit 12 selects the representative color of a cluster in the text region, it may select the median or average of the pixel values contained the cluster as the representative color. Alternatively, the text color specifying unit 12 may calculate the histogram of color and select the color indicating the highest value (the color with the highest frequency of appearance) as the representative color. Because there are only few pixels on which anti-aliasing has been performed and they are exceptional, it is possible to reduce the effect of anti-aliasing by using the statistics (for example, the median, the average or the highest frequency of appearance) of pixel values in this manner.
- the statistics for example, the median, the average or the highest frequency of appearance
- the image processing device 10 may have the function of determining whether text is translucent or solid.
- the image processing device 10 (for example, the acquisition unit 11 ) may determine that it is translucent text when the pixel values in the text region are dispersed and determine that it is solid text when the pixel values are not dispersed.
- the image processing device 10 (for example, the acquisition unit 11 ) may determine that it is translucent text when there is an edge that is continuous from the background region to the text region.
Landscapes
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Image Processing (AREA)
- Editing Of Facsimile Originals (AREA)
- Image Analysis (AREA)
Abstract
Description
- NPL1: He, Liu, Michael Bleyer, and Margrit Gelautz. “Object Removal by Depth-guided Inpainting.” Austrian Association for Pattern Recognition, 2011.
I=Or+(1−r)T (1)
where the variable O is the color of the background itself covered by the translucent text, and it is the color restored by the
O 1 r+(1−r)T=I 1
O 2 r+(1−r)T=I 2 (2)
T=(I 1 −O 1 r)/(1−r)=(I 2 −O 2 r)/(1−r) (3)
r=(I 1 −I 2)/(O 1 −O 2) (4)
I c =O org r+(1−r)T
Accordingly, O org ={I c−(1−r)T}/r (5)
Claims (11)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/JP2013/057075 WO2014141412A1 (en) | 2013-03-13 | 2013-03-13 | Image processing device, image processing method, and image processing program |
Publications (2)
Publication Number | Publication Date |
---|---|
US20160005200A1 US20160005200A1 (en) | 2016-01-07 |
US9734610B2 true US9734610B2 (en) | 2017-08-15 |
Family
ID=49595891
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/766,166 Active 2033-04-24 US9734610B2 (en) | 2013-03-13 | 2013-03-13 | Image processing device, image processing method, and image processing program |
Country Status (3)
Country | Link |
---|---|
US (1) | US9734610B2 (en) |
JP (1) | JP5324017B1 (en) |
WO (1) | WO2014141412A1 (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2016058879A (en) * | 2014-09-09 | 2016-04-21 | キヤノン株式会社 | Image processing apparatus and image processing method |
CN106937103B (en) * | 2015-12-31 | 2018-11-30 | 深圳超多维科技有限公司 | A kind of image processing method and device |
CN109300169B (en) * | 2018-09-06 | 2023-04-07 | 华东师范大学 | Semitransparent image color migration method based on linear transformation |
CN112053419B (en) * | 2020-09-21 | 2023-09-22 | 长沙景嘉微电子股份有限公司 | Image superposition processing method and device, storage medium and electronic device |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5615282A (en) * | 1990-02-05 | 1997-03-25 | Scitex Corporation Ltd. | Apparatus and techniques for processing of data such as color images |
US5781225A (en) * | 1992-05-19 | 1998-07-14 | Eastman Kodak Company | Method and apparatus for improving electronic recording of depth images |
US6134345A (en) * | 1998-08-28 | 2000-10-17 | Ultimatte Corporation | Comprehensive method for removing from an image the background surrounding a selected subject |
US20090066716A1 (en) * | 2007-09-07 | 2009-03-12 | Palm, Inc. | Video Blending Using Time-Averaged Color Keys |
US8947452B1 (en) * | 2006-12-07 | 2015-02-03 | Disney Enterprises, Inc. | Mechanism for displaying visual clues to stacking order during a drag and drop operation |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH10214337A (en) * | 1997-01-28 | 1998-08-11 | Namco Ltd | Picture forming device |
JP2007249743A (en) * | 2006-03-17 | 2007-09-27 | Nagasaki Univ | Identification method for moving object, identification device for moving object and program making identification processing for moving object to be performed |
JP5732935B2 (en) * | 2011-03-15 | 2015-06-10 | 株式会社リコー | Information processing apparatus, print control program, and computer-readable recording medium |
-
2013
- 2013-03-13 WO PCT/JP2013/057075 patent/WO2014141412A1/en active Application Filing
- 2013-03-13 US US14/766,166 patent/US9734610B2/en active Active
- 2013-03-13 JP JP2013525061A patent/JP5324017B1/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5615282A (en) * | 1990-02-05 | 1997-03-25 | Scitex Corporation Ltd. | Apparatus and techniques for processing of data such as color images |
US5781225A (en) * | 1992-05-19 | 1998-07-14 | Eastman Kodak Company | Method and apparatus for improving electronic recording of depth images |
US6134345A (en) * | 1998-08-28 | 2000-10-17 | Ultimatte Corporation | Comprehensive method for removing from an image the background surrounding a selected subject |
US8947452B1 (en) * | 2006-12-07 | 2015-02-03 | Disney Enterprises, Inc. | Mechanism for displaying visual clues to stacking order during a drag and drop operation |
US20090066716A1 (en) * | 2007-09-07 | 2009-03-12 | Palm, Inc. | Video Blending Using Time-Averaged Color Keys |
Non-Patent Citations (2)
Title |
---|
Communication issued by the International Searching Authority dated Sep. 24, 2015 in counterpart International Application No. PCT/JP2013/057075. |
Liu He, et al., "Object Removal by Depth-guided Inpainting", Institute for Software Technology and Interactive Systems Interactive and Multimedia Systems Group (IMS) Vienna University of Technology, Austria, 2011, total 8 pages. |
Also Published As
Publication number | Publication date |
---|---|
US20160005200A1 (en) | 2016-01-07 |
WO2014141412A1 (en) | 2014-09-18 |
JP5324017B1 (en) | 2013-10-23 |
JPWO2014141412A1 (en) | 2017-02-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10719954B2 (en) | Method and electronic device for extracting a center position of an infrared spot | |
US9477885B2 (en) | Image processing apparatus, image processing method and image processing program | |
US8634659B2 (en) | Image processing apparatus, computer readable medium storing program, and image processing method | |
CN108960229B (en) | Multidirectional character detection method and device | |
US8805077B2 (en) | Subject region detecting apparatus | |
US11455502B2 (en) | Learning device, classification device, learning method, classification method, learning program, and classification program | |
KR20200037887A (en) | Method and apparatus for generating synthetic picture | |
US11017260B2 (en) | Text region positioning method and device, and computer readable storage medium | |
US9734610B2 (en) | Image processing device, image processing method, and image processing program | |
CN110415237B (en) | Skin flaw detection method, skin flaw detection device, terminal device and readable storage medium | |
US20160283786A1 (en) | Image processor, image processing method, and non-transitory recording medium | |
CN112906532B (en) | Image processing method and device, electronic equipment and storage medium | |
CN110866965A (en) | Mapping drawing method and device for three-dimensional model | |
CN108960012A (en) | Feature point detecting method, device and electronic equipment | |
US20160035116A1 (en) | Image processing device, image processing method, information storage medium, and program | |
US9779513B2 (en) | Image processing device, image processing method, and image processing program | |
JP6546385B2 (en) | IMAGE PROCESSING APPARATUS, CONTROL METHOD THEREOF, AND PROGRAM | |
CN115880362B (en) | Code region positioning method, device, computer equipment and computer readable storage medium | |
US11704807B2 (en) | Image processing apparatus and non-transitory computer readable medium storing program | |
JP4825888B2 (en) | Document image processing apparatus and document image processing method | |
CN107103321B (en) | The generation method and generation system of road binary image | |
EP2645698A1 (en) | Trapping method and apparatus | |
US11232335B2 (en) | Printing sub-images to create complete image | |
CN112419141B (en) | Picture processing method and device and computer equipment | |
US20230023611A1 (en) | Method of identifying characters in images, electronic device, and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: RAKUTEN, INC., JAPAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CHIBA, NAOKI;LIU, XINHAO;SIGNING DATES FROM 20150707 TO 20150715;REEL/FRAME:036266/0403 |
|
AS | Assignment |
Owner name: RAKUTEN, INC., JAPAN Free format text: CHANGE OF ADDRESS;ASSIGNOR:RAKUTEN, INC.;REEL/FRAME:037690/0315 Effective date: 20150907 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |
|
AS | Assignment |
Owner name: RAKUTEN GROUP, INC., JAPAN Free format text: CHANGE OF NAME;ASSIGNOR:RAKUTEN, INC.;REEL/FRAME:058314/0657 Effective date: 20210901 |
|
AS | Assignment |
Owner name: RAKUTEN GROUP, INC., JAPAN Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE REMOVE PATENT NUMBERS 10342096;10671117; 10716375; 10716376;10795407;10795408; AND 10827591 PREVIOUSLY RECORDED AT REEL: 58314 FRAME: 657. ASSIGNOR(S) HEREBY CONFIRMS THE ASSIGNMENT;ASSIGNOR:RAKUTEN, INC.;REEL/FRAME:068066/0103 Effective date: 20210901 |