EP1325464A1 - Halftone watermarking and related applications - Google Patents
Halftone watermarking and related applicationsInfo
- Publication number
- EP1325464A1 EP1325464A1 EP01979700A EP01979700A EP1325464A1 EP 1325464 A1 EP1325464 A1 EP 1325464A1 EP 01979700 A EP01979700 A EP 01979700A EP 01979700 A EP01979700 A EP 01979700A EP 1325464 A1 EP1325464 A1 EP 1325464A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- image
- watermark
- halftone
- signal
- values
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/0021—Image watermarking
- G06T1/005—Robust watermarking, e.g. average attack or collusion attack resistant
- G06T1/0071—Robust watermarking, e.g. average attack or collusion attack resistant using multiple or alternating watermarks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N17/00—Diagnosis, testing or measuring for television systems or their details
- H04N17/004—Diagnosis, testing or measuring for television systems or their details for digital television systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/442—Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
- H04N21/4425—Monitoring of client processing errors or hardware failure
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/835—Generation of protective data, e.g. certificates
- H04N21/8358—Generation of protective data, e.g. certificates involving watermark
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2201/00—General purpose image data processing
- G06T2201/005—Image watermarking
- G06T2201/0051—Embedding of the watermark in the spatial domain
Definitions
- the invention relates to multimedia signal processing, and in particular relates to image watermarking methods and related applications.
- Digital watermarking is a process for modifying physical or electronic media to embed a machine-readable code into the media.
- the media may be modified such that the embedded code is imperceptible or nearly imperceptible to the user, yet may be detected through an automated detection process.
- digital watermarking is applied to media signals such as images, audio signals, and video signals.
- documents e.g., through line, word or character shifting
- software e.g., multi-dimensional graphics models, and surface textures of objects.
- Digital watermarking systems typically have two primary components: an encoder that embeds the watermark in a host media signal, and a decoder that detects and reads the embedded watermark from a signal suspected of containing a watermark (a suspect signal).
- the encoder embeds a watermark by altering the host media signal.
- the reading component analyzes a suspect signal to detect whether a watermark is present, hi applications where the watermark encodes information, the reader extracts this information from the detected watermark.
- the invention provides halftone image watermark methods and systems.
- One aspect of the invention is a method of halftone image watermarking.
- This method assigns a set of halftone watermark values to locations within a halftone image. It diffuses error associated with the halftone watermark values to neighboring locations within the halftone image. The error is characterized as a difference between a multilevel pixel value at a location of a halftone watermark dot, and the halftone watermark value of the halftone watermark dot.
- This method may be used in conjunction with other watermark embedding stages. For example, a robust watermark carrying a key for the halftone watermark may be embedded in the image before embedding the halftone watermark.
- Another aspect of the invention is a method of decoding a halftone watermark from an image.
- the decoding method uses a key to identify locations and values of a halftone watermark, and analyzes pixel values at the locations to determine whether the values at the locations correspond to the values specified by the key.
- a robust watermark embedded in the image carries the key used to decode the halftone watermark from the image.
- a decoder reads the robust watermark from a scan of the halftone watermarked image, optionally compensating for geometric distortion of the scanned image using an orientation signal.
- the decoder then extracts the key from the robust watermark and passes it to a verifier, which in turn, examines a high resolution scan of the suspect image to determine whether the halftone watermark is present at locations specified by the key.
- Another aspect of the invention is another method of embedding a watermark in a halftone image.
- This method computes a watermark image comprising an array of values corresponding to pixel locations in a halftone image. It embeds the watermark image in the halftone image by using the values of the watermark image to modulate thresholds at the pixel locations. The thresholds are used in a halftone process to convert multilevel pixel values in a multilevel per pixel image into halftone pixel values of the halftone image.
- Another aspect of the invention is yet another method of embedding a watermark in a halftone image. This method computes a watermark image comprising an array of values corresponding to pixel locations in a target halftone image at a halftone resolution.
- the decoder includes a watermark detector and reader.
- the detector analyzes portions of an image to detect a watermark signal embedded in the image.
- the image is scanned from a halftone printed image at a sufficiently high resolution to discern a watermark image embedded at a resolution of the halftone image.
- the watermark reader reads a watermark signal from the portions of the image and decodes an auxiliary message comprising one or more symbols.
- Another aspect of the invention is a method of embedding a digital watermark into a halftone image. The method redundantly encodes a multi-bit message, and transforms the encoded message to a multilevel per pixel watermark image. It then derives halftone thresholds from the multilevel per pixel watermark image.
- thresholds are then used to convert target images into watermarked halftone images.
- the multilevel pixels in the target image are used to select corresponding halftone thresholds from the halftone thresholds derived from the watermark image.
- the selected thresholds are applied to corresponding multilevel pixels in the watermark image to create the watermarked halftone image of the target image.
- Another aspect of the invention is a method of measuring digital watermark strength.
- a watermarked signal is processed to extract estimates of error correction encoded bits embedded into the watermarked signal.
- the error correction encoded bits are decoded to compute a message payload.
- the message payload is re-encoded to compute error correction encoded bits.
- a measure of watermark strength is computed from the error correction encoded bits and the estimates of error correction encoded bits.
- the soft bit estimates decoded from the watermarked signal are multiplied by corresponding recomputed error correction encoded bits and summed to get a measure of the watermark signal strength.
- This measurement may be compared with a threshold to detect tampering with the watermarked signal, such as compression, scanning and re-printing, photo-copying, etc.
- a threshold to detect tampering with the watermarked signal, such as compression, scanning and re-printing, photo-copying, etc.
- the embedded digital watermark may be used to carry printer information that is later decoded by a watermark detector and used to examine a digital scan of a printed object and determine whether the printed object is authentic.
- Fig. 1 is a diagram illustrating an example of an error diffusion mask used in creating halftone images from multilevel per pixel images.
- Fig. 2 is a diagram illustrating another example of an error diffusion mask used in creating halftone images from multilevel per pixel images.
- Fig. 3 is a diagram illustrating another example of an error diffusion mask used in creating halftone images from multilevel per pixel images.
- Fig. 4 is a diagram of an application of a halftone watermark application.
- Fig. 5 is a diagram illustrating a method of creating a watermarked halftone image using a threshold mask.
- Fig. 6 is a diagram illustrating a method using the halftone screen as an orientation signal to determine geometric distortion of a watermarked image signal.
- error diffusion used in processing halftone images is called Floyd- Steinberg error diffusion.
- this error diffusion method takes a 0-255 level input image at a first resolution (e.g., 200 pixels per inch) and produces a binary image with a higher resolution (e.g., 600 dots per inch).
- This description details an implementation for an image plane where each pixel has a corresponding multilevel value, such as a luminance value, or some other color channel like cyan, magenta, yellow, etc. The description applies to color images with more than one multilevel value per pixel. In such cases, a halftone process operates on each of the color channels per pixel and creates a halftone image for each channel.
- the input image is upsampled to the higher resolution.
- One approach for producing the binary 600 dots per inche (dpi) image from the upsampled 0-255 level 600 pixel per inch (ppi) image is to threshold the pixel values so that a pixel value less than 128 produces a zero in the corresponding location of the binary image; otherwise the corresponding location would be set to a one.
- Error diffusion takes this general approach in a raster-scan order, but achieves improved performance by "diffusing" the error at each location to nearby locations yet to be processed. This error diffusion is guided by a weighting mask, shown in Fig. 1.
- the "X" represents the pixel location currently being processed, and the adjoining cells show how the error from that location is diffused.
- the sum of the diffused errors is equal to the total error at location X.
- the accuracy of this method is no better than the simple thresholding approach. However, if a pixel from the original 200 ppi image is compared with the corresponding 3x3 binary region, the number of cells with a one is closely correlated with the multilevel pixel value.
- Pi(m, ri) upsampled input intensity, 0 -255 p'(m, n) : modified input intensity p 0 (m, ri) : output binary value e(m, n) : error at location (m, n) w kl : error diffusion weighting coefficients
- the weighting mask shown in Fig. 1 can be modified to show which errors are diffused into a given location.
- Fig. 2 shows an example of such modifications.
- a modified error diffusion method that embeds a watermark comprising a set of binary values at specified dot locations in a binary image. This method starts with an upsampled binary host image, a list of dot locations in a binary image and corresponding binary values for a watermark. This method assigns to these locations the corresponding values of the watermark and tries to improve the image with an error diffusion algorithm at the other (non- watermark) locations. It is assumed that the fraction of total pixels occupied by the watermark is small.
- the error for a given location is always calculated after the algorithm has processed all preceding locations.
- the algorithm scans a rectangular image comprising scanline rows of pixels starting from the top row and scanning from left to right across each row.
- the modified error diffusion method calculates error values for all locations at the start and modifies them as the method proceeds.
- the error of all locations not covered by the watermark are set to zero.
- W(m, ri) is the value of the watermark at location (m, ri) .
- a new set of error diffusion weights is used; the pattern of error diffusion into location X is shown in Fig.
- the diffusion of errors takes place in two directions. Errors from the watermarked locations are diffused in one direction (backwards in this case), and errors from processed locations are diffused in another direction (forwards). As each location is processed, the error for that location is updated for later diffusion. The calculation of the output binary value is unchanged from the error diffusion algorithm, with the exception at watermark locations, where p 0 (m, ri) is set to W(m, ri) .
- the appearance of the watermark can be improved by arranging locations of watermark dots in a pseudorandom pattern subject to some additional human visual system frequency response criteria.
- One approach for arranging the locations of the watermark dots is to use a minimum visual cost technique. In this approach the watermark is chosen by minimizing a visual cost function such as
- H(f x ,f y ) is the frequency spectrum of the watermark and V(f x ,f y ) is a frequency response model of the human visual system.
- V(f x ,f y ) is a frequency response model of the human visual system.
- This cost function is to weight the frequency content of the watermark by the human visual ability to detect it. In this manner, watermark patterns that move the frequency content to a region of the frequency spectrum where the human visual system is less sensitive will have a lower cost.
- This method of selecting locations and values of watermark dots provides a perceptually more even distribution of the dots in solid regions (e.g., white regions in a grayscale image). Searching for a watermark with a low cost can be done by a variety of methods; one such method is simulated annealing.
- the encoder may repeat the watermark in blocks of the host image, such as contiguous blocks of pixels throughout the image. Each block may use the same or a different key. If the encoder uses different keys per block, each key may be related to another key by a secret function, such as a cryptographic function.
- a watermark decoder uses a key specifying where the watermark dots are located. It then determines the binary values of those dots.
- a scanner or camera with sufficiently high resolution to read the halftone dots creates a digital image from which the decoder extracts the watermark.
- this type of watermark may be used to carry a message, including usage control instructions or other metadata like an identifier of the image owner or an index to a database record storing related information. It may also convey a fixed pattern used to determine whether a watermarked image (e.g., a printed halftone watermarked image) is authentic or has been altered. In such an application, the decoder compares the extracted watermark pattern with the known pattern, and based on this comparison determines whether the watermarked image has been altered (e.g., copied, scanned and re-printed, compressed, etc.). It can also specify where the image has been altered by showing locations in the scanned image where the halftone watermark is not present.
- a watermarked image e.g., a printed halftone watermarked image
- the decoder compares the extracted watermark pattern with the known pattern, and based on this comparison determines whether the watermarked image has been altered (e.g., copied, scanned and re-printed, compressed, etc.). It can also specify where the image has been altered
- the watermark is applied to halftone images it may be applied as part of the printing process where a multilevel per pixel digital image is converted to a halftone image. For example, it may be incorporated into the halftoning process implemented in a printer device or printer driver executing in a computer that sends the image to the printer for printing. More generally, the watermarking method may be applied in applications where halftone images are printed, including commercial printing presses as well as personal printing devices, such as ink jet printers.
- the host image interferes with the watermark.
- the host image provides a communication channel for the watermark. In decoding the watermark from a host image, the host image can be considered noise that interferes with the ability of the watermark decoder to accurately extract the watermark signal.
- the watermark to be embedded takes values from 0- 255.
- the error diffusion process is changed so that the threshold used to calculate the binary output values is modulated by the watermark signal:
- T(m, ri) 128 - I(W(m, ri) - 128)
- the intensity level I controls how heavily the watermark is embedded in the image.
- the watermark may be embedded without modifying the halftoning process. For example, a multilevel per pixel watermark signal is created at the resolution of a target halftone image.
- the watermark encoder produces the multilevel per pixel watermark signal at the desired resolution of the halftone image, or at some other resolution and up or down samples it to match the resolution of a target halftone image.
- This watermarked signal is then added to the host image at the same spatial resolution to create a composite, watermarked image.
- the error diffusion process or some other type of halftone process may then be applied directly to this composite image to generate a watermarked halftone image.
- This technique applies to a variety of halftone processes including ordered dithering (e.g., blue noise masks, clustered dot halftones, etc.) as well as error diffusion halftone processes.
- the watermark signal may be generated in a variety of ways.
- One approach is to take an auxiliary message comprising binary or M-ary symbols, apply error correction coding to it, and then spread spectrum modulate the error correction encoded message.
- One way to spread spectrum modulate the message is to spread each binary symbol in the message over a pseudorandom number, using an exclusive OR operation or multiplication operation. The resulting binary message elements in the spread spectrum modulated message signal are then mapped to spatial image locations.
- the watermark signal may be expressed in a binary antipodal form, where binary symbols are either positive or negative.
- the spread spectrum modulated message signal may be repeated throughout the host image, by for example, embedding the message signal in several blocks of the host image.
- the watermark encoder may embed instances of the watermark signal into contiguous blocks of pixels throughout a portion of the host image or throughout the entire host image.
- Perceptual modeling may be applied to the host image to calculate a gain vector with gain values that correspond to the message signal elements. For example, in the case where the upsampled watermarked signal is added to the host signal, the gain values may be used to scale binary antipodal values of the message signal before adding them to the host signal. Each gain value may be a function of desired watermark visibility and detectability constraints.
- the perceptual model analyzes the image to determine the extent to which it can hide a corresponding element of the watermark image.
- One type of an analysis is to compute local contrast in a neighborhood around each pixel (e.g., signal activity) and select gain for pixel as a function of local contrast.
- a detectability model analyzes the host signal to determine the extent to which pixel values are biased toward the value of the watermark signal at the corresponding pixel locations. It then adjusts the gain up or down depending on the extent to which the host image pixels are biased towards the watermark signal.
- This type of watermark may be read from the watermarked halftone image or other image representations of that watermarked image, such as a multilevel per pixel representation of the image at a resolution sufficiently high to represent the watermark signal.
- a watermark decoder detects the presence and orientation of the watermark in the watermarked image. It then performs an inverse of the embedding function to extract an estimate watermark message signal.
- the message signal is robustly encoded using a combination of the following processes:
- the watermark decoder reconstructs an embedded message from the estimated watermark signal by:
- the decoder uses an orientation signal component of the watermark to detect its presence and orientation in the watermarked image. It then performs a predictive filtering on the image sample values to estimate the original un- watermarked signal, and subtracts the estimate of the original from the watermarked signal to produce an estimate of the watermark signal. It performs spread spectrum demodulation and error correction decoding to reconstruct an auxiliary message embedded in the watermarked signal.
- the watermark includes an orientation watermark signal component.
- the watermark message signal and the orientation watermark signal form the watermark signal.
- Both of these components may be added to a host image at the resolution of the halftone image before the host image is converted to a the halftone image. Alternatively, these components may be combined to form the watermark signal used in modulating the error diffusion threshold used in an error diffusion type halftone process.
- One type of watermark orientation signal is an image signal that comprises a set of impulse functions in the Fourier magnitude domain, each with pseudorandom phase.
- the watermark decoder converts the image to the Fourier magnitude domain and then performs a log polar resampling of the Fourier magnitude image.
- a generalized matched filter correlates the known orientation signal with the re-sampled watermarked signal to find the rotation and scale parameters providing the highest correlation.
- the watermark decoder performs additional correlation operations between the phase information of the known orientation signal and the watermarked signal to determine translation parameters, which identify the origin of the watermark message signal. Having determined the rotation, scale and translation of the watermark signal, the reader then adjusts the image data to compensate for this distortion, and extracts the watermark message signal as described above.
- the halftone watermarks described above may be used in combination with one or more other watermarks.
- a robust watermark is used to carry a key that specifies the dot locations of a halftone watermark.
- the robust watermark's message payload carries a key that identifies specific dots (the high-resolution binary values) that were turned on or off in a specific pattern. These binary valued bits act as a secondary fragile watermark that can be verified by close inspection of the image.
- Fig. 4 illustrates an implementation of this application.
- a watermark encoder 100 operates on an input image 102 to embed a robust watermark that survives printing, scanning, and geometric distortion.
- An example of this type of watermark is described above and in the patent and patent application incorporated by reference.
- At least part of the message payload of this robust watermark carries a key that is used to decode a halftone watermark at specified halftone dot locations in a halftone image.
- the key includes a seed to a random number generator
- a halftone converter 106 (e.g., error diffusion, blue noise masking, etc.) then converts the robustly watermarked image into a halftone image while ensuring that the halftone watermark dots are assigned the correct values based on the output of the random number generator.
- This process is the modified error diffusion method described above in which the halftone converter diffuses the error introduced by halftone watermark dots to a spatial neighborhood around their respective locations while ensuring that those watermark dot values remain fixed. This approach reduces the perceptible impact of the halftone watermark.
- halftone image 108 may be printed using conventional printer technology, such as ink jet printing, etc. (110).
- the watermark embedding process (100) may be implemented in a printer or software driver for a printer.
- a scanner 120 captures a digital image from a printed image 122. Since the robust watermark survives printing and then scanning by a low resolution (e.g., 100 dpi) scanner or digital camera, it may be recovered from a digital image captured from the printed image 122.
- a watermark reader 124 uses the decoding operations outlined above, detects the robust watermark, determines its orientation, and then reads the watermark payload, including the key. It supplies the key to a random number generator 126, which provides the halftone dot locations and values of the halftone watermark.
- a secondary watermark verifier 128 then examines a suitably high resolution scan of the printed image to determine whether the halftone watermark is present.
- a "suitably high resolution scan" of the printed image is one in which the halftone dots of the printed image are readable. This high resolution scan may be the same image captured by the scanner 120 if it is at a sufficient resolution. Alternatively, a separate image capture device 130 may be used to capture a high resolution image depicting the halftone dots.
- the verifier 128 provides a signal indicating the extent to which the halftone watermark is present.
- a number of actions can be taken. Some actions include recording identifying information about the user (e.g., a device ID of the user's computer or imaging device, address (e.g., network address), user ID, etc.), sending this information and the result of the verification operation to a remote device via communication link, displaying information about rules governing use of the image, connecting the decoding system to a licensing or electronic transaction server (e.g., a web server) enabling the user to license or purchase related rights, products or services, etc. electronically.
- identifying information about the user e.g., a device ID of the user's computer or imaging device, address (e.g., network address), user ID, etc.
- a remote device via communication link
- displaying information about rules governing use of the image connecting the decoding system to a licensing or electronic transaction server (e.g., a web server) enabling the user to license or purchase related rights, products or services, etc. electronically.
- a licensing or electronic transaction server e.g., a
- Both the robust and fragile halftone watermark may also carry other information. They may be used to carry a message, including usage control instructions or other metadata like an identifier of the image owner, a computer address for establishing a remote connection (e.g., an IP address, URL, etc.) or an index to a database record storing related information.
- the message carries an identifier that is used to fetch related information to the image.
- the watermark decoder communicates the identifier to a database management system in the form of a request.
- the database management system and underlying database records may be implemented in a remote device connected to the watermark decoder device via a network connection (e.g., a web server on the Internet connected via a TCP/IP connection) or within the system containing the watermark decoder (e.g., a local database executing within the same device as the decoder or within a computer locally connected to the decoding device via a port, such as a serial, parallel, infrared, Bluetooth wireless or other peripheral port).
- a network connection e.g., a web server on the Internet connected via a TCP/IP connection
- the database looks up information related to the identifier or identifiers extracted from the robust watermark.
- the database either returns the information to the watermark decoding device (e.g., a personal computer, personal digital assistant, Internet appliance, scanner, printer, etc.), or forwards it to one or more other devices along with an address of the decoding device, which in turn, return related information to the decoding device.
- the watermark decoding device e.g., a personal computer, personal digital assistant, Internet appliance, scanner, printer, etc.
- One example is to use an identifier or address encoded in the robust watermark to fetch information from a web site related to the watermarked image.
- the calibration signal may be integrally related to the watermark message signal, such as a carrier signal or synchronization code. Alternatively, it may be a distinct signal, such as an imperceptible registration template.
- the following watermarking method uses a halftone image screen as an orientation signal, avoiding the need to embed a separate calibration signal.
- the halftone screen may be one normally used to print un- watermarked halftone images, or one specifically adapted to create and print watermarked halftone images.
- a threshold mask is a pattern which can be tiled in a repetitive fashion over an image.
- the threshold mask is a square matrix of numbers.
- the threshold mask may be 128x128, and the image may be comprised of an array of 8-bit pixels.
- the threshold mask contains numbers ranging from 0 to 255, corresponding to the 256 possible levels in an 8 bit pixel value.
- the threshold mask is usually designed to include an equal number of occurrences of each level.
- the threshold mask is used to create the binary pattern of ink dots used to represent the printed version of an image. If an image with 200 pixels per inch is to be printed using 2400 dots per inch of resolution, the image is first upsampled to 2400 pixels per inch. Then the threshold mask is tiled and applied to the upsampled image to construct the halftone image. Each pixel of the halftone image is calculated from the corresponding pixel of the upsampled image and the value of the tiled threshold mask at that location. If the threshold mask value is less than or equal to the upsampled image pixel, then the halftone image pixel is set to 1 ; otherwise it is set to 0.
- Fig. 5 is a diagram illustrating a method of creating a watermarked halftone image using a threshold mask.
- the watermark encoding process begins by transforming an auxiliary message into a watermark image signal (150).
- the method shown in Fig. 5 can be used with a variety of methods of constructing a watermark image signal.
- Fig. 5 cites an example using a spatial spread spectrum watermark. In this case, a watermark message payload is spread spectrum modulated over an area coextensive with the threshold mask.
- Alternative watermark encoding functions may be used, such as frequency domain techniques, where the watermark signal modulates frequency coefficients to encode message symbols, or statistical feature modulation where the watermark modulates features of the signal, such as its autocorrelation, power, amplitude, signal peaks, etc.
- Each of these embedding functions can be characterized as adding a spatial watermark image signal with corresponding elements in the host image.
- the encoder may modulate the watermark image signal with a gain to make the watermark message more likely to be recovered, less visible, or some balancing of recoverability and imperceptibility.
- the encoder converts the message to a binary antipodal signal (150) and a gain modulator adjusts the value of each element of that signal (152).
- the spatial resolution of the watermark image signal corresponds to the target halftone image.
- the watermark image signal may be tiled, so that the same payload is repeated over the image, or it may be varied, so that different information is embedded in different areas of the image.
- the encoder takes a multilevel per pixel form of the image (154) and converts it to the halftone resolution (156). This step typically involves upsampling the multilevel image to a higher resolution (e.g., upsampling a 200 dpi, 8 bit per pixel image to a 2400 dpi per pixel image).
- One method for inserting the watermark is to add a spatial domain representation of the watermark image signal with the multilevel image signal (158), each at the halftone image resolution, to create a composite image.
- the encoder applies a threshold mask (160, 162) to the resulting composite image.
- the halftone screen process converts pixel levels below or equal to the corresponding threshold mask level to zero, otherwise it sets them to one.
- the result is a watermarked halftone image.
- the image may then be printed using a variety of halftone printing process, including ink jet printers and printing presses.
- Fig. 6 is a diagram illustrating a method using the halftone screen as an orientation signal to determine geometric distortion of a watermarked image signal.
- this method is used to calculate the orientation of the host signal at the time of watermark embedding to facilitate recovery of a watermark message.
- a scanner or camera or other imaging device captures a digital version of the printed, watermarked image 170 at sufficiently high resolution (172) to distinguish the halftone pixels.
- a watermark decoder obtains blocks of a digital image 174, each having an area approximately corresponding to the size of the threshold mask (e.g., 128x128). Each digital image pixel is approximately equal in size to a halftone image pixel.
- the decoder filters the image block using a low pass filtering technique
- the decoder applies the same threshold mask used to create the halftone image for printing (178, 180). This process results in a target orientation signal having orientation attributes similar to the original image.
- the decoder To determine the geometric distortion of the received image relative to the original, watermarked image, the decoder performs a series of correlation operations between the target orientation signal and the received image. In the method shown in
- the decoder converts both the received image block and the target orientation signal derived from it to a correlation domain (182). In particular, it performs a
- 128x128 Fast Fourier Transform and resamples the resulting Fourier magnitude representation into a log-polar coordinate system to get a Fourier-Mellin representation of both images.
- Correlation operators 184 such as a generalized matched filter, correlate the Fourier-Mellin representation of the target signal (the reference signal) with a Fourier-Mellin representation of the received image. The correlation operation produces an estimate of the scale and rotation parameters of the digital image.
- the decoder transforms the received spatial digital image by the inverse of the estimated rotation and scale.
- the correlation operator correlates the resulting spatial image with the spatial target orientation signal to obtain an estimate of the translation of the image with respect to the threshold mask.
- a watermark message reader (186) demodulates the spread spectrum watermark signal from the received image.
- Each of the halftone watermark methods described above may be used in
- fragment watermark or “semi-fragile” watermark applications, where the watermark is analyzed to detect and characterize alterations to a watermarked image.
- a fragile watermark refers to a watermark that degrades or becomes unreadable when the host image is subjected to certain types of distortion.
- a semi-fragile watermark is a variant on this theme where the watermark survives some types of distortion, but not others.
- One approach is to design the watermark so that it degrades or becomes unreadable in response to certain types of distortions, like printing, scanning on consumer grade scanners, image compression, etc.
- the decoder If the decoder is unable to detect or read the watermark, or the measured degradation exceeds a threshold, then the decoder provides output indicating that the image has been altered (e.g., is not authentic).
- the degradation can be measured by the extent of recovery of the watermark signal, such as the extent of correlation between a known watermark signal and one extracted from a received image.
- the gain values of the halftone watermark methods previously described may be chosen to provide an explicit balance between ease of reading and fragility with respect to certain types of distortions.
- An enhanced approach is to design the watermark such that the type of alteration can be distinguished.
- the type of alteration can be characterized by the type of degradation it causes to the watermark signal.
- a decoder can match the observed degradation to a particular type of alteration.
- the performance of such fragile watermarking applications can be improved by choosing the frequency distribution of the embedded watermark to differentiate between different forms of distortion. There are two main considerations in designing the watermark signal attributes.
- the first is to put part of the watermark's energy in frequencies that are likely to be degraded by a particular type of degradation to be detected (such as print and scan operations); the second is to put energy in the frequencies that are less likely to be degraded by distortion that the watermark should survive (such as smudges, and normal wear and tear).
- a particular type of degradation to be detected such as print and scan operations
- the second is to put energy in the frequencies that are less likely to be degraded by distortion that the watermark should survive (such as smudges, and normal wear and tear).
- a digital watermark may be embedded into a halftone image by using the digital watermark signal as a halftone screen.
- the following discussion illustrates an example of the process :
- the watermark signal may include an orientation signal, such as a pseudorandom carrier signal that yields a pattern of signal peaks when transformed into the autocorrelation domain or an orientation signal that yields a pattern when transformed into a frequency domain, such as the Fourier domain.
- an orientation signal such as a pseudorandom carrier signal that yields a pattern of signal peaks when transformed into the autocorrelation domain or an orientation signal that yields a pattern when transformed into a frequency domain, such as the Fourier domain.
- the resulting watermark signal is a multi-level per pixel gray scale image or array of luminance values. It has a known block size, such as 64 by 64, 128 by 128, 256 by 256 pixel elements that can be tiled contiguously to create an image of varying sizes.
- halftone threshold levels for corresponding gray values (or luminance values). This example uses gray levels, but the same technique applies to luminance values as well as other color channels of color images. These halftone threshold levels are later used to convert a multilevel pixel value to a binary state of "on” or "off of a halftone pixel by comparing the multilevel pixel value to the threshold and setting the halftone pixel at that location to a zero or one depending on whether the multilevel pixel value is less or greater than the threshold.
- the method picks a threshold level that, when applied to the multilevel pixel values in the watermark image block, the resulting halftone image has a tone density that achieves the desired gray value.
- the thresholds set by this process may be used over and over to create watermarked halftone images with the same embedded signal.
- the process of creating the halftone image operates:
- Each halftone dot in the watermarked halftone image is either ink (minimum luminance, ) or no ink (maximum luminance).
- a high luminance value in the target host image sets a threshold such that a corresponding pixel in the watermark image is more likely to be set to the "no ink” state.
- a low luminance value in the target image sets a threshold such that a corresponding pixel in the watermark image is more likely to be set to an "ink present" state.
- This process of converting an image to a watermarked halftone image may be applied to one or more color planes in a color halftone image.
- the target host image is a simple gray level image with only a few different gray levels
- the above technique can be used to set thresholds for each of the gray levels in the host image.
- these gray levels can be divided into ranges, where each range is assigned a single halftone threshold.
- Masks can then be calculated to define the areas of the host image that correspond to each of the halftone thresholds.
- the halftone threshold corresponding to the particular mask is applied to the watermark signal areas covered by the mask to create a watermarked halftone image.
- the spread signal values are binary, e.g., ⁇ 1, 0 ⁇ , or ⁇ 1,-1 ⁇ representing increases or decreases in corresponding sample values (e.g., luminance, gray level, intensity, etc.) Each binary value corresponds to one or more neighboring pixels in the watermark signal.
- These binary values may be converted to multilevel values by adjusting a midlevel pixel value (e.g., mid level gray value of 128 in an 8 bit pixel) up or down corresponding to the value of the spread signal.
- the gain of the watermark signal may be adjusted by applying a scale factor to the spread signal. Further, the changes from each bit of the spread signal may be made to smoothly vary over the corresponding pixels in the neighborhood.
- the carrier may be used for synchronizing the detector with the watermark in a watermarked signal.
- an orientation signal may be added to the spread signal to assist in calibration.
- a watermark detector operates as follows:
- error correction encoded bit values may be "soft" bit values weighted by the extent of correlation with the carrier. In other words, rather than representing each estimate of an error correction encoded bit as hard one or zero, they are represented by some weighted probability between values corresponding to a binary one or zero (e.g., between -1 and 1, -128 and 128, etc.)
- the process of reading the watermark may also be used to detect whether the watermarked halftone image has been altered. In particular, it may be used to determine whether the printed halftone image has been copied, e.g., photocopied, or scanned and re-printed.
- the detected strength of the watermark signal may be compared to a copy detection threshold to determine whether the printed object has been copied.
- a copy detection threshold One approach for measuring the strength of the watermark signal is as follows:
- the watermark message payload may be used to identify the printer model and/or resolution of the watermarked halftone image. Then, to verify a printed object is authentic, the watermark detector extracts the message payload and determines the printer model and or printed resolution. A forensic scan of the printed object is than analyzed to verify that the object was printed at the resolution specified in the watermark payload. One specific way to analyze the resolution of the forensic scan is examine its frequency content to determine if it was likely printed at the specified resolution. The technique described above of detecting signal tampering by measuring symbol errors can be applied to two or more different watermarks embedded at different spatial resolutions in the host media signal. Each of the watermarks may have the same or different message payloads.
- the message extracted from one of the watermarks may be used to measure bit errors in each of the other watermarks.
- the message payload from a robust watermark embedded at a low spatial resolution may be used to measure the bit errors from a less robust watermark at a higher spatial resolution.
- error detection bits such as CRC bits, can be used in each message payload to ensure that the message is accurately decoded before re-creating the original, embedded bit sequence.
- Using two or more different watermarks enables a threshold to be set based on the ratio of the signal strength of the watermarks relative to each other.
- the signal strength of a first watermark at a high resolution (600-1200 dpi) is divided by the signal strength of a second watermark at a lower resolution (75-100 dpi).
- the signal strength is measured using a measure of symbol errors or some other measure (e.g., correlation measure). If the measured strength exceeds a threshold, the detector deems the watermark signal to be authentic and generates an authentication signal.
- This signal may be a simple binary value indicating whether or not the object is authentic, or a more complex image signal indicating where bit errors were detected in the scanned image.
- the watermark and host signal can be particularly tailored to detect copying by photo duplication and printing/re-scanning of the printed object. This entails embedding the watermark with selected screening structures at particular spatial frequencies/resolutions that are likely to generate message symbol errors when the object is re-printed.
- This detection process has an additional advantages in that it enables automatic authentication, it can be used with lower quality camera devices such as web cameras and common image scanners, and it allows the watermark to serve the functions of determining authenticity as well as carrying a message payload useful for a variety of applications.
- the message payload can include an identifier or index to a database that stores information about the object or a link to a network resource (e.g., a web page on the Internet).
- the payload may also include a covert trace identifier associated with a particular authentic item, batch of items, printer, or distributor. This enables a counterfeit object, or authentic object that has been printed without authority to be detected and traced to a particular source, such as its printer, distributor or batch number.
- the payload may also carry printer characteristics or printer type information that enables the watermark reader to adapt its detection routines to printer types that generated the authentic object.
- the payload may carry an identifier that specifies the type of halftoning used to create the authentic image, and more specifically, the attributes of the halftone screen.
- the reader can check authenticity by determining whether features associated with the halftone screen exist in the printed object.
- the reader can check for halftone screen attributes that indicate that a different halftone screen process has been used (e.g., a counterfeit has been created using a different halftone screen).
- One specific example is a payload that identifies the halftone screen type and paper type.
- the reader extracts this payload from a robust watermark payload and then analyzes the halftone screen and paper attributes to see if they match the halftone type and paper type indicated in the watermark payload.
- the halftone type can specify the type of halftone screen patterns used to create an authentic image. If this halftone screen pattern is not detected (e.g., by absence of a watermark embedded using that particular type of halftone screen), then the image is considered to be a fake.
- a related approach for analyzing halftone type is to look for halftone attributes, like tell-tale signs of stochastic halftone screens vs. ordered dither matrix type screens.
- Dither matrix screens used in low end printers tend to generate tell tale patterns, such as a pattern of peaks in the Fourier domain that differentiate the halftone process from a stochastic screen, such as an error diffusion process, which does not generate such telltale peaks. If the reader finds peaks where none were anticipated, then the image is deemed a fake. Likewise, if the reader finds no peaks where peaks were anticipated, then the image is also deemed a fake. Before performing such analysis, it is preferable to use the embedded digital watermark to re-align the image to its original orientation at the time of printing.
- Attributes due to the halftone screen can then be evaluated in a proper spatial frame of reference. For example, if the original ordered dither matrix printer created an array of peaks in the Fourier domain, then the peak locations can be checked more accurately after the image is realigned.
- auxiliary data encoding processes may be implemented in a programmable computer or a special purpose digital circuit.
- auxiliary data decoding may be implemented in software, firmware, hardware, or combinations of software, firmware and hardware.
- the methods and processes described above may be implemented in programs executed from a system's memory (a computer readable medium, such as an electronic, optical or magnetic storage device).
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Networks & Wireless Communication (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Biology (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Computer Security & Cryptography (AREA)
- Image Processing (AREA)
- Editing Of Facsimile Originals (AREA)
- Facsimile Image Signal Circuits (AREA)
- Accessory Devices And Overall Control Thereof (AREA)
- Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
Abstract
One method for halftone image watermarking assigns halftone watermark dot values to pseudorandom locations in an image and diffuses the error (106) of the watermark to neighboring pixel locations of these dots. Another method modulates halftone thresholds to convert multilevel pixels to watermarked halftone pixels. This method may be used in conjunction with a robust watermark (110) spread throughout image before embedding the halftone watermark. In a compatible watermark reader (124), digital watermark signal strength is measured by analyzing bit errors in the extracted watermark message signal.
Description
Halftone Watermarking and Related Applications
Technical Field
The invention relates to multimedia signal processing, and in particular relates to image watermarking methods and related applications.
Background and Summary
Digital watermarking is a process for modifying physical or electronic media to embed a machine-readable code into the media. The media may be modified such that the embedded code is imperceptible or nearly imperceptible to the user, yet may be detected through an automated detection process. Most commonly, digital watermarking is applied to media signals such as images, audio signals, and video signals. However, it may also be applied to other types of media objects, including documents (e.g., through line, word or character shifting), software, multi-dimensional graphics models, and surface textures of objects.
Digital watermarking systems typically have two primary components: an encoder that embeds the watermark in a host media signal, and a decoder that detects and reads the embedded watermark from a signal suspected of containing a watermark (a suspect signal). The encoder embeds a watermark by altering the host media signal. The reading component analyzes a suspect signal to detect whether a watermark is present, hi applications where the watermark encodes information, the reader extracts this information from the detected watermark.
Several particular watermarking techniques have been developed. The reader is presumed to be familiar with the literature in this field. Particular techniques for embedding and detecting imperceptible watermarks in media signals are detailed in the assignee's co-pending application serial number 09/503,881 and US Patent 5,862,260, which are hereby incorporated by reference.
The invention provides halftone image watermark methods and systems.
One aspect of the invention is a method of halftone image watermarking. This method assigns a set of halftone watermark values to locations within a halftone image. It diffuses error associated with the halftone watermark values to neighboring locations
within the halftone image. The error is characterized as a difference between a multilevel pixel value at a location of a halftone watermark dot, and the halftone watermark value of the halftone watermark dot. This method may be used in conjunction with other watermark embedding stages. For example, a robust watermark carrying a key for the halftone watermark may be embedded in the image before embedding the halftone watermark.
Another aspect of the invention is a method of decoding a halftone watermark from an image. The decoding method uses a key to identify locations and values of a halftone watermark, and analyzes pixel values at the locations to determine whether the values at the locations correspond to the values specified by the key. In one application, a robust watermark embedded in the image carries the key used to decode the halftone watermark from the image. A decoder reads the robust watermark from a scan of the halftone watermarked image, optionally compensating for geometric distortion of the scanned image using an orientation signal. The decoder then extracts the key from the robust watermark and passes it to a verifier, which in turn, examines a high resolution scan of the suspect image to determine whether the halftone watermark is present at locations specified by the key.
Another aspect of the invention is another method of embedding a watermark in a halftone image. This method computes a watermark image comprising an array of values corresponding to pixel locations in a halftone image. It embeds the watermark image in the halftone image by using the values of the watermark image to modulate thresholds at the pixel locations. The thresholds are used in a halftone process to convert multilevel pixel values in a multilevel per pixel image into halftone pixel values of the halftone image. Another aspect of the invention is yet another method of embedding a watermark in a halftone image. This method computes a watermark image comprising an array of values corresponding to pixel locations in a target halftone image at a halftone resolution. It then combines the array of values of the watermark image with corresponding multilevel pixel values of a multilevel per pixel image to create a watermarked multilevel per pixel image at the resolution of the target halftone image.
Finally, it performs a halftone process to convert the watermarked multilevel per pixel image to a watermarked halftone image.
Another aspect of the invention is a watermark decoder. The decoder includes a watermark detector and reader. The detector analyzes portions of an image to detect a watermark signal embedded in the image. The image is scanned from a halftone printed image at a sufficiently high resolution to discern a watermark image embedded at a resolution of the halftone image. The watermark reader reads a watermark signal from the portions of the image and decodes an auxiliary message comprising one or more symbols. Another aspect of the invention is a method of embedding a digital watermark into a halftone image. The method redundantly encodes a multi-bit message, and transforms the encoded message to a multilevel per pixel watermark image. It then derives halftone thresholds from the multilevel per pixel watermark image. These thresholds are then used to convert target images into watermarked halftone images. The multilevel pixels in the target image are used to select corresponding halftone thresholds from the halftone thresholds derived from the watermark image. The selected thresholds are applied to corresponding multilevel pixels in the watermark image to create the watermarked halftone image of the target image.
Another aspect of the invention is a method of measuring digital watermark strength. In this method, a watermarked signal is processed to extract estimates of error correction encoded bits embedded into the watermarked signal. Then, the error correction encoded bits are decoded to compute a message payload. The message payload is re-encoded to compute error correction encoded bits. A measure of watermark strength is computed from the error correction encoded bits and the estimates of error correction encoded bits. In particular, in one implementation the soft bit estimates decoded from the watermarked signal are multiplied by corresponding recomputed error correction encoded bits and summed to get a measure of the watermark signal strength. This measurement may be compared with a threshold to detect tampering with the watermarked signal, such as compression, scanning and re-printing, photo-copying, etc.
For printing applications, the embedded digital watermark may be used to carry printer information that is later decoded by a watermark detector and used to examine a digital scan of a printed object and determine whether the printed object is authentic.
Further features will become apparent with reference to the following detailed description and accompanying drawings.
Brief Description of the Drawings
Fig. 1 is a diagram illustrating an example of an error diffusion mask used in creating halftone images from multilevel per pixel images. Fig. 2 is a diagram illustrating another example of an error diffusion mask used in creating halftone images from multilevel per pixel images.
Fig. 3 is a diagram illustrating another example of an error diffusion mask used in creating halftone images from multilevel per pixel images.
Fig. 4 is a diagram of an application of a halftone watermark application. Fig. 5 is a diagram illustrating a method of creating a watermarked halftone image using a threshold mask.
Fig. 6 is a diagram illustrating a method using the halftone screen as an orientation signal to determine geometric distortion of a watermarked image signal.
Detailed Description The following description details methods for watermarking halftone images and related applications. While the watermarking methods apply to other forms of halftoning, the description provides specific examples applicable to error diffusion techniques used to create halftone images.
One form of error diffusion used in processing halftone images is called Floyd- Steinberg error diffusion. In a typical implementation for 8 bit per pixel images, this error diffusion method takes a 0-255 level input image at a first resolution (e.g., 200 pixels per inch) and produces a binary image with a higher resolution (e.g., 600 dots per inch). This description details an implementation for an image plane where each pixel has a corresponding multilevel value, such as a luminance value, or some other color
channel like cyan, magenta, yellow, etc. The description applies to color images with more than one multilevel value per pixel. In such cases, a halftone process operates on each of the color channels per pixel and creates a halftone image for each channel. As a first step, the input image is upsampled to the higher resolution. One approach for producing the binary 600 dots per inche (dpi) image from the upsampled 0-255 level 600 pixel per inch (ppi) image is to threshold the pixel values so that a pixel value less than 128 produces a zero in the corresponding location of the binary image; otherwise the corresponding location would be set to a one. Error diffusion takes this general approach in a raster-scan order, but achieves improved performance by "diffusing" the error at each location to nearby locations yet to be processed. This error diffusion is guided by a weighting mask, shown in Fig. 1.
The "X" represents the pixel location currently being processed, and the adjoining cells show how the error from that location is diffused. For this algorithm, the sum of the diffused errors is equal to the total error at location X. For any specific binary location, the accuracy of this method is no better than the simple thresholding approach. However, if a pixel from the original 200 ppi image is compared with the corresponding 3x3 binary region, the number of cells with a one is closely correlated with the multilevel pixel value.
A more prescise description of the basic error diffusion algorithm uses three equations. See "Digital Color Halftoning", p. 359, by Kang (co-published by The International Society for Optical Engineering and IEEE Press 1999), which is incorporated by reference:
(m, n) : binary (upsampled) pixel location
Pi(m, ri) : upsampled input intensity, 0 -255 p'(m, n) : modified input intensity p0 (m, ri) : output binary value e(m, n) : error at location (m, n) wkl : error diffusion weighting coefficients
p'(m, ri) = pt(m, ri) + ^ wkle(m ~k, n — T) kl
e(m, ri) = p'(m, ri) - 255 p0(m, ri)
The weighting mask shown in Fig. 1 can be modified to show which errors are diffused into a given location. Fig. 2 shows an example of such modifications. In the following sections, we describe a modified error diffusion method that embeds a watermark comprising a set of binary values at specified dot locations in a binary image. This method starts with an upsampled binary host image, a list of dot locations in a binary image and corresponding binary values for a watermark. This method assigns to these locations the corresponding values of the watermark and tries to improve the image with an error diffusion algorithm at the other (non- watermark) locations. It is assumed that the fraction of total pixels occupied by the watermark is small.
In the error diffusion algorithm introduced above, the error for a given location is always calculated after the algorithm has processed all preceding locations. Typically, the algorithm scans a rectangular image comprising scanline rows of pixels starting from the top row and scanning from left to right across each row. The modified error diffusion method calculates error values for all locations at the start and modifies them as the method proceeds. At the start, the error of all locations not covered by the watermark are set to zero. For a location covered by the watermark, the error is calculated as
e(m, ri) = Pi (m, ri) - 255W(m, ri)
where W(m, ri) is the value of the watermark at location (m, ri) . A new set of error diffusion weights is used; the pattern of error diffusion into location X is shown in Fig.
3.
In this method, the diffusion of errors takes place in two directions. Errors from the watermarked locations are diffused in one direction (backwards in this case), and errors from processed locations are diffused in another direction (forwards). As each location is processed, the error for that location is updated for later diffusion. The calculation of the output binary value is unchanged from the error diffusion algorithm, with the exception at watermark locations, where p0 (m, ri) is set to W(m, ri) .
The appearance of the watermark can be improved by arranging locations of watermark dots in a pseudorandom pattern subject to some additional human visual system frequency response criteria. One approach for arranging the locations of the watermark dots is to use a minimum visual cost technique. In this approach the watermark is chosen by minimizing a visual cost function such as
where H(fx,fy) is the frequency spectrum of the watermark and V(fx,fy) is a frequency response model of the human visual system. One possible such model is due to Sullivan et al. and given in Kang, section 5.6. The effect of this cost function is to weight the frequency content of the watermark by the human visual ability to detect it. In this manner, watermark patterns that move the frequency content to a region of the frequency spectrum where the human visual system is less sensitive will have a lower cost. This method of selecting locations and values of watermark dots provides a perceptually more even distribution of the dots in solid regions (e.g., white regions in a grayscale image). Searching for a watermark with a low cost can be done by a variety of methods; one such method is simulated annealing.
The encoder may repeat the watermark in blocks of the host image, such as contiguous blocks of pixels throughout the image. Each block may use the same or a
different key. If the encoder uses different keys per block, each key may be related to another key by a secret function, such as a cryptographic function.
To read the watermark, a watermark decoder uses a key specifying where the watermark dots are located. It then determines the binary values of those dots. In cases where the watermark is embedded in a printed image, a scanner or camera with sufficiently high resolution to read the halftone dots creates a digital image from which the decoder extracts the watermark.
There are many applications of this type of watermark. It may be used to carry a message, including usage control instructions or other metadata like an identifier of the image owner or an index to a database record storing related information. It may also convey a fixed pattern used to determine whether a watermarked image (e.g., a printed halftone watermarked image) is authentic or has been altered. In such an application, the decoder compares the extracted watermark pattern with the known pattern, and based on this comparison determines whether the watermarked image has been altered (e.g., copied, scanned and re-printed, compressed, etc.). It can also specify where the image has been altered by showing locations in the scanned image where the halftone watermark is not present.
Since the watermark is applied to halftone images it may be applied as part of the printing process where a multilevel per pixel digital image is converted to a halftone image. For example, it may be incorporated into the halftoning process implemented in a printer device or printer driver executing in a computer that sends the image to the printer for printing. More generally, the watermarking method may be applied in applications where halftone images are printed, including commercial printing presses as well as personal printing devices, such as ink jet printers. In some types of image watermarking, the host image interferes with the watermark. The host image provides a communication channel for the watermark. In decoding the watermark from a host image, the host image can be considered noise that interferes with the ability of the watermark decoder to accurately extract the watermark signal. To increase the chances of accurate recovery of the watermark, it is spread throughout a large portion (perhaps the entire image) of the host image.
The type of watermark described in the previous paragraph can be embedded directly into a halftone image. The following discussion details an error diffusion method for directly embedding this type of watermark into a halftone image.
In this method, the watermark to be embedded, W (m, ri) , takes values from 0- 255. The error diffusion process is changed so that the threshold used to calculate the binary output values is modulated by the watermark signal:
T(m, ri) = 128 - I(W(m, ri) - 128)
The intensity level I controls how heavily the watermark is embedded in the image. As an alternative to modulating error diffusion thresholds, the watermark may be embedded without modifying the halftoning process. For example, a multilevel per pixel watermark signal is created at the resolution of a target halftone image. The watermark encoder produces the multilevel per pixel watermark signal at the desired resolution of the halftone image, or at some other resolution and up or down samples it to match the resolution of a target halftone image. This watermarked signal is then added to the host image at the same spatial resolution to create a composite, watermarked image. The error diffusion process or some other type of halftone process may then be applied directly to this composite image to generate a watermarked halftone image. This technique applies to a variety of halftone processes including ordered dithering (e.g., blue noise masks, clustered dot halftones, etc.) as well as error diffusion halftone processes.
There are a variety of ways to generate the watermark signal. One approach is to take an auxiliary message comprising binary or M-ary symbols, apply error correction coding to it, and then spread spectrum modulate the error correction encoded message. One way to spread spectrum modulate the message is to spread each binary symbol in the message over a pseudorandom number, using an exclusive OR operation or multiplication operation. The resulting binary message elements in the spread
spectrum modulated message signal are then mapped to spatial image locations. The watermark signal may be expressed in a binary antipodal form, where binary symbols are either positive or negative. To increase robustness, the spread spectrum modulated message signal may be repeated throughout the host image, by for example, embedding the message signal in several blocks of the host image. In particular, the watermark encoder may embed instances of the watermark signal into contiguous blocks of pixels throughout a portion of the host image or throughout the entire host image.
Perceptual modeling may be applied to the host image to calculate a gain vector with gain values that correspond to the message signal elements. For example, in the case where the upsampled watermarked signal is added to the host signal, the gain values may be used to scale binary antipodal values of the message signal before adding them to the host signal. Each gain value may be a function of desired watermark visibility and detectability constraints. In particular, the perceptual model analyzes the image to determine the extent to which it can hide a corresponding element of the watermark image. One type of an analysis is to compute local contrast in a neighborhood around each pixel (e.g., signal activity) and select gain for pixel as a function of local contrast. A detectability model analyzes the host signal to determine the extent to which pixel values are biased toward the value of the watermark signal at the corresponding pixel locations. It then adjusts the gain up or down depending on the extent to which the host image pixels are biased towards the watermark signal.
This type of watermark may be read from the watermarked halftone image or other image representations of that watermarked image, such as a multilevel per pixel representation of the image at a resolution sufficiently high to represent the watermark signal. To decode the watermark, a watermark decoder detects the presence and orientation of the watermark in the watermarked image. It then performs an inverse of the embedding function to extract an estimate watermark message signal.
The message signal is robustly encoded using a combination of the following processes:
1. repetitively encoding instances of a message signal at several locations (e.g., blocks of the image);
2. spread spectrum modulation of the message, including modulation techniques using M sequences and gold codes; and
3. error correction coding, such as convolution coding, turbo coding, BCH coding, Reed Solomon coding, etc. The watermark decoder reconstructs an embedded message from the estimated watermark signal by:
1. aggregating estimates of the same message element in repetitively encoded instances of the message;
2. performing spread spectrum demodulation, and 3. error correction decoding.
In one implementation, the decoder uses an orientation signal component of the watermark to detect its presence and orientation in the watermarked image. It then performs a predictive filtering on the image sample values to estimate the original un- watermarked signal, and subtracts the estimate of the original from the watermarked signal to produce an estimate of the watermark signal. It performs spread spectrum demodulation and error correction decoding to reconstruct an auxiliary message embedded in the watermarked signal.
For more details about embedding an image watermark, and detecting and reading the watermark from a digitized version of the image after printing and scanning see assignee's co-pending application serial number 09/503,881 and US Patent 5,862,260, which are hereby incorporated by reference. In order to make the watermark robust to geometric distortion, the watermark includes an orientation watermark signal component. Together, the watermark message signal and the orientation watermark signal form the watermark signal. Both of these components may be added to a host image at the resolution of the halftone image before the host image is converted to a the halftone image. Alternatively, these components may be combined to form the watermark signal used in modulating the error diffusion threshold used in an error diffusion type halftone process.
One type of watermark orientation signal is an image signal that comprises a set of impulse functions in the Fourier magnitude domain, each with pseudorandom phase. To detect rotation and scale of the watermarked image (e.g., after printing and scanning
of the watermarked image), the watermark decoder converts the image to the Fourier magnitude domain and then performs a log polar resampling of the Fourier magnitude image. A generalized matched filter correlates the known orientation signal with the re-sampled watermarked signal to find the rotation and scale parameters providing the highest correlation. The watermark decoder performs additional correlation operations between the phase information of the known orientation signal and the watermarked signal to determine translation parameters, which identify the origin of the watermark message signal. Having determined the rotation, scale and translation of the watermark signal, the reader then adjusts the image data to compensate for this distortion, and extracts the watermark message signal as described above.
The halftone watermarks described above may be used in combination with one or more other watermarks. In one application, for example, a robust watermark is used to carry a key that specifies the dot locations of a halftone watermark. In particular, the robust watermark's message payload carries a key that identifies specific dots (the high-resolution binary values) that were turned on or off in a specific pattern. These binary valued bits act as a secondary fragile watermark that can be verified by close inspection of the image.
Fig. 4 illustrates an implementation of this application. On the watermark embedding side, a watermark encoder 100 operates on an input image 102 to embed a robust watermark that survives printing, scanning, and geometric distortion. An example of this type of watermark is described above and in the patent and patent application incorporated by reference. At least part of the message payload of this robust watermark carries a key that is used to decode a halftone watermark at specified halftone dot locations in a halftone image. In this implementation, the key includes a seed to a random number generator
104 that identifies locations of the halftone watermark dots in the halftone image and also specifies the binary values (on or off) of the dots at those locations. A halftone converter 106 (e.g., error diffusion, blue noise masking, etc.) then converts the robustly watermarked image into a halftone image while ensuring that the halftone watermark dots are assigned the correct values based on the output of the random number generator.
One example of this process is the modified error diffusion method described above in which the halftone converter diffuses the error introduced by halftone watermark dots to a spatial neighborhood around their respective locations while ensuring that those watermark dot values remain fixed. This approach reduces the perceptible impact of the halftone watermark. Other halftone methods may be used as well as long as they ensure that the halftone watermark dots are set as specified by the key. The result is a halftone image 108 that may be printed using conventional printer technology, such as ink jet printing, etc. (110). In fact, the watermark embedding process (100) may be implemented in a printer or software driver for a printer. To verify the authenticity of the printed image, a scanner 120 captures a digital image from a printed image 122. Since the robust watermark survives printing and then scanning by a low resolution (e.g., 100 dpi) scanner or digital camera, it may be recovered from a digital image captured from the printed image 122. A watermark reader 124, using the decoding operations outlined above, detects the robust watermark, determines its orientation, and then reads the watermark payload, including the key. It supplies the key to a random number generator 126, which provides the halftone dot locations and values of the halftone watermark.
A secondary watermark verifier 128 then examines a suitably high resolution scan of the printed image to determine whether the halftone watermark is present. A "suitably high resolution scan" of the printed image is one in which the halftone dots of the printed image are readable. This high resolution scan may be the same image captured by the scanner 120 if it is at a sufficient resolution. Alternatively, a separate image capture device 130 may be used to capture a high resolution image depicting the halftone dots. The verifier 128 provides a signal indicating the extent to which the halftone watermark is present.
Based on the output of the verifier, a number of actions can be taken. Some actions include recording identifying information about the user (e.g., a device ID of the user's computer or imaging device, address (e.g., network address), user ID, etc.), sending this information and the result of the verification operation to a remote device via communication link, displaying information about rules governing use of the image, connecting the decoding system to a licensing or electronic transaction server (e.g., a
web server) enabling the user to license or purchase related rights, products or services, etc. electronically.
Both the robust and fragile halftone watermark may also carry other information. They may be used to carry a message, including usage control instructions or other metadata like an identifier of the image owner, a computer address for establishing a remote connection (e.g., an IP address, URL, etc.) or an index to a database record storing related information. In one application, the message carries an identifier that is used to fetch related information to the image. In particular, the watermark decoder communicates the identifier to a database management system in the form of a request. The database management system and underlying database records may be implemented in a remote device connected to the watermark decoder device via a network connection (e.g., a web server on the Internet connected via a TCP/IP connection) or within the system containing the watermark decoder (e.g., a local database executing within the same device as the decoder or within a computer locally connected to the decoding device via a port, such as a serial, parallel, infrared, Bluetooth wireless or other peripheral port). In response to the request, the database looks up information related to the identifier or identifiers extracted from the robust watermark. The database either returns the information to the watermark decoding device (e.g., a personal computer, personal digital assistant, Internet appliance, scanner, printer, etc.), or forwards it to one or more other devices along with an address of the decoding device, which in turn, return related information to the decoding device. One example is to use an identifier or address encoded in the robust watermark to fetch information from a web site related to the watermarked image. For more information on this use of the robust watermark, see US Patent No. 5,841,978 and co-pending applications 09/571 ,422, 09/563,664, and 09/597,209, which are hereby incorporated by reference.
As noted previously, for many image watermark applications, it is necessary to, as part of the watermark detection and reading process, determine the orientation (scale, rotation, etc.) of the watermarked image prior to the process of reading the watermark. One way to determine orientation of a watermarked image relative to its orientation at the time of embedding the watermark is to use some form of calibration signal as part
of the watermark. The calibration signal may be integrally related to the watermark message signal, such as a carrier signal or synchronization code. Alternatively, it may be a distinct signal, such as an imperceptible registration template.
The following watermarking method uses a halftone image screen as an orientation signal, avoiding the need to embed a separate calibration signal. The halftone screen may be one normally used to print un- watermarked halftone images, or one specifically adapted to create and print watermarked halftone images.
This method is suitable for image halftoning methods in which the halftoning is implemented through the use of threshold masks. A threshold mask is a pattern which can be tiled in a repetitive fashion over an image. Most commonly, the threshold mask is a square matrix of numbers. For example, the threshold mask may be 128x128, and the image may be comprised of an array of 8-bit pixels. In this case, the threshold mask contains numbers ranging from 0 to 255, corresponding to the 256 possible levels in an 8 bit pixel value. In cases where the number of elements in the mask exceeds the number of levels per pixel, the threshold mask is usually designed to include an equal number of occurrences of each level. For example, in the current example where there are 16384 elements in a 128x128 mask, there are 64 entries of the threshold mask for each level (16384/256 = 64). It is possible to use a 16x16 mask with each element corresponding to a different level from 0 to 255. However, when tiled across the image, such a mask may introduce more visual artifacts than the larger mask. Of course, these masks are just examples, and there are many alternative combinations of mask sizes, dimensions, and levels for threshold masks used in halftone screen processes.
In a typical halftone screen process, the threshold mask is used to create the binary pattern of ink dots used to represent the printed version of an image. If an image with 200 pixels per inch is to be printed using 2400 dots per inch of resolution, the image is first upsampled to 2400 pixels per inch. Then the threshold mask is tiled and applied to the upsampled image to construct the halftone image. Each pixel of the halftone image is calculated from the corresponding pixel of the upsampled image and the value of the tiled threshold mask at that location. If the threshold mask value is less
than or equal to the upsampled image pixel, then the halftone image pixel is set to 1 ; otherwise it is set to 0.
Error diffusion halftoning may not be implemented in this way. Fig. 5 is a diagram illustrating a method of creating a watermarked halftone image using a threshold mask. The watermark encoding process begins by transforming an auxiliary message into a watermark image signal (150). The method shown in Fig. 5 can be used with a variety of methods of constructing a watermark image signal. For the purpose of illustration, Fig. 5 cites an example using a spatial spread spectrum watermark. In this case, a watermark message payload is spread spectrum modulated over an area coextensive with the threshold mask. Alternative watermark encoding functions may be used, such as frequency domain techniques, where the watermark signal modulates frequency coefficients to encode message symbols, or statistical feature modulation where the watermark modulates features of the signal, such as its autocorrelation, power, amplitude, signal peaks, etc. Each of these embedding functions can be characterized as adding a spatial watermark image signal with corresponding elements in the host image.
The encoder may modulate the watermark image signal with a gain to make the watermark message more likely to be recovered, less visible, or some balancing of recoverability and imperceptibility. In the case of the spatial spread spectrum watermark image, the encoder converts the message to a binary antipodal signal (150) and a gain modulator adjusts the value of each element of that signal (152). The spatial resolution of the watermark image signal corresponds to the target halftone image.
The watermark image signal may be tiled, so that the same payload is repeated over the image, or it may be varied, so that different information is embedded in different areas of the image.
To prepare the host image for the watermark, the encoder takes a multilevel per pixel form of the image (154) and converts it to the halftone resolution (156). This step typically involves upsampling the multilevel image to a higher resolution (e.g., upsampling a 200 dpi, 8 bit per pixel image to a 2400 dpi per pixel image). One method for inserting the watermark is to add a spatial domain representation of the
watermark image signal with the multilevel image signal (158), each at the halftone image resolution, to create a composite image.
Next, the encoder applies a threshold mask (160, 162) to the resulting composite image. The halftone screen process converts pixel levels below or equal to the corresponding threshold mask level to zero, otherwise it sets them to one. The result is a watermarked halftone image. The image may then be printed using a variety of halftone printing process, including ink jet printers and printing presses.
Fig. 6 is a diagram illustrating a method using the halftone screen as an orientation signal to determine geometric distortion of a watermarked image signal. In particular, this method is used to calculate the orientation of the host signal at the time of watermark embedding to facilitate recovery of a watermark message.
First, a scanner or camera or other imaging device captures a digital version of the printed, watermarked image 170 at sufficiently high resolution (172) to distinguish the halftone pixels. In the continuing example, a watermark decoder obtains blocks of a digital image 174, each having an area approximately corresponding to the size of the threshold mask (e.g., 128x128). Each digital image pixel is approximately equal in size to a halftone image pixel.
Next, the decoder filters the image block using a low pass filtering technique
176 like computing the average pixel value and replacing each pixel with the average. Next, the decoder applies the same threshold mask used to create the halftone image for printing (178, 180). This process results in a target orientation signal having orientation attributes similar to the original image.
To determine the geometric distortion of the received image relative to the original, watermarked image, the decoder performs a series of correlation operations between the target orientation signal and the received image. In the method shown in
Fig. 6, the decoder converts both the received image block and the target orientation signal derived from it to a correlation domain (182). In particular, it performs a
128x128 Fast Fourier Transform and resamples the resulting Fourier magnitude representation into a log-polar coordinate system to get a Fourier-Mellin representation of both images. Correlation operators 184, such as a generalized matched filter, correlate the Fourier-Mellin representation of the target signal (the reference signal)
with a Fourier-Mellin representation of the received image. The correlation operation produces an estimate of the scale and rotation parameters of the digital image.
The decoder transforms the received spatial digital image by the inverse of the estimated rotation and scale. The correlation operator correlates the resulting spatial image with the spatial target orientation signal to obtain an estimate of the translation of the image with respect to the threshold mask.
Having estimates for the scale, rotation, and translation parameters of the digital version of the watermarked image, a watermark message reader (186) demodulates the spread spectrum watermark signal from the received image. Each of the halftone watermark methods described above may be used in
"fragile watermark" or "semi-fragile" watermark applications, where the watermark is analyzed to detect and characterize alterations to a watermarked image. A fragile watermark refers to a watermark that degrades or becomes unreadable when the host image is subjected to certain types of distortion. A semi-fragile watermark is a variant on this theme where the watermark survives some types of distortion, but not others. One approach is to design the watermark so that it degrades or becomes unreadable in response to certain types of distortions, like printing, scanning on consumer grade scanners, image compression, etc. If the decoder is unable to detect or read the watermark, or the measured degradation exceeds a threshold, then the decoder provides output indicating that the image has been altered (e.g., is not authentic). The degradation can be measured by the extent of recovery of the watermark signal, such as the extent of correlation between a known watermark signal and one extracted from a received image. In such fragile or semi-fragile applications the gain values of the halftone watermark methods previously described may be chosen to provide an explicit balance between ease of reading and fragility with respect to certain types of distortions.
An enhanced approach is to design the watermark such that the type of alteration can be distinguished. For example, the type of alteration can be characterized by the type of degradation it causes to the watermark signal. By quantifying the degradation to different aspects of the watermark signal, a decoder can match the observed degradation to a particular type of alteration.
The performance of such fragile watermarking applications can be improved by choosing the frequency distribution of the embedded watermark to differentiate between different forms of distortion. There are two main considerations in designing the watermark signal attributes. The first is to put part of the watermark's energy in frequencies that are likely to be degraded by a particular type of degradation to be detected (such as print and scan operations); the second is to put energy in the frequencies that are less likely to be degraded by distortion that the watermark should survive (such as smudges, and normal wear and tear). By analyzing the frequency distribution of the watermark, the decoder can distinguish between forms of alteration. This approach is particularly useful to determine whether a printed image is genuine (e.g., whether it has been reproduced through a scan and print operation), as opposed to being merely soiled and worn through ordinary use.
A digital watermark may be embedded into a halftone image by using the digital watermark signal as a halftone screen. The following discussion illustrates an example of the process :
1. Create a digital watermark signal, for example, using spread spectrum techniques described above. Optionally, the watermark signal may include an orientation signal, such as a pseudorandom carrier signal that yields a pattern of signal peaks when transformed into the autocorrelation domain or an orientation signal that yields a pattern when transformed into a frequency domain, such as the Fourier domain.
The resulting watermark signal is a multi-level per pixel gray scale image or array of luminance values. It has a known block size, such as 64 by 64, 128 by 128, 256 by 256 pixel elements that can be tiled contiguously to create an image of varying sizes.
2. Calculate a histogram of the watermark image block.
3. Use the histogram to set halftone threshold levels for corresponding gray values (or luminance values). This example uses gray levels, but the same technique applies to luminance values as well as other color channels of color images. These halftone threshold levels are later used to convert a multilevel pixel value to a binary state of
"on" or "off of a halftone pixel by comparing the multilevel pixel value to the threshold and setting the halftone pixel at that location to a zero or one depending on whether the multilevel pixel value is less or greater than the threshold.
For each discrete gray value between 0-255, for example, the method picks a threshold level that, when applied to the multilevel pixel values in the watermark image block, the resulting halftone image has a tone density that achieves the desired gray value.
If the same watermark signal is to be embedded in several halftone images, the thresholds set by this process may be used over and over to create watermarked halftone images with the same embedded signal. Here's how the process of creating the halftone image operates:
1. Upsample or otherwise convert the target host multilevel per pixel image to the halftone dot resolution. Also, upsample or otherwise convert the multilevel per pixel watermark block to the halftone dot resolution, and if necessary, tile the blocks in a contiguous pattern across the target image so that the watermark signal is coextensive with the target image.
2. For each multilevel pixel value in the upsampled image, look-up the corresponding halftone threshold level that corresponds to that level.
3. Apply the threshold to the corresponding multilevel per pixel value in the watermark signal to get a halftone dot value that is either a zero or one. When the halftone conversion process of stages 2 and 3 is complete, the result is a watermarked halftone image that may be printed on paper or other objects.
Each halftone dot in the watermarked halftone image is either ink (minimum luminance, ) or no ink (maximum luminance). As such, a high luminance value in the target host image sets a threshold such that a corresponding pixel in the watermark image is more likely to be set to the "no ink" state. Conversely, a low luminance value
in the target image sets a threshold such that a corresponding pixel in the watermark image is more likely to be set to an "ink present" state.
This process of converting an image to a watermarked halftone image may be applied to one or more color planes in a color halftone image. If the target host image is a simple gray level image with only a few different gray levels, then the above technique can be used to set thresholds for each of the gray levels in the host image. Also, while the host image may have several gray levels, these gray levels can be divided into ranges, where each range is assigned a single halftone threshold. Masks can then be calculated to define the areas of the host image that correspond to each of the halftone thresholds. Finally, the halftone threshold corresponding to the particular mask is applied to the watermark signal areas covered by the mask to create a watermarked halftone image.
Consider the following example where the host image has three gray levels, each with a corresponding mask defining the areas of the image where those gray levels reside. This method now thresholds the multilevel per pixel watermark signal, coextensive with the target image, based on three different masks that correspond to areas of a particular tonal density in the target image: mask 1 has tonal density Dl, and uses threshold TI mask 2 has tonal density D2, and uses threshold T2 mask 3 has tonal density D3, and uses threshold T3.
There are a variety of ways to create the digital watermark signal in this technique. One way is:
1. Take a desired watermark message payload comprising an N bit binary string. 2. Perform error correction encoding of the message, such as convolution coding, turbo coding, etc.
3. Spread each bit of the error correction encoded message over a pseudorandom carrier signal by, for example, taking the XOR of the bit value with each value in the pseudorandom carrier; 4. Map the spread signal values to pixel locations in a watermark image block.
5. Convert spread signal to multilevel per pixel watermark signal. At this stage, the spread signal values are binary, e.g., {1, 0}, or {1,-1} representing increases or decreases in corresponding sample values (e.g., luminance, gray level, intensity, etc.) Each binary value corresponds to one or more neighboring pixels in the watermark signal. These binary values may be converted to multilevel values by adjusting a midlevel pixel value (e.g., mid level gray value of 128 in an 8 bit pixel) up or down corresponding to the value of the spread signal. The gain of the watermark signal may be adjusted by applying a scale factor to the spread signal. Further, the changes from each bit of the spread signal may be made to smoothly vary over the corresponding pixels in the neighborhood.
There are a variety of techniques to create the spread signal, such as convolving the message with the carrier, multiplying a binary antipodal message signal with a binary antipodal carrier, etc. The carrier may be used for synchronizing the detector with the watermark in a watermarked signal. Also, as detailed above, an orientation signal may be added to the spread signal to assist in calibration.
To recover the watermark message, a watermark detector operates as follows:
1. Capture a digital version of the halftone image (e.g., from a web cam or scanner capture of the printed halftone image).
2. Detect and align the image using any one of the techniques described previously.
3. Predict the watermark signal at each pixel in the aligned image from neighboring pixels; e.g., using a filter to de-correlate the host signal from the watermark signal.
4. Correlate the watermark signal with the carrier to recover the error correction encoded bit values. These may be "soft" bit values weighted by the extent of correlation with the carrier. In other words, rather than representing each estimate of an error correction encoded bit as hard one or zero, they are represented by some weighted
probability between values corresponding to a binary one or zero (e.g., between -1 and 1, -128 and 128, etc.)
5. Perform error correction decoding on the soft bit values, e.g., using a convolution or turbo coding scheme compatible with the embedded message to recover the message payload.
The process of reading the watermark may also be used to detect whether the watermarked halftone image has been altered. In particular, it may be used to determine whether the printed halftone image has been copied, e.g., photocopied, or scanned and re-printed.
The detected strength of the watermark signal may be compared to a copy detection threshold to determine whether the printed object has been copied. One approach for measuring the strength of the watermark signal is as follows:
1. Use the message payload read from the watermark to re-create the original embedded bit sequence (including redundantly encoded bits from error correction coding) used for the watermark.
2. Convert the original bit sequence so that a zero is represented by -1 and a one is represented by 1.
3. Multiply (element-wise) the soft- valued bit sequence used to decode the watermark by the sequence of step 2.
4. Create one or more measures of watermark strength from the sequence resulting in the previous step. One such measure is the sum of the squares of the values in the sequence. Another measure is the square of the sum of the values in the sequence. Other measurements are possible as well. For example, soft bits associated with high frequency components of the watermark signal may be analyzed to get' a strength measure attributed to high frequency components. Such high frequencies are likely to be more sensitive to degradation due to photocopying, digital to analog and analog to digital conversion, scanning and re-printing, etc.
5. Compare the strength measures to thresholds to decide if the suspect image has been captured from an original or a copy of the printed object. The threshold is derived by evaluating the difference in measured watermark strength of copied vs. original printed objects on the subject printer platform used to create the original, and a variety of copiers, scanners and printers used to create copies.
As an additional method of detecting copying, the watermark message payload may be used to identify the printer model and/or resolution of the watermarked halftone image. Then, to verify a printed object is authentic, the watermark detector extracts the message payload and determines the printer model and or printed resolution. A forensic scan of the printed object is than analyzed to verify that the object was printed at the resolution specified in the watermark payload. One specific way to analyze the resolution of the forensic scan is examine its frequency content to determine if it was likely printed at the specified resolution. The technique described above of detecting signal tampering by measuring symbol errors can be applied to two or more different watermarks embedded at different spatial resolutions in the host media signal. Each of the watermarks may have the same or different message payloads. In the first case where the watermarks have the same message payloads, the message extracted from one of the watermarks may be used to measure bit errors in each of the other watermarks. For example, the message payload from a robust watermark embedded at a low spatial resolution may be used to measure the bit errors from a less robust watermark at a higher spatial resolution. If the watermarks carry different message payloads, then error detection bits, such as CRC bits, can be used in each message payload to ensure that the message is accurately decoded before re-creating the original, embedded bit sequence.
Using two or more different watermarks enables a threshold to be set based on the ratio of the signal strength of the watermarks relative to each other. In particular, the signal strength of a first watermark at a high resolution (600-1200 dpi) is divided by the signal strength of a second watermark at a lower resolution (75-100 dpi). In each case, the signal strength is measured using a measure of symbol errors or some other measure (e.g., correlation measure).
If the measured strength exceeds a threshold, the detector deems the watermark signal to be authentic and generates an authentication signal. This signal may be a simple binary value indicating whether or not the object is authentic, or a more complex image signal indicating where bit errors were detected in the scanned image. The watermark and host signal can be particularly tailored to detect copying by photo duplication and printing/re-scanning of the printed object. This entails embedding the watermark with selected screening structures at particular spatial frequencies/resolutions that are likely to generate message symbol errors when the object is re-printed. This detection process has an additional advantages in that it enables automatic authentication, it can be used with lower quality camera devices such as web cameras and common image scanners, and it allows the watermark to serve the functions of determining authenticity as well as carrying a message payload useful for a variety of applications.
The message payload can include an identifier or index to a database that stores information about the object or a link to a network resource (e.g., a web page on the Internet). The payload may also include a covert trace identifier associated with a particular authentic item, batch of items, printer, or distributor. This enables a counterfeit object, or authentic object that has been printed without authority to be detected and traced to a particular source, such as its printer, distributor or batch number.
The payload may also carry printer characteristics or printer type information that enables the watermark reader to adapt its detection routines to printer types that generated the authentic object. For example, the payload may carry an identifier that specifies the type of halftoning used to create the authentic image, and more specifically, the attributes of the halftone screen. With this information, the reader can check authenticity by determining whether features associated with the halftone screen exist in the printed object. Similarly, the reader can check for halftone screen attributes that indicate that a different halftone screen process has been used (e.g., a counterfeit has been created using a different halftone screen). One specific example is a payload that identifies the halftone screen type and paper type. The reader extracts this payload from a robust watermark payload and then analyzes the halftone screen and paper
attributes to see if they match the halftone type and paper type indicated in the watermark payload. For example, the halftone type can specify the type of halftone screen patterns used to create an authentic image. If this halftone screen pattern is not detected (e.g., by absence of a watermark embedded using that particular type of halftone screen), then the image is considered to be a fake.
A related approach for analyzing halftone type is to look for halftone attributes, like tell-tale signs of stochastic halftone screens vs. ordered dither matrix type screens. Dither matrix screens used in low end printers tend to generate tell tale patterns, such as a pattern of peaks in the Fourier domain that differentiate the halftone process from a stochastic screen, such as an error diffusion process, which does not generate such telltale peaks. If the reader finds peaks where none were anticipated, then the image is deemed a fake. Likewise, if the reader finds no peaks where peaks were anticipated, then the image is also deemed a fake. Before performing such analysis, it is preferable to use the embedded digital watermark to re-align the image to its original orientation at the time of printing. Attributes due to the halftone screen can then be evaluated in a proper spatial frame of reference. For example, if the original ordered dither matrix printer created an array of peaks in the Fourier domain, then the peak locations can be checked more accurately after the image is realigned.
Concluding Remarks
Having described and illustrated the principles of the technology with reference to specific implementations, it will be recognized that the technology can be implemented in many other, different, forms. To provide a comprehensive disclosure without unduly lengthening the specification, applicants incorporate by reference the patents and patent applications referenced above.
The methods, processes, and systems described above may be implemented in hardware, software or a combination of hardware and software. For example, the auxiliary data encoding processes may be implemented in a programmable computer or a special purpose digital circuit. Similarly, auxiliary data decoding may be implemented in software, firmware, hardware, or combinations of software, firmware
and hardware. The methods and processes described above may be implemented in programs executed from a system's memory (a computer readable medium, such as an electronic, optical or magnetic storage device).
The particular combinations of elements and features in the above-detailed embodiments are exemplary only; the interchanging and substitution of these teachings with other teachings in this and the incorporated-by-reference patents/applications are also contemplated.
Claims
1. A method of halftone image watermarking comprising: assigning a set of halftone watermark values to locations within a halftone image; and diffusing error associated with the halftone watermark values to neighboring locations within the halftone image, where the error is characterized as a difference between multilevel pixel value at a location of a halftone watermark dot, and the halftone watermark value of the halftone watermark dot.
2. The method of claim 1 wherein error associated with the halftone watermark values is diffused on one direction in the image, and error associated with converting multilevel pixel values in the image to a halftone image is diffused in another direction.
3. The method of claim 1 wherein the locations and values of halftone watermark dots are specified by a key.
4. The method of claim 3 wherein the key is carried in a separate watermark embedded in the image.
5. The method of claim 3 wherein the key is used to seed a pseudorandom process that specifies the values and locations of the halftone watermark.
6. The method of claim 1 comprising: embedding a watermark into a multilevel pixel representation of an image before converting the image to the halftone image.
7. The method of claim 6 wherein the watermark embedded in the multilevel pixel representation of the image includes a watermark orientation signal used to detect the watermark in an image after undergoing geometric distortion.
8. The method of claim 6 wherein the watermark in the multilevel pixel representation carries a message including a key specifying locations of the halftone watermark.
9. A computer readable medium on which is stored software for performing the method of claim 1.
10. A method of decoding a halftone watermark from an image comprising: using a key to identify locations and values of a halftone watermark; and analyzing pixel values at the locations to determine whether the values at the locations correspond to the values specified by the key.
11. The method of claim 10 including: scanning a printed version of the image at a suitably high resolution to read halftone watermark dots embedded into the printed version of the image; and then performing the actions of claim 10.
12. The method of claim 10 including deriving the key from data embedded in the image.
13. The method of claim 12 wherein the key is derived from a watermark embedded in the image.
14. The method of claim 13 wherein the watermark from which the key is derived is embedded so as to survive printing and scanning.
15. The method of claim 13 including: detecting an orientation signal and using the orientation signal to determined orientation parameters of the watermark; and using the orientation parameters to read a message embedded in the watermark, the message including the key.
16. A computer readable medium having software for performing the method of claim 10.
17. A method of embedding a watermark in a halftone image comprising: computing a watermark image comprising an array of values corresponding to pixel locations in a halftone image; embedding the watermark image in the halftone image by using the values of the watermark image to modulate thresholds at the pixel locations, where the thresholds are used to convert multilevel pixel values in a multilevel per pixel image into halftone pixel values of the halftone image.
18. The method of claim 17 wherein the watermark image comprises an orientation watermark signal enabling a watermark decoder to compensate for rotation, scale and translation.
19. The method of claim 17 wherein the watermark image comprises a watermark message signal carrying a message of two or more symbols.
20. A computer readable medium on which is stored software for performing the method of claim 17.
21. A method of embedding a watermark in a halftone image comprising: computing a watermark image comprising an array of values corresponding to pixel locations in a target halftone image at a resolution; combining the array of values of the watermark image with corresponding multilevel pixel values of a multilevel per pixel image to create a watermarked multilevel per pixel image at the resolution of the target halftone image; and performing a halftone process to convert the watermarked multilevel per pixel image to a watermarked halftone image.
22. The method of claim 21 wherein combining comprises adding the array of values of the watermark image with the corresponding pixel values of the multilevel per pixel image.
23. The method of claim 21 wherein performing the halftone process includes performing an error diffusion process on the watermarked multilevel per pixel image.
24. The method of claim 21 wherein performing the halftone process includes performing an ordered dithering process on the watermarked multilevel per pixel image.
25. The method of claim 21 wherein the watermark image comprises an orientation watermark signal enabling a watermark decoder to compensate for rotation, scale and translation.
26. The method of claim 21 wherein the watermark image comprises a watermark message signal carrying a message of two or more symbols.
27. A computer readable medium on which is stored software for performing the method of claim 21.
28. A watermark decoder comprising: a watermark detector for analyzing portions of an image to detect a watermark signal embedded in the image, the image having been scanned from a halftone printed image at a sufficiently high resolution to discern a watermark image embedded at a resolution of the halftone image; and a watermark reader for reading a watermark signal from the portions of the image.
29. The watermark decoder of claim 28 wherein the decoder is operable to use a key to identify locations and values of a halftone watermark, and is operable to detect alteration of the image by analyzing pixel values at the locations to determine whether the values at the locations correspond to the values specified by the key.
30. The watermark decoder of claim 29 wherein the key is decoded from a primary watermark in the image and the halftone watermark is a secondary watermark in the image.
31. The watermark decoder of claim 28 wherein the watermark signal is encoded by computing a watermark image comprising an array of values corresponding to pixel locations in a halftone image, and embedding the watermark image in the halftone image by using the values of the watermark image to modulate thresholds at the pixel locations, where the thresholds are used to convert multilevel pixel values in a multilevel per pixel image into halftone pixel values of the halftone image.
32. The method of claim 31 wherein the watermark decoder is operable to detect alteration of the image by analyzing the watermark signal.
33. The watermark decoder of claim 28 wherein the watermark signal is encoded by computing a watermark image comprising an array of values corresponding to pixel locations in a target halftone image at a resolution, combining the array of values of the watermark image with corresponding multilevel pixel values of a multilevel per pixel image to create a watermarked multilevel per pixel image at the resolution of the target halftone image, and performing a halftone process to convert the watermarked multilevel per pixel image to a watermarked halftone image.
34. The watermark decoder of claim 33 wherein the watermark decoder is operable to detect alteration of the image by analyzing the watermark signal.
35. An image processor for determining geometric distortion of an image; the process comprising: a halftone screen threshold analyzer for creating a target orientation signal by applying a halftone screen threshold mask to image data from a received image; and a correlation operator for correlating the target orientation signal with the received image in a correlation domain, the correlation operator producing one or more orientation parameters estimating geometric distortion of the image.
36. The processor of claim 35 wherein the orientation parameter is used to realign the image to facilitate decoding of a watermark from the re-aligned image.
37. The processor of claim 36 wherein the watermark carries a message of one or more symbols.
38. The processor of claim 36 wherein the watermark is analyzed to detect alteration of the image.
39. The method of claim 1 wherein values and locations of the halftone watermark are chosen using a human visual system model.
40. A method of embedding a digital watermark into a halftone image comprising: redundantly encoding a multi-bit message; transforming the encoded message to a multilevel per pixel watermark image; deriving halftone thresholds from the multilevel per pixel watermark image; converting a multilevel per pixel target image to a watermarked halftone image by: using the multilevel pixels in the target image to select corresponding halftone thresholds from the halftone thresholds derived from the watermark image, and applying the selected thresholds to corresponding multilevel pixels in the watermark image to create the watermarked halftone image of the target image.
41. The method of claim 40 wherein the multi-bit message includes information about a printer to be used in printing the watermarked halftone image.
42. The method of claim 41 wherein the information about the printer includes resolution of the printer.
43. The method of claim 41 wherein the information about the printer includes information identifying a model of the printer.
44. The method of claim 40 wherein transforming the encoded message includes modulating the message with a pseudo random carrier.
45. The method of claim 40 wherein deriving the halftone thresholds includes creating a histogram of the watermark image; and from the histogram, associated multilevel pixel values with halftone thresholds such that when a halftone threshold is applied to the watermark image, a halftone image is created with a tonal density that corresponds to the multilevel pixel value associated with the halftone threshold.-
46. A method of measuring digital watermark strength comprising: processing a watermarked signal to extract estimates of error correction encoded bits embedded into the watermarked signal; decoding the error correction encoded bits to compute a message payload; re-encoding the message payload to compute error correction encoded bits; computing a measure of watermark strength from the error correction encoded bits and the estimates of error correction encoded bits.
47. The method of claim 46 wherein the measure of watermark strength is compared with a strength threshold to detect degradation of the watermark signal.
48. The method of claim 47 wherein the watermarked signal is an image captured of a printed object, and the strength threshold is used to detect whether the printed object is an original or a copy.
49. The method of claim 46 wherein the message payload includes printer information about a printer used to print the watermarked signal; and the printer information is used to determine whether a printed object bearing the watermarked signal is an original or a copy.
Applications Claiming Priority (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/689,226 US6694041B1 (en) | 2000-10-11 | 2000-10-11 | Halftone watermarking and related applications |
US689226 | 2000-10-11 | ||
US840016 | 2001-04-20 | ||
US09/840,016 US6760464B2 (en) | 2000-10-11 | 2001-04-20 | Halftone watermarking and related applications |
US09/938,870 US7246239B2 (en) | 2001-01-24 | 2001-08-23 | Digital watermarks for checking authenticity of printed objects |
US938870 | 2001-08-23 | ||
PCT/US2001/031784 WO2002031752A1 (en) | 2000-10-11 | 2001-10-10 | Halftone watermarking and related applications |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1325464A1 true EP1325464A1 (en) | 2003-07-09 |
Family
ID=27418513
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP01979700A Withdrawn EP1325464A1 (en) | 2000-10-11 | 2001-10-10 | Halftone watermarking and related applications |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP1325464A1 (en) |
JP (2) | JP4199540B2 (en) |
KR (1) | KR20030051712A (en) |
AU (1) | AU2002211634A1 (en) |
CA (1) | CA2422412A1 (en) |
WO (1) | WO2002031752A1 (en) |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4554358B2 (en) * | 2002-05-14 | 2010-09-29 | メディアセック テクノロジーズ ゲーエムべーハー | Visible authentication pattern for printed documents |
KR100814029B1 (en) * | 2006-03-30 | 2008-03-14 | 순천향대학교 산학협력단 | Method for digital watermarking |
US7925045B2 (en) * | 2007-09-10 | 2011-04-12 | Konica Minolta Systems Laboratory, Inc. | Determining document authenticity in a closed-loop process |
US8451501B2 (en) * | 2010-12-13 | 2013-05-28 | Xerox Corporation | Watermark decoding via spectral analysis of pixel spacing |
US8594453B2 (en) | 2011-08-18 | 2013-11-26 | Hewlett-Packard Development Company, L.P. | Method of robust alignment and payload recovery for data-bearing images |
US8970910B2 (en) * | 2011-12-07 | 2015-03-03 | Xerox Corporation | Visible and invisible watermarking of printed images via 2nd generation stochastic seed frequency modulation |
WO2013119233A1 (en) * | 2012-02-09 | 2013-08-15 | Hewlett-Packard Development Company, L.P. | Forensic verification utilizing halftone boundaries |
US9373032B2 (en) | 2012-02-09 | 2016-06-21 | Hewlett-Packard Development Company, L.P. | Forensic verification utilizing forensic markings inside halftones |
EP2812849B1 (en) * | 2012-02-09 | 2019-12-18 | Hewlett-Packard Development Company, L.P. | Forensic verification from halftone images |
US20150169928A1 (en) | 2012-03-01 | 2015-06-18 | Sys-Tech Solutions, Inc. | Methods and a system for verifying the identity of a printed item |
US20150379321A1 (en) | 2012-03-01 | 2015-12-31 | Sys-Tech Solutions, Inc. | Methods and a system for verifying the authenticity of a mark |
WO2013130946A1 (en) | 2012-03-01 | 2013-09-06 | Sys-Tech Solutions, Inc. | Unique identification information from marked features |
KR101817153B1 (en) * | 2015-01-28 | 2018-01-11 | 한국전자통신연구원 | Information insertion method, information extraction method and information extraction apparatus using dot based information robust to geometric distortion |
US9940572B2 (en) | 2015-02-17 | 2018-04-10 | Sys-Tech Solutions, Inc. | Methods and a computing device for determining whether a mark is genuine |
KR101889265B1 (en) | 2015-06-16 | 2018-08-16 | 시스-테크 솔루션스 인코포레이티드 | Methods for determining if a mark is genuine, |
KR101978109B1 (en) | 2016-03-14 | 2019-05-13 | 시스-테크 솔루션스 인코포레이티드 | Methods for determining if a mark is genuine, |
US9986202B2 (en) | 2016-03-28 | 2018-05-29 | Microsoft Technology Licensing, Llc | Spectrum pre-shaping in video |
JP2022126367A (en) | 2021-02-18 | 2022-08-30 | キヤノン株式会社 | Image processing device, image processing method, and program |
JP2022126334A (en) | 2021-02-18 | 2022-08-30 | キヤノン株式会社 | Image processing device, image processing method, and program |
JP2022139238A (en) | 2021-03-11 | 2022-09-26 | キヤノン株式会社 | Image processing system and image processing method |
JP2022139237A (en) | 2021-03-11 | 2022-09-26 | キヤノン株式会社 | Information processing apparatus and program and image processing method |
JP2022139239A (en) | 2021-03-11 | 2022-09-26 | キヤノン株式会社 | Information processing apparatus and program and image processing method |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5875249A (en) * | 1997-01-08 | 1999-02-23 | International Business Machines Corporation | Invisible image watermark for image verification |
US6252971B1 (en) * | 1998-04-29 | 2001-06-26 | Xerox Corporation | Digital watermarking using phase-shifted stoclustic screens |
-
2001
- 2001-10-10 KR KR10-2003-7005154A patent/KR20030051712A/en not_active Application Discontinuation
- 2001-10-10 JP JP2002535062A patent/JP4199540B2/en not_active Expired - Lifetime
- 2001-10-10 EP EP01979700A patent/EP1325464A1/en not_active Withdrawn
- 2001-10-10 WO PCT/US2001/031784 patent/WO2002031752A1/en not_active Application Discontinuation
- 2001-10-10 CA CA002422412A patent/CA2422412A1/en not_active Abandoned
- 2001-10-10 AU AU2002211634A patent/AU2002211634A1/en not_active Abandoned
-
2006
- 2006-03-24 JP JP2006083301A patent/JP4187749B2/en not_active Expired - Lifetime
Non-Patent Citations (1)
Title |
---|
See references of WO0231752A1 * |
Also Published As
Publication number | Publication date |
---|---|
WO2002031752A1 (en) | 2002-04-18 |
CA2422412A1 (en) | 2002-04-18 |
JP4187749B2 (en) | 2008-11-26 |
JP4199540B2 (en) | 2008-12-17 |
JP2006270972A (en) | 2006-10-05 |
AU2002211634A1 (en) | 2002-04-22 |
KR20030051712A (en) | 2003-06-25 |
JP2004511938A (en) | 2004-04-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6760464B2 (en) | Halftone watermarking and related applications | |
US7020349B2 (en) | Halftone watermarking and related applications | |
JP4187749B2 (en) | Halftone watermarking and related applications | |
US8006092B2 (en) | Digital watermarks for checking authenticity of printed objects | |
EP1485863B1 (en) | Authenticating printed objects using digital watermarks associated with multidimensional quality metrics | |
US7277468B2 (en) | Measuring quality of service of broadcast multimedia signals using digital watermark analyses | |
US8051295B2 (en) | Benchmarks for digital watermarking | |
US8130811B2 (en) | Assessing quality of service using digital watermark information | |
US7369678B2 (en) | Digital watermark and steganographic decoding | |
US6721440B2 (en) | Low visibility watermarks using an out-of-phase color | |
US7970166B2 (en) | Steganographic encoding methods and apparatus | |
US20080298632A1 (en) | Correcting image capture distortion | |
JP3884891B2 (en) | Image processing apparatus and method, and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20030311 |
|
AK | Designated contracting states |
Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK RO SI |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20040504 |