US20010041016A1 - Method and apparatus for near lossless digital image compression - Google Patents
Method and apparatus for near lossless digital image compression Download PDFInfo
- Publication number
- US20010041016A1 US20010041016A1 US09/904,829 US90482901A US2001041016A1 US 20010041016 A1 US20010041016 A1 US 20010041016A1 US 90482901 A US90482901 A US 90482901A US 2001041016 A1 US2001041016 A1 US 2001041016A1
- Authority
- US
- United States
- Prior art keywords
- level
- quantization table
- quantizer
- information
- sample
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 33
- 238000007906 compression Methods 0.000 title description 23
- 230000006835 compression Effects 0.000 title description 22
- 238000013139 quantization Methods 0.000 claims abstract description 78
- 238000003384 imaging method Methods 0.000 claims description 9
- 238000012937 correction Methods 0.000 description 15
- 230000002093 peripheral effect Effects 0.000 description 6
- 238000012545 processing Methods 0.000 description 5
- 230000008901 benefit Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 241000282412 Homo Species 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/005—Statistical coding, e.g. Huffman, run length coding
Definitions
- the present invention pertains to the field of digital image compression. More particularly, this invention pertains to algorithms for near-lossless digital video compression.
- a video signal comprises a sequence of frames or images which when displayed at a given frame rate (e.g., 15 to 30 frames per second) simulates the appearance of motion to a human observer.
- Each frame of the video image comprises a matrix of picture elements, known as “pixels” or “pels.”
- a pixel is the minimum unit of the picture which may be assigned a luminance intensity and a color.
- pixels the minimum unit of the picture which may be assigned a luminance intensity and a color.
- 3 bytes of data can be used to define visual information for a pixel.
- One popular data format assigns a luminance intensity represented by 1 byte of data to each pixel and further assigns a color represented by 2 bytes of data to groups of 4 pixels each. This format results in an average use of 12 data bits to represent each pixel. Therefore, a single frame at a resolution of 320 ⁇ 240 can be represented by about 120,000 bytes of data.
- Digital video cameras must transfer frame data to a video memory system for display. Multiple frames are transferred over a period of time. The number of frames transferred and displayed per second is referred to as the frame rate. In general, greater frame rates contribute to heightened appearances of motion, while lower frame rates contribute to the observer being able to perceive individual frames, thus destroying the appearance of motion.
- a frame rate of 15 frames per second (fps) with a resolution of 320 ⁇ 240 and each pixel being represented by an average of 12 bits requires that about 14 Mbits of data must be transferred between the digital video camera and the video memory each second.
- USB Universal Serial Bus
- 8 Mbits/second supports a wide variety of desktop peripherals, from modems, printers, microphones and speakers to graphics tablets, game controls, joysticks, scanners, and digital cameras.
- the USB data rate of 8 Mbits/second is insufficient to support a frame rate of 15 fps for 320 ⁇ 240 video.
- Interframe compression allows an image or video segment to be transferred and stored in substantially fewer bytes of data than required for uncompressed frames.
- Many methods of digital video compression are based on the idea of eliminating redundant information from frame to frame in a digitized video segment. This is referred to as “interframe compression.” Interframe compression methods exploit the temporal redundancy that exists between digital video frames from the same scene recorded moments apart in time. This reduces the required data needed to encode each frame.
- Interframe compression is not ideally suited to the USB environment. This is due to the fact that the USB architecture will not guarantee that every frame of digital video data will be transferred. It is possible that some frames may be dropped. Since interframe compression techniques depend on frame-by-frame redundancies, dropped frames can have a significant impact on picture quality. Therefore, it is desirable to provide a method and apparatus for compressing digital video images that will allow digital cameras to operate on peripheral busses such as the USB.
- a method and apparatus for encoding a digital image is disclosed.
- a difference is calculated between a first sample and a second sample.
- a quantization table level is selected from a quantization table the includes at least one level, and information identifying the selected level is placed into an information stream.
- a quantizer is selected from the quantization table level based on the calculated difference and information corresponding to the selected quantizer is placed into the information stream.
- FIG. 1 shows a flow chart of a method for encoding data representing a component of a picture element implemented in accordance with one embodiment of the invention.
- FIG. 2 depicts a flow chart of a method for encoding data representing a component of each picture element of a digital image implemented in accordance with one embodiment of the invention.
- FIG. 3 shows data representing two scan lines from an example Y-plane of a frame of digital video.
- FIG. 4 depicts an example quantization table implemented in accordance with one embodiment of the invention.
- FIG. 5 a illustrates a high quality, 4-bit encoding example implemented in accordance with one embodiment of the invention.
- FIG. 5 b depicts an example decoding result from the high quality, 4-bit encoding example illustrated in FIG. 5 a implemented in accordance with one embodiment of the invention.
- FIG. 6 a illustrates a high compression, 3-bit encoding example implemented in accordance with one embodiment of the invention.
- FIG. 6 b depicts an example decoding result from the high compression, 3-bit encoding example illustrated in FIG. 6 a implemented in accordance with one embodiment of the invention.
- FIG. 7 shows an example computer system including an imaging device implemented in accordance with one embodiment of the invention.
- FIG. 8 a depicts an example 4-bit encode table with no under-correction implemented in accordance with one embodiment of the invention.
- FIG. 8 b depicted an example 4-bit encode table with 50% under-correction implemented in accordance with one embodiment of the invention.
- FIG. 8 c depicts an example 4-bit encode table implemented in accordance with one embodiment of the invention.
- the invention solves the problem of providing digital image compression that results in an ordinary viewer of the resulting image perceiving the image as “non-lossy” and further that does not rely on interframe redundancies, thereby allowing high quality digital video signals to be transferred across a peripheral bus such as the USB.
- the invention accomplishes this by using a compression method that takes advantage of redundancies between two scan lines within a given frame of digital video.
- the compression algorithm of the invention uses Differential Pulse Code Modulation (DPCM) with varying levels or tables of quantizers.
- DPCM Differential Pulse Code Modulation
- a difference is calculated between a first sample from a current scan line and a corresponding second sample from a previous scan line.
- a quantization table level is selected from a quantization table that includes at least one level, and information identifying the selected level is placed into an information stream.
- a quantizer is selected from the quantization table level based on the calculated difference and information corresponding to the selected quantizer is place into the information stream.
- the invention provides “near-lossless” digital image compression.
- Near-lossless as used herein means that while some information corresponding to an image is not preserved during the compression process, an ordinary viewer of the resulting decoded and displayed image would not notice any degradation in quality.
- each picture element has a luminance intensity component (the Y component) and two color components (the C R and C B components).
- the totality of Y components for a given frame is referred to as the Y plane for that frame.
- the totality of the C R and C B components for a given frame can be referred to as C R and C B planes, respectively.
- These luminance and color components conform to the International Telecommunications Union—Radio Sector (ITU-R) BT.601 standard.
- ITU-R International Telecommunications Union—Radio Sector
- Other video color systems, such as RGB may also be used with the invention.
- FIG. 1 shows a flow chart of a method for encoding data representing a component of a picture element implemented in accordance with one embodiment of the invention.
- a difference is calculated between a sample value for a picture element from a current scan line and a sample value for a picture element from a previous scan line.
- a quantization table level is selected at step 120 .
- the quantization table may include any number of levels, with each level including any number of quantizers.
- An example quantization table is shown in FIG. 4, discussed below. The level selection is based on the value of the calculated difference.
- a level switch occurs. This determination is made at step 125 .
- the term “level switch” is more fully discussed below in connection with FIG. 2. If there is a level switch, information identifying the selected level is placed into an information stream at step 130 .
- the information stream may be a bit-stream that is transferred across a peripheral bus such as the USB.
- the information stream may also include information being transferred across buses with data path widths greater than 1 bit.
- the information stream is received by a device or system that will perform decoding functions.
- a quantizer from the selected level is selected at step 140 .
- the term quantizer as used herein is defined as an index into the quantization table. The quantizer selection is based on the value of the calculated difference. Following quantizer selection, information identifying the selected quantizer is placed in the information stream at step 150 .
- FIG. 2 depicts a flow chart of a method for encoding data representing a component of each picture element of a digital image implemented in accordance with one embodiment of the invention.
- each plane is processed independently, although other embodiments are possible where the planes are not processed independently.
- the method depicted in FIG. 2 begins with step 202 where the 0 th scan line is sampled.
- the sample values for scan line 0 are stored in a buffer or some other storage device at step 204 .
- the sample values are placed in an information stream at step 206 .
- the 0 th scan line of each plane is not quantized or encoded at all.
- the 0 th line serves as a predictor for the following scan line, as discussed below.
- step 208 the 0 th picture element of the next scan line (line 1) is sampled. After sampling the 0 th picture element of scan line 1, a difference is calculated at step 210 between the sample value of the 0 th picture element of line 1 and the sample value of the 0 th picture element from the 0 th scan line.
- the present embodiment uses a quantization table that has multiple levels. For example, one level may have quantizers ranging in value from 0 to +/ ⁇ 7 while another level may have quantizers ranging in value from 0 to +/ ⁇ 181, as shown in the example quantization table of FIG. 4 which is discussed below.
- a default level known to both the encoder and decoder, is used at the beginning of each scan line that is encoded. Since the default level is known to both the encoder and the decoder, there is no need to communicate the selection of the default level to the decoder.
- the invention may also be practiced without a default level. In this case, a level must be selected at the beginning of processing each scan line, and the selection of the new level must be communicated to the decoder.
- a quantizer is selected at step 214 .
- the quantizer is selected based on the difference calculated between the sample value of the 0 th picture element of line 1 and the sample value of the 0 th picture element from the 0 th scan line at step 210 .
- the quantizer is selected from the default quantization table level. Following quantizer selection, information identifying the selected quantizer is placed into the information stream at step 216 .
- the quantizer is decoded and written back to a buffer for use as a predictor by the 0 th element of the next scan line.
- the decoding occurs at step 218 .
- the decoding step 218 uses the quantizer to look up a quantization value in the currently selected quantization table level.
- the quantization value is placed in a buffer or other storage device at step 220 .
- next picture element in the current scan line (presently line 1) is processed.
- the next picture element is sampled at step 222 , and a difference between the sample value of the current picture element and the sample value for the corresponding picture element from the previous scan line is calculated at step 224 .
- a quantization table level is selected. The selection is based on the difference calculated at step 224 . Specific example embodiments of level selection algorithms are discussed below in connection with FIG. 4.
- level switch means that a different quantization table level will be used for the current quantizer selection than was used for the previous quantizer selection.
- the invention may be practiced by allowing level switches every time a level is selected at step 226 , or the invention may be practice by restricting the frequency of level switches. For example, it is possible to only allow level switches every four times a picture element is processed. Thus, a single level is used for at least four contiguous samples before a level switch can occur.
- Level shift maximum can be set to equal 10% of the byte size of one raw (uncompressed) video frame, plus one. The plus one is there in order to handle the case where the 10% limit is reached part way through a scan line. The one extra level switch can be used to switch to the default level, and the default level will then be used for the remainder of the frame. Level shift maximums other than 10% are also possible. Further, the level shift maximum may be varied at any time.
- Step 230 may include placing an escape code into the information stream to let the decoding device know that information identifying a new level follows. Following step 230 is step 232 .
- step 232 a quantizer is selected from the proper quantization table level and information identifying the selected quantizer is placed into the information stream.
- the current sample is decoded at step 234 and the decoded current sample is placed in a buffer or other storage device at step 236 .
- step 222 follows step 238 and another picture element is processed at steps 222 through 236 . These steps are repeated until the final picture element for the current scan line has been processed.
- step 240 a determination is made on whether the current scan line is the last scan line in the frame. If the current scan line is the last scan line, then the processing of the frame (or at least one plane of the frame) has completed. Otherwise, the processing of the next scan line begins at step 208 . Steps 208 through 240 are repeated until the last scan line has been processed.
- FIG. 3 shows an example Y-plane 300 consisting of two scan lines and eight sampled picture elements per scan line.
- the Y-plane 300 is not intended to represent a realistic sampled Y-plane, but is disclosed in order to provide a simple example.
- the Y-plane 300 is discussed below in connection with FIGS. 5 a through 6 b ⁇ .
- FIG. 4 depicts an example quantization table 400 implemented in accordance with one embodiment of the invention.
- the table 400 consists of 5 levels with levels 0 through 2 including 15 quantizers each and levels 3 and 4 including 7 quantizers each.
- One embodiment of the invention has two modes of operation: 1) a high quality, 4-bit encoding mode; and 2) a high compression, 3-bit encoding mode.
- the quantizers delineated by the dashed line are used for the high quality mode, while the quantizers delineated by the solid line are used for the high compression mode.
- the quantization values included in the table 400 are examples. It is possible to practice the invention using different numbers of levels, embodying different numbers of quantizers and using different quantizer values.
- the invention may also be practiced with more than one quantization table. It is possible to switch between two or more quantization tables as applications require. For example, one table could be used to provide standard encoding while another is used to provide under correction. Under correction has the effect of both smoothing out quantization noise (the decoded and displayed images look subtly blurred, but this tends to look “better”), and reducing the number of level shifts.
- One method of applying under correction involves multiplying the differences calculated in steps 210 and 224 in FIG. 2 by a factor of, for example, between 0.5 and 1.
- Another method of applying under correction is to scale the quantization table values by a factor of, for example, between 1 and 2. A factor of 1 in each method if applying under correction is the same as having no under correction.
- one table could be used to provide no under correction while another table could be used to provide under correction. Multiple tables could be used to provide varying degrees of under correction. Further, while one table is being used for encoding, another table may be updated or its values changed.
- One embodiment of the invention also provides a separate decode table. This allows an under correction factor to be applied to the encode table(s) only.
- the decode table could include the same quantization values as the encode table(s).
- FIG. 5 a illustrates a high quality, 4-bit encoding example implemented in accordance with one embodiment of the invention.
- This example uses the example Y-plane 300 shown in FIG. 3 and the quantization table 400 shown in FIG. 4.
- the difference between the 0 th sample of the current line and the 0 th sample of the previous line is calculated.
- the difference has a value of ⁇ 1.
- a “best” level is selected.
- the method for selecting the “best” level for this example embodiment is as follows:
- level 0 is not used when encoding the Y-plane and level 2 is not used when encoding the C R and C B planes.
- the present embodiment implements a policy of restricting level shifts to once every four samples.
- One possibility, implemented in the present embodiment is to determine the “best” level for four contiguous samples, then use the highest level for all four samples. As mentioned above, the default level for the beginning of each scan line for the Y-plane is level 1. Thus, the first four samples in the present example use level 1.
- the difference between the 4 th sample of the previous scan line and the 4 th sample of the current line is ⁇ 119.
- the “best” level for the 4 th sample is 2.
- the highest “best” level for sample 4 through 7 is level 2.
- level 2 will be used for samples 4 through 7.
- an escape code of 15 is placed in the information stream to signal to the decoding device that a new level follows.
- a value of 15 followed by a value of 2 are placed into the information stream.
- the difference of ⁇ 119 for the 4 th sample matches most closely in level 2 with the quantization value ⁇ 129 which corresponds with quantizer 12. Therefore, a value of 12 is placed into the information stream.
- Samples 5 through 7 are processed in similar fashion, each using level 2.
- FIG. 5 b depicts an example decoding result from the high quality, 4-bit encoding example illustrated in FIG. 5 a implemented in accordance with one embodiment of the invention.
- the values in FIG. 5 b are the result of decoding the information placed into the information stream in the example shown in FIG. 5 a .
- the decoding table used to determine the values in FIG. 5 b includes the same quantization table values as the table 400 in FIG. 4.
- FIG. 6 a illustrates a high compression, 3-bit encoding example implemented in accordance with one embodiment of the invention.
- This example also uses the example Y-plane 300 shown in FIG. 3 and the quantization table 400 shown in FIG. 4.
- the difference between the 0 th sample of the current line and the 0 th sample of the previous line is calculated.
- the difference has a value of ⁇ 1.
- a “best” level is selected.
- the method for selecting the “best” level for this example embodiment is as follows:
- levels 0 and 1 are not used when encoding the Y-plane and levels 3 and 4 are not used when encoding the C B and C R planes.
- This example also implements the policy of restricting level shifts to once every four samples, and in particular, the policy of using the highest “best” level for four contiguous samples.
- the default level for the beginning of each scan line for the Y-plane is level 2.
- the first four samples in the present example use level 2.
- sample 4 the difference between the 4 th sample of the previous scan line and the 4 th sample of the current line is ⁇ 119.
- the “best” level for the 4 th sample is 4.
- level 4 will be used for samples 4 through 7.
- an escape code of 7 is placed in the information stream to signal to the decoding device that a new level follows.
- a value 7 followed by a value 4 are placed into the information stream.
- the difference of ⁇ 119 for the 4 th sample matches most closely in level 4 with the quantization value ⁇ 128 which corresponds with quantizer 6. Therefore, a value 6 is placed into the information stream.
- Samples 5 through 7 are processed in similar fashion, each using level 4.
- FIG. 6 b depicts an example decoding result from the high compression, 3-bit encoding example illustrated in FIG. 6 a implemented in accordance with one embodiment of the invention.
- the values in FIG. 6 b are the result of decoding the information placed into the information stream in the example shown in FIG. 6 a .
- the decoding table used to determine the values in FIG. 6 b includes the same quantization table values as the table 400 in FIG. 4.
- FIG. 7 depicts an exemplary computer system 700 including an imaging device 730 having an encoder 740 implemented in accordance with one embodiment of the invention.
- Computer system 700 typically includes a bus 702 for communicating information, such as instructions and data.
- the system further includes a processor 704 , coupled to the bus 702 , for processing information according to programmed instructions, a main memory 706 coupled to the bus 702 for storing information for processor 704 , and an information storage device 708 coupled with the bus 702 for storing information.
- the above components are typically located within a chassis (not shown).
- the processor 704 could be an 80960, 80386, 80486, Pentium ⁇ processor, Pentium ⁇ processor with MMXTM technology, Pentium ⁇ Pro processor or Pentium ⁇ II processor made by Intel Corp., among others, including processors that are compatible with those listed above.
- the processor 704 typically includes a plurality of pipelines for parallel and overlapping execution of operations.
- the main memory 706 could be a random access memory (RAM) to store dynamic information for processor 704 , a read-only memory (ROM) to store static information and instructions for processor 704 , or a combination of both types of memory.
- the processor 704 executes instructions that cause the output of the imaging device 730 to be decoded.
- the processor may either perform calculations in order to decode the output of the imaging device 730 or may use a decode table stored in the main memory 706 .
- the results of the decoding may be displayed on a display device 710 or stored for later display in the information storage device 708 or in the main memory 706 .
- the information storage device 708 could be any medium for storage of computer readable information. Suitable candidates include a read-only memory (ROM), a hard disk drive, a disk drive with removable media (e.g., a floppy magnetic disk or an optical disk), or a tape drive with removable media (e.g., magnetic tape), synchronous DRAM or a flash memory (i.e., a disk-like storage device implemented with flash semiconductor memory). A combination of these, or other devices that support reading or writing computer readable media, could be used.
- ROM read-only memory
- a hard disk drive e.g., a floppy magnetic disk or an optical disk
- a tape drive with removable media e.g., magnetic tape
- synchronous DRAM or a flash memory i.e., a disk-like storage device implemented with flash semiconductor memory
- the input/output devices of the computer system 700 typically comprise the display device 710 , an alphanumeric input device 712 , a position input device 714 , a communications interface 716 , and the imaging device 730 , each of which is coupled to the bus 702 .
- the imaging device may comprise a digital camera. If the data storage device 708 supports removable media, such as a floppy disk, it may also be considered an input/output device.
- the imaging device 730 includes the encoder 740 which comprises an encoder quantization table select register 742 , a level shift maximum register 748 , a scan line buffer 744 , encoder quantization tables 745 and 747 , and a decoder table 749 .
- the table select register 742 is written to by the processor 704 or other system device in order to select which quantization table is to be used for encoding.
- the tables 745 and 747 may also be written to by the processor or other system device in order to update the values contained in the tables. Although two encoding tables are shown, any number of encoding tables may be used with the invention.
- the level shift maximum register 748 is used to store a value that determines the maximum number of level shifts allowed per frame.
- This register is may also be written to by the processor 704 or other system device.
- the scan line buffer 744 is used to store at least one previously sampled decoded scan line which is to be used as a predictor for a subsequent scan line.
- the decoder table 749 is used when decoding scan lines for storage in the line buffer 744 .
- Example tables that may be used to implement tables 745 , 747 , and 749 are shown in FIGS. 8 a , 8 b , and 8 c.
- FIG. 8 a depicts an example 4-bit encode table with no under-correction.
- This example table may be used to implement encoder quantization tables 745 and/or 747 shown in FIG. 7.
- the table shown in FIG. 8 a uses only absolute values of the calculated difference to be encoded.
- This example table contains similar information as the table shown in FIG. 4 for 4-bit encoding, but since only absolute values are used, the table is smaller and therefore more efficient to implement in hardware.
- the absolute difference value must be less than the value in the quantizer (limit) column, with the exception of the “Last” column.
- the “Last” column contains the upper limits (Max) for the Y, C R and C B planes.
- the quantizer from the “Last” column is selected. For all of the columns, if the sign of the calculated difference to be encoded is positive, then the appropriate odd numbered quantizer is selected. If the sign of the calculated difference to be encoded is negative, then the appropriate even numbered quantizer is selected.
- FIG. 8 b shows an example 4-bit encode table that functions in a manner similar to the table depicted in FIG. 8 a .
- the table of FIG. 8 b has 50% under correction applied. Notice that the table values are twice the values contained in the table of FIG. 8 a .
- This example table may also be used to implement encoder quantization tables 745 and/or 747 shown in FIG. 7. Further, the table of FIG. 8 a may be used to implement one of the tables 745 and 747 while the table of FIG. 8 b may be used to implement the other.
- FIG. 8 c depicts an example 4-bit decode table that may be used in connection with the tables shown in FIGS. 8 a and 8 b .
- the present example decode table may be used to implement the decoder table 749 as shown in FIG. 7.
- the decode table of FIG. 8 c also uses absolute values. If the quantizer to be decoded is an even number, then the sign of the decoded value will be negative. If the quantizer to be decoded is an odd number, then the sign of the decoded value will be positive. If the quantizer to be decoded is zero, then the decoded value will be zero.
- FIGS. 8 a , 8 b , and 8 c are shown as having particular values, formats, and sizes, the tables may be implemented with varying numbers of levels and quantizers, and may also be implemented with different values.
- the tables of FIGS. 8 a , 8 b , and 8 c may be implemented as 3-bit encode or decode tables, and also may implement varying amounts of under-correction.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression Of Band Width Or Redundancy In Fax (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
A method and apparatus for encoding a digital image is disclosed. A difference is calculated between a first sample from a current scan line and a corresponding second sample from a previous scan line. A quantization table level is selected from a quantization table that includes at least one level, and information identifying the selected level is placed into an information stream. A quantizer is selected from the quantization table level based on the calculated difference and information corresponding to the selected quantizer is place into the information stream.
Description
- 1. Field of the Invention
- The present invention pertains to the field of digital image compression. More particularly, this invention pertains to algorithms for near-lossless digital video compression.
- 2. Background of the Related Art
- A video signal comprises a sequence of frames or images which when displayed at a given frame rate (e.g., 15 to 30 frames per second) simulates the appearance of motion to a human observer. Each frame of the video image comprises a matrix of picture elements, known as “pixels” or “pels.” A pixel is the minimum unit of the picture which may be assigned a luminance intensity and a color. In a computer, depending on the data format used, as many as 3 bytes of data can be used to define visual information for a pixel. One popular data format assigns a luminance intensity represented by 1 byte of data to each pixel and further assigns a color represented by 2 bytes of data to groups of 4 pixels each. This format results in an average use of 12 data bits to represent each pixel. Therefore, a single frame at a resolution of 320×240 can be represented by about 120,000 bytes of data.
- Digital video cameras must transfer frame data to a video memory system for display. Multiple frames are transferred over a period of time. The number of frames transferred and displayed per second is referred to as the frame rate. In general, greater frame rates contribute to heightened appearances of motion, while lower frame rates contribute to the observer being able to perceive individual frames, thus destroying the appearance of motion. A frame rate of 15 frames per second (fps) with a resolution of 320×240 and each pixel being represented by an average of 12 bits requires that about 14 Mbits of data must be transferred between the digital video camera and the video memory each second.
- Digital video cameras are typically connected to a computer system via a peripheral bus. One peripheral bus that is gaining wide acceptance in the computer industry is the Universal Serial Bus (USB). The USB data transfer rate of 8 Mbits/second supports a wide variety of desktop peripherals, from modems, printers, microphones and speakers to graphics tablets, game controls, joysticks, scanners, and digital cameras. However, the USB data rate of 8 Mbits/second is insufficient to support a frame rate of 15 fps for 320×240 video. Further, since several devices may reside on the USB, it is desirable that a digital camera not use the entire 8 Mbits/second.
- Data compression allows an image or video segment to be transferred and stored in substantially fewer bytes of data than required for uncompressed frames. Many methods of digital video compression are based on the idea of eliminating redundant information from frame to frame in a digitized video segment. This is referred to as “interframe compression.” Interframe compression methods exploit the temporal redundancy that exists between digital video frames from the same scene recorded moments apart in time. This reduces the required data needed to encode each frame.
- Interframe compression is not ideally suited to the USB environment. This is due to the fact that the USB architecture will not guarantee that every frame of digital video data will be transferred. It is possible that some frames may be dropped. Since interframe compression techniques depend on frame-by-frame redundancies, dropped frames can have a significant impact on picture quality. Therefore, it is desirable to provide a method and apparatus for compressing digital video images that will allow digital cameras to operate on peripheral busses such as the USB.
- Further, it is desirable to provide a method and apparatus for compressing digital video images that is not perceived as “lossy” by the human visual system. Video compression techniques that humans perceive as “lossy” do not fully preserve all the information originally captured in an image. In general, for “lossy” techniques, as the compression of the video data is increased, the quality of the video suffers.
- A method and apparatus for encoding a digital image is disclosed. A difference is calculated between a first sample and a second sample. A quantization table level is selected from a quantization table the includes at least one level, and information identifying the selected level is placed into an information stream. A quantizer is selected from the quantization table level based on the calculated difference and information corresponding to the selected quantizer is placed into the information stream.
- FIG. 1 shows a flow chart of a method for encoding data representing a component of a picture element implemented in accordance with one embodiment of the invention.
- FIG. 2 depicts a flow chart of a method for encoding data representing a component of each picture element of a digital image implemented in accordance with one embodiment of the invention.
- FIG. 3 shows data representing two scan lines from an example Y-plane of a frame of digital video.
- FIG. 4 depicts an example quantization table implemented in accordance with one embodiment of the invention.
- FIG. 5a illustrates a high quality, 4-bit encoding example implemented in accordance with one embodiment of the invention.
- FIG. 5b depicts an example decoding result from the high quality, 4-bit encoding example illustrated in FIG. 5a implemented in accordance with one embodiment of the invention.
- FIG. 6a illustrates a high compression, 3-bit encoding example implemented in accordance with one embodiment of the invention.
- FIG. 6b depicts an example decoding result from the high compression, 3-bit encoding example illustrated in FIG. 6a implemented in accordance with one embodiment of the invention.
- FIG. 7 shows an example computer system including an imaging device implemented in accordance with one embodiment of the invention.
- FIG. 8a depicts an example 4-bit encode table with no under-correction implemented in accordance with one embodiment of the invention.
- FIG. 8b depicted an example 4-bit encode table with 50% under-correction implemented in accordance with one embodiment of the invention.
- FIG. 8c depicts an example 4-bit encode table implemented in accordance with one embodiment of the invention.
- A method and apparatus for near-lossless digital video compression is disclosed. In the following description, for the purposes of explanation, specific details are set forth to provide a thorough understanding of the invention. However, it will be obvious to one skilled in the art that these specific details are not required to practice the invention. In other instances, well known methods, devices, and structures are not described in particular detail in order to avoid obscuring the invention.
- Overview
- The invention solves the problem of providing digital image compression that results in an ordinary viewer of the resulting image perceiving the image as “non-lossy” and further that does not rely on interframe redundancies, thereby allowing high quality digital video signals to be transferred across a peripheral bus such as the USB. The invention accomplishes this by using a compression method that takes advantage of redundancies between two scan lines within a given frame of digital video. The compression algorithm of the invention uses Differential Pulse Code Modulation (DPCM) with varying levels or tables of quantizers. In general, and in accordance with one embodiment of the invention, a difference is calculated between a first sample from a current scan line and a corresponding second sample from a previous scan line. A quantization table level is selected from a quantization table that includes at least one level, and information identifying the selected level is placed into an information stream. A quantizer is selected from the quantization table level based on the calculated difference and information corresponding to the selected quantizer is place into the information stream.
- The invention provides “near-lossless” digital image compression. The term “near-lossless” as used herein means that while some information corresponding to an image is not preserved during the compression process, an ordinary viewer of the resulting decoded and displayed image would not notice any degradation in quality.
- One Embodiment of the Invention
- For this embodiment, each picture element has a luminance intensity component (the Y component) and two color components (the CR and CB components). The totality of Y components for a given frame is referred to as the Y plane for that frame. Likewise, the totality of the CR and CB components for a given frame can be referred to as CR and CB planes, respectively. These luminance and color components conform to the International Telecommunications Union—Radio Sector (ITU-R) BT.601 standard. Other video color systems, such as RGB, may also be used with the invention.
- FIG. 1 shows a flow chart of a method for encoding data representing a component of a picture element implemented in accordance with one embodiment of the invention. At
step 110, a difference is calculated between a sample value for a picture element from a current scan line and a sample value for a picture element from a previous scan line. - Following
step 110, a quantization table level is selected atstep 120. The quantization table may include any number of levels, with each level including any number of quantizers. An example quantization table is shown in FIG. 4, discussed below. The level selection is based on the value of the calculated difference. - If the selected level differs from a previously selected level, a level switch occurs. This determination is made at
step 125. The term “level switch” is more fully discussed below in connection with FIG. 2. If there is a level switch, information identifying the selected level is placed into an information stream atstep 130. The information stream may be a bit-stream that is transferred across a peripheral bus such as the USB. The information stream may also include information being transferred across buses with data path widths greater than 1 bit. The information stream is received by a device or system that will perform decoding functions. - After the information identifying the selected level is placed in the information stream, a quantizer from the selected level is selected at
step 140. The term quantizer as used herein is defined as an index into the quantization table. The quantizer selection is based on the value of the calculated difference. Following quantizer selection, information identifying the selected quantizer is placed in the information stream atstep 150. - FIG. 2 depicts a flow chart of a method for encoding data representing a component of each picture element of a digital image implemented in accordance with one embodiment of the invention. For this embodiment, each plane is processed independently, although other embodiments are possible where the planes are not processed independently. The method depicted in FIG. 2 begins with
step 202 where the 0th scan line is sampled. The sample values forscan line 0 are stored in a buffer or some other storage device atstep 204. Next, the sample values are placed in an information stream atstep 206. Note that the 0th scan line of each plane is not quantized or encoded at all. The 0th line serves as a predictor for the following scan line, as discussed below. - Following
step 206 isstep 208 where the 0th picture element of the next scan line (line 1) is sampled. After sampling the 0th picture element ofscan line 1, a difference is calculated atstep 210 between the sample value of the 0th picture element ofline 1 and the sample value of the 0th picture element from the 0th scan line. - The present embodiment uses a quantization table that has multiple levels. For example, one level may have quantizers ranging in value from 0 to +/−7 while another level may have quantizers ranging in value from 0 to +/−181, as shown in the example quantization table of FIG. 4 which is discussed below. For the present embodiment, a default level, known to both the encoder and decoder, is used at the beginning of each scan line that is encoded. Since the default level is known to both the encoder and the decoder, there is no need to communicate the selection of the default level to the decoder. The invention may also be practiced without a default level. In this case, a level must be selected at the beginning of processing each scan line, and the selection of the new level must be communicated to the decoder.
- Following
step 210, a quantizer is selected atstep 214. The quantizer is selected based on the difference calculated between the sample value of the 0th picture element ofline 1 and the sample value of the 0th picture element from the 0th scan line atstep 210. The quantizer is selected from the default quantization table level. Following quantizer selection, information identifying the selected quantizer is placed into the information stream atstep 216. - Once the sample value of the 0th picture element has been encoded, that is, after the difference has been calculated and after the quantizer has been selected, the quantizer is decoded and written back to a buffer for use as a predictor by the 0th element of the next scan line. The decoding occurs at
step 218. Thedecoding step 218 uses the quantizer to look up a quantization value in the currently selected quantization table level. The quantization value is placed in a buffer or other storage device atstep 220. - After processing the 0th picture element of
line 1 insteps step 222, and a difference between the sample value of the current picture element and the sample value for the corresponding picture element from the previous scan line is calculated atstep 224. - At
step 226, a quantization table level is selected. The selection is based on the difference calculated atstep 224. Specific example embodiments of level selection algorithms are discussed below in connection with FIG. 4. - Following quantization table level selection at
step 226, a determination is made atstep 228 on whether a level switch should occur. The term “level switch” as used herein means that a different quantization table level will be used for the current quantizer selection than was used for the previous quantizer selection. The invention may be practiced by allowing level switches every time a level is selected atstep 226, or the invention may be practice by restricting the frequency of level switches. For example, it is possible to only allow level switches every four times a picture element is processed. Thus, a single level is used for at least four contiguous samples before a level switch can occur. Restricting the frequency of level switches has the benefits of reducing traffic on the information stream, helping to providing an adequate compression ratio, and allowing easier and speedier decoding. Further, it is possible to limit the total number of times a level shift can occur for an entire frame. The Y, CR and CB planes can be considered separately or in combination when determining a level shift maximum. As an example, the maximum number of level shifts can be set to equal 10% of the byte size of one raw (uncompressed) video frame, plus one. The plus one is there in order to handle the case where the 10% limit is reached part way through a scan line. The one extra level switch can be used to switch to the default level, and the default level will then be used for the remainder of the frame. Level shift maximums other than 10% are also possible. Further, the level shift maximum may be varied at any time. - If a determination is made at
step 228 that a level switch is required, information identifying the new level is placed into the information stream atstep 230. Step 230 may include placing an escape code into the information stream to let the decoding device know that information identifying a new level follows. Followingstep 230 isstep 232. - If no level switch is required or allowed, control passes to step232 following
step 228. Atstep 232, a quantizer is selected from the proper quantization table level and information identifying the selected quantizer is placed into the information stream. Followingstep 232, the current sample is decoded atstep 234 and the decoded current sample is placed in a buffer or other storage device atstep 236. - If
previous steps 222 through 236 processed the last picture element in the current scan line, then control passes to step 240. Otherwise,step 222 followsstep 238 and another picture element is processed atsteps 222 through 236. These steps are repeated until the final picture element for the current scan line has been processed. - At
step 240, a determination is made on whether the current scan line is the last scan line in the frame. If the current scan line is the last scan line, then the processing of the frame (or at least one plane of the frame) has completed. Otherwise, the processing of the next scan line begins atstep 208.Steps 208 through 240 are repeated until the last scan line has been processed. - FIGS. 3 through 6b set forth a couple of examples of how the invention may be practiced. FIG. 3 shows an example Y-
plane 300 consisting of two scan lines and eight sampled picture elements per scan line. The Y-plane 300 is not intended to represent a realistic sampled Y-plane, but is disclosed in order to provide a simple example. The Y-plane 300 is discussed below in connection with FIGS. 5a through 6 b`. - FIG. 4 depicts an example quantization table400 implemented in accordance with one embodiment of the invention. The table 400 consists of 5 levels with
levels 0 through 2 including 15 quantizers each andlevels - The invention may also be practiced with more than one quantization table. It is possible to switch between two or more quantization tables as applications require. For example, one table could be used to provide standard encoding while another is used to provide under correction. Under correction has the effect of both smoothing out quantization noise (the decoded and displayed images look subtly blurred, but this tends to look “better”), and reducing the number of level shifts. One method of applying under correction involves multiplying the differences calculated in
steps - One embodiment of the invention also provides a separate decode table. This allows an under correction factor to be applied to the encode table(s) only. Alternatively, the decode table could include the same quantization values as the encode table(s).
- FIG. 5a illustrates a high quality, 4-bit encoding example implemented in accordance with one embodiment of the invention. This example uses the example Y-
plane 300 shown in FIG. 3 and the quantization table 400 shown in FIG. 4. First, the difference between the 0th sample of the current line and the 0th sample of the previous line is calculated. The difference has a value of −1. Next, a “best” level is selected. The method for selecting the “best” level for this example embodiment is as follows: - For all Y-plane data
- if (absolute value of the calculated difference)<65
- best level=1; (default level at the beginning of each scan line) else
- best level=2;
- For all CR and CB plane data
- if (absolute value of calculated difference)<8
- best level=0; (default level at the beginning of each scan line) else
- best level=1.
- Note that
level 0 is not used when encoding the Y-plane andlevel 2 is not used when encoding the CR and CB planes. Also, the present embodiment implements a policy of restricting level shifts to once every four samples. One possibility, implemented in the present embodiment, is to determine the “best” level for four contiguous samples, then use the highest level for all four samples. As mentioned above, the default level for the beginning of each scan line for the Y-plane islevel 1. Thus, the first four samples in the presentexample use level 1. - Once the “best” level is determined, the difference (−1 in the case of sample 0) is matched to the closest quantization table400 value. The closest quantization table 400 value for in
level 1 forsample 0 is 0.Quantization value 0 corresponds toquantizer 0, thus 0 is output into the information stream.Samples 1 through 3 are likewise processed, withlevel 1 used in each case. - For
sample 4, the difference between the 4th sample of the previous scan line and the 4th sample of the current line is −119. Using the “best” level algorithm disclosed above, the “best” level for the 4th sample is 2. Note that the highest “best” level forsample 4 through 7 islevel 2. Thus,level 2 will be used forsamples 4 through 7. In the present example, an escape code of 15 is placed in the information stream to signal to the decoding device that a new level follows. Thus, a value of 15 followed by a value of 2 are placed into the information stream. The difference of −119 for the 4th sample matches most closely inlevel 2 with the quantization value −129 which corresponds withquantizer 12. Therefore, a value of 12 is placed into the information stream.Samples 5 through 7 are processed in similar fashion, each usinglevel 2. - FIG. 5b depicts an example decoding result from the high quality, 4-bit encoding example illustrated in FIG. 5a implemented in accordance with one embodiment of the invention. The values in FIG. 5b are the result of decoding the information placed into the information stream in the example shown in FIG. 5a. The decoding table used to determine the values in FIG. 5b includes the same quantization table values as the table 400 in FIG. 4.
- FIG. 6a illustrates a high compression, 3-bit encoding example implemented in accordance with one embodiment of the invention. This example also uses the example Y-
plane 300 shown in FIG. 3 and the quantization table 400 shown in FIG. 4. First, the difference between the 0th sample of the current line and the 0th sample of the previous line is calculated. The difference has a value of −1. Next, a “best” level is selected. The method for selecting the “best” level for this example embodiment is as follows: - For all Y-plane data
- if (absolute value of the calculated difference)<33
- best level=2; (default level at the beginning of each scan line)
- else if (absolute value of the calculated difference)<65
- best level=3; else
- best level=4;
- For all CR and CB plane data
- if (absolute value of calculated difference)<11
- best level=0; (default level at the beginning of each scan line)
- else if (absolute value of calculated difference)<21
- best level=1; else
- best level=2.
- Note that
levels levels level 2. Thus, the first four samples in the presentexample use level 2. - Once the “best” level is determined, the difference (−1 in the case of sample 0) is matched to the closest quantization table400 value. The closest quantization table 400 value for in
level 2 forsample 0 is 0.Quantization value 0 corresponds toquantizer 0, thus 0 is output into the information stream.Samples 1 through 3 are likewise processed, withlevel 2 used in each case. - For
sample 4, the difference between the 4th sample of the previous scan line and the 4th sample of the current line is −119. Using the “best” level algorithm disclosed above, the “best” level for the 4th sample is 4. Note that the highest “best” level forsample 4 through 7 islevel 4. Thus,level 4 will be used forsamples 4 through 7. In the present example, an escape code of 7 is placed in the information stream to signal to the decoding device that a new level follows. Thus, avalue 7 followed by avalue 4 are placed into the information stream. The difference of −119 for the 4th sample matches most closely inlevel 4 with the quantization value −128 which corresponds withquantizer 6. Therefore, avalue 6 is placed into the information stream.Samples 5 through 7 are processed in similar fashion, each usinglevel 4. - FIG. 6b depicts an example decoding result from the high compression, 3-bit encoding example illustrated in FIG. 6a implemented in accordance with one embodiment of the invention. The values in FIG. 6b are the result of decoding the information placed into the information stream in the example shown in FIG. 6a. The decoding table used to determine the values in FIG. 6b includes the same quantization table values as the table 400 in FIG. 4.
- FIG. 7 depicts an exemplary computer system700 including an
imaging device 730 having anencoder 740 implemented in accordance with one embodiment of the invention. Computer system 700 typically includes abus 702 for communicating information, such as instructions and data. The system further includes aprocessor 704, coupled to thebus 702, for processing information according to programmed instructions, amain memory 706 coupled to thebus 702 for storing information forprocessor 704, and aninformation storage device 708 coupled with thebus 702 for storing information. In the case of a desk-top design for computer system 700, the above components are typically located within a chassis (not shown). - The
processor 704 could be an 80960, 80386, 80486, Pentium^ processor, Pentium^ processor with MMX™ technology, Pentium^ Pro processor or Pentium^ II processor made by Intel Corp., among others, including processors that are compatible with those listed above. Theprocessor 704 typically includes a plurality of pipelines for parallel and overlapping execution of operations. Themain memory 706 could be a random access memory (RAM) to store dynamic information forprocessor 704, a read-only memory (ROM) to store static information and instructions forprocessor 704, or a combination of both types of memory. Theprocessor 704 executes instructions that cause the output of theimaging device 730 to be decoded. The processor may either perform calculations in order to decode the output of theimaging device 730 or may use a decode table stored in themain memory 706. The results of the decoding may be displayed on adisplay device 710 or stored for later display in theinformation storage device 708 or in themain memory 706. - In alternative designs for the computer system700, the
information storage device 708 could be any medium for storage of computer readable information. Suitable candidates include a read-only memory (ROM), a hard disk drive, a disk drive with removable media (e.g., a floppy magnetic disk or an optical disk), or a tape drive with removable media (e.g., magnetic tape), synchronous DRAM or a flash memory (i.e., a disk-like storage device implemented with flash semiconductor memory). A combination of these, or other devices that support reading or writing computer readable media, could be used. - The input/output devices of the computer system700 typically comprise the
display device 710, an alphanumeric input device 712, aposition input device 714, acommunications interface 716, and theimaging device 730, each of which is coupled to thebus 702. The imaging device may comprise a digital camera. If thedata storage device 708 supports removable media, such as a floppy disk, it may also be considered an input/output device. - The
imaging device 730 includes theencoder 740 which comprises an encoder quantization tableselect register 742, a level shiftmaximum register 748, ascan line buffer 744, encoder quantization tables 745 and 747, and a decoder table 749. The tableselect register 742 is written to by theprocessor 704 or other system device in order to select which quantization table is to be used for encoding. The tables 745 and 747 may also be written to by the processor or other system device in order to update the values contained in the tables. Although two encoding tables are shown, any number of encoding tables may be used with the invention. The level shiftmaximum register 748 is used to store a value that determines the maximum number of level shifts allowed per frame. This register is may also be written to by theprocessor 704 or other system device. Thescan line buffer 744 is used to store at least one previously sampled decoded scan line which is to be used as a predictor for a subsequent scan line. The decoder table 749 is used when decoding scan lines for storage in theline buffer 744. Example tables that may be used to implement tables 745, 747, and 749 are shown in FIGS. 8a, 8 b, and 8 c. - It will be clear to one skilled in the art that the invention can operate upon a wide range of programmable computer systems, not just the example computer system700.
- FIG. 8a depicts an example 4-bit encode table with no under-correction. This example table may be used to implement encoder quantization tables 745 and/or 747 shown in FIG. 7. The table shown in FIG. 8a uses only absolute values of the calculated difference to be encoded. This example table contains similar information as the table shown in FIG. 4 for 4-bit encoding, but since only absolute values are used, the table is smaller and therefore more efficient to implement in hardware. The absolute difference value must be less than the value in the quantizer (limit) column, with the exception of the “Last” column. The “Last” column contains the upper limits (Max) for the Y, CR and CB planes. Since the “Last” column contains these upper limits, no “less than” test occurs. When an upper limit is reached, the quantizer from the “Last” column is selected. For all of the columns, if the sign of the calculated difference to be encoded is positive, then the appropriate odd numbered quantizer is selected. If the sign of the calculated difference to be encoded is negative, then the appropriate even numbered quantizer is selected.
- FIG. 8b shows an example 4-bit encode table that functions in a manner similar to the table depicted in FIG. 8a. The table of FIG. 8b, however, has 50% under correction applied. Notice that the table values are twice the values contained in the table of FIG. 8a. This example table may also be used to implement encoder quantization tables 745 and/or 747 shown in FIG. 7. Further, the table of FIG. 8a may be used to implement one of the tables 745 and 747 while the table of FIG. 8b may be used to implement the other.
- FIG. 8c depicts an example 4-bit decode table that may be used in connection with the tables shown in FIGS. 8a and 8 b. The present example decode table may be used to implement the decoder table 749 as shown in FIG. 7. The decode table of FIG. 8c also uses absolute values. If the quantizer to be decoded is an even number, then the sign of the decoded value will be negative. If the quantizer to be decoded is an odd number, then the sign of the decoded value will be positive. If the quantizer to be decoded is zero, then the decoded value will be zero.
- Although the tables discussed above in connection with FIGS. 8a, 8 b, and 8 c are shown as having particular values, formats, and sizes, the tables may be implemented with varying numbers of levels and quantizers, and may also be implemented with different values. In particular, the tables of FIGS. 8a, 8 b, and 8 c may be implemented as 3-bit encode or decode tables, and also may implement varying amounts of under-correction.
- In the foregoing specification the invention has been described with reference to specific exemplary embodiments thereof. It will, however, be evident that various modifications and changes me be made thereto without departing from the broader spirit and scope of the invention as set forth in the appended claims. The specification and drawings are accordingly to the regarded in an illustrative rather than in a restrictive sense.
Claims (11)
1. A method for encoding a digital image, the method comprising the steps of:
calculating a difference between a first sample and a second sample;
selecting a quantization table level from the first quantization table which includes at least one level;
placing information identifying the selected quantization table level into an information stream;
selecting a quantizer from the quantization table level based on the difference;
and
placing information corresponding to the quantizer into the information stream.
2. The method of , wherein the step of calculating a difference includes the step of calculating a difference between a first sample from a current scan line and a corresponding second sample from a previous scan line.
claim 1
3. The method of , further including the steps of:
claim 1
providing a second quantization table which includes at least one level; and
selecting one of the first and second quantization tables for use when performing the step of selecting a quantization table level.
4. The method of , wherein steps of calculating a difference, providing a first quantization table, selecting a quantization table level, placing information identifying the selected quantization table level into an information stream, selecting a quantizer from the quantization table level, and placing information corresponding to the quantizer into the information stream are performed for each sample in the current scan line.
claim 1
5. The method of , wherein the step of placing information identifying the selected quantization table level into an information stream is performed, then the steps of calculating a difference, providing a first quantization table, selecting a quantizer from the quantization table level, and placing information corresponding to the quantizer into the information stream are performed a predefined number of times before the step of placing information identifying the selected quantization table level into an information stream is again performed.
claim 4
6. The method of , wherein the step of placing information identifying the selected quantization table level into an information stream is performed not more than a predefined maximum number of times.
claim 5
7. A method for decoding a digital image, the method comprising the steps of:
providing a decoder quantization table which includes at least one level;
receiving information identifying a quantization table level, the table level information being received through an information stream;
selecting a quantization table level;
receiving information corresponding to a quantizer, the quantizer information previously generated using an encoder quantization table; and
selecting a value from the quantization table level, the value generally representing a difference between a first sample and a second sample.
8. The method of , wherein the decoder quantization table contains at least one value that differs from the encoder quantization table.
claim 7
9. A computer system, comprising:
a processor coupled to a bus;
a main memory coupled to the bus and having stored therein a decoder table; and
an imaging device coupled to the bus, the imaging device having an encoder that includes a first encoder quantization table which has at least one level, calculates a difference between a first sample and a second sample, selects a quantization table level,
places information corresponding to the selected quantization table level into the information stream, the information stream delivered over the bus,
selects a quantizer from the quantization table level based on the difference, and
places information corresponding to the quantizer into the information stream.
10. The computer system of , wherein the encoder further comprises:
claim 9
a second encoder quantization table; and
a quantization table select register.
11. The computer system of wherein the decoder table contains at least one value that differs from the encoder quantization table.
claim 9
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/904,829 US6771830B2 (en) | 1997-09-29 | 2001-07-12 | Differential pulse code modulation image compression with varying levels of quantizers |
US10/736,950 US7127116B2 (en) | 1997-09-29 | 2003-12-16 | Image data compression |
US10/739,631 US6941021B2 (en) | 1997-09-29 | 2003-12-16 | Video conferencing |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/939,321 US6295379B1 (en) | 1997-09-29 | 1997-09-29 | DPCM image compression with plural quantization table levels |
US09/904,829 US6771830B2 (en) | 1997-09-29 | 2001-07-12 | Differential pulse code modulation image compression with varying levels of quantizers |
Related Parent Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/939,321 Continuation US6295379B1 (en) | 1997-09-29 | 1997-09-29 | DPCM image compression with plural quantization table levels |
US08/939,321 Continuation-In-Part US6295379B1 (en) | 1997-09-29 | 1997-09-29 | DPCM image compression with plural quantization table levels |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/736,950 Division US7127116B2 (en) | 1997-09-29 | 2003-12-16 | Image data compression |
US10/739,631 Division US6941021B2 (en) | 1997-09-29 | 2003-12-16 | Video conferencing |
Publications (2)
Publication Number | Publication Date |
---|---|
US20010041016A1 true US20010041016A1 (en) | 2001-11-15 |
US6771830B2 US6771830B2 (en) | 2004-08-03 |
Family
ID=25472956
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/939,321 Expired - Lifetime US6295379B1 (en) | 1997-09-29 | 1997-09-29 | DPCM image compression with plural quantization table levels |
US09/904,829 Expired - Fee Related US6771830B2 (en) | 1997-09-29 | 2001-07-12 | Differential pulse code modulation image compression with varying levels of quantizers |
US10/739,631 Expired - Fee Related US6941021B2 (en) | 1997-09-29 | 2003-12-16 | Video conferencing |
US10/736,950 Expired - Fee Related US7127116B2 (en) | 1997-09-29 | 2003-12-16 | Image data compression |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/939,321 Expired - Lifetime US6295379B1 (en) | 1997-09-29 | 1997-09-29 | DPCM image compression with plural quantization table levels |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/739,631 Expired - Fee Related US6941021B2 (en) | 1997-09-29 | 2003-12-16 | Video conferencing |
US10/736,950 Expired - Fee Related US7127116B2 (en) | 1997-09-29 | 2003-12-16 | Image data compression |
Country Status (1)
Country | Link |
---|---|
US (4) | US6295379B1 (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109923860A (en) * | 2016-09-08 | 2019-06-21 | 威诺瓦国际有限公司 | Quantization parameter determination and layered code |
GB2593522A (en) * | 2020-03-26 | 2021-09-29 | Imagination Tech Ltd | Image data decompression |
US12008791B2 (en) | 2020-03-26 | 2024-06-11 | Imagination Technologies Limited | Image data compression |
US12148188B2 (en) | 2020-03-26 | 2024-11-19 | Imagination Technologies Limited | Image data decompression |
Families Citing this family (57)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6295379B1 (en) * | 1997-09-29 | 2001-09-25 | Intel Corporation | DPCM image compression with plural quantization table levels |
US8059673B2 (en) | 2003-05-01 | 2011-11-15 | Genesis Microchip Inc. | Dynamic resource re-allocation in a packet based video display interface |
US7839860B2 (en) | 2003-05-01 | 2010-11-23 | Genesis Microchip Inc. | Packet based video display interface |
US8068485B2 (en) * | 2003-05-01 | 2011-11-29 | Genesis Microchip Inc. | Multimedia interface |
US8204076B2 (en) * | 2003-05-01 | 2012-06-19 | Genesis Microchip Inc. | Compact packet based multimedia interface |
US7405719B2 (en) * | 2003-05-01 | 2008-07-29 | Genesis Microchip Inc. | Using packet transfer for driving LCD panel driver electronics |
US20040228545A1 (en) * | 2003-05-12 | 2004-11-18 | Kwang-Bo Cho | Multisampling with reduced bit samples |
US10554985B2 (en) | 2003-07-18 | 2020-02-04 | Microsoft Technology Licensing, Llc | DC coefficient signaling at small quantization step sizes |
US8218624B2 (en) * | 2003-07-18 | 2012-07-10 | Microsoft Corporation | Fractional quantization step sizes for high bit rates |
US7738554B2 (en) | 2003-07-18 | 2010-06-15 | Microsoft Corporation | DC coefficient signaling at small quantization step sizes |
US7602851B2 (en) * | 2003-07-18 | 2009-10-13 | Microsoft Corporation | Intelligent differential quantization of video coding |
US7580584B2 (en) | 2003-07-18 | 2009-08-25 | Microsoft Corporation | Adaptive multiple quantization |
US20050055483A1 (en) * | 2003-08-22 | 2005-03-10 | Devlin William Lee | System and method for file comperssion |
US7634090B2 (en) | 2003-09-26 | 2009-12-15 | Genesis Microchip Inc. | Packet based high definition high-bandwidth digital content protection |
US7801383B2 (en) | 2004-05-15 | 2010-09-21 | Microsoft Corporation | Embedded scalar quantizers with arbitrary dead-zone ratios |
US20060188151A1 (en) * | 2005-02-23 | 2006-08-24 | Lexmark International, Inc. | Method for processing data for use with a video display of an imaging apparatus |
EP1878262A1 (en) * | 2005-04-27 | 2008-01-16 | Koninklijke Philips Electronics N.V. | Dpcm coding method of video signals |
US8422546B2 (en) | 2005-05-25 | 2013-04-16 | Microsoft Corporation | Adaptive video encoding using a perceptual model |
US8503536B2 (en) | 2006-04-07 | 2013-08-06 | Microsoft Corporation | Quantization adjustments for DC shift artifacts |
US7974340B2 (en) | 2006-04-07 | 2011-07-05 | Microsoft Corporation | Adaptive B-picture quantization control |
US7995649B2 (en) | 2006-04-07 | 2011-08-09 | Microsoft Corporation | Quantization adjustment based on texture level |
US8059721B2 (en) | 2006-04-07 | 2011-11-15 | Microsoft Corporation | Estimating sample-domain distortion in the transform domain with rounding compensation |
US8130828B2 (en) | 2006-04-07 | 2012-03-06 | Microsoft Corporation | Adjusting quantization to preserve non-zero AC coefficients |
US8711925B2 (en) | 2006-05-05 | 2014-04-29 | Microsoft Corporation | Flexible quantization |
US7893990B1 (en) | 2006-07-31 | 2011-02-22 | Cisco Technology, Inc. | Digital video camera with retractable data connector and resident software application |
US8238424B2 (en) | 2007-02-09 | 2012-08-07 | Microsoft Corporation | Complexity-based adaptive preprocessing for multiple-pass video compression |
US8498335B2 (en) | 2007-03-26 | 2013-07-30 | Microsoft Corporation | Adaptive deadzone size adjustment in quantization |
US8243797B2 (en) | 2007-03-30 | 2012-08-14 | Microsoft Corporation | Regions of interest for quality adjustments |
KR101503227B1 (en) * | 2007-04-11 | 2015-03-16 | 레드.컴 인코포레이티드 | Video camera |
US8237830B2 (en) | 2007-04-11 | 2012-08-07 | Red.Com, Inc. | Video camera |
US8442337B2 (en) | 2007-04-18 | 2013-05-14 | Microsoft Corporation | Encoding adjustments for animation content |
US8331438B2 (en) | 2007-06-05 | 2012-12-11 | Microsoft Corporation | Adaptive selection of picture-level quantization parameters for predicted video pictures |
KR101394151B1 (en) * | 2007-10-04 | 2014-05-14 | 삼성전자주식회사 | Apparatus and method for encoding image using a psycho-visual characteristic |
US8189101B2 (en) * | 2007-10-26 | 2012-05-29 | Cisco Technology, Inc. | Data connector for an electronics device |
US20090164575A1 (en) * | 2007-11-26 | 2009-06-25 | Haivision Systems Inc. | Method and system for the establishment of complex network telepresence conference |
US8189933B2 (en) | 2008-03-31 | 2012-05-29 | Microsoft Corporation | Classifying and controlling encoding quality for textured, dark smooth and smooth video content |
US8897359B2 (en) | 2008-06-03 | 2014-11-25 | Microsoft Corporation | Adaptive quantization for enhancement layer video coding |
TW201012222A (en) * | 2008-09-12 | 2010-03-16 | Primax Electronics Ltd | Method for producing internet video images |
US8761268B2 (en) * | 2009-04-06 | 2014-06-24 | Intel Corporation | Selective local adaptive wiener filter for video coding and decoding |
US8156238B2 (en) | 2009-05-13 | 2012-04-10 | Stmicroelectronics, Inc. | Wireless multimedia transport method and apparatus |
US8429440B2 (en) | 2009-05-13 | 2013-04-23 | Stmicroelectronics, Inc. | Flat panel display driver method and system |
KR101379186B1 (en) * | 2009-08-21 | 2014-04-10 | 에스케이 텔레콤주식회사 | Inrtra Prediction Enoding/Decoding Method and Apparatus |
USD627380S1 (en) | 2009-10-08 | 2010-11-16 | Cisco Technology, Inc. | Digital video camera with a connector |
US10178406B2 (en) * | 2009-11-06 | 2019-01-08 | Qualcomm Incorporated | Control of video encoding based on one or more video capture parameters |
US8837576B2 (en) * | 2009-11-06 | 2014-09-16 | Qualcomm Incorporated | Camera parameter-assisted video encoding |
US8671234B2 (en) | 2010-05-27 | 2014-03-11 | Stmicroelectronics, Inc. | Level shifting cable adaptor and chip system for use with dual-mode multi-media device |
WO2011159270A1 (en) * | 2010-06-18 | 2011-12-22 | Eu3C Company, Limited | Digital camera scanner in a desk lamp |
EP2448265A1 (en) | 2010-10-26 | 2012-05-02 | Google, Inc. | Lip synchronization in a video conference |
US9210302B1 (en) | 2011-08-10 | 2015-12-08 | Google Inc. | System, method and apparatus for multipoint video transmission |
US8749612B1 (en) | 2011-12-01 | 2014-06-10 | Google Inc. | Reduced bandwidth usage in video conferencing |
US8917309B1 (en) | 2012-03-08 | 2014-12-23 | Google, Inc. | Key frame distribution in video conferencing |
US8791982B1 (en) | 2012-06-27 | 2014-07-29 | Google Inc. | Video multicast engine |
WO2014127153A1 (en) | 2013-02-14 | 2014-08-21 | Red. Com, Inc. | Video camera |
US20140333779A1 (en) * | 2013-05-13 | 2014-11-13 | Electronics And Telecommunications Research Institute | Apparatus for distributing bus traffic of multiple camera inputs of automotive system on chip and automotive system on chip using the same |
US9626981B2 (en) * | 2014-06-25 | 2017-04-18 | Cirrus Logic, Inc. | Systems and methods for compressing a digital signal |
US9609275B2 (en) | 2015-07-08 | 2017-03-28 | Google Inc. | Single-stream transmission method for multi-user video conferencing |
WO2019010233A1 (en) | 2017-07-05 | 2019-01-10 | Red. Com, Llc | Video image data processing in electronic devices |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4540973A (en) * | 1983-02-23 | 1985-09-10 | Siemens Aktiengesellschaft | Quantizer for a DPCM encoder |
US4725885A (en) * | 1986-12-22 | 1988-02-16 | International Business Machines Corporation | Adaptive graylevel image compression system |
US4953214A (en) * | 1987-07-21 | 1990-08-28 | Matushita Electric Industrial Co., Ltd. | Signal encoding and decoding method and device |
US5509089A (en) * | 1993-09-09 | 1996-04-16 | Intel Corporation | Method and system for encoding images using temporal filtering |
US6295379B1 (en) * | 1997-09-29 | 2001-09-25 | Intel Corporation | DPCM image compression with plural quantization table levels |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5225904A (en) * | 1987-10-05 | 1993-07-06 | Intel Corporation | Adaptive digital video compression system |
US6124882A (en) | 1992-02-19 | 2000-09-26 | 8×8, Inc. | Videocommunicating apparatus and method therefor |
US5317397A (en) * | 1991-05-31 | 1994-05-31 | Kabushiki Kaisha Toshiba | Predictive coding using spatial-temporal filtering and plural motion vectors |
US5926208A (en) * | 1992-02-19 | 1999-07-20 | Noonen; Michael | Video compression and decompression arrangement having reconfigurable camera and low-bandwidth transmission capability |
US6121998A (en) | 1992-02-19 | 2000-09-19 | 8×8, Inc. | Apparatus and method for videocommunicating having programmable architecture permitting data revisions |
US5655032A (en) * | 1992-08-20 | 1997-08-05 | Canon Kabushiki Kaisha | Coding method and apparatus therefor |
JP3703195B2 (en) * | 1995-03-27 | 2005-10-05 | ヒューレット・パッカード・カンパニー | Color image transmission device |
US5835149A (en) * | 1995-06-06 | 1998-11-10 | Intel Corporation | Bit allocation in a coded video sequence |
US5812790A (en) * | 1996-09-09 | 1998-09-22 | Sonic Solutions | Variable encoding rate plan generation |
US6833863B1 (en) * | 1998-02-06 | 2004-12-21 | Intel Corporation | Method and apparatus for still image capture during video streaming operations of a tethered digital camera |
US5995150A (en) * | 1998-02-20 | 1999-11-30 | Winbond Electronics Corporation America | Dual compressed video bitstream camera for universal serial bus connection |
US6359560B1 (en) | 1998-11-12 | 2002-03-19 | Smith Micro Software | Computer system with motion-triggered alarm procedure |
US6446155B1 (en) * | 1999-06-30 | 2002-09-03 | Logitech Europe S. A. | Resource bus interface |
US6859557B1 (en) * | 2000-07-07 | 2005-02-22 | Microsoft Corp. | System and method for selective decoding and decompression |
-
1997
- 1997-09-29 US US08/939,321 patent/US6295379B1/en not_active Expired - Lifetime
-
2001
- 2001-07-12 US US09/904,829 patent/US6771830B2/en not_active Expired - Fee Related
-
2003
- 2003-12-16 US US10/739,631 patent/US6941021B2/en not_active Expired - Fee Related
- 2003-12-16 US US10/736,950 patent/US7127116B2/en not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4540973A (en) * | 1983-02-23 | 1985-09-10 | Siemens Aktiengesellschaft | Quantizer for a DPCM encoder |
US4725885A (en) * | 1986-12-22 | 1988-02-16 | International Business Machines Corporation | Adaptive graylevel image compression system |
US4953214A (en) * | 1987-07-21 | 1990-08-28 | Matushita Electric Industrial Co., Ltd. | Signal encoding and decoding method and device |
US5509089A (en) * | 1993-09-09 | 1996-04-16 | Intel Corporation | Method and system for encoding images using temporal filtering |
US6295379B1 (en) * | 1997-09-29 | 2001-09-25 | Intel Corporation | DPCM image compression with plural quantization table levels |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109923860A (en) * | 2016-09-08 | 2019-06-21 | 威诺瓦国际有限公司 | Quantization parameter determination and layered code |
US11012088B2 (en) | 2016-09-08 | 2021-05-18 | V-Nova International Limited | Data processing apparatuses, methods, computer programs and computer-readable media |
US11955994B2 (en) | 2016-09-08 | 2024-04-09 | V-Nova International Limited | Data processing apparatuses, methods, computer programs and computer-readable media |
GB2593522A (en) * | 2020-03-26 | 2021-09-29 | Imagination Tech Ltd | Image data decompression |
GB2593522B (en) * | 2020-03-26 | 2023-02-22 | Imagination Tech Ltd | Image data decompression |
US11663743B2 (en) | 2020-03-26 | 2023-05-30 | Imagination Technologies Limited | Image data decompression |
US12008791B2 (en) | 2020-03-26 | 2024-06-11 | Imagination Technologies Limited | Image data compression |
US12148188B2 (en) | 2020-03-26 | 2024-11-19 | Imagination Technologies Limited | Image data decompression |
Also Published As
Publication number | Publication date |
---|---|
US6771830B2 (en) | 2004-08-03 |
US6941021B2 (en) | 2005-09-06 |
US20040131270A1 (en) | 2004-07-08 |
US20040179593A1 (en) | 2004-09-16 |
US6295379B1 (en) | 2001-09-25 |
US7127116B2 (en) | 2006-10-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6295379B1 (en) | DPCM image compression with plural quantization table levels | |
US7856147B2 (en) | Method and apparatus for processing a compressed image in an order other than the order of which it was compressed | |
US8285037B2 (en) | Compression format and apparatus using the new compression format for temporarily storing image data in a frame memory | |
US7483486B2 (en) | Method and apparatus for encoding high dynamic range video | |
US5253078A (en) | System for compression and decompression of video data using discrete cosine transform and coding techniques | |
US5377018A (en) | Video compression and decompression using block selection and subdivision | |
EP0272763B1 (en) | Image display apparatus | |
US7526124B2 (en) | Match MSB digital image compression | |
JPH05207517A (en) | Method and device for compressing and extending multiple bit digital pixel data | |
US20070053429A1 (en) | Color video codec method and system | |
WO1998027736A3 (en) | Pixel block compression apparatus in an image processing system | |
WO2007036608A1 (en) | Compression of images for computer graphics | |
US8675732B2 (en) | Method and apparatus of compressing image data | |
JP2003517796A (en) | How to reduce the "uneven picture" effect | |
US7414632B1 (en) | Multi-pass 4:2:0 subpicture blending | |
JP3167631B2 (en) | Interpolation circuit, graphics controller circuit, compressed pixel data generation circuit, compressed pixel data expansion circuit, computer system, upscale method, and compressed pixel data generation method | |
US20090073178A1 (en) | Compressing scheme using qualifier watermarking and apparatus using the compression scheme for temporarily storing image data in a frame memory | |
KR100803402B1 (en) | Method, apparatus and couputer readable medium for run length encoding of a quantized digital image | |
US5768507A (en) | Method and apparatus for overcoming a slope overload condition while using differential pulse code modulation scheme | |
JPH06315143A (en) | Image processor | |
JPH0787487A (en) | Method and device for coding and decoding picture data | |
JPH04352591A (en) | Image data compressing circuit | |
JPH11242738A (en) | Method device for compressing and expanding picture, method/device for extending picture and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
CC | Certificate of correction | ||
FPAY | Fee payment |
Year of fee payment: 4 |
|
REMI | Maintenance fee reminder mailed | ||
FPAY | Fee payment |
Year of fee payment: 8 |
|
REMI | Maintenance fee reminder mailed | ||
LAPS | Lapse for failure to pay maintenance fees | ||
STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20160803 |