[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

US20100061449A1 - Programmable quantization dead zone and threshold for standard-based h.264 and/or vc1 video encoding - Google Patents

Programmable quantization dead zone and threshold for standard-based h.264 and/or vc1 video encoding Download PDF

Info

Publication number
US20100061449A1
US20100061449A1 US12/568,878 US56887809A US2010061449A1 US 20100061449 A1 US20100061449 A1 US 20100061449A1 US 56887809 A US56887809 A US 56887809A US 2010061449 A1 US2010061449 A1 US 2010061449A1
Authority
US
United States
Prior art keywords
quantization
value
values
quantized
coefficient
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/568,878
Inventor
Guy Cote
Elliot N. Linzer
Lowell L. Winger
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US12/568,878 priority Critical patent/US20100061449A1/en
Publication of US20100061449A1 publication Critical patent/US20100061449A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B1/00Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
    • H04B1/66Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • H04N19/126Details of normalisation or weighting functions, e.g. normalisation matrices or variable uniform quantisers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • H04N19/137Motion inside a coding unit, e.g. average field, frame or block difference
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • H04N19/14Coding unit complexity, e.g. amount of activity or edge presence estimation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/147Data rate or code amount at the encoder output according to rate distortion criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/159Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/186Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Definitions

  • the present invention relates to video encoding generally and, more particularly, to a programmable quantization dead zone and threshold for standard-based H.264 and/or VC1 video encoding.
  • Aliasing refers to defects or distortion in a video picture due to sampling limitations. The defects commonly appear as jagged edges on diagonal lines and twinkling or brightening (beating/pulsing) in picture detail.
  • H.264 In the H.264 specification, I-frame beating/pulsing is a significant problem, more so than with other video compression standards.
  • the main source of the problem in H.264 is poor DC quantization in the coded bit stream. Poor DC quantization occurs because the H.264 specification does not provide explicit syntax to support finer DC quantization (i.e., quantization matrices and/or DC quantization).
  • the VC1 specification has separate quantization for AC and DC coefficients. However, VC1 does not have separately controllable quantization for every separate frequency component. The quantization of lower frequency AC coefficient can be relatively poor in VC1.
  • the present invention concerns a video encoder including an encoder circuit, a quantizer circuit and a control circuit.
  • the encoder circuit may be configured to generate a plurality of coefficient values and motion vectors in response to a video stream, a first control signal, a second control signal, and a number of quantized values.
  • the control circuit may be configured to (i) generate the first control signal, the second control signal, and a quantizer index signal and (ii) set two or more quantization dead zones and two or more offsets to different values, where the quantization dead zones and the offsets are independently programmable.
  • the quantizer circuit may be configured to generate the number of quantized values in response to the coefficient values, the quantizer index signal, the two or more quantization dead zones and the two or more offsets.
  • a first coefficient value may be quantized using a first quantization dead zone and a first offset.
  • a second coefficient value may be quantized using a second quantization dead zone and a second offset.
  • the first and second quantization dead zones generally have different values.
  • the first and second offsets generally have different values. Quantization calculations determining each quantized value take into account respective quantization dead zones and offsets.
  • the objects, features and advantages of the present invention include providing programmable quantization dead zones and thresholds for standard-based H.264 and/or VC1 video encoding that may (i) adjust quantization dead zones, (ii) adjust quantization decision levels (or thresholds), (iii) improve I-frame visual quality, (iv) reduce I-frame beating and/or pulsing, (v) improve quantization of lower frequency AC coefficients in VC1, (vi) improve rate-distortion performance and/or (vii) improve overall visual quality.
  • FIG. 1 is a block diagram illustrating various components of a compressed video system in accordance with a preferred embodiment of the present invention
  • FIG. 2 is a more detailed block diagram illustrating a video encoder in accordance with a preferred embodiment of the present invention.
  • FIG. 3 is a diagram illustrating an example of forward quantization parameters.
  • FIG. 1 a block diagram of a system 100 is shown illustrating components of a compressed video system in accordance with a preferred embodiment of the present invention.
  • a content provider 102 presents video image, audio or other data to be compressed and transmitted in a data stream 104 to an input of an encoder 106 .
  • the encoder 106 may be configured to generate a compressed bit stream 108 in response to the input stream 104 .
  • the encoder 106 maybe configured to encode the data stream 104 according to one or more encoding standards (e.g., MPEG-1, MPEG-2, MPEG-4, WMV, VC-9, VC-1, H.262, H.263, H.264, H.264/JVC/AVC/MPEG-4 part 10 , AVS 1.0 and/or other standards for compression of audio-video data).
  • the encoder 106 may be further configured to generate the bit stream 108 using a quantization process implemented with a programmable dead zone and thresholds.
  • the compressed bit stream 108 from the encoder 106 may be presented to an encoder transport system 110 .
  • An output of the encoder transport system 110 generally presents a signal 112 to a transmitter 114 .
  • the transmitter 114 transmits the compressed data via a transmission medium 116 .
  • the content provider 102 may comprise a video broadcast, DVD, or any other source of video data stream.
  • the transmission medium 116 may comprise, for example, a broadcast, cable, satellite, network, DVD, hard drive, or any other medium implemented to carry, transfer, and/or store a compressed bit stream.
  • a receiver 118 On a receiving side of the system 100 , a receiver 118 generally receives the compressed data bit stream from the transmission medium 116 .
  • the receiver 118 presents an encoded bit stream 120 to a decoder transport system 122 .
  • the decoder transport system 122 generally presents the encoded bit stream via a link 124 to a decoder 126 .
  • the decoder 126 generally decompresses (decodes) the data bit stream and presents the data via a link 128 to an end user hardware block (or circuit) 130 .
  • the end user hardware block 130 may comprise a television, a monitor, a computer, a projector, a hard drive, a personal video recorder (PVR), an optical disk recorder (e.g., DVD), or any other medium implemented to carry, transfer, present, display and/or store the uncompressed bit stream (e.g., decoded video signal).
  • PVR personal video recorder
  • optical disk recorder e.g., DVD
  • the circuit 106 may be implemented as a video encoder.
  • the circuit 106 may be compliant with one or more compression standards (e.g., H.264, VC1, and/or other specifications).
  • the present invention generally relates to encoding rules for forward quantization.
  • the present invention generally provides a flexible design for a quantization process that may achieve better video compression in standard based video encoding.
  • the circuit 106 may comprise a circuit (or block) 132 , a circuit (or block) 134 , a circuit (or block) 136 and a circuit (or block) 138 .
  • the block 132 may be implemented as an encoder/decoder (CODEC) circuit.
  • the block 134 may be implemented as a quantizer circuit.
  • the block 136 may be implemented as a coding control circuit.
  • the block 138 may be implemented as an entropy coding block.
  • the circuit 132 may have an input 140 that may receive a signal (e.g., VIDEO IN), an output 142 that may present a signal (e.g., COEFF), an output 144 that may present a signal (e.g., MV) and an input 146 that may receive a signal (e.g., QUANTIZED VALUES).
  • the signal VIDEO IN may comprise a video bit stream.
  • the signal QUANTIZED VALUES may comprise a number of quantized values for transform coefficients.
  • the signal MV may comprise a number of motion vectors.
  • the block 132 may be configured to generate the signal COEFF and the signal MV in response to the signal VIDEO IN, the signal QUANTIZED VALUES and one or more control signals (e.g., CTR1 and INTER/INTRA) received from the block 136 .
  • the block 132 may be implemented with conventional techniques.
  • the circuit 134 may have a first input that may receive the signal COEFF, a second input that may receive a signal (e.g., DEAD ZONES/THRESHOLDS), a third input that may receive a signal (e.g., QUANTIZER INDEX) and an output that may present the signal QUANTIZED VALUES.
  • the signal DEAD ZONES/THRESHOLDS may be configured to signal (or set) quantization dead zones and thresholds of the circuit 134 .
  • the signal DEAD ZONES/THRESHOLDS may comprise information for programming one or more quantization dead zones and/or thresholds. In one example, the signal DEAD ZONES/THRESHOLDS may be configured to select between a number of predetermined quantization parameters.
  • the circuit 134 may be configured to generate the signal QUANTIZED VALUES in response to the signal COEFF, the signal DEAD ZONES/THRESHOLDS and the signal QUANTIZER INDEX.
  • the circuit 136 may have a first output that may present the control signal CTR1, a second output that may present the signal DEAD ZONES/THRESHOLDS, a third output that may present the signal QUANTIZER INDEX, a fourth output that may present the signal INTER/INTRA and a fifth output that may present a signal (e.g., TRANSMITTED OR NOT).
  • the signal INTER/INTRA may be implemented, in one example, as a decision flag.
  • the signal TRANSMITTED OR NOT may be implemented, in one example, as a status flag.
  • the signal CTR1 may be configured, in one example, to control a transformation process of the block 132 . For example, the signal CTR1 may be configured to select between 8 ⁇ 8 and 4 ⁇ 4 transforms.
  • the circuit 136 may be configured, in one example, to generate the signals CTR1, INTER/INTRA, TRANSMITTED OR NOT, and QUANTIZER INDEX according to conventional techniques.
  • the circuit 136 may be further configured, in one example, to generate the signal DEAD ZONES/THRESHOLDS in accordance with the present invention.
  • the circuit 136 may be configured to generate the signal DEAD ZONE/THRESHOLDS using a number of different algorithms.
  • the signal DEAD ZONES/THRESHOLDS may be varied for each macroblock, or may be fixed.
  • statistics e.g., mean, variance, etc.
  • the offset and threshold values may be determined empirically (e.g., by experimentation).
  • fixed values may be used for intra and inter, or fixed for a specific sequence type.
  • a video sequence classification may be used to determine the offset and/or threshold values.
  • quantization alone may be performed repeatedly with different values.
  • a measure of rate-distortion performance may be used in determining offset and threshold values. For example, a set of offset and threshold values may be selected for determining final quantized coefficients from a plurality of sets based on obtaining the best rate-distortion performance.
  • the block 138 may be implemented using conventional techniques.
  • the block 138 may be configured to generate a compressed bit stream using one or more of variable length codes (VLC), context-adaptive variable length coding (CAVLC) and/or context-adaptive binary arithmetic coding (CABAC).
  • VLC variable length codes
  • CAVLC context-adaptive variable length coding
  • CABAC context-adaptive binary arithmetic coding
  • the block 138 may be configured to generate the compressed bit stream in response to the signals INTER/INTRA, TRANSMITTED OR NOT, QUANTIZER INDEX, QUANTIZED VALUES and MV.
  • the circuit 132 may comprise a block (or circuit) 160 , a block (or circuit) 162 , a block (or circuit) 164 , a block (or circuit) 166 , a block (or circuit) 168 , a block (or circuit) 170 , a block (or circuit) 172 and a block (or circuit) 174 .
  • the block 160 may be implemented as a subtractor.
  • the block 162 may be implemented as a multiplexer (or selector) circuit.
  • the block 164 may be implemented as a transform block. In one example, the block 164 may be configured to perform a discrete cosine transform (DCT).
  • the block 166 may be implemented as an inverse quantizer block.
  • the block 168 may be implemented as an inverse transform block.
  • the block 168 may be implemented as an inverse discrete cosine transform (IDCT) block.
  • the block 170 may be implemented as an adder.
  • the block 172 may be implemented as a picture memory.
  • the block 174 may be implemented as a multiplexer (or selector) circuit. Other blocks or circuits may be implemented accordingly to meet the design criteria of a particular implementation.
  • the signal VIDEO_IN may be presented to a first input of the block 160 and a first input of the block 162 .
  • An output of the block 160 may present a signal to a second input of the block 162 .
  • the block 162 may be configured to couple either the first input or the second input to an output in response to the signal INTER/INTRA.
  • the output of the block 162 may be presented to an input of the block 164 .
  • the block 164 may be configured to transform the signal received from the block 162 based upon the signal CTR1.
  • An output of the block 164 may present the signal COEFF.
  • the block 166 may have an input that may receive the signal QUANTIZED VALUES. An output of the block 166 may be presented to an input of the block 168 . An output of the block 168 may be presented to a first input of the block 170 . An output of the block 170 may be presented to a first input of the block 172 .
  • the block 172 may have a first output that may present the signal MV, a second input that may receive the signal VIDEO IN and a second output that may present a signal to a second input of the block 160 and a first input of the block 174 . An output of the block 174 may be presented to a second input of the block 170 .
  • the blocks 160 - 174 may be implemented using conventional techniques for block-based video encoders. Motion estimation and compensation may be implemented with conventional techniques. However, the implementation of motion estimation and compensation may vary substantially from one encoder to another encoder to meet the design criteria of a particular implementation.
  • Video compression standards (e.g., MPEG2, MPEG4, H.264, VC1, etc.) generally define a video decoder syntax and decoding process. As part of the decoding process, inverse quantization is generally defined in the standard. However, video encoding implementation is generally more flexible. For example, different encoders may implement forward quantization rules differently. Quantization as used herein generally refers to the process of assigning a range of coefficient levels to a predetermined reconstructed level that may be obtained from a quantization parameter.
  • a diagram 200 is shown illustrating an example relationship between quantization parameters and coefficient levels.
  • coefficient values falling within a pair of decision thresholds e.g., [D n , D n+1 ]
  • a corresponding quantized value e.g., Q n
  • Conventional implementations of forward quantization have fixed decision thresholds.
  • the present invention generally provides a process for forward quantization that allows for varying the decision thresholds D 1 -D n (also called “offsets”) and the decision threshold D 0 around zero (also called “dead zone”) independently for (i) all coefficients, (ii) luminance blocks and/or (iii) chrominance blocks.
  • a different quantization dead zone and a different quantization offset may be selected for different coefficients in a block.
  • a first coefficient in a block may be quantized using a first offset threshold value and a first dead zone threshold value.
  • a second coefficient in the block may be quantized using a second offset threshold value and/or a second dead zone threshold value.
  • the first and second offset values may be independently programmed.
  • the first and second dead zone values may be independently programmed.
  • the dead zone threshold values and offset threshold values may be independently programmed to different values.
  • the present invention may be implemented in an H.264 compliant encoder/decoder.
  • H.264 uses a 4 ⁇ 4 transform.
  • the 4 ⁇ 4 transform produces 16 coefficients per block.
  • H.264 also uses one luminance (luma or Y) and two chrominance (Cb an Cr) blocks for a 4 ⁇ 4 pixel area.
  • H.264 may also implement 16 ⁇ 16 blocks.
  • the DC transformed coefficients for a 16 ⁇ 16 block are generally grouped into a 4 ⁇ 4 block of DC coefficients, and transformed and quantized again.
  • a different set of offsets and dead zone may be implemented for the latter step.
  • the present invention may be implemented in a VC-1 compliant encoder/decoder. Examples of a transform/quantization process in accordance with preferred embodiments of the present invention are presented below.
  • an encoder in accordance with the present invention may implement H.264 forward transform and quantization similar to that described in the Joint Video Team (JVT) document JVT-B039.doc entitled “Low Complexity Transform and Quantization—Part II: Extensions” (which is hereby incorporated by reference in its entirety) with the exception of (i) material regarding residuals>9 bits, (ii) proposed changes to the standard that were not adopted and (iii) implementation of programmable offsets and thresholds.
  • a// 2 b sign( a ) ⁇ [( abs ( a )+2 b ⁇ 1 )>> b]
  • Quantization in accordance with the present invention may be implemented, in one example, with 126-bits for programmable parameter sets (e.g., 18 3-bit offsets and 18 4-bit deadzones).
  • the present invention may implement 16 parameter sets for 8 ⁇ 8 or 4 ⁇ 4 luma and chroma quantization (e.g., O ij and d ij ), 1 parameter set for 2 ⁇ 2 chroma DC quantization (e.g., O CDC and d CDC ), and 1 parameter set for 4 ⁇ 4 luma DC quantization (e.g., O LDC and d LDC ).
  • a forward transform for 4 ⁇ 4 luma and chroma may be defined by the following equation:
  • Quantization for the 4 ⁇ 4 luma and chroma may be performed according to the following equation:
  • the quantization coefficients Q may be set as follows:
  • the sixteen 3-bit programmable offsets O ij for 4 ⁇ 4 luma and chroma quantization may be utilized according to the following equation:
  • O ij 0 . . . 7.
  • SAD sum of absolute differences
  • B-frames non-reference blocks
  • O 1 may have the following value:
  • the matrix O 1 may be implemented for intra blocks as follows:
  • a more sophisticated scheme may be implemented where the matrix is changed based on reference/non-reference, intra/inter, or use of the macroblock variances and residuals variances directly.
  • the sixteen 3-bit programmable offsets O ij for 8 ⁇ 8 luma and chroma quantization may be utilized according to the following equation:
  • the sixteen 4-bit programmable dead zones d ij for 4 ⁇ 4 luma and chroma quantization may be utilized according to the following equation:
  • D 1 D 1 .
  • D 1 may be set as follows:
  • the intermediate values inside the square brackets in the above equations may have, in one example, a 32-bit range. However, the value Y may be implemented to fit in 16 bits.
  • the sixteen 4-bit programmable dead zones d ij for 8 ⁇ 8 luma and chroma quantization may be utilized according to the following equation:
  • the luma DC coefficients of a 16 ⁇ 16 block are grouped into a 4 ⁇ 4 block and further transformed, for intra frames, to improve compression.
  • the forward transform for 4 ⁇ 4 luma DC transformation may be implemented as follows.
  • the input matrix X D may be formed by picking out DC coefficients from the 16 transformed 4 ⁇ 4 blocks.
  • DC coefficients may be transformed, for example, using a symmetric Hadamard transform.
  • the symmetric Hadamard transform generally leads to essentially the same performance as the DCT-like transform in TML).
  • the forward transform may be expressed by the following equation:
  • the value p is generally related to a bitdepth of the input video. For example for 8-bit video the value k 9 is implemented. However, other values may be implemented for 10-bit and/or 12-bit input video.
  • the 4 ⁇ 4 luma DC quantization may be performed according to the following equation:
  • O LDC 0 . . . 7
  • the 2 ⁇ 2 chroma DC transform and quantization may be implemented as follows.
  • the forward transform may be implemented by adding the chroma DC transform on top of the chroma transform.
  • the input matrix may be formed by picking out DC coefficients from the 4 transformed 4 ⁇ 4 blocks. For example, denoting the coefficients as X D , the transform may be computed according to the following equation.
  • the 2 ⁇ 2 chroma DC quantization may be performed according to the following equation:
  • the value in each cell generally corresponds to a 9-bit input residual.
  • the present invention may be implemented in a VC1 (a Society of Motion Picture and Television Engineers (SMPTE) standard based on Microsoft WMV9) compliant encoder/decoder.
  • the present invention may modify forward transform and quantization specifications of the VC1 standard to provide for implementation of programmable offsets and dead zones.
  • the present invention may be implemented as a stand alone device.
  • the present invention may provide an implementation that re-uses MPEG2 or MPEG4 hardware.
  • the forward quantization of a given coefficient at an index (e.g., Idx) of a block may be implemented according to the following pseudocode:
  • QuantMat[Idx] may be set to 16 and Qrnd may be set to a predetermined fixed value.
  • the value Qrnd may be either 5 or 8 for intra pictures (I-pictures) and 0 or 1 for inter pictures. The particular value of Qrnd implemented may be determined based on user preference.
  • a programmable dead zone for VC1 may be implemented by varying the parameter Qrnd between a first predetermined value and a second predetermined value.
  • Qrnd may be varied between 0 and 2 ⁇ 4 (or between 0 and 15 inclusive).
  • Programmable independent coefficient offsets for VC1 may be implemented by varying the parameter QuantMat[Idx] between a first predetermined value and a second predetermined value.
  • the value of QuantMat[Idx] may be implemented, in one example, having (i) a range of up to 16 bits and (ii) a positive sign (e.g., 0 to (2 ⁇ 16) ⁇ 1). In one example, the value of QuantMat[Idx] may be varied between 0 and 31 inclusive.
  • the present invention may apply conventional MPEG-2 or MPEG-4 very large scale integrated (VLSI) implementations of quantization for standard-based VC1 encoding.
  • MPEG-2 parameters may be varied to apply programmable dead zones and offsets.
  • MPEG-2 quantization matrices may be utilized in an alternative implementation of programmable offsets for VC1 quantization.
  • the present invention has been illustrated with an 8-bit video input, other bitdepths (e.g., 10-bit, 12-bit, etc.) of input video streams may be implemented in accordance with the present invention.
  • the present invention may also be implemented with other video formats (e.g., 4:2:2, 4:4:4, etc.).
  • the present invention may provide advantages for Main Profile, High Profile and VC1 by matching quantization decisions to the coefficient distributions to improve rate-distortion performance of encoded bitstreams.
  • the function performed by the present invention may be implemented using a conventional general purpose digital computer programmed according to the teachings of the present specification, as will be apparent to those skilled in the relevant art(s).
  • Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will also be apparent to those skilled in the relevant art(s).
  • the present invention may also be implemented by the preparation of ASICs, FPGAs, or by interconnecting an appropriate network of conventional component circuits, as is described herein, modifications of which will be readily apparent to those skilled in the art(s).
  • the present invention thus may also include a computer product which may be a storage medium including instructions which can be used to program a computer to perform a process in accordance with the present invention.
  • the storage medium can include, but is not limited to, any type of disk including floppy disk, optical disk, CD-ROM, magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, Flash memory, magnetic or optical cards, or any type of media suitable for storing electronic instructions.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A video encoder including an encoder circuit, a quantizer circuit and a control circuit. The encoder circuit may be configured to generate a plurality of coefficient values and motion vectors in response to a video stream, a first control signal, a second control signal, and a number of quantized values. The control circuit may be configured to (i) generate the first control signal, the second control signal, and a quantizer index signal and (ii) set two or more quantization dead zones and two or more offsets to different values, where the quantization dead zones and the offsets are independently programmable. The quantizer circuit may be configured to generate the number of quantized values in response to the coefficient values, the quantizer index signal, the two or more quantization dead zones and the two or more offsets. A first coefficient value may be quantized using a first quantization dead zone and a first offset. A second coefficient value may be quantized using a second quantization dead zone and a second offset. The first and second quantization dead zones generally have different values. The first and second offsets generally have different values. Quantization calculations determining each quantized value take into account respective quantization dead zones and offsets.

Description

  • This is a continuation of U.S. Ser. No. 11/010,029, filed Dec. 10, 2004, which is incorporated by reference.
  • FIELD OF THE INVENTION
  • The present invention relates to video encoding generally and, more particularly, to a programmable quantization dead zone and threshold for standard-based H.264 and/or VC1 video encoding.
  • BACKGROUND OF THE INVENTION
  • Video picture quality can be decreased by aliasing. Aliasing refers to defects or distortion in a video picture due to sampling limitations. The defects commonly appear as jagged edges on diagonal lines and twinkling or brightening (beating/pulsing) in picture detail.
  • In the H.264 specification, I-frame beating/pulsing is a significant problem, more so than with other video compression standards. The main source of the problem in H.264 is poor DC quantization in the coded bit stream. Poor DC quantization occurs because the H.264 specification does not provide explicit syntax to support finer DC quantization (i.e., quantization matrices and/or DC quantization).
  • The VC1 specification has separate quantization for AC and DC coefficients. However, VC1 does not have separately controllable quantization for every separate frequency component. The quantization of lower frequency AC coefficient can be relatively poor in VC1.
  • It would be desirable to implement a video encoder with a programmable quantization dead zone and thresholds for standard-based H.264 and/or VC1 video encoding.
  • SUMMARY OF THE INVENTION
  • The present invention concerns a video encoder including an encoder circuit, a quantizer circuit and a control circuit. The encoder circuit may be configured to generate a plurality of coefficient values and motion vectors in response to a video stream, a first control signal, a second control signal, and a number of quantized values. The control circuit may be configured to (i) generate the first control signal, the second control signal, and a quantizer index signal and (ii) set two or more quantization dead zones and two or more offsets to different values, where the quantization dead zones and the offsets are independently programmable. The quantizer circuit may be configured to generate the number of quantized values in response to the coefficient values, the quantizer index signal, the two or more quantization dead zones and the two or more offsets. A first coefficient value may be quantized using a first quantization dead zone and a first offset. A second coefficient value may be quantized using a second quantization dead zone and a second offset. The first and second quantization dead zones generally have different values. The first and second offsets generally have different values. Quantization calculations determining each quantized value take into account respective quantization dead zones and offsets.
  • The objects, features and advantages of the present invention include providing programmable quantization dead zones and thresholds for standard-based H.264 and/or VC1 video encoding that may (i) adjust quantization dead zones, (ii) adjust quantization decision levels (or thresholds), (iii) improve I-frame visual quality, (iv) reduce I-frame beating and/or pulsing, (v) improve quantization of lower frequency AC coefficients in VC1, (vi) improve rate-distortion performance and/or (vii) improve overall visual quality.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • These and other objects, features and advantages of the present invention will be apparent from the following detailed description and the appended claims and drawings in which:
  • FIG. 1 is a block diagram illustrating various components of a compressed video system in accordance with a preferred embodiment of the present invention;
  • FIG. 2 is a more detailed block diagram illustrating a video encoder in accordance with a preferred embodiment of the present invention; and
  • FIG. 3 is a diagram illustrating an example of forward quantization parameters.
  • DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • Referring to FIG. 1, a block diagram of a system 100 is shown illustrating components of a compressed video system in accordance with a preferred embodiment of the present invention. In general, a content provider 102 presents video image, audio or other data to be compressed and transmitted in a data stream 104 to an input of an encoder 106. The encoder 106 may be configured to generate a compressed bit stream 108 in response to the input stream 104. In one example, the encoder 106 maybe configured to encode the data stream 104 according to one or more encoding standards (e.g., MPEG-1, MPEG-2, MPEG-4, WMV, VC-9, VC-1, H.262, H.263, H.264, H.264/JVC/AVC/MPEG-4 part 10, AVS 1.0 and/or other standards for compression of audio-video data). The encoder 106 may be further configured to generate the bit stream 108 using a quantization process implemented with a programmable dead zone and thresholds.
  • The compressed bit stream 108 from the encoder 106 may be presented to an encoder transport system 110. An output of the encoder transport system 110 generally presents a signal 112 to a transmitter 114. The transmitter 114 transmits the compressed data via a transmission medium 116. In one example, the content provider 102 may comprise a video broadcast, DVD, or any other source of video data stream. The transmission medium 116 may comprise, for example, a broadcast, cable, satellite, network, DVD, hard drive, or any other medium implemented to carry, transfer, and/or store a compressed bit stream.
  • On a receiving side of the system 100, a receiver 118 generally receives the compressed data bit stream from the transmission medium 116. The receiver 118 presents an encoded bit stream 120 to a decoder transport system 122. The decoder transport system 122 generally presents the encoded bit stream via a link 124 to a decoder 126. The decoder 126 generally decompresses (decodes) the data bit stream and presents the data via a link 128 to an end user hardware block (or circuit) 130. The end user hardware block 130 may comprise a television, a monitor, a computer, a projector, a hard drive, a personal video recorder (PVR), an optical disk recorder (e.g., DVD), or any other medium implemented to carry, transfer, present, display and/or store the uncompressed bit stream (e.g., decoded video signal).
  • Referring to FIG. 2, a more detailed block diagram of the circuit 106 of FIG. 1 implemented in accordance with a preferred embodiment of the present invention is shown. The circuit 106 may be implemented as a video encoder. In one example, the circuit 106 may be compliant with one or more compression standards (e.g., H.264, VC1, and/or other specifications). The present invention generally relates to encoding rules for forward quantization. The present invention generally provides a flexible design for a quantization process that may achieve better video compression in standard based video encoding.
  • The circuit 106 may comprise a circuit (or block) 132, a circuit (or block) 134, a circuit (or block) 136 and a circuit (or block) 138. The block 132 may be implemented as an encoder/decoder (CODEC) circuit. The block 134 may be implemented as a quantizer circuit. The block 136 may be implemented as a coding control circuit. the block 138 may be implemented as an entropy coding block.
  • The circuit 132 may have an input 140 that may receive a signal (e.g., VIDEO IN), an output 142 that may present a signal (e.g., COEFF), an output 144 that may present a signal (e.g., MV) and an input 146 that may receive a signal (e.g., QUANTIZED VALUES). The signal VIDEO IN may comprise a video bit stream. The signal QUANTIZED VALUES may comprise a number of quantized values for transform coefficients. The signal MV may comprise a number of motion vectors. The block 132 may be configured to generate the signal COEFF and the signal MV in response to the signal VIDEO IN, the signal QUANTIZED VALUES and one or more control signals (e.g., CTR1 and INTER/INTRA) received from the block 136. In general, the block 132 may be implemented with conventional techniques.
  • The circuit 134 may have a first input that may receive the signal COEFF, a second input that may receive a signal (e.g., DEAD ZONES/THRESHOLDS), a third input that may receive a signal (e.g., QUANTIZER INDEX) and an output that may present the signal QUANTIZED VALUES. The signal DEAD ZONES/THRESHOLDS may be configured to signal (or set) quantization dead zones and thresholds of the circuit 134. The signal DEAD ZONES/THRESHOLDS may comprise information for programming one or more quantization dead zones and/or thresholds. In one example, the signal DEAD ZONES/THRESHOLDS may be configured to select between a number of predetermined quantization parameters. The circuit 134 may be configured to generate the signal QUANTIZED VALUES in response to the signal COEFF, the signal DEAD ZONES/THRESHOLDS and the signal QUANTIZER INDEX.
  • The circuit 136 may have a first output that may present the control signal CTR1, a second output that may present the signal DEAD ZONES/THRESHOLDS, a third output that may present the signal QUANTIZER INDEX, a fourth output that may present the signal INTER/INTRA and a fifth output that may present a signal (e.g., TRANSMITTED OR NOT). The signal INTER/INTRA may be implemented, in one example, as a decision flag. The signal TRANSMITTED OR NOT may be implemented, in one example, as a status flag. The signal CTR1 may be configured, in one example, to control a transformation process of the block 132. For example, the signal CTR1 may be configured to select between 8×8 and 4×4 transforms.
  • The circuit 136 may be configured, in one example, to generate the signals CTR1, INTER/INTRA, TRANSMITTED OR NOT, and QUANTIZER INDEX according to conventional techniques. The circuit 136 may be further configured, in one example, to generate the signal DEAD ZONES/THRESHOLDS in accordance with the present invention. In one example, the circuit 136 may be configured to generate the signal DEAD ZONE/THRESHOLDS using a number of different algorithms.
  • For example, the signal DEAD ZONES/THRESHOLDS may be varied for each macroblock, or may be fixed. In one example, statistics (e.g., mean, variance, etc.) of the original macroblocks (e.g., for intra or inter) or residual macroblocks (e.g., for inter) may be utilized to determine offset and threshold values. In another example, the offset and threshold values may be determined empirically (e.g., by experimentation). In one example, fixed values may be used for intra and inter, or fixed for a specific sequence type. For example, a video sequence classification may be used to determine the offset and/or threshold values. Also, quantization alone may be performed repeatedly with different values. In one example a measure of rate-distortion performance may be used in determining offset and threshold values. For example, a set of offset and threshold values may be selected for determining final quantized coefficients from a plurality of sets based on obtaining the best rate-distortion performance.
  • The block 138 may be implemented using conventional techniques. In one example, the block 138 may be configured to generate a compressed bit stream using one or more of variable length codes (VLC), context-adaptive variable length coding (CAVLC) and/or context-adaptive binary arithmetic coding (CABAC). The block 138 may be configured to generate the compressed bit stream in response to the signals INTER/INTRA, TRANSMITTED OR NOT, QUANTIZER INDEX, QUANTIZED VALUES and MV.
  • In one example, the circuit 132 may comprise a block (or circuit) 160, a block (or circuit) 162, a block (or circuit) 164, a block (or circuit) 166, a block (or circuit) 168, a block (or circuit) 170, a block (or circuit) 172 and a block (or circuit) 174. The block 160 may be implemented as a subtractor. The block 162 may be implemented as a multiplexer (or selector) circuit. The block 164 may be implemented as a transform block. In one example, the block 164 may be configured to perform a discrete cosine transform (DCT). The block 166 may be implemented as an inverse quantizer block. The block 168 may be implemented as an inverse transform block. In one example, the block 168 may be implemented as an inverse discrete cosine transform (IDCT) block. The block 170 may be implemented as an adder. The block 172 may be implemented as a picture memory. The block 174 may be implemented as a multiplexer (or selector) circuit. Other blocks or circuits may be implemented accordingly to meet the design criteria of a particular implementation.
  • The signal VIDEO_IN may be presented to a first input of the block 160 and a first input of the block 162. An output of the block 160 may present a signal to a second input of the block 162. The block 162 may be configured to couple either the first input or the second input to an output in response to the signal INTER/INTRA. The output of the block 162 may be presented to an input of the block 164. The block 164 may be configured to transform the signal received from the block 162 based upon the signal CTR1. An output of the block 164 may present the signal COEFF.
  • The block 166 may have an input that may receive the signal QUANTIZED VALUES. An output of the block 166 may be presented to an input of the block 168. An output of the block 168 may be presented to a first input of the block 170. An output of the block 170 may be presented to a first input of the block 172. The block 172 may have a first output that may present the signal MV, a second input that may receive the signal VIDEO IN and a second output that may present a signal to a second input of the block 160 and a first input of the block 174. An output of the block 174 may be presented to a second input of the block 170. The blocks 160-174 may be implemented using conventional techniques for block-based video encoders. Motion estimation and compensation may be implemented with conventional techniques. However, the implementation of motion estimation and compensation may vary substantially from one encoder to another encoder to meet the design criteria of a particular implementation.
  • Video compression standards (e.g., MPEG2, MPEG4, H.264, VC1, etc.) generally define a video decoder syntax and decoding process. As part of the decoding process, inverse quantization is generally defined in the standard. However, video encoding implementation is generally more flexible. For example, different encoders may implement forward quantization rules differently. Quantization as used herein generally refers to the process of assigning a range of coefficient levels to a predetermined reconstructed level that may be obtained from a quantization parameter.
  • Referring to FIG. 3, a diagram 200 is shown illustrating an example relationship between quantization parameters and coefficient levels. In general, coefficient values falling within a pair of decision thresholds (e.g., [Dn, Dn+1]) are generally assigned a corresponding quantized value (e.g., Qn). Conventional implementations of forward quantization have fixed decision thresholds. The present invention generally provides a process for forward quantization that allows for varying the decision thresholds D1-Dn (also called “offsets”) and the decision threshold D0 around zero (also called “dead zone”) independently for (i) all coefficients, (ii) luminance blocks and/or (iii) chrominance blocks.
  • In one example, a different quantization dead zone and a different quantization offset may be selected for different coefficients in a block. For example, a first coefficient in a block may be quantized using a first offset threshold value and a first dead zone threshold value. A second coefficient in the block may be quantized using a second offset threshold value and/or a second dead zone threshold value. The first and second offset values may be independently programmed. The first and second dead zone values may be independently programmed. The dead zone threshold values and offset threshold values may be independently programmed to different values.
  • In one example, the present invention may be implemented in an H.264 compliant encoder/decoder. In general, H.264 uses a 4×4 transform. The 4×4 transform produces 16 coefficients per block. H.264 also uses one luminance (luma or Y) and two chrominance (Cb an Cr) blocks for a 4×4 pixel area. H.264 may also implement 16×16 blocks. In H.264, the DC transformed coefficients for a 16×16 block are generally grouped into a 4×4 block of DC coefficients, and transformed and quantized again. In a preferred embodiment of the present invention, a different set of offsets and dead zone may be implemented for the latter step. In another example, the present invention may be implemented in a VC-1 compliant encoder/decoder. Examples of a transform/quantization process in accordance with preferred embodiments of the present invention are presented below.
  • In one example, an encoder in accordance with the present invention may implement H.264 forward transform and quantization similar to that described in the Joint Video Team (JVT) document JVT-B039.doc entitled “Low Complexity Transform and Quantization—Part II: Extensions” (which is hereby incorporated by reference in its entirety) with the exception of (i) material regarding residuals>9 bits, (ii) proposed changes to the standard that were not adopted and (iii) implementation of programmable offsets and thresholds. As used herein, the symbol // denotes division with rounding to the nearest integer:

  • a//2b=sign(a)×[(abs(a)+2b−1)>>b]
  • Quantization in accordance with the present invention may be implemented, in one example, with 126-bits for programmable parameter sets (e.g., 18 3-bit offsets and 18 4-bit deadzones). In one example, the present invention may implement 16 parameter sets for 8×8 or 4×4 luma and chroma quantization (e.g., Oij and dij), 1 parameter set for 2×2 chroma DC quantization (e.g., OCDC and dCDC), and 1 parameter set for 4×4 luma DC quantization (e.g., OLDC and dLDC).
  • In one example, a forward transform for 4×4 luma and chroma may be defined by the following equation:
  • Y = [ 1 1 1 1 2 1 - 1 - 2 1 - 1 - 1 1 1 - 2 2 - 1 ] [ x 00 x 01 x 02 x 03 x 10 x 11 x 12 x 13 x 20 x 21 x 22 x 23 x 30 x 31 x 32 x 33 ] [ 1 2 1 1 1 1 - 1 - 2 1 - 1 - 1 2 1 - 2 1 - 1 ]
  • Quantization for the 4×4 luma and chroma may be performed according to the following equation:
  • Y QQ ( i , j ) = [ Y ( i , j ) · Q ( QP %6 , i , j ) + ( ( ( Off ( i , j ) + 1 ) · 2 17 + QP / 6 ) / 16 ) ] / 2 17 - 3 + QP / 6 , i , j = 0 3 Y Q ( i , j ) = { 0 , if abs ( Y QQ ( i , j ) ) 7 + DZ ( i , j ) Y QQ ( i , j ) / 8 , else
  • where Y represents the transformed coefficients, Q represents the quantization coefficients, Off(i,j) has the same sign as the coefficient that is being quantized and a value +1 is added to the parameter Off(i,j). In one example, the quantization coefficients Q may be set as follows:
      • Q[QP %6][i][j]=quantMat[QP %6][0] for (i,j)={(0,0), (0,2), (2,0), (2,2)},
      • Q[QP %6][i][j]=quantMat[QP %6][1] for (i,j)={(1,1), (1,3), (3,1), (3,3)},
      • Q[QP %6][i][j]=quantMat[QP %6][2] otherwise.
      • R[QP %6][i][j]=dequantMat[QP %6][0] for (i,j)={(0,0), (0,2), (2,0), (2,2)},
      • R[QP %6][i][j]=dequantMat[QP %6][1] for (i,j)={(1,1), (1,3), (3,1), (3,3)},
      • R[QP %6][i][j]=dequantMat[QP %6][2] otherwise.
      • quantMat[6][3]={{13107, 5243, 8066}, {11916, 4660, 7490}, {10082, 4194, 6554}, {9362, 3647, 5825}, {8192, 3355, 5243}, {7282, 2893, 4559}};
      • dequantMat[6][3]={{10, 16, 13}, {11, 18, 14}, {13, 20, 16}, {14, 23, 18}, {16, 25, 20}, {18, 29, 23}}.
  • The sixteen 3-bit programmable offsets Oij for 4×4 luma and chroma quantization may be utilized according to the following equation:
  • Off ( i , j ) = sgn ( Y ( i , j ) ) · O 1 ( i , j ) , where O 1 = [ o 00 o 01 o 02 o 03 o 10 o 11 o 12 o 13 o 20 o 21 o 22 o 23 o 30 o 31 o 32 o 33 ]
  • where Oij=0 . . . 7. For example, for smaller sum of absolute differences (SAD) blocks and/or non-reference blocks (e.g. B-frames), O1 may be implemented as follows:
  • O 1 = [ 7 3 3 1 3 3 3 1 3 3 3 1 1 1 1 1 ]
  • while for all other cases, O1 may have the following value:
  • O 1 = [ 7 5 5 3 5 5 5 3 5 5 5 3 3 3 3 1 ]
  • where variance and significance change for non reference blocks. In an example where variance differs for intra blocks, the matrix O1 may be implemented for intra blocks as follows:
  • O 1 = [ 7 6 5 4 6 5 4 3 5 4 3 2 4 3 2 2 ]
  • and for inter blocks as follows:
  • O 1 = [ 4 4 3 3 4 3 3 2 3 3 2 1 3 2 1 1 ]
  • A more sophisticated scheme may be implemented where the matrix is changed based on reference/non-reference, intra/inter, or use of the macroblock variances and residuals variances directly.
  • The sixteen 3-bit programmable offsets Oij for 8×8 luma and chroma quantization may be utilized according to the following equation:
  • Off ( i , j ) = sgn ( Y ( i , j ) ) · O 1 ( i , j ) , where O 1 = [ o 00 o 01 o 02 o 02 o 03 o 03 o 03 o 03 o 10 o 11 o 12 o 12 o 13 o 13 o 13 o 13 o 20 o 21 o 22 o 22 o 23 o 23 o 23 o 23 o 20 o 21 o 22 o 22 o 23 o 23 o 23 o 23 o 30 o 31 o 32 o 32 o 33 o 33 o 33 o 33 o 30 o 31 o 32 o 32 o 33 o 33 o 33 o 33 o 30 o 31 o 32 o 32 o 33 o 33 o 33 o 33 o 30 o 31 o 32 o 32 o 33 o 33 o 33 o 33 ]
  • where the values in the square brackets are the same as in the 4×4 example above, but i and j=0 . . . 7.
  • The sixteen 4-bit programmable dead zones dij for 4×4 luma and chroma quantization may be utilized according to the following equation:
  • DZ ( i , j ) = D 1 ( i , j ) , where D 1 = [ d 00 d 01 d 02 d 03 d 10 d 11 d 12 d 13 d 20 d 21 d 22 d 23 d 30 d 31 d 32 d 33 ] ,
  • where dij=0 . . . 15. In one example, for smaller SAD blocks and/or non-reference blocks (e.g., B-frames), the value of D1 may be set as follows:
  • D 1 = [ 0 0 0 2 0 0 2 5 0 2 3 5 2 5 5 10 ]
  • Otherwise, the value of D1 may be set as follows:
  • D 1 = [ 0 0 0 1 0 0 1 3 0 1 2 3 1 3 3 6 ]
  • The intermediate values inside the square brackets in the above equations may have, in one example, a 32-bit range. However, the value Y may be implemented to fit in 16 bits.
  • The sixteen 4-bit programmable dead zones dij for 8×8 luma and chroma quantization may be utilized according to the following equation:
  • DZ ( i , j ) = D 1 ( i , j ) , where D 1 = [ d 00 d 00 d 00 d 00 d 01 d 01 d 02 d 03 d 00 d 00 d 00 d 00 d 01 d 01 d 02 d 03 d 00 d 00 d 00 d 00 d 01 d 01 d 02 d 03 d 00 d 00 d 00 d 00 d 01 d 01 d 02 d 03 d 10 d 10 d 10 d 10 d 11 d 11 d 12 d 13 d 10 d 10 d 10 d 10 d 11 d 11 d 12 d 13 d 20 d 20 d 20 d 20 d 21 d 21 d 22 d 23 d 30 d 30 d 30 d 30 d 31 d 31 d 32 d 33 ]
  • where the values in the square brackets are the same as in the 4×4 example above, but i and j=0 . . . 7.
  • For 4×4 luma DC transformation and quantization, the luma DC coefficients of a 16×16 block are grouped into a 4×4 block and further transformed, for intra frames, to improve compression. The forward transform for 4×4 luma DC transformation may be implemented as follows. The input matrix XD may be formed by picking out DC coefficients from the 16 transformed 4×4 blocks. DC coefficients may be transformed, for example, using a symmetric Hadamard transform. The symmetric Hadamard transform generally leads to essentially the same performance as the DCT-like transform in TML). The forward transform may be expressed by the following equation:
  • Y D = ( [ 1 1 1 1 1 1 - 1 - 1 1 - 1 - 1 1 1 - 1 1 - 1 ] [ x D 00 x D 01 x D 02 x D 03 x D 10 x D 11 x D 12 x D 13 x D 20 x D 12 x D 22 x D 23 x D 30 x D31 x D 32 x D 33 ] [ 1 1 1 1 1 1 - 1 - 1 1 - 1 - 1 1 1 - 1 1 - 1 ] ) // k p
  • with kp=k9=2. The value p is generally related to a bitdepth of the input video. For example for 8-bit video the value k9 is implemented. However, other values may be implemented for 10-bit and/or 12-bit input video.
  • The 4×4 luma DC quantization may be performed according to the following equation:
  • Y QQD ( i , j ) = [ Y D ( i , j ) · Q ( QP % 6 , 0 , 0 ) + ( ( sgn ( Y D ( i , j ) ) · ( o LDC + 1 ) · 2 17 + QP / 6 ) / 16 ) ] 2 17 - 3 + QP / 6 , i , j = 0 3 Y QD ( i , j ) = { 0 , abs ( Y QQD ( i , j ) ) 7 + d LDC Y QQD ( i , j ) / 8 , else
  • where OLDC=0 . . . 7, dLDC=0 . . . 15 (e.g., OLDC=7, dLDC=0).
  • The 2×2 chroma DC transform and quantization may be implemented as follows. In one example, the forward transform may be implemented by adding the chroma DC transform on top of the chroma transform. The input matrix may be formed by picking out DC coefficients from the 4 transformed 4×4 blocks. For example, denoting the coefficients as XD, the transform may be computed according to the following equation.
  • Y D = ( [ 1 1 1 - 1 ] [ x D 00 x D 01 x D 10 x D 11 ] [ 1 1 1 - 1 ] ) // k p
  • with kp=k9=1.
  • The 2×2 chroma DC quantization may performed according to the following equation:
  • Y QQD ( i , j ) = [ Y D ( i , j ) · Q ( QP % 6 , 0 , 0 ) + ( ( sgn ( Y D ( i , j ) ) · ( o CDC + 1 ) · 2 17 + QP / 6 ) / 16 ) ] 2 17 - 3 + QP / 6 , i , j = 0 3 Y QD ( i , j ) = { 0 , abs ( Y QQD ( i , j ) ) 7 + d CDC Y QQD ( i , j ) / 8 , else
  • where OCDC=0 . . . 7, dCDC=0 . . . 15 (e.g., OCDC=7, dCDC=0).
  • The dynamic range specifications for each stage may be summarized in the following TABLE 1:
  • TABLE 1
    Precision
    Input bits Output bits
    Luma and chroma transform and quantization
    Forward transform 9 15
    Quantization 15 10
    De-quantization 10 15
    Inverse transform 15 9
    Luma DC transform and quantization
    transform 13 16
    Quantization 16 12
    Inverse transform 12 12
    De-quantization 12 15
    Chroma DC transform and quantization
    transform 13 15
    Quantization 15 11
    Inverse transform 11 11
    De-quantization 11 15
  • The value in each cell generally corresponds to a 9-bit input residual.
  • In another example, the present invention may be implemented in a VC1 (a Society of Motion Picture and Television Engineers (SMPTE) standard based on Microsoft WMV9) compliant encoder/decoder. The present invention may modify forward transform and quantization specifications of the VC1 standard to provide for implementation of programmable offsets and dead zones. In one example, the present invention may be implemented as a stand alone device. In another example, the present invention may provide an implementation that re-uses MPEG2 or MPEG4 hardware.
  • In one example, the forward quantization of a given coefficient at an index (e.g., Idx) of a block may be implemented according to the following pseudocode:
  • Quantizer = (InvQuantScale * QuantMat[Idx] + 0x8000) >> 16;
    Tmp = Coeffs[Idx] * Quantizer;
    /* Handle rounding */
    if (Tmp < 0)
    {
      Tmp += ((~Qrnd & 0xf) << 12 | 0xfff;
    }
    else
    {
      Tmp += Qrnd << 12;
    }
    QCoeffs[Idx] = Tmp >> 16;

    where Quantizer represents the forward quantizer, InvQuantScale represents a scale factor for the block, QuantMat represents the forward quantization matrix, Coeffs represents the blocks of coefficients, Qrnd represents a rounding factor and QCoeffs represents a resulting quantized block. In one example, QuantMat[Idx] may be set to 16 and Qrnd may be set to a predetermined fixed value. In one example, the value Qrnd may be either 5 or 8 for intra pictures (I-pictures) and 0 or 1 for inter pictures. The particular value of Qrnd implemented may be determined based on user preference.
  • A programmable dead zone for VC1 may be implemented by varying the parameter Qrnd between a first predetermined value and a second predetermined value. In one example, Qrnd may be varied between 0 and 2̂4 (or between 0 and 15 inclusive). Programmable independent coefficient offsets for VC1 may be implemented by varying the parameter QuantMat[Idx] between a first predetermined value and a second predetermined value. The value of QuantMat[Idx] may be implemented, in one example, having (i) a range of up to 16 bits and (ii) a positive sign (e.g., 0 to (2̂16)−1). In one example, the value of QuantMat[Idx] may be varied between 0 and 31 inclusive.
  • The present invention may apply conventional MPEG-2 or MPEG-4 very large scale integrated (VLSI) implementations of quantization for standard-based VC1 encoding. For example, MPEG-2 parameters may be varied to apply programmable dead zones and offsets. In one example, MPEG-2 quantization matrices may be utilized in an alternative implementation of programmable offsets for VC1 quantization. Although the present invention has been illustrated with an 8-bit video input, other bitdepths (e.g., 10-bit, 12-bit, etc.) of input video streams may be implemented in accordance with the present invention. The present invention may also be implemented with other video formats (e.g., 4:2:2, 4:4:4, etc.). The present invention may provide advantages for Main Profile, High Profile and VC1 by matching quantization decisions to the coefficient distributions to improve rate-distortion performance of encoded bitstreams.
  • The function performed by the present invention may be implemented using a conventional general purpose digital computer programmed according to the teachings of the present specification, as will be apparent to those skilled in the relevant art(s). Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will also be apparent to those skilled in the relevant art(s).
  • The present invention may also be implemented by the preparation of ASICs, FPGAs, or by interconnecting an appropriate network of conventional component circuits, as is described herein, modifications of which will be readily apparent to those skilled in the art(s).
  • The present invention thus may also include a computer product which may be a storage medium including instructions which can be used to program a computer to perform a process in accordance with the present invention. The storage medium can include, but is not limited to, any type of disk including floppy disk, optical disk, CD-ROM, magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, Flash memory, magnetic or optical cards, or any type of media suitable for storing electronic instructions.
  • While the invention has been particularly shown and described with reference to the preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made without departing from the spirit and scope of the invention.

Claims (20)

1. An apparatus comprising:
an encoder circuit configured to generate a plurality of coefficient values and motion vectors in response to a video stream, a first control signal, a second control signal, and a number of quantized values;
a control circuit configured to (i) generate said first control signal, said second control signal, and a quantizer index signal and (ii) set two or more quantization dead zones and two or more offsets to different values, wherein said quantization dead zones and said offsets are independently programmable; and
a quantizer circuit configured to generate said number of quantized values in response to said coefficient values, said quantizer index signal, said two or more quantization dead zones and said two or more offsets, wherein a first coefficient value is quantized using a first quantization dead zone and a first offset, a second coefficient value is quantized using a second quantization dead zone and a second offset, said first and second quantization dead zones have different values, said first and second offsets have different values, and quantization calculations determining each quantized value take into account respective quantization dead zones and offsets.
2. The apparatus according to claim 1, wherein:
generating each quantized value comprises (i) scaling a corresponding coefficient value based upon a respective quantizer coefficient and a respective offset value, (ii) setting said quantized value to the scaled coefficient value when a magnitude of said scaled coefficient value is greater than a threshold based upon a respective dead zone value, and setting said quantized value to zero when the magnitude of said scaled coefficient value is not greater than the threshold value based upon the respective dead zone value.
3. The apparatus according to claim 1, wherein:
4×4 luma and chroma quantization is performed according to a first equation comprising
Y QQ ( i , j ) = [ Y ( i , j ) · Q ( QP % 6 , i , j ) + ( ( ( Off ( i , j ) + 1 ) · 2 17 + QP / 6 ) / 16 ) ] 2 17 - 3 + QP / 6 , i , j = 0 3 Y Q ( i , j ) = { 0 , if abs ( Y QQD ( i , j ) ) 7 + DZ ( i , j ) Y QQD ( i , j ) / 8 , else
where Y represents the coefficient values, Q represents the quantization coefficients, Off(i,j) represents the offset values and DZ(i,j) represents the quantization dead zone values;
4×4 luma DC quantization is performed according to a second equation comprising
Y QQD ( i , j ) = [ Y D ( i , j ) · Q ( QP % 6 , 0 , 0 ) + ( ( sgn ( Y D ( i , j ) ) · ( o LDC + 1 ) · 2 17 + QP / 6 ) / 16 ) ] 2 17 - 3 + QP / 6 , i , j = 0 3 Y QD ( i , j ) = { 0 , abs ( Y QQD ( i , j ) ) 7 + d LDC Y QQD ( i , j ) / 8 , else
where OLDC=0 . . . 7 has a value ranging from zero through 7 and dLDC has a value ranging from zero through 15; and
2×2 chroma DC quantization is performed according to a third equation comprising
Y QQD ( i , j ) = [ Y D ( i , j ) · Q ( QP % 6 , 0 , 0 ) + ( ( sgn ( Y D ( i , j ) ) · ( o CDC + 1 ) · 2 17 + QP / 6 ) / 16 ) ] 2 17 - 3 + QP / 6 , i , j = 0 3 Y QD ( i , j ) = { 0 , abs ( Y QQD ( i , j ) ) 7 + d CDC Y QQD ( i , j ) / 8 , else
where OCDC has a value ranging from zero through 7 and dCDC has a value ranging from zero through 15.
4. The apparatus according to claim 1, wherein said control circuit is further configured to provide a plurality of dead zone threshold values and offset threshold values.
5. The apparatus according to claim 1, wherein said control circuit is configured to provide a plurality of parameter sets.
6. The apparatus according to claim 5, wherein said plurality of parameter sets comprises a plurality of parameter pairs.
7. The apparatus according to claim 6, wherein said plurality of parameter pairs each comprise an offset parameter and a dead zone parameter.
8. The apparatus according to claim 5, wherein said plurality of parameter sets comprises 16 parameter sets configured for either 8×8 or 4×4 luma and chroma quantization, one parameter set configured for 2×2 chroma DC quantization and one parameter set configured for 4×4 luma DC quantization.
9. The apparatus according to claim 1, wherein said encoder circuit comprises a VC1 compliant encoder/decoder (CODEC) and said quantizer circuit is configured to implement a variable dead zone comprising a rounding factor configured to vary between a first predetermined value and a second predetermined value.
10. The apparatus according to claim 1, wherein said quantizer circuit is further configured to implement a programmable offset for standard-based VC1 encoding, wherein a forward quantization matrix parameter is variable between a first predetermined value and a second predetermined value.
11. A video encoder comprising:
means for generating a plurality of coefficient values and motion vectors in response to a video stream, a first control signal, a second control signal, and a number of quantized values;
means for (i) generating said first control signal, said second control signal, and a quantizer index signal and (ii) setting two or more quantization dead zones and two or more offsets to different values, wherein said two or more quantization dead zones and said two or more offsets are independently programmable;
means for generating said number of quantized values in response to said coefficient values, said quantizer index signal, said two or more quantization dead zones and said two or more offsets, wherein a first coefficient value is quantized using a first quantization dead zone and a first offset, a second coefficient value is quantized using a second quantization dead zone and a second offset, said first and second quantization dead zones have different values, said first and second offsets have different values, and generating each quantized value comprises (i) scaling a corresponding coefficient value based upon a respective quantizer coefficient and a respective offset value, (ii) setting said quantized value to the scaled coefficient value when a magnitude of said scaled coefficient value is greater than a threshold based upon a respective dead zone value, and setting said quantized value to zero when the magnitude of said scaled coefficient value is not greater than the threshold value based upon the respective dead zone value.
12. A method for encoding video comprising:
generating a plurality of coefficient values and motion vectors with a video encoder in response to a video stream, a first control signal, a second control signal, and a number of quantized values;
generating said first control signal, said second control signal, and a quantizer index signal;
setting two or more quantization dead zones and two or more offsets to different values, wherein said two or more quantization dead zones and said two or more offsets are independently programmable; and
generating said number of quantized values in said video encoder in response to said coefficient values, said quantizer index signal, said two or more quantization dead zones and said two or more offsets, wherein a first coefficient value is quantized using a first quantization dead zone and a first offset, a second coefficient value is quantized using a second quantization dead zone and a second offset, said first and second quantization dead zones have different values, said first and second offsets have different values, and quantization calculations determining each quantized value take into account respective quantization dead zones and offsets.
13. The method according to claim 12, further comprising:
generating each quantized value by (i) scaling a corresponding coefficient value based upon a respective quantizer coefficient and a respective offset value, (ii) setting said quantized value to the scaled coefficient value when a magnitude of said scaled coefficient value is greater than a threshold based upon a respective dead zone value, and setting said quantized value to zero when the magnitude of said scaled coefficient value is not greater than the threshold value based upon the respective dead zone value.
14. The method according to claim 12, further comprising:
providing a plurality of parameter sets.
15. The method according to claim 14, wherein said plurality of parameter sets comprises a plurality of parameter pairs, wherein said plurality of parameter pairs each comprise an offset parameter and a dead zone parameter.
16. The method according to claim 14, wherein said plurality of parameter sets comprises 16 parameter sets configured for either 8×8 or 4×4 luma and chroma quantization, one parameter set configured for 2×2 chroma DC quantization and one parameter set configured for 4×4 luma DC quantization.
17. The method according to claim 12, wherein said video encoder is implemented as a VC1 compliant encoder/decoder (CODEC) and a quantizer circuit configured to implement one or more of (a) a variable dead zone comprising a rounding factor configured to vary between a first predetermined value and a second predetermined value, (b) a programmable offset for standard-based VC1 encoding, wherein a forward quantization matrix parameter is variable between a first predetermined value and a second predetermined value and (c) a variable dead zone comprising a rounding factor configured to vary between a first predetermined value and a second predetermined value, and a programmable offset for standard base VC1 encoding, wherein a forward quantization matrix parameter is variable between a third predetermined value and a fourth predetermined value.
18. The method according to claim 12, wherein:
4×4 luma and chroma quantization is performed according to a first equation comprising
Y QQ ( i , j ) = [ Y ( i , j ) · Q ( QP % 6 , i , j ) + ( ( ( Off ( i , j ) + 1 ) · 2 17 + QP / 6 ) / 16 ) ] 2 17 - 3 + QP / 6 , i , j = 0 3 Y Q ( i , j ) = { 0 , if abs ( Y QQD ( i , j ) ) 7 + DZ ( i , j ) Y QQD ( i , j ) / 8 , else
where Y represents the coefficient values, Q represents the quantization coefficients, Off(i,j) represents the offset values and DZ(i,j) represents the quantization dead zone values;
4×4 luma DC quantization is performed according to a second equation comprising
Y QQD ( i , j ) = [ Y D ( i , j ) · Q ( QP % 6 , 0 , 0 ) + ( ( sgn ( Y D ( i , j ) ) · ( o LDC + 1 ) · 2 17 + QP / 6 ) / 16 ) ] 2 17 - 3 + QP / 6 , i , j = 0 3 Y QD ( i , j ) = { 0 , abs ( Y QQD ( i , j ) ) 7 + d LDC Y QQD ( i , j ) / 8 , else
where OLDC=0 . . . 7 has a value ranging from zero through 7 and dLDC has a value ranging from zero through 15; and
2×2 chroma DC quantization is performed according to a third equation comprising
Y QQD ( i , j ) = [ Y D ( i , j ) · Q ( QP % 6 , 0 , 0 ) + ( ( sgn ( Y D ( i , j ) ) · ( o CDC + 1 ) · 2 17 + QP / 6 ) / 16 ) ] 2 17 - 3 + QP / 6 , i , j = 0 3 Y QD ( i , j ) = { 0 , abs ( Y QQD ( i , j ) ) 7 + d CDC Y QQD ( i , j ) / 8 , else
where OCDC has a value ranging from zero through 7 and dCDC has a value ranging from zero through 15.
19. The method according to claim 18, wherein:
sixteen 3-bit programmable offsets Oij for 4×4 luma and chroma quantization are utilized according to a first equation
Off ( i , j ) = sgn ( Y ( i , j ) ) · O 1 ( i , j ) , where O 1 = [ o 00 o 01 o 02 o 03 o 10 o 11 o 12 o 13 o 20 o 21 o 22 o 23 o 30 o 31 o 32 o 33 ]
where Oij has a value ranging from zero through seven; and
sixteen 4-bit programmable dead zones dij for 4×4 luma and chroma quantization are utilized according to a second equation
D Z ( i , j ) = D 1 ( i , j ) , where D 1 = [ d 00 d 01 d 02 d 03 d 10 d 11 d 12 d 13 d 20 d 21 d 22 d 23 d 30 d 31 d 32 d 33 ] ,
where dij has a value ranging from zero through fifteen.
20. The method according to claim 18, wherein:
sixteen 3-bit programmable offsets Oij for 8×8 luma and chroma quantization are utilized according to a third equation
Off ( i , j ) = sgn ( Y ( i , j ) ) · O 1 ( i , j ) , where O 1 = [ o 00 o 01 o 02 o 02 o 03 o 03 o 03 o 03 o 10 o 11 o 12 o 12 o 13 o 13 o 13 o 13 o 20 o 21 o 22 o 22 o 23 o 23 o 23 o 23 o 20 o 21 o 22 o 22 o 23 o 23 o 23 o 23 o 30 o 31 o 32 o 32 o 33 o 33 o 33 o 33 o 30 o 31 o 32 o 32 o 33 o 33 o 33 o 33 o 30 o 31 o 32 o 32 o 33 o 33 o 33 o 33 o 30 o 31 o 32 o 32 o 33 o 33 o 33 o 33 ]
and sixteen 4-bit programmable dead zones dij for 8×8 luma and chroma quantization are utilized according to a fourth equation
DZ ( i , j ) = D 1 ( i , j ) , where D 1 = [ d 00 d 00 d 00 d 00 d 01 d 01 d 02 d 03 d 00 d 00 d 00 d 00 d 01 d 01 d 02 d 03 d 00 d 00 d 00 d 00 d 01 d 01 d 02 d 03 d 00 d 00 d 00 d 00 d 01 d 01 d 02 d 03 d 10 d 10 d 10 d 10 d 11 d 11 d 12 d 13 d 10 d 10 d 10 d 10 d 11 d 11 d 12 d 13 d 20 d 20 d 20 d 20 d 21 d 21 d 22 d 23 d 30 d 30 d 30 d 30 d 31 d 31 d 32 d 33 ]
where the values in the square brackets correspond to the respective values used for 4×4 luma and chroma quantization, but i and j range from zero to seven.
US12/568,878 2004-12-10 2009-09-29 Programmable quantization dead zone and threshold for standard-based h.264 and/or vc1 video encoding Abandoned US20100061449A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/568,878 US20100061449A1 (en) 2004-12-10 2009-09-29 Programmable quantization dead zone and threshold for standard-based h.264 and/or vc1 video encoding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/010,029 US7620103B2 (en) 2004-12-10 2004-12-10 Programmable quantization dead zone and threshold for standard-based H.264 and/or VC1 video encoding
US12/568,878 US20100061449A1 (en) 2004-12-10 2009-09-29 Programmable quantization dead zone and threshold for standard-based h.264 and/or vc1 video encoding

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/010,029 Continuation US7620103B2 (en) 2004-12-10 2004-12-10 Programmable quantization dead zone and threshold for standard-based H.264 and/or VC1 video encoding

Publications (1)

Publication Number Publication Date
US20100061449A1 true US20100061449A1 (en) 2010-03-11

Family

ID=36583798

Family Applications (2)

Application Number Title Priority Date Filing Date
US11/010,029 Active 2027-09-20 US7620103B2 (en) 2004-12-10 2004-12-10 Programmable quantization dead zone and threshold for standard-based H.264 and/or VC1 video encoding
US12/568,878 Abandoned US20100061449A1 (en) 2004-12-10 2009-09-29 Programmable quantization dead zone and threshold for standard-based h.264 and/or vc1 video encoding

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US11/010,029 Active 2027-09-20 US7620103B2 (en) 2004-12-10 2004-12-10 Programmable quantization dead zone and threshold for standard-based H.264 and/or VC1 video encoding

Country Status (1)

Country Link
US (2) US7620103B2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110188573A1 (en) * 2010-02-04 2011-08-04 Canon Kabushiki Kaisha Method and Device for Processing a Video Sequence
TWI685245B (en) * 2013-04-08 2020-02-11 新力股份有限公司 Data encoding and decoding
US11490083B2 (en) 2020-02-05 2022-11-01 Qualcomm Incorporated Learned low-complexity adaptive quantization for video compression
US12058348B2 (en) 2020-04-17 2024-08-06 Qualcomm Incorporated Parallelized rate-distortion optimized quantization using deep learning

Families Citing this family (79)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
PL2384002T3 (en) 2004-01-30 2016-07-29 Panasonic Ip Corp America Moving picture decoding method using additional quantization matrices
US7620103B2 (en) * 2004-12-10 2009-11-17 Lsi Corporation Programmable quantization dead zone and threshold for standard-based H.264 and/or VC1 video encoding
US8902989B2 (en) * 2005-04-27 2014-12-02 Broadcom Corporation Decoder system for decoding multi-standard encoded video
US8422546B2 (en) * 2005-05-25 2013-04-16 Microsoft Corporation Adaptive video encoding using a perceptual model
CN100370834C (en) * 2005-08-08 2008-02-20 北京中星微电子有限公司 Coefficient pantagraph calculating module in multi-mode image encoding and decoding chips
ES2383611T3 (en) * 2005-08-15 2012-06-22 Nds Limited Video cheat mode system
US7778476B2 (en) * 2005-10-21 2010-08-17 Maxim Integrated Products, Inc. System and method for transform coding randomization
US20070147496A1 (en) * 2005-12-23 2007-06-28 Bhaskar Sherigar Hardware implementation of programmable controls for inverse quantizing with a plurality of standards
US8059721B2 (en) 2006-04-07 2011-11-15 Microsoft Corporation Estimating sample-domain distortion in the transform domain with rounding compensation
US7995649B2 (en) 2006-04-07 2011-08-09 Microsoft Corporation Quantization adjustment based on texture level
US20070237237A1 (en) * 2006-04-07 2007-10-11 Microsoft Corporation Gradient slope detection for video compression
US8503536B2 (en) 2006-04-07 2013-08-06 Microsoft Corporation Quantization adjustments for DC shift artifacts
US8711925B2 (en) 2006-05-05 2014-04-29 Microsoft Corporation Flexible quantization
JP4250638B2 (en) * 2006-06-30 2009-04-08 株式会社東芝 Video encoding apparatus and method
US7885476B2 (en) * 2006-12-14 2011-02-08 Sony Corporation System and method for effectively performing an adaptive encoding procedure
US8238424B2 (en) 2007-02-09 2012-08-07 Microsoft Corporation Complexity-based adaptive preprocessing for multiple-pass video compression
US8498335B2 (en) * 2007-03-26 2013-07-30 Microsoft Corporation Adaptive deadzone size adjustment in quantization
US20080240257A1 (en) * 2007-03-26 2008-10-02 Microsoft Corporation Using quantization bias that accounts for relations between transform bins and quantization bins
US8243797B2 (en) 2007-03-30 2012-08-14 Microsoft Corporation Regions of interest for quality adjustments
US8442337B2 (en) 2007-04-18 2013-05-14 Microsoft Corporation Encoding adjustments for animation content
US8331438B2 (en) * 2007-06-05 2012-12-11 Microsoft Corporation Adaptive selection of picture-level quantization parameters for predicted video pictures
US8189933B2 (en) 2008-03-31 2012-05-29 Microsoft Corporation Classifying and controlling encoding quality for textured, dark smooth and smooth video content
US8897359B2 (en) 2008-06-03 2014-11-25 Microsoft Corporation Adaptive quantization for enhancement layer video coding
US8687702B2 (en) * 2008-10-27 2014-04-01 Advanced Micro Devices, Inc. Remote transmission and display of video data using standard H.264-based video codecs
JP5749710B2 (en) * 2009-05-16 2015-07-15 トムソン ライセンシングThomson Licensing Method and apparatus for adjusting improved quantization rounding offset for video encoding and decoding
US8294781B2 (en) * 2009-10-20 2012-10-23 Apple Inc. System and method for sharpening image data
US8593483B2 (en) * 2009-10-20 2013-11-26 Apple Inc. Temporal filtering techniques for image signal processing
US8638342B2 (en) * 2009-10-20 2014-01-28 Apple Inc. System and method for demosaicing image data using weighted gradients
US8259198B2 (en) * 2009-10-20 2012-09-04 Apple Inc. System and method for detecting and correcting defective pixels in an image sensor
US8472712B2 (en) * 2009-10-20 2013-06-25 Apple Inc. System and method for applying lens shading correction during image processing
US8705619B2 (en) 2010-04-09 2014-04-22 Sony Corporation Directional discrete wavelet transform (DDWT) for video compression applications
US8929440B2 (en) 2010-04-09 2015-01-06 Sony Corporation QP adaptive coefficients scanning and application
JP2011259362A (en) * 2010-06-11 2011-12-22 Sony Corp Image processing system and method of the same
US8525895B2 (en) 2010-07-29 2013-09-03 Apple Inc. Binning compensation filtering techniques for image signal processing
US8493482B2 (en) 2010-08-18 2013-07-23 Apple Inc. Dual image sensor image processing system and method
US8922704B2 (en) 2010-09-01 2014-12-30 Apple Inc. Techniques for collection of auto-focus statistics
US8605167B2 (en) 2010-09-01 2013-12-10 Apple Inc. Flexible color space selection for auto-white balance processing
US8531542B2 (en) 2010-09-01 2013-09-10 Apple Inc. Techniques for acquiring and processing statistics data in an image signal processor
US9398205B2 (en) 2010-09-01 2016-07-19 Apple Inc. Auto-focus control using image statistics data with coarse and fine auto-focus scores
US8736700B2 (en) 2010-09-30 2014-05-27 Apple Inc. Techniques for synchronizing audio and video data in an image signal processing system
US8508621B2 (en) 2010-09-30 2013-08-13 Apple Inc. Image sensor data formats and memory addressing techniques for image signal processing
US8471932B2 (en) 2010-09-30 2013-06-25 Apple Inc. Spatial filtering for image signal processing
US8508612B2 (en) 2010-09-30 2013-08-13 Apple Inc. Image signal processor line buffer configuration for processing ram image data
US8488055B2 (en) 2010-09-30 2013-07-16 Apple Inc. Flash synchronization using image sensor interface timing signal
US8629913B2 (en) 2010-09-30 2014-01-14 Apple Inc. Overflow control techniques for image signal processing
US8786625B2 (en) 2010-09-30 2014-07-22 Apple Inc. System and method for processing image data using an image signal processor having back-end processing logic
KR20130054408A (en) 2010-10-05 2013-05-24 제너럴 인스트루먼트 코포레이션 Coding and decoding utilizing adaptive context model selection with zigzag scan
GB2486692B (en) * 2010-12-22 2014-04-16 Canon Kk Method for encoding a video sequence and associated encoding device
US8938001B1 (en) 2011-04-05 2015-01-20 Google Inc. Apparatus and method for coding using combinations
US8891616B1 (en) * 2011-07-27 2014-11-18 Google Inc. Method and apparatus for entropy encoding based on encoding cost
US8885706B2 (en) 2011-09-16 2014-11-11 Google Inc. Apparatus and methodology for a video codec system with noise reduction capability
US20130077673A1 (en) * 2011-09-23 2013-03-28 Cisco Technology, Inc. Multi-processor compression system
US20130114688A1 (en) * 2011-11-07 2013-05-09 Louis Joseph Kerofsky Video Decoder with Constrained Dynamic Range
US9167261B2 (en) 2011-11-07 2015-10-20 Sharp Laboratories Of America, Inc. Video decoder with constrained dynamic range
US9247257B1 (en) 2011-11-30 2016-01-26 Google Inc. Segmentation based entropy encoding and decoding
US9131073B1 (en) 2012-03-02 2015-09-08 Google Inc. Motion estimation aided noise reduction
US11039138B1 (en) 2012-03-08 2021-06-15 Google Llc Adaptive coding of prediction modes using probability distributions
WO2013154028A1 (en) * 2012-04-13 2013-10-17 ソニー株式会社 Image processing device, and method
US9105078B2 (en) 2012-05-31 2015-08-11 Apple Inc. Systems and methods for local tone mapping
US8953882B2 (en) 2012-05-31 2015-02-10 Apple Inc. Systems and methods for determining noise statistics of image data
US9077943B2 (en) 2012-05-31 2015-07-07 Apple Inc. Local image statistics collection
US8917336B2 (en) 2012-05-31 2014-12-23 Apple Inc. Image signal processing involving geometric distortion correction
US9743057B2 (en) 2012-05-31 2017-08-22 Apple Inc. Systems and methods for lens shading correction
US8872946B2 (en) 2012-05-31 2014-10-28 Apple Inc. Systems and methods for raw image processing
US9142012B2 (en) 2012-05-31 2015-09-22 Apple Inc. Systems and methods for chroma noise reduction
US9025867B2 (en) 2012-05-31 2015-05-05 Apple Inc. Systems and methods for YCC image processing
US8817120B2 (en) 2012-05-31 2014-08-26 Apple Inc. Systems and methods for collecting fixed pattern noise statistics of image data
US9031319B2 (en) 2012-05-31 2015-05-12 Apple Inc. Systems and methods for luma sharpening
US9014504B2 (en) 2012-05-31 2015-04-21 Apple Inc. Systems and methods for highlight recovery in an image signal processor
US9332239B2 (en) 2012-05-31 2016-05-03 Apple Inc. Systems and methods for RGB image processing
US11089247B2 (en) 2012-05-31 2021-08-10 Apple Inc. Systems and method for reducing fixed pattern noise in image data
US9774856B1 (en) 2012-07-02 2017-09-26 Google Inc. Adaptive stochastic entropy coding
US9344729B1 (en) 2012-07-11 2016-05-17 Google Inc. Selective prediction signal filtering
US9509998B1 (en) 2013-04-04 2016-11-29 Google Inc. Conditional predictive multi-symbol run-length coding
US9154782B2 (en) * 2013-09-16 2015-10-06 Magnum Semiconductor, Inc. Apparatuses and methods for adjusting coefficients using dead zones
US9392288B2 (en) 2013-10-17 2016-07-12 Google Inc. Video coding using scatter-based scan tables
US9179151B2 (en) 2013-10-18 2015-11-03 Google Inc. Spatial proximity context entropy coding
US10102613B2 (en) 2014-09-25 2018-10-16 Google Llc Frequency-domain denoising
KR102636100B1 (en) * 2016-12-16 2024-02-13 삼성전자주식회사 Encoder performing quantization based on deadzone and video processing system comprising the same

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4969040A (en) * 1989-10-26 1990-11-06 Bell Communications Research, Inc. Apparatus and method for differential sub-band coding of video signals
US5214502A (en) * 1991-01-11 1993-05-25 Sony Broadcast & Communications Limited Compression of video signals
US7620103B2 (en) * 2004-12-10 2009-11-17 Lsi Corporation Programmable quantization dead zone and threshold for standard-based H.264 and/or VC1 video encoding

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040179608A1 (en) * 2003-02-27 2004-09-16 Intel Corporation Multiple-description coding methods and apparatus
US7738554B2 (en) * 2003-07-18 2010-06-15 Microsoft Corporation DC coefficient signaling at small quantization step sizes

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4969040A (en) * 1989-10-26 1990-11-06 Bell Communications Research, Inc. Apparatus and method for differential sub-band coding of video signals
US5214502A (en) * 1991-01-11 1993-05-25 Sony Broadcast & Communications Limited Compression of video signals
US7620103B2 (en) * 2004-12-10 2009-11-17 Lsi Corporation Programmable quantization dead zone and threshold for standard-based H.264 and/or VC1 video encoding

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110188573A1 (en) * 2010-02-04 2011-08-04 Canon Kabushiki Kaisha Method and Device for Processing a Video Sequence
TWI685245B (en) * 2013-04-08 2020-02-11 新力股份有限公司 Data encoding and decoding
US11490083B2 (en) 2020-02-05 2022-11-01 Qualcomm Incorporated Learned low-complexity adaptive quantization for video compression
US12058348B2 (en) 2020-04-17 2024-08-06 Qualcomm Incorporated Parallelized rate-distortion optimized quantization using deep learning

Also Published As

Publication number Publication date
US7620103B2 (en) 2009-11-17
US20060126724A1 (en) 2006-06-15

Similar Documents

Publication Publication Date Title
US7620103B2 (en) Programmable quantization dead zone and threshold for standard-based H.264 and/or VC1 video encoding
US20210344965A1 (en) Image processing device and image processing method
US9942570B2 (en) Resource efficient video processing via prediction error computational adjustments
US8184699B2 (en) Method of reducing computations in intra-prediction and mode decision processes in a digital video encoder
US7949044B2 (en) Method for coefficient bitdepth limitation, encoder and bitstream generation apparatus
US20180352238A1 (en) Dc coefficient signaling at small quantization step sizes
US7792193B2 (en) Image encoding/decoding method and apparatus therefor
US6259741B1 (en) Method of architecture for converting MPEG-2 4:2:2-profile bitstreams into main-profile bitstreams
US7957600B2 (en) Methods and systems for rate-distortion optimized quantization of transform blocks in block transform video coding
US7272181B2 (en) Method and apparatus for estimating and controlling the number of bits output from a video coder
US8077769B2 (en) Method of reducing computations in transform and scaling processes in a digital video encoder using a threshold-based approach
US7373009B2 (en) Method and apparatus for efficient transmission and decoding of quantization matrices
JPH08256341A (en) Image signal coding method, image signal coder, image signal recording medium and image signal decoder
US11671609B2 (en) DC coefficient signaling at small quantization step sizes
JP2006191253A (en) Rate converting method and rate converter
US9071844B2 (en) Motion estimation with motion vector penalty
JP2003230142A (en) Method and apparatus for coding image information, and program and recording medium
JP4292659B2 (en) Image information conversion apparatus and image information conversion method
JP2002262293A (en) Moving image decoder and moving image decoding method
JP4292658B2 (en) Image information conversion apparatus and image information conversion method
Westwater et al. The MPEG Video Compression Standard
Pan Digital Video Coding–Techniques and Standards
STANDARD THE MPEG VroEO COMPRESSION STANDARD
Reed Improvement of MPEG-2 compression by position-dependent encoding
Joshi et al. Proposed H. 264/AVC for Real Time Applications in DVB-H Sever

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION