WO2010150486A1 - 画像符号化方法及び画像符号化装置 - Google Patents
画像符号化方法及び画像符号化装置 Download PDFInfo
- Publication number
- WO2010150486A1 WO2010150486A1 PCT/JP2010/003985 JP2010003985W WO2010150486A1 WO 2010150486 A1 WO2010150486 A1 WO 2010150486A1 JP 2010003985 W JP2010003985 W JP 2010003985W WO 2010150486 A1 WO2010150486 A1 WO 2010150486A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- encoding
- target block
- block
- condition
- value
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 95
- 230000000694 effects Effects 0.000 claims abstract description 314
- 230000033001 locomotion Effects 0.000 claims description 138
- 238000012545 processing Methods 0.000 claims description 71
- 239000013598 vector Substances 0.000 claims description 66
- 238000004364 calculation method Methods 0.000 claims description 46
- 238000006243 chemical reaction Methods 0.000 claims description 15
- 230000008859 change Effects 0.000 claims description 3
- 238000011156 evaluation Methods 0.000 description 71
- 238000010586 diagram Methods 0.000 description 27
- 230000008569 process Effects 0.000 description 26
- 230000006866 deterioration Effects 0.000 description 17
- 230000003287 optical effect Effects 0.000 description 17
- 230000005236 sound signal Effects 0.000 description 14
- 239000000872 buffer Substances 0.000 description 13
- 238000000605 extraction Methods 0.000 description 8
- 230000005540 biological transmission Effects 0.000 description 7
- 230000000903 blocking effect Effects 0.000 description 7
- 230000015556 catabolic process Effects 0.000 description 7
- 238000006731 degradation reaction Methods 0.000 description 7
- 238000004891 communication Methods 0.000 description 6
- 238000013139 quantization Methods 0.000 description 6
- 230000006835 compression Effects 0.000 description 5
- 238000007906 compression Methods 0.000 description 5
- 238000009826 distribution Methods 0.000 description 5
- 239000004065 semiconductor Substances 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 238000012854 evaluation process Methods 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 238000000638 solvent extraction Methods 0.000 description 3
- 230000001360 synchronised effect Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000005192 partition Methods 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000001678 irradiating effect Effects 0.000 description 1
- 239000010410 layer Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 239000002356 single layer Substances 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/24—Systems for the transmission of television signals using pulse code modulation
- H04N7/52—Systems for transmission of a pulse code modulated video signal with one or more other pulse code modulated signals, e.g. an audio signal or a synchronizing signal
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/14—Coding unit complexity, e.g. amount of activity or edge presence estimation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/593—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
Definitions
- the present invention relates to an image encoding method and an image encoding apparatus, and more particularly to an image encoding method and an image encoding apparatus that determine an encoding condition for each block and encode a target block according to the determined encoding condition.
- Non-Patent Document 1 an uncompressed original picture is divided into a plurality of M ⁇ N coding units.
- M represents the number of samples in the width
- N represents the number of samples in the height.
- the values of M and N are 16 for example.
- the obtained encoding unit is encoded sequentially from the next to the next.
- the encoding unit is usually called a “macroblock”.
- “macroblock” is hereinafter used as a term indicating an M ⁇ N coding unit in the image / video coding algorithm.
- Each macroblock is further divided into a plurality of K ⁇ L sub-macroblocks.
- K represents the number of samples in the width
- L represents the number of samples in the height.
- the values of K and L are, for example, 8 respectively.
- the encoding process is performed in order from the next to the next, but may be performed independently.
- sub-macroblock division for encoding in units of sub-macroblocks include a transform block size, an intra prediction block size, and a motion compensation block size.
- it may be divided into sub-macroblocks of different sizes and differently encoded.
- the above-described conventional technique has a problem that an increase in calculation amount and deterioration in image quality cannot be sufficiently suppressed.
- the encoding cost is composed of a rate component (a bit amount necessary for storing the encoding information) and a distortion component (a difference amount between the reconstructed sample and the uncompressed original image sample).
- this method has a problem that encoding artifacts such as ringing artifacts and blocking artifacts are not usually reflected in the encoding cost. Therefore, the best image quality may not be obtained using the selected sub-macroblock coding tool.
- Ringing artifacts and blocking artifacts are artifacts that stand out in compressed / moving images. This is particularly noticeable in an image / video coding algorithm such as the ISO / IEC 14496-2 part 10 international standard that achieves a high compression rate.
- a conventional technique for selecting a sub-macroblock coding tool to perform using an exhaustive selection process based on the coding cost based on the relationship between coding rate and distortion is not suitable for the human visual system.
- the encoding cost is calculated without considering or predicting the encoding effect. Therefore, the determined sub-macroblock coding tool does not efficiently reduce ringing artifacts or blocking artifacts.
- the exhaustive selection process in the prior art requires a large amount of operations in the encoder.
- an object of the present invention is to provide an image encoding method and an image encoding apparatus that can suppress deterioration in image quality.
- an image encoding method is an image encoding method for encoding an original image, the original image being divided into a plurality of blocks, A coding condition of a target block that is one of the blocks is determined, the target block is coded according to the determined coding condition, and in determining the coding condition of the target block, a value indicating the complexity of the target block.
- the encoding condition of the target block is determined based on the spatial activity value, and the determination of the encoding condition based on the spatial activity value is a value indicating the complexity of at least a partial area of the target block.
- a first spatial activity value is calculated, the calculated first spatial activity value is compared with a predetermined first threshold value, and the first spatial activity value is If it is smaller than one threshold, a first encoding condition indicating that the target block is encoded for each sub-block of the first division size is determined as an encoding condition of the target block, and the first spatial activity value is When it is equal to or greater than the first threshold, the second encoding condition indicating that the target block is encoded for each sub-block of the second division size larger than the first division size is the encoding condition of the target block. Determine as.
- the encoding condition (the encoding condition for subdivision) is determined so that the target block is encoded for each sub-block having a small division size.
- the encoding condition for subdivision is determined so that the target block is encoded for each sub-block having a small division size.
- a complex image even if distortion occurs, it is not noticeable. Therefore, if the target block is a complex image, encoding is performed so that the target block is encoded for each sub-block having a large division size.
- a plurality of small block clusters each including at least a partial region of the target block are formed based on an extended block including at least the target block, and the plurality of small blocks are formed.
- a small block cluster space activity value that is a value indicating the complexity of each block cluster is calculated, and a minimum value among the calculated small block cluster space activity values may be calculated as the first spatial activity value. Good.
- the spatial activity of a plurality of regions in the target block is calculated, and the calculated minimum value of the plurality of spatial activity values is compared with the threshold value, that is, the flattest region among the plurality of regions in the target block is It can be determined whether the distortion is conspicuous. Since the subdivision encoding condition is selected when the distortion is conspicuous, deterioration in image quality can be suppressed.
- the target block is further divided into a plurality of large blocks, and a large block space activity value that is a value indicating the complexity of each of the plurality of large blocks is calculated and calculated.
- a maximum value among a plurality of large block space activity values is calculated as a second space activity value, the calculated second space activity value is compared with a predetermined second threshold value, and the second space activity value is calculated. If the second threshold is less than or equal to the second threshold, the second encoding condition is determined as the encoding condition of the target block, and if the second spatial activity value is greater than the second threshold, the first spatial activity value of the first spatial activity value
- the encoding condition of the target block may be determined by performing the calculation and the comparison.
- the coding condition for large division is determined, so that the coding efficiency can be improved.
- extension block may be larger than the target block.
- the target block not only the target block but also the image information around the target block is used to determine whether or not the target block includes a flat area, thereby suppressing image quality degradation and improving encoding efficiency. And can be achieved appropriately.
- the value indicating the complexity of the target block may be a value indicating a change in sample values of a plurality of samples included in the target block.
- the encoding condition of the target block is further determined based on the motion of the target block.
- the motion vector of the target block is determined.
- the second encoding condition is set to the target block. If the absolute value of the motion vector is less than or equal to the third threshold value, the encoding condition of the target block may be determined according to the determination result of the encoding condition based on the spatial activity value. Good.
- a predicted block of the target block is generated by performing motion compensation using the calculated motion vector, and the target block, the predicted block, and the motion
- a cost value indicating the coding efficiency of the target block is calculated using a vector, and when the absolute value of the motion vector is equal to or less than the third threshold value, the cost value and a predetermined fourth threshold value are When the cost value is smaller than the fourth threshold, the second coding condition is determined as the coding condition of the target block, and when the cost value is equal to or larger than the fourth threshold, the spatial activity
- the encoding condition of the target block may be determined according to the determination result of the encoding condition based on the value.
- the coding efficiency can be further increased by determining the coding conditions for the large division.
- the cost value may be calculated based on a sum of absolute differences between the target block and the prediction block.
- the cost value can be easily calculated by the calculation method using the sample value.
- the sub-block of the first division size or the second division size may be a processing unit in which at least one process of intra prediction, motion compensation, and frequency conversion is executed.
- An image encoding method is an image encoding method for encoding an original image, wherein the original image is divided into a plurality of blocks and the target is one of the plurality of blocks.
- the encoding condition of the block is determined, the target block is encoded according to the determined encoding condition, and in determining the encoding condition of the target block, the motion vector of the target block is calculated, and the absolute value of the calculated motion vector is determined.
- the second coding condition indicating that encoded for each sub-block of the split size may be determined as the encoding condition for the current block.
- the present invention can be realized not only as an image encoding method but also as an image encoding device including a processing unit that realizes processing included in the image encoding method. Moreover, you may implement
- a communication network such as the Internet.
- the system LSI is an ultra-multifunctional LSI manufactured by integrating a plurality of components on a single chip, and specifically includes a microprocessor, ROM, RAM (Random Access Memory), and the like.
- Computer system is an ultra-multifunctional LSI manufactured by integrating a plurality of components on a single chip, and specifically includes a microprocessor, ROM, RAM (Random Access Memory), and the like.
- an increase in the amount of computation can be suppressed, and deterioration in image quality can be sufficiently suppressed.
- FIG. 1 is a block diagram showing an example of a configuration of an image encoding device according to Embodiment 1 of the present invention.
- FIG. 2 is a block diagram showing an example of the configuration of the coding condition determination unit according to Embodiment 1 of the present invention.
- FIG. 3 is a schematic diagram for explaining the effect of reducing ringing artifacts by the image coding apparatus according to Embodiment 1 of the present invention.
- FIG. 4 is a schematic diagram for explaining possible positions of the encoding tool indicator signal in the compressed video bitstream according to Embodiment 1 of the present invention.
- FIG. 5 is a flowchart showing an example of the operation of the image coding apparatus according to Embodiment 1 of the present invention.
- FIG. 1 is a block diagram showing an example of a configuration of an image encoding device according to Embodiment 1 of the present invention.
- FIG. 2 is a block diagram showing an example of the configuration of the coding condition determination unit according to Embodiment 1 of the
- FIG. 6 is a flowchart showing an example of the operation of the coding condition determination unit according to Embodiment 1 of the present invention.
- FIG. 7 is a block diagram showing an example of the configuration of the image coding apparatus according to Embodiment 2 of the present invention.
- FIG. 8 is a schematic diagram showing an example of forming a picture sample according to Embodiment 2 of the present invention.
- FIG. 9 is a block diagram showing an example of the configuration of the spatial activity condition evaluation unit of the image coding apparatus according to Embodiment 2 of the present invention.
- FIG. 10 is a block diagram showing an example of the configuration of the motion activity condition evaluation unit of the image coding apparatus according to Embodiment 2 of the present invention.
- FIG. 10 is a block diagram showing an example of the configuration of the motion activity condition evaluation unit of the image coding apparatus according to Embodiment 2 of the present invention.
- FIG. 11 is a flowchart showing an example of the operation of the image coding apparatus according to Embodiment 2 of the present invention.
- FIG. 12 is a flowchart showing an example of the operation of the spatial activity condition evaluation unit according to Embodiment 2 of the present invention.
- FIG. 13 is a flowchart showing a detailed example of the space activity condition evaluation unit according to Embodiment 2 of the present invention.
- FIG. 14 is a flowchart showing an example of the operation of the motion activity condition evaluation unit according to Embodiment 2 of the present invention.
- FIG. 15 is a flowchart showing an example of the operation of the coding condition determination unit according to the modification of the embodiment of the present invention.
- FIG. 12 is a flowchart showing an example of the operation of the spatial activity condition evaluation unit according to Embodiment 2 of the present invention.
- FIG. 13 is a flowchart showing a detailed example of the space activity condition evaluation unit according to Embodiment 2 of the present invention.
- FIG. 14 is
- FIG. 16 is a schematic diagram illustrating an example of the overall configuration of a content supply system that implements a content distribution service.
- FIG. 17 is a diagram illustrating an appearance of a mobile phone.
- FIG. 18 is a block diagram illustrating a configuration example of a mobile phone.
- FIG. 19 is a schematic diagram illustrating an example of the overall configuration of a digital broadcasting system.
- FIG. 20 is a block diagram illustrating a configuration example of a television.
- FIG. 21 is a block diagram illustrating a configuration example of an information reproducing / recording unit that reads and writes information from and on a recording medium that is an optical disk.
- FIG. 22 is a diagram illustrating a structure example of a recording medium that is an optical disk.
- FIG. 23 is a block diagram illustrating a configuration example of an integrated circuit that realizes the image encoding method and the image decoding method according to each embodiment.
- the image encoding method according to Embodiment 1 of the present invention is an encoding method for encoding an original image, which divides the original image into a plurality of blocks, and encodes a target block that is one of the plurality of blocks. Encoding conditions are determined, and the target block is encoded according to the determined encoding conditions. In the determination of the encoding condition of the target block, the encoding condition of the target block is determined based on the spatial activity value that is a value indicating the complexity of the target block.
- a first spatial activity value that is a value indicating the complexity of at least a partial area of the target block is calculated, and the first spatial activity value is compared with a predetermined first threshold.
- a first encoding condition indicating that the target block is encoded for each sub-block of the first division size is determined as the encoding condition of the target block.
- a second encoding condition indicating that the target block is encoded for each sub-block having a second division size larger than the first division size is set to the target block. The encoding condition is determined.
- FIG. 1 is a block diagram showing an example of a configuration of an image encoding device 100 according to Embodiment 1 of the present invention.
- the image encoding device 100 generates an encoded image 205 by encoding an uncompressed original image 201.
- the image encoding device 100 includes a dividing unit 110, an encoding condition determining unit 120, and an encoding unit 130. Note that the image encoding device 100 encodes the original image 201 in units of blocks.
- the dividing unit 110 divides the input original image 201 into a plurality of blocks.
- the original image 201 is one picture included in the moving image or one still image.
- the plurality of blocks are, for example, a plurality of macro blocks.
- the macroblock includes a 16 ⁇ 16 luminance value sample and an 8 ⁇ 8 chromaticity value sample for each color component.
- the target block 203 that is one of the plurality of divided blocks is output to the encoding unit 130. Further, the extended block 202 including at least the target block 203 is output to the encoding condition determining unit 120.
- extension block 202 may be the same size as the target block 203, that is, the extension block 202 may be the target block 203.
- the extension block 202 includes the target block 203 and is a block having a size larger than the target block 203.
- the extended block 202 is an extended macroblock including at least the target macroblock, and includes a 24 ⁇ 24 luminance value sample and a 12 ⁇ 12 chromaticity value sample for each color component.
- the encoding condition determination unit 120 determines the encoding condition 204 of the target block that is one of a plurality of blocks. In other words, the encoding condition determination unit 120 determines an encoding condition (encoding tool) for each block. A detailed configuration of the encoding condition determination unit 120 will be described later with reference to FIG.
- the encoding unit 130 encodes the target block 203 in accordance with the encoding condition 204 determined by the encoding condition determining unit 120. Specifically, the encoding unit 130 generates the encoded image 205 by encoding the target block 203 sequentially input from the dividing unit 110.
- FIG. 2 is a block diagram showing an example of the configuration of the coding condition determining unit 120 according to Embodiment 1 of the present invention.
- the encoding condition determination unit 120 determines the encoding condition of the target block 203 based on a spatial activity value that is a value indicating the complexity of the target block 203.
- the spatial activity value which is a value indicating the complexity of the target block 203, is a value indicating a change in the sample value of the target block 203.
- the spatial activity value is a value indicating the complexity of a block that is a set of a plurality of sample values, and is a value indicating a variation of a plurality of sample values included in the block.
- the spatial activity value is a variance of multiple sample values.
- the spatial activity value may be a sum of absolute differences between the average value of a plurality of sample values and each sample value, or may be a sum of absolute differences between adjacent samples.
- a small spatial activity value of a block means that the block is a flat image, for example, an image having a constant luminance value (such as a computer graphics image).
- a large spatial activity value of a certain block means that the block is a complex image, for example, a natural image captured by a camera or the like.
- the encoding condition determination unit 120 includes a spatial activity value calculation unit 121, a comparison unit 122, and a determination unit 123.
- the spatial activity value calculation unit 121 calculates a first spatial activity value 211 that is a value indicating the complexity of at least a partial area of the target block 203. For example, the spatial activity value calculation unit 121 forms a plurality of small block clusters based on the extended block 202 including the target block 203. Each of the plurality of small block clusters includes a partial region of the target block 203.
- the space activity value calculation unit 121 calculates a small block cluster space activity value indicating the complexity of each of the plurality of small block clusters. Then, the space activity value calculation unit 121 calculates the smallest value among the plurality of calculated small block cluster space activity values as the first space activity value 211.
- the comparison unit 122 compares the first spatial activity value 211 calculated by the spatial activity value calculation unit 121 with a predetermined first threshold value. Then, the comparison unit 122 outputs a comparison result 212 indicating which of the first spatial activity value 211 and the first threshold value is smaller to the determination unit 123.
- the determining unit 123 determines the encoding condition of the target block 203 based on the comparison result 212. Specifically, when the first spatial activity value 211 is smaller than the first threshold, the first encoding condition indicating that the target block 203 is encoded for each sub-block of the first division size is set to the code of the target block 203. It is determined as the conversion condition.
- the sub-block of the first encoding condition is a processing unit in which at least one process of intra prediction, motion compensation, and frequency conversion is executed.
- the sub-block of the first encoding condition is composed of 4 ⁇ 4 sample values or 8 ⁇ 8 sample values.
- the encoding unit 130 divides the target block into sub-blocks having the division size indicated by the first encoding condition, and each intra block is divided into intra blocks.
- the target block is encoded by performing processing such as prediction.
- the determination unit 123 indicates that when the first spatial activity value 211 is equal to or greater than the first threshold, the target block 203 is encoded for each sub-block having a second division size larger than the first division size.
- the encoding condition is determined as the encoding condition for the target block 203.
- the sub-block of the second coding condition is a processing unit in which at least one process of intra prediction, motion compensation, and frequency conversion is executed.
- the sub-block of the second encoding condition is composed of 16 ⁇ 16 sample values.
- the encoding unit 130 divides the target block into sub-blocks having a division size indicated by the second encoding condition, and each intra block is divided into intra blocks.
- the target block is encoded by performing processing such as prediction. Note that the size of the sub-block of the second encoding condition may be the same as the size of the target block.
- image coding apparatus 100 encodes a target block for each sub-block of a small division size when the first spatial activity value is smaller than the first threshold.
- the coding conditions that are to be converted are determined.
- the image encoding device 100 determines an encoding condition for encoding the target block for each sub-block having a large division size.
- the image encoding apparatus 100 divides the target block into sub-blocks with a small division size, and when the target block is a complex image, The encoding condition is determined so that the block is divided into sub-blocks having a large division size. Thereby, it is possible to suppress the occurrence of ringing artifacts or blocking artifacts in a flat image.
- FIG. 3 is a schematic diagram for explaining the effect of reducing ringing artifacts by the image coding apparatus 100 according to Embodiment 1 of the present invention.
- the image coding apparatus 100 according to Embodiment 1 of the present invention has a small size when the first spatial activity value, which is a value indicating the complexity of at least a partial area of the target block, is smaller than the first threshold.
- a division size for example, a 4 ⁇ 4 division size is selected.
- the image coding apparatus 100 selects a small division size when a flat region is included in a part of the target block. Therefore, according to the image coding apparatus 100 according to Embodiment 1 of the present invention, it is possible to suppress deterioration in image quality as shown in FIG.
- FIG. 4 is a diagram for explaining possible positions of the encoding tool indicator in the compressed image / moving image bitstream.
- the selected encoding tool may be shown on the indicator collectively for each macroblock group.
- Examples of the collective indicator in the case of ISO / IEC 14496-2 Part 10 international standard include a picture header and a slice header.
- the encoding tool selected for each macroblock may be indicated in the macroblock header in the case of ISO / IEC 14496-2 part 10 international standard.
- FIG. 5 is a flowchart showing an example of the operation of the image coding apparatus 100 according to Embodiment 1 of the present invention.
- the dividing unit 110 divides the input original image into a plurality of blocks (S110).
- the encoding condition determination unit 120 determines the encoding condition of the target block (S120). Specific processing will be described later with reference to FIG.
- the encoding unit 130 encodes the target block according to the determined encoding condition (S130).
- the determination of the encoding condition of the target block (S120) and the encoding of the target block (S130) are repeated when there is a block to be processed. That is, determination of encoding conditions and encoding are repeated by using, as a new target block, a block that has not been encoded among a plurality of blocks generated by dividing the original image.
- FIG. 6 is a flowchart showing an example of the operation of the coding condition determining unit 120 according to Embodiment 1 of the present invention.
- the spatial activity value calculation unit 121 calculates a first spatial activity value that is a value indicating the complexity of at least a part of the target block (S210). Specifically, first, the spatial activity value calculation unit 121 forms a plurality of small block clusters based on the extended blocks including the target block. Then, the space activity value calculation unit 121 calculates a small block cluster space activity value that is a space activity value for each of the plurality of small block clusters. The spatial activity value calculation unit 121 calculates the smallest spatial activity value as the first spatial activity value among the calculated small block cluster spatial activity values.
- the comparison unit 122 compares the first spatial activity value with the first threshold value (S220). Specifically, the comparison unit 122 determines whether the first spatial activity value is smaller than the first threshold value.
- the determination unit 123 determines the first encoding condition for subdivision as the encoding condition of the target block (S230). That is, the determination unit 123 determines the first encoding condition indicating that the target block is encoded for each sub-block of the first division size.
- the determination unit 123 determines the second coding condition for large partitioning as the coding condition of the target block (S240). That is, the determination unit 123 determines a second encoding condition indicating that the target block is encoded for each sub-block of the second division size.
- the target block when a part of the target block is a flat image, the target block is divided into sub-blocks of a small division size.
- the encoding condition is determined so that the target block is divided into sub-blocks having a large division size. Thereby, it is possible to suppress the occurrence of ringing artifacts or blocking artifacts in a flat image.
- the distortion when the distortion is not conspicuous, it is possible to increase the compression ratio by determining the encoding condition of the large division size, and when the distortion is conspicuous, the encoding condition of the small division size is determined. By doing so, deterioration of image quality can be suppressed.
- the image coding method according to the second embodiment of the present invention performs not only the evaluation of the spatial activity condition as shown in the first embodiment, but also the motion activity condition, and the code of the target block is based on the evaluation result. It is characterized by determining the conversion condition.
- the evaluation of the spatial activity condition the spatial activity of a plurality of large blocks included in the target block and the spatial activity of a plurality of small block clusters included in the extended block including the target block are evaluated.
- FIG. 7 is a block diagram showing an example of the configuration of the image encoding device 300 according to Embodiment 2 of the present invention.
- the image encoding apparatus 300 includes a sample extraction unit 310, an encoding condition determination unit 320, and an encoding unit 330.
- the sample extraction unit 310 is an example of a division unit according to the present invention, and divides an original image into a plurality of blocks. Specifically, the sample extraction unit 310 acquires the original picture 402 to be processed as the input image Vin, and divides the acquired original picture 402 into a plurality of blocks. Then, the sample extraction unit 310 outputs an original picture sample 404 having an M ⁇ N arrangement as a target block which is one of a plurality of blocks. Further, the sample extraction unit 310 extracts and outputs an original image sample 403 having an O ⁇ P arrangement as an extended block including the target block.
- the values of O and P are 24, and the values of M and N are 16, respectively.
- the original sample 403 having the O ⁇ P arrangement surrounds the target macroblock and includes all of the M ⁇ N samples in the target macroblock.
- the original image sample 403 is the same as the original image sample 404. That is, the extension macroblock may be the same as the target macroblock.
- the encoding condition determination unit 320 includes a spatial activity condition evaluation unit 321, a prediction type evaluation unit 322, a switch unit 323, a motion activity condition evaluation unit 324, and an OR gate unit 325.
- the spatial activity condition evaluation unit 321 determines the encoding condition of the target block based on the spatial activity value that is a value indicating the complexity of the target block. Specifically, the spatial activity condition evaluation unit 321 acquires the extracted original image sample 403 having an O ⁇ P arrangement, and evaluates the spatial activity value of the input original image sample. Then, the spatial activity condition evaluation unit 321 is to be used by the encoding unit 330 to encode the target macroblock, either the “small division encoding tool” or the “large division encoding tool”. An encoding tool determination signal 405 indicating that is output.
- the encoding tool determination signal 405 is, for example, a binary value, 0 represents a “small division encoding tool”, and 1 represents a “large division encoding tool”.
- the “subdivision encoding tool” is an example of an encoding tool according to the first encoding condition, and is an encoding tool for dividing the target macroblock into sub-blocks of the first division size.
- the “large-division encoding tool” is an example of an encoding tool according to the second encoding condition, and is an encoding tool for dividing the target macroblock into sub-blocks of the second division size.
- FIG. 9 is a block diagram showing an example of the configuration of the spatial activity condition evaluation unit 321 according to Embodiment 2 of the present invention. As shown in FIG. 7, the spatial activity condition evaluation unit 321 receives the original picture sample 403 that is an extended macroblock and outputs an encoding tool determination signal 405.
- the spatial activity condition evaluation unit 321 includes a large block dividing unit 501, a large block space activity value calculating unit 502, a maximum value calculating unit 503, a spatial activity value comparing unit 504, and a small block cluster.
- a forming unit 505, a small block cluster space activity value calculating unit 506, a minimum value calculating unit 507, a space activity value comparing unit 508, and an encoding tool determining unit 509 are provided.
- the large block dividing unit 501 acquires the original image sample 403 having the O ⁇ P arrangement extracted by the sample extracting unit 310 as the input sample Oin.
- the large block dividing unit 501 divides the M ⁇ N partial array corresponding to the target macroblock into a plurality of large G ⁇ H blocks, and outputs these large block samples 601.
- FIG. 8 An example of the large block sample 601 is shown in FIG. In the example shown in FIG. 8, G and H are 8, and the 16 ⁇ 16 target macroblock is divided into four 8 ⁇ 8 large block samples 601.
- the large block space activity value calculation unit 502 receives a plurality of large block samples 601 and calculates the space activity value of each of the received large block samples 601.
- the spatial activity value calculated at this time is an example of a large block space activity value that is a value indicating the complexity of the large block according to the present invention.
- the large block space activity value calculation unit 502 outputs a set including all large block space activity values 602 corresponding to the original image sample (target macroblock) in the M ⁇ N array. For example, in the example of FIG. 8, the large block space activity value calculation unit 502 calculates and outputs four large block space activity values 602.
- the maximum value calculation unit 503 specifies a maximum value among a set of a plurality of large block space activity values 602, and outputs the specified maximum value as a maximum space activity value 603.
- the maximum space activity value 603 is an example of a second space activity value.
- the spatial activity value comparison unit 504 acquires the maximum spatial activity value 603 and compares it with a predetermined second threshold value of the spatial activity value. Then, the spatial activity value comparison unit 504 outputs a control signal 604 indicating the comparison result to the encoding tool determination unit 509. If the maximum spatial activity value 603 is greater than the second threshold, the control signal 604 is set to 1. If the maximum spatial activity value 603 is less than or equal to the second threshold, the control signal 604 is set to zero.
- the small block cluster forming unit 505 acquires the OxP array original image sample 403 extracted by the sample extracting unit 310 as an input sample Oin. Then, the small block cluster forming unit 505 divides the original image sample 403 having the O ⁇ P arrangement into a plurality of E ⁇ F small blocks, and groups these small blocks into a plurality of small block clusters.
- FIG. 8 An example of the small block cluster sample 605 is shown in FIG. In the example shown in FIG. 8, E and F are 4, and a 24 ⁇ 24 extended macroblock is divided into 36 4 ⁇ 4 small blocks.
- the small block cluster forming unit 505 forms a plurality of I ⁇ J small block clusters having different sizes based on the 36 small blocks. In the example shown in FIG. 8, 25 8 ⁇ 8 small block clusters, 12 4 ⁇ 16 small block clusters, and 12 16 ⁇ 4 small block clusters are formed.
- the small block cluster forming unit 505 outputs the formed small block cluster sample 605 to the small block cluster space activity value calculating unit 506.
- the small block cluster space activity value calculation unit 506 receives the small block cluster sample 605 and calculates the space activity value of each of the received plurality of small block cluster samples 605.
- the spatial activity value calculated at this time is an example of a small block cluster spatial activity value that is a value indicating the complexity of the small block cluster according to the present invention.
- the small block cluster space activity value calculation unit 506 outputs a set including all of the small block cluster space activity values 606 corresponding to the original image sample (extended macroblock) of the O ⁇ P array. For example, in the example of FIG. 8, the small block cluster space activity value calculation unit 506 calculates and outputs 49 small block cluster space activity values 606 in total.
- the minimum value calculation unit 507 specifies the minimum value in the set of small block cluster space activity values 606, and outputs the specified minimum value as the minimum space activity value 607.
- the minimum space activity value 607 is an example of a first space activity value.
- the space activity value comparison unit 508 acquires the minimum space activity value 607 and compares it with a predetermined first threshold value of the space activity value. Then, the space activity value comparison unit 508 outputs a control signal 608 indicating the comparison result to the encoding tool determination unit 509. If the minimum spatial activity value 607 is less than the first threshold, the control signal 608 is set to 1. If the minimum space activity value 607 is greater than or equal to the first threshold, the control signal 608 is set to zero.
- the encoding tool determination unit 509 acquires the control signals 604 and 608 and generates an encoding tool determination signal 405 indicating which of the “small-division encoding tool” and the “large-division encoding tool” is determined. Output. Specifically, when both of the control signals 604 and 608 are 1, the encoding tool determination unit 509 sets the encoding tool determination signal 405 to “a subdivision encoding tool”. Otherwise (if either or both of the control signals 604 and 608 are not 1), the encoding tool determination unit 509 sets the encoding tool determination signal 405 to “enlargement encoding tool”.
- the encoding tool determination signal 405 by setting the value of the encoding tool determination signal 405 to 1, it is possible to indicate a “subdivision encoding tool”. Further, by setting the value of the encoding tool determination signal 405 to 0, it is possible to indicate a “large division encoding tool”. The encoding tool determination signal 405 is then transmitted as the output signal Dout.
- the prediction type evaluation unit 322 acquires the available prediction type information 401 and evaluates whether or not the inter prediction mode can be used as a prediction type candidate for the target macroblock. Then, the prediction type evaluation unit 322 outputs a control signal 406 indicating the evaluation result to the switch unit 323.
- the control signal 406 When the inter prediction type is available, the control signal 406 is set to 1. If the inter prediction type is not available, the control signal 406 is set to zero.
- the available prediction type information 401 is information indicating whether or not the target original picture 402 is a picture to be encoded as an I picture. When the original picture 402 is a picture to be encoded as an I picture, the inter prediction type cannot be used, and thus the prediction type evaluation unit 322 sets the control signal 406 to 0.
- the switch unit 323 determines whether to output the coding tool determination signal 405 to the motion activity condition evaluation unit 324 or the OR gate unit 325 according to the control signal 406. Specifically, when the inter prediction type is available, an encoding tool determination signal is output to the motion activity condition evaluation unit 324.
- the switch unit 323 when the control signal 406 is 1, the switch unit 323 outputs the encoding tool determination signal 405 to the motion activity condition evaluation unit 324 as the encoding tool determination signal 407.
- the encoding tool determination signal 405 is output to the OR gate unit 325 as the encoding tool determination signal 408.
- the motion activity condition evaluation unit 324 determines the encoding condition of the target block based on the motion of the target block. Specifically, the motion activity condition evaluation unit 324 receives the encoding tool determination signal 407, the available prediction type information 401, the extracted M ⁇ N array original image sample 404, and the reconstructed picture 414 as inputs. get. The motion activity condition evaluation unit 324 evaluates the motion activity condition and outputs an encoding tool determination signal 409 and an inter prediction sample 410 having an M ⁇ N arrangement.
- FIG. 10 is a block diagram showing an example of the configuration of the motion activity condition evaluation unit 324 according to Embodiment 2 of the present invention.
- the motion activity condition evaluation unit 324 includes a reference picture selection unit 701, a motion prediction unit 702, a motion compensation unit 703, a cost calculation unit 704, a cost comparison unit 705, and a motion vector comparison unit. 706, an encoding tool determination initialization unit 707, and two switch units 708 and 709.
- the reference picture selection unit 701 accepts the reconstructed picture 414 and the available prediction type information 401 as inputs, and outputs a set of one or more selected reference pictures 801 for use in the motion prediction process.
- the motion prediction unit 702 calculates a motion vector of the target block. Specifically, the motion prediction unit 702 acquires a set of reference pictures 801, performs motion prediction using these reference pictures 801, and outputs a set of reference picture indexes 802 and a set of motion vectors 803.
- the motion vector comparison unit 706 acquires a set of motion vectors 803 and compares it with a predetermined third threshold value. Then, the motion vector comparison unit 706 outputs a control signal 804 indicating the comparison result.
- the motion vector comparison unit 706 causes the control signal 804 to Is set to 1.
- the motion vector comparison unit 706 sets the control signal 804 to 0.
- the motion compensation unit 703 acquires a set of reference picture indexes 802, a set of motion vectors 803, and a set of reconstructed pictures 414.
- the motion compensation unit 703 outputs a prediction image of the target macroblock, specifically, an inter prediction sample 410 having an M ⁇ N arrangement.
- the cost calculation unit 704 acquires an M ⁇ N array inter prediction sample 410, an M ⁇ N array original image sample 404, and a set of motion vectors 803.
- the cost calculation unit 704 outputs a cost value 807 to the cost comparison unit 705. An example of the cost value 807 will be described later.
- the cost comparison unit 705 acquires the cost value 807, and compares the acquired cost value 807 with a predetermined fourth threshold value. Then, the cost comparison unit 705 outputs a control signal 808 indicating the comparison result to the switch unit 709.
- the cost comparison unit 705 sets the control signal 808 to 1. If the cost value is greater than or equal to the fourth threshold, the cost comparison unit 705 sets the control signal 808 to 0.
- the encoding tool determination initialization unit 707 generates an initialization signal 805 indicating “enlargement encoding tool”. As described above, since the signal indicating the “large division encoding tool” is indicated by a binary value of 1 (for example, the encoding tool determination signal 405), the initialization signal 805 is similarly 2 It can be represented by the decimal value 1.
- the switch unit 708 controls which of the encoding tool determination signal 407 and the initialization signal 805 is output as the encoding tool determination signal 806 according to the control signal 804. Specifically, the switch unit 708 outputs the initialization signal 805 as the encoding tool determination signal 806 when the absolute value of the motion vector is larger than the third threshold, that is, when the control signal 804 is 1. In other words, when the absolute value of the motion vector is larger than the third threshold, “enlargement encoding tool” is selected as the encoding tool indicated by the encoding tool determination signal 806.
- the switch unit 708 When the absolute value of the motion vector is equal to or smaller than the third threshold, that is, when the control signal 804 is 0, the switch unit 708 outputs the encoding tool determination signal 407 as the encoding tool determination signal 806. In other words, when the absolute value of the motion vector is equal to or smaller than the third threshold, the encoding tool determined by the spatial activity condition evaluation unit 321 is selected as the encoding tool indicated by the encoding tool determination signal 806.
- the switch unit 709 controls which of the encoding tool determination signal 806 and the initialization signal 805 is output as the encoding tool determination signal 409 according to the control signal 808. Specifically, the switch unit 709 outputs the initialization signal 805 as the encoding tool determination signal 409 when the cost value is less than the fourth threshold, that is, when the control signal 808 is 1. In other words, when the cost value is smaller than the fourth threshold, “enlargement encoding tool” is selected as the encoding tool indicated by the encoding tool determination signal 409.
- the switch unit 709 When the cost value is equal to or greater than the fourth threshold, that is, when the control signal 808 is 0, the switch unit 709 outputs the encoding tool determination signal 806 as the encoding tool determination signal 409.
- the encoding tool determination signal 806 is transmitted as the output signal Dout.
- the OR gate unit 325 outputs a final encoding tool determination signal 411 depending on whether the encoding tool determination signals 408 and 409 are available.
- the final encoding tool determination signal 411 includes the finally determined encoding tool information.
- the OR gate unit 325 outputs a final encoding tool determination signal 411 indicating 1 if at least one of the encoding tool determination signals 408 and 409 is 1. In other words, when “enlargement coding tool” is selected in at least one of the spatial activity condition evaluation unit 321 and the motion activity condition evaluation unit 324, the OR gate unit 325 selects “enlargement encoding tool”. As shown, the final encoding tool determination signal 411 is set to 1.
- the OR gate unit 325 outputs a final encoding tool determination signal 411 indicating 0.
- the OR gate unit 325 selects “encoding tool for subdivision”. As shown, the final encoding tool determination signal 411 is set to zero.
- the encoding unit 330 includes an image encoding unit 331, an image decoding unit 332, a picture memory unit 333, and an entropy encoding unit 334.
- the image encoding unit 331 uses the final encoding tool determination signal 411, the available prediction type information 401, and the M ⁇ N array original sample 404 as inputs. Further, in the present embodiment, the image encoding unit 331 acquires the M ⁇ N array inter prediction sample 410 as an additional input used to encode the target macroblock of the M ⁇ N array.
- the image encoding unit 331 performs the image encoding process on the target macroblock using one of the two predetermined encoding tools selected by the final encoding tool determination signal 411.
- the image encoding unit 331 generates a quantization residual 412 of an M ⁇ N array by performing processing such as intra prediction, difference calculation with a prediction sample, frequency conversion, and quantization. . Then, the image encoding unit 331 outputs an M ⁇ N quantization residual 412 to the entropy encoding unit 334 and the image decoding unit 332.
- the entropy encoding unit 334 generates the compressed bit stream 415 by entropy encoding the quantization residual 412 of the M ⁇ N array. Then, the entropy encoding unit 334 outputs the generated compressed bit stream 415 as the output signal Vout.
- the entropy encoding unit 334 is not limited to entropy encoding, and may perform other variable length encoding.
- the image decoding unit 332 decodes and reconstructs the quantization residual 412.
- the image decoding unit 332 may acquire the M ⁇ N array inter prediction sample 410 used to reconstruct the target macroblock sample of the M ⁇ N array.
- the image decoding unit 332 finally outputs the reconstructed sample 413 having the M ⁇ N arrangement, and stores the reconstructed sample 413 in the picture memory unit 333.
- the picture memory unit 333 is a memory for storing the reconstructed sample 413, so-called local decoded image.
- the reconstructed sample 413 stored in the picture memory unit 333 is referred to as a reference image by the motion activity condition evaluation unit 324.
- the image coding apparatus 300 includes a spatial activity condition evaluation unit 321 that evaluates a spatial activity condition, and a motion activity condition evaluation that evaluates a motion activity condition. Part 324. Then, the image encoding device 300 determines the encoding condition of the target block based on the evaluation result, and encodes the target block according to the determined encoding condition.
- the encoding condition is selected using the image feature such as the space activity condition and the motion activity condition, it is possible to suppress the deterioration of the image quality.
- FIG. 11 is a flowchart showing an example of the operation of the image coding apparatus 300 according to Embodiment 2 of the present invention.
- the sample extraction unit 310 extracts an OxP array original picture sample from the target uncompressed original picture (S310).
- O indicates the number of samples in the width, and O is greater than or equal to M.
- P indicates the number of samples at the height, and P is greater than or equal to N.
- the values of O and P are each 24, for example.
- the original sample of the O ⁇ P array includes the target macroblock, and includes all of the M ⁇ N samples in the target macroblock as shown in FIG.
- extended macroblock is used as a term indicating an original picture sample of an O ⁇ P arrangement surrounding the target macroblock.
- the sample extraction unit 310 outputs an original image sample of an M ⁇ N array that constitutes the target macroblock, in addition to the original image sample of the O ⁇ P array (that is, an extended macroblock).
- the spatial activity condition evaluation unit 321 evaluates the spatial activity condition as a first condition for selecting an optimal encoding tool for encoding the target macroblock (S320).
- the specific operation of the space activity condition evaluation unit 321 will be described in more detail with reference to FIG.
- FIG. 12 is a flowchart showing an example of the operation of the spatial activity condition evaluation unit 321 according to Embodiment 2 of the present invention.
- the spatial activity condition evaluation unit 321 calculates a second spatial activity value by performing processing in units of large blocks (S410).
- the second spatial activity value is the maximum value among the spatial activity values calculated for each large block. Specific processing will be described later with reference to FIG.
- the spatial activity value comparison unit 504 compares the second spatial activity value with the second threshold value (S420).
- the encoding tool determination unit 509 determines the second encoding condition for large division as the encoding condition used for the target macroblock ( S460).
- the spatial activity condition evaluation unit 321 calculates the spatial activity value of the target macroblock in units of large blocks. That is, the spatial activity condition evaluation unit 321 calculates the complexity (or flatness) of the target macroblock in units of large blocks.
- the space activity condition evaluation unit 321 determines whether or not the maximum value of the space activity value for each large block is larger than the threshold value. That is, the spatial activity condition evaluation unit 321 determines whether or not the complexity of the most complex block among the plurality of large blocks constituting the target macroblock is larger than the threshold value. When the complexity of the most complicated block is equal to or less than the threshold value, a coding tool for large division is selected.
- the fact that the complexity of the most complex block is below the threshold means that the flatness of the target macroblock is high. That is, it shows that the information amount of the target macroblock is small, and it is possible to save the code amount by increasing the compression rate by using the coding tool for large division. Further, since the image is complex, distortion is not conspicuous, and subjective image quality deterioration can be suppressed even when a large-division encoding tool is used.
- the spatial activity condition evaluation unit 321 calculates the first spatial activity value by performing processing in units of small block clusters (S430). As described above, the first spatial activity value is the smallest value among the spatial activity values calculated for each small block cluster. Specific processing will be described later with reference to FIG.
- the spatial activity value comparison unit 508 compares the first spatial activity value with the first threshold value (S440).
- the encoding tool determination unit 509 determines the first encoding condition for subdivision as the encoding condition used for the target macroblock (S450). ).
- the spatial activity condition evaluation unit 321 calculates the spatial activity value of the target macroblock in units of small block clusters. In other words, the spatial activity condition evaluation unit 321 calculates the complexity (or flatness) of the target macroblock in units of small block clusters.
- the spatial activity condition evaluation unit 321 determines whether or not the minimum value of the spatial activity value for each small block cluster is smaller than the threshold value. That is, the spatial activity condition evaluation unit 321 determines whether or not the complexity of the flatest small block cluster among the plurality of small block clusters each including a part of the target macroblock is smaller than the threshold value. When the complexity of the flattest small block cluster is smaller than the threshold, an encoding tool for subdivision is selected.
- the complexity of the flattest small block cluster is smaller than the threshold value means that a part of the target macroblock includes a region with high flatness. That is, it means that the target macroblock includes an area where subjective image quality degradation is conspicuous when distortion spreads over a flat area.
- the encoding tool determination unit 509 determines the second encoding condition for large partitioning as the encoding condition used for the target macroblock. (S460). That the first spatial activity value is equal to or greater than the first threshold means that the target macroblock does not include a region with high flatness. Therefore, even if distortion spreads within the target macroblock, subjective image quality degradation is not noticeable. For this reason, the code amount can be reduced by selecting the second encoding condition for large division.
- FIG. 13 is a flowchart showing an example of detailed operation of the spatial activity condition evaluation unit 321 according to Embodiment 2 of the present invention.
- the same operations as those shown in FIG. 12 are denoted by the same reference numerals.
- the large block dividing unit 501 divides the M ⁇ N target macroblock into a plurality of large G ⁇ H blocks (S411).
- G represents the number of samples in the width, and M can be divided by this G.
- H represents the number of samples at the height, and N can be divided by this H.
- the value of G and H is, for example, 8 respectively.
- FIG. 8 shows an example in which an M ⁇ N macroblock is divided into a plurality of large G ⁇ H blocks.
- the large block space activity value calculation unit 502 calculates the space activity value (large block space activity value) of each of the plurality of large blocks (S412).
- the spatial activity value indicates the amount of variation of the sample value in the large block.
- the spatial activity value is greater than or equal to zero.
- the calculation method of the large block spatial activity value (LargeBlockSpatialAct) is expressed, for example, by the following (Formula 1).
- the spatial activity value of the large block is obtained as the statistical variance of the sample values in the large block.
- x (h, g) represents a sample value at a position (h, g) in the large block.
- Another method for calculating the spatial activity value of the large block is represented by, for example, the following (Expression 2) and (Expression 3).
- SmallBlock represents a small block of E ⁇ F samples.
- E represents the number of samples in the width, and G (large block width) can be divided by this E.
- F represents the number of samples at the height, and H (height of the large block) can be divided by this F.
- the values of E and F are, for example, 4 respectively.
- SmallBlock SpatialAc represents a spatial activity value of a small block obtained as a statistical distribution of sample values in the small block.
- B represents the number of small blocks in the target large block
- x (f, e) represents a sample value at a position (f, e) in the small block
- min (•) is a minimum. Value / minimum value.
- FIG. 8 illustrates an example in which an O ⁇ P extended macroblock is divided into a plurality of E ⁇ F small blocks.
- Another method for calculating the spatial activity value of the large block is represented by, for example, the following (formula 4).
- SmallBlock SpatialAct represents a spatial activity value of a small block obtained as a statistical variance of sample values in the small block, as in (Expression 3).
- B represents the number of small blocks in the target large block
- min (•) represents a minimum value / minimum value
- max (•) represents a maximum value / maximum value.
- the maximum value calculation unit 503 After calculating the spatial activity value of the large block, the maximum value calculation unit 503 obtains the second spatial activity value that is the maximum spatial activity value among all large blocks in the target macroblock (S413).
- the spatial activity value comparison unit 504 compares the maximum spatial activity value in the large block with a predetermined second threshold related to the spatial activity value (S414).
- the predetermined second threshold value is, for example, an integer value.
- the encoding tool determination unit 509 determines and sets “enlargement encoding tool” as the encoding tool. (S460).
- the small block cluster forming unit 505 forms a plurality of I ⁇ J small block clusters based on the O ⁇ P extended macroblock including the M ⁇ N target macroblock (S431).
- I represents the number of samples in the width, and this I can be divided by E (the width of the small block).
- J represents the number of samples at the height, and this J can be divided by F (the height of the small block).
- each small block cluster includes one or more small blocks located within an M ⁇ N macroblock.
- an O ⁇ P extended macroblock is divided into a plurality of E ⁇ F small blocks, and a plurality of I ⁇ J small block clusters having different sizes are formed based on these E ⁇ F small blocks. An example is shown.
- the small block cluster space activity value calculation unit 506 calculates each space activity value (small block cluster space activity value) of the plurality of small block clusters (S432).
- This spatial activity value indicates the amount of variation of the sample value in the small block cluster.
- the calculation method of the spatial activity value of the small block cluster is expressed by, for example, the following (Formula 5).
- Another method for calculating the spatial activity value of the small block cluster is expressed by, for example, the following (formula 6).
- SmallBlockSpatialAct represents the spatial activity value of the small block obtained as the statistical variance of the sample values in the small block, as in (Equation 3).
- B indicates the number of small blocks in the target small block cluster.
- the minimum value calculation unit 507 After calculating the spatial activity value in the small block cluster, the minimum value calculation unit 507 obtains the first spatial activity value that is the minimum spatial activity value among all the small block clusters in the target extended macroblock (S433). .
- the spatial activity value comparison unit 508 compares the minimum spatial activity value in the small block cluster with a predetermined first threshold related to the spatial activity value (S434).
- the second threshold value is, for example, an integer value.
- the encoding tool determination unit 509 determines and sets “encoding tool for subdivision” as the encoding tool. (S450).
- the encoding tool determination unit 509 determines and sets “enlargement encoding tool” as the encoding tool.
- the spatial activity condition evaluation unit 321 determines the encoding tool for the target macroblock based on the spatial activity value of the target macroblock.
- spatial activity values are compared in units of large blocks, and only when the second spatial activity value is equal to or smaller than the second threshold value, in units of small block clusters. You may compare the spatial activity values at. Alternatively, the comparison of spatial activity values in units of large blocks and the comparison of spatial activity values in units of small block clusters may be performed in parallel processing.
- the prediction type evaluation unit 322 determines whether or not the inter prediction mode can be applied to the target macroblock (S330). Specifically, the prediction type evaluation unit 322 checks whether the target picture including the target macroblock is inter-coded. A picture to be inter-encoded is encoded using a prediction result (usually motion compensated) obtained from an encoded reference picture.
- the encoding unit 330 uses the encoding tool determined by evaluating the spatial activity condition to encode the target macroblock (S350).
- the motion activity condition evaluation unit 324 uses the motion activity as a second condition for selecting an optimal encoding tool for encoding the target macroblock. Conditions are evaluated (S340). Note that the coding tool determined by the spatial activity condition evaluation unit 321 may be changed by the movement activity condition evaluation unit 324 as shown in detail in FIG.
- FIG. 14 is a flowchart showing an example of the operation of the motion activity condition evaluation unit 324 according to Embodiment 2 of the present invention.
- the reference picture selection unit 701 selects a plurality of reference picture candidates from the reconstructed picture group stored in the picture memory unit 414 (S501).
- the picture in this case can be referred to as a frame picture or a field picture.
- the motion prediction unit 702 derives a plurality of sets of motion vectors for the uncompressed original picture macroblock of M ⁇ N samples by performing motion prediction on the reference picture candidate (S502). Each of the derived plurality of motion vectors is associated with a reference picture that is a reference destination.
- the motion compensation unit 703 derives a prediction block of M ⁇ N samples by performing motion compensation on a related reference picture using the derived set of motion vectors (S503).
- the cost calculation unit 704 calculates a cost value based on the difference in the sample values between the derived prediction block of M ⁇ N samples and the original block of M ⁇ N samples (S504).
- the formula for obtaining the cost value is expressed by, for example, the following (Formula 7).
- DiffCost is a cost component obtained according to the difference between the predicted sample and the original picture sample.
- DiffCost there is a sum of absolute differences between a predicted sample and an original picture sample.
- Lambda is a constant value.
- MVCost represents a cost value obtained according to the derived motion vector. Lambda may be a zero value, in which case the MVCost component may be ignored.
- the correlation between the target original picture and the preceding decoded picture is obtained by performing the above processing (S501 to S504). As shown in (Expression 7), the smaller the cost value, the higher the correlation.
- the motion vector comparison unit 706 compares the absolute value of the motion vector derived by the motion prediction unit 702 with a predetermined third threshold value of the motion vector (S505).
- the predetermined third threshold value of the motion vector is, for example, an integer value.
- the encoding tool is determined by the encoding tool determination initialization unit 707 as “the encoding tool for large division”. Is determined and set (S509). Then, the movement activity condition evaluation process ends.
- the motion vector is larger than the third threshold means that the motion between pictures is large and the continuity is small. Therefore, even if distortion occurs in the target macroblock, since the motion between pictures is large, subjective image quality degradation is suppressed. For this reason, when the motion is large, the encoding efficiency can be increased by selecting the “large division encoding tool”.
- the cost comparison unit 705 includes the cost value calculated by the cost calculation unit 704, The cost value is compared with a predetermined fourth threshold value (S507).
- the fourth threshold is, for example, an integer value.
- the encoding tool determination initialization unit 707 determines and sets the encoding tool as “enlargement encoding tool” (S509). ). Then, the movement activity condition evaluation process ends.
- the cost value is a value calculated based on the sum of absolute differences and the like, and the smaller the cost value, the better the encoding efficiency. That is, when the cost value is less than the fourth threshold value, the encoding efficiency is sufficiently excellent. Therefore, the encoding efficiency can be sufficiently increased by selecting the “large-division encoding tool”. .
- the coding tool determined by the spatial activity condition evaluation unit 321 is not changed, and the motion activity condition evaluation process ends. That is, the encoding condition is determined based on the determination result of the encoding condition by the spatial activity condition evaluation unit 321.
- the encoding unit 330 encodes the image of the target macroblock using the encoding tool selected as described above (S350).
- the image encoding device 300 evaluates the spatial activity condition and the motion activity condition, and encodes the target block based on the evaluation result. Determine the conditions.
- the image encoding device 300 encodes an encoding condition (for large partitioning) such that the target block is encoded for each subblock having a large partition size. (Encoding condition) is determined.
- Encoding condition it is preferable to prioritize the compression rate because the continuity between images (pictures) is small when the motion is large by selecting the coding condition for large division.
- the image coding apparatus 300 determines the coding condition for large division even when the absolute value of the motion vector is equal to or smaller than the third threshold and the cost value is smaller than the fourth threshold. Thereby, the fact that the cost value is smaller than the fourth threshold indicates that the target block can be effectively encoded, so that the encoding efficiency can be improved by determining the encoding condition for large division.
- the image encoding device 300 determines that the target block according to the evaluation result of the spatial activity condition of the target block and the extended block. Is determined.
- the image encoding device 300 calculates the spatial activity values of a plurality of large blocks included in the target block, and the second spatial activity value that is the maximum value of the calculated spatial activity values is equal to or less than the second threshold value.
- coding conditions for large division are determined.
- the target block is a block with a small amount of information. Therefore, the coding efficiency is effectively increased by using the coding condition for large division. be able to.
- the image coding apparatus 300 calculates the spatial activity values of a plurality of small block clusters included in the extended block when the second spatial activity value is larger than the second threshold value.
- the image encoding apparatus 300 encodes the target block for each sub-block having a small division size when the first spatial activity value, which is the minimum value of the calculated spatial activity value, is smaller than the first threshold.
- the encoding conditions (encoding conditions for subdivision) are determined.
- the second spatial activity value is larger than the second threshold value, the amount of information of the target block is large, so even if the coding condition for large division is determined, the second spatial activity value is less than or equal to the second threshold value. The effect of is not obtained. Therefore, the first spatial activity value is compared with the first threshold value, and if the first spatial activity value is smaller than the first threshold value, the encoding condition for subdivision is selected.
- the image encoding device 300 selects the encoding condition for subdivision when at least a part of the target block includes a flat region. Thereby, it is possible to suppress the occurrence of ringing artifacts and blocking artifacts in a flat image, and it is possible to suppress deterioration in image quality.
- the small block cluster is generated based on the extended block that includes the target block and is larger than the target block.
- Information can be used. For example, when the complexity of a partial area of the target block is low, that is, when the area is flat, the encoding condition is determined based on whether or not the area included in the surrounding blocks is flat. can do.
- target area If a part of the target block (target area) is flat and the surrounding blocks are also flat, when the distortion spreads over the target area, the distortion is noticeable and the image quality deteriorates. . Conversely, if the target area is flat and the surrounding blocks are complex, the surroundings are complex, so even if the distortion spreads over the target area, the distortion is inconspicuous and suppresses image quality degradation. Is done.
- the encoding condition is set based on the surrounding block image information. Selection can be made. Specifically, as described above, when the minimum value of the spatial activity value of the small block cluster is equal to or smaller than the first threshold, the first coding condition for subdivision is selected as the coding condition for the target block. .
- the surrounding image is also flat, encoding is performed with a small division size, so that distortion does not spread within the target block, and deterioration in image quality can be suppressed.
- the cost values need not be compared in the flowchart shown in FIG. Specifically, the calculation of the cost value (S504) and the comparison of the cost values (S507 and S508) may not be performed.
- the coding tool for large division is selected, and when the absolute value of the motion vector is equal to or smaller than the third threshold value, the coding condition based on the evaluation of the spatial activity condition
- the coding condition may be determined based on the determination result.
- the image coding apparatus 100 determines the coding condition based on the spatial activity condition as shown in FIG. Based on this, the encoding condition may be determined. Specifically, as shown in FIG. 15, the encoding condition may be determined based on the magnitude of the motion vector.
- the encoding condition determination unit calculates a motion vector of the target block (S610). Specifically, the encoding condition determination unit calculates a motion vector of the target block by selecting a reference picture and performing motion prediction.
- the encoding condition determination unit compares the calculated motion vector with a predetermined third threshold value (S620).
- the motion vector is equal to or smaller than the third threshold (No in S620)
- the first encoding condition for subdivision (“the subdivision encoding tool”) is determined as the encoding condition for the target block (S630).
- the second coding condition for large division (“large division coding tool”) is determined as the coding condition for the target block (S640).
- the encoding condition may be determined based only on the motion vector.
- encoding efficiency can be improved by selecting the “large-division encoding tool” when the motion is large.
- the “small division encoding tool” is selected, distortion is likely to spread in the target macroblock, but since the motion between pictures is large, subjective image quality degradation is suppressed.
- Embodiment 3 By recording a program for realizing the configuration of the image encoding method or the image decoding method described in the above embodiment on a storage medium, the processing described in the above embodiment can be easily performed in an independent computer system. It becomes possible.
- the storage medium may be any medium that can record a program, such as a magnetic disk, an optical disk, a magneto-optical disk, an IC card, and a semiconductor memory.
- FIG. 16 is a diagram showing an overall configuration of a content supply system ex100 that realizes a content distribution service.
- the communication service providing area is divided into desired sizes, and base stations ex106 to ex110, which are fixed radio stations, are installed in each cell.
- the content supply system ex100 includes a computer ex111, a PDA (Personal Digital Assistant) ex112, a camera ex113, a mobile phone ex114, a game machine via an Internet ex101, an Internet service provider ex102, a telephone network ex104, and base stations ex106 to ex110. Each device such as ex115 is connected.
- PDA Personal Digital Assistant
- each device may be directly connected to the telephone network ex104 without going through the base stations ex106 to ex110 which are fixed wireless stations.
- the devices may be directly connected to each other via short-range wireless or the like.
- the camera ex113 is a device that can shoot moving images such as a digital video camera
- the camera ex116 is a device that can shoot still images and movies such as a digital camera.
- the mobile phone ex114 is a GSM (Global System for Mobile Communications) method, a CDMA (Code Division Multiple Access) method, a W-CDMA (Wideband-Code Division Multiple Access L (Semiconductor), a W-CDMA (Wideband-Code Divition Multiple Access L) method.
- GSM Global System for Mobile Communications
- CDMA Code Division Multiple Access
- W-CDMA Wideband-Code Division Multiple Access L (Semiconductor)
- W-CDMA Wideband-Code Divition Multiple Access L
- a High Speed Packet Access) mobile phone or a PHS (Personal Handyphone System) may be used.
- the camera ex113 and the like are connected to the streaming server ex103 through the base station ex109 and the telephone network ex104, thereby enabling live distribution and the like.
- the content for example, music live video
- the streaming server ex103 streams the content data transmitted to the requested client.
- the client include a computer ex111, a PDA ex112, a camera ex113, a mobile phone ex114, a game machine ex115, and the like that can decode the encoded data.
- Each device that has received the distributed data decodes and reproduces the received data.
- the encoded processing of the captured data may be performed by the camera ex113, the streaming server ex103 that performs the data transmission processing, or may be performed in a shared manner.
- the decryption processing of the distributed data may be performed by the client, the streaming server ex103, or may be performed in a shared manner.
- still images and / or moving image data captured by the camera ex116 may be transmitted to the streaming server ex103 via the computer ex111.
- the encoding process in this case may be performed by any of the camera ex116, the computer ex111, and the streaming server ex103, or may be performed in a shared manner.
- these encoding processing and decoding processing are generally performed in a computer ex111 and an LSI (Large Scale Integration) ex500 included in each device.
- the LSI ex500 may be configured as a single chip or a plurality of chips.
- image encoding and image decoding software is incorporated into some recording media (CD-ROM, flexible disk, hard disk, etc.) that can be read by the computer ex111 and the like, and encoding processing and decoding processing are performed using the software. May be.
- moving image data acquired by the camera may be transmitted. The moving image data at this time is data encoded by the LSI ex500 included in the mobile phone ex114.
- the streaming server ex103 may be a plurality of servers or a plurality of computers, and may process, record, and distribute data in a distributed manner.
- the encoded data can be received and reproduced by the client.
- the information transmitted by the user can be received, decrypted and reproduced by the client in real time, and even a user who does not have special rights or facilities can realize personal broadcasting.
- the image encoding method or the image decoding method shown in the above embodiment may be used for encoding and decoding of each device constituting the content supply system.
- FIG. 17 is a diagram illustrating the mobile phone ex114 using the image encoding method and the image decoding method described in the above embodiment.
- the cellular phone ex114 includes an antenna ex601 for transmitting and receiving radio waves to and from the base station ex110, a video from a CCD camera, a camera unit ex603 capable of taking a still image, a video shot by the camera unit ex603, and an antenna ex601.
- a display unit ex602 such as a liquid crystal display that displays data obtained by decoding received video and the like, a main body unit composed of a group of operation keys ex604, an audio output unit ex608 such as a speaker for outputting audio, and a voice input Audio input unit ex605 such as a microphone, recorded moving image or still image data, received e-mail data, moving image data or still image data, etc., for storing encoded data or decoded data
- Recording media ex607 can be attached to media ex607 and mobile phone ex114 It has order of slot Ex606.
- the recording medium ex607 stores a flash memory element, which is a kind of EEPROM, which is a nonvolatile memory that can be electrically rewritten and erased, in a plastic case such as an SD card.
- the mobile phone ex114 has a power supply circuit ex710, an operation input control unit ex704, and an image encoding unit for a main control unit ex711 that is configured to control the respective units of the main body unit including the display unit ex602 and the operation key ex604.
- LCD Liquid Crystal Display
- the power supply circuit ex710 activates the camera-equipped digital mobile phone ex114 by supplying power from the battery pack to each unit. .
- the mobile phone ex114 converts the audio signal collected by the audio input unit ex605 in the audio call mode into digital audio data by the audio processing unit ex705 based on the control of the main control unit ex711 including a CPU, a ROM, a RAM, and the like.
- the modulation / demodulation circuit unit ex706 performs spread spectrum processing, the transmission / reception circuit unit ex701 performs digital analog conversion processing and frequency conversion processing, and then transmits the result via the antenna ex601.
- the cellular phone ex114 amplifies the received data received by the antenna ex601 in the voice call mode, performs frequency conversion processing and analog-digital conversion processing, performs spectrum despreading processing by the modulation / demodulation circuit unit ex706, and analog audio by the voice processing unit ex705. After the data is converted, it is output via the audio output unit ex608.
- text data of the e-mail input by operating the operation key ex604 on the main body is sent to the main control unit ex711 via the operation input control unit ex704.
- the main control unit ex711 performs spread spectrum processing on the text data in the modulation / demodulation circuit unit ex706, performs digital analog conversion processing and frequency conversion processing in the transmission / reception circuit unit ex701, and then transmits the text data to the base station ex110 via the antenna ex601.
- the image data captured by the camera unit ex603 is supplied to the image encoding unit ex712 via the camera interface unit ex703.
- the image data captured by the camera unit ex603 can be directly displayed on the display unit ex602 via the camera interface unit ex703 and the LCD control unit ex702.
- the image encoding unit ex712 is configured to include the image encoding device described in the present invention, and an encoding method using the image data supplied from the camera unit ex603 in the image encoding device described in the above embodiment. Is converted into encoded image data by compression encoding and sent to the demultiplexing unit ex708. At the same time, the mobile phone ex114 sends the sound collected by the sound input unit ex605 during imaging by the camera unit ex603 to the demultiplexing unit ex708 via the sound processing unit ex705 as digital sound data.
- the demultiplexing unit ex708 multiplexes the encoded image data supplied from the image encoding unit ex712 and the audio data supplied from the audio processing unit ex705 by a predetermined method, and the resulting multiplexed data is a modulation / demodulation circuit unit A spectrum spread process is performed at ex706, a digital-analog conversion process and a frequency conversion process are performed at the transmission / reception circuit unit ex701, and then the signal is transmitted via the antenna ex601.
- the received data received from the base station ex110 via the antenna ex601 is subjected to spectrum despreading processing by the modulation / demodulation circuit unit ex706, and the resulting multiplexing is obtained.
- Data is sent to the demultiplexing unit ex708.
- the demultiplexing unit ex708 separates the multiplexed data into a bit stream of image data and a bit stream of audio data, and a synchronization bus
- the encoded image data is supplied to the image decoding unit ex709 via ex713 and the audio data is supplied to the audio processing unit ex705.
- the image decoding unit ex709 is configured to include the image decoding device described in the present application, and is reproduced by decoding the bit stream of the image data with a decoding method corresponding to the encoding method described in the above embodiment.
- Moving image data is generated and supplied to the display unit ex602 via the LCD control unit ex702, thereby displaying, for example, moving image data included in a moving image file linked to a home page.
- the audio processing unit ex705 converts the audio data into analog audio data, and then supplies the analog audio data to the audio output unit ex608.
- the audio data included in the moving image file linked to the home page is reproduced.
- the digital broadcasting system also includes at least the image encoding device or the image according to the above embodiment.
- a decoding device can be incorporated.
- audio data, video data, or a bit stream obtained by multiplexing these data is transmitted to a communication or broadcasting satellite ex202 via radio waves.
- the broadcasting satellite ex202 transmits a radio wave for broadcasting, and a home antenna ex204 having a satellite broadcasting receiving facility receives the radio wave, and receives a TV (receiver) ex300 or a set top box (STB) ex217.
- the device decodes the bitstream and reproduces it.
- the reader / recorder ex218 that reads and decodes a bitstream in which image data and audio data recorded on recording media ex215 and ex216 such as CD and DVD as recording media are multiplexed is also shown in the above embodiment. It is possible to implement an image decoding device. In this case, the reproduced video signal is displayed on the monitor ex219. Further, a configuration in which an image decoding device is mounted in a set-top box ex217 connected to a cable ex203 for cable television or an antenna ex204 for satellite / terrestrial broadcasting, and this is reproduced on the monitor ex219 of the television is also conceivable. At this time, the image decoding apparatus may be incorporated in the television instead of the set top box.
- a car ex210 having an antenna ex205 can receive a signal from a satellite ex202 or a base station and reproduce a moving image on a display device such as a car navigation ex211 included in the car ex210.
- the image decoding apparatus or the image encoding apparatus described in the above embodiment can also be mounted on the reader / recorder ex218 that encodes the data and records the multiplexed data as multiplexed data.
- the reproduced video signal is displayed on the monitor ex219.
- the recording medium ex215 on which the encoded bitstream is recorded allows other devices, systems, and the like to reproduce the video signal.
- the other reproduction device ex212 can reproduce the video signal on the monitor ex213 using the recording medium ex214 on which the encoded bitstream is copied.
- an image decoding device may be mounted in a set-top box ex217 connected to a cable ex203 for cable television or an antenna ex204 for satellite / terrestrial broadcasting and displayed on the monitor ex219 of the television.
- the image decoding apparatus may be incorporated in the television instead of the set top box.
- FIG. 20 is a diagram illustrating a television (receiver) ex300 that uses the image decoding method and the image encoding method described in the above embodiment.
- the television ex300 obtains or outputs a bit stream of video information via the antenna ex204 or the cable ex203 that receives the broadcast, and a demodulator of the received encoded data or the generated encoding.
- a modulation / demodulation unit ex302 that modulates data for transmission to the outside, and a multiplexing / separation unit ex303 that separates demodulated video data and audio data, or multiplexes encoded video data and audio data Is provided.
- the television ex300 decodes each of the audio data and the video data, or encodes each information, an audio signal processing unit ex304, a signal processing unit ex306 having a video signal processing unit ex305, and the decoded audio signal. And an output unit ex309 including a display unit ex308 such as a display for displaying the decoded video signal.
- the television ex300 includes an interface unit ex317 including an operation input unit ex312 that receives an input of a user operation.
- the television ex300 includes a control unit ex310 that controls each unit in an integrated manner, and a power supply circuit unit ex311 that supplies power to each unit.
- the interface unit ex317 includes a bridge ex313 connected to an external device such as a reader / recorder ex218, a slot unit ex314 for enabling recording media ex216 such as an SD card, and an external recording such as a hard disk
- a driver ex315 for connecting to a medium, a modem ex316 for connecting to a telephone network, and the like may be included.
- the recording medium ex216 is capable of electrically recording information by using a nonvolatile / volatile semiconductor memory element to be stored.
- Each part of the television ex300 is connected to each other via a synchronous bus.
- the television ex300 receives a user operation from the remote controller ex220 or the like, and demultiplexes the video data and audio data demodulated by the modulation / demodulation unit ex302 by the multiplexing / separation unit ex303 based on the control of the control unit ex310 having a CPU or the like. . Furthermore, the television ex300 decodes the separated audio data by the audio signal processing unit ex304, and the separated video data is decoded by the video signal processing unit ex305 using the decoding method described in the above embodiment. The decoded audio signal and video signal are output to the outside from the output unit ex309.
- the television ex300 may read the encoded bitstream encoded from the recording media ex215 and ex216 such as a magnetic / optical disk and an SD card, not from broadcasting.
- the television ex300 encodes an audio signal and a video signal and transmits the signal to the outside or to a recording medium.
- the television ex300 receives a user operation from the remote controller ex220 or the like, and encodes an audio signal with the audio signal processing unit ex304 based on the control of the control unit ex310, and the video signal with the video signal processing unit ex305 in the above embodiment.
- Encoding is performed using the described encoding method.
- the encoded audio signal and video signal are multiplexed by the multiplexing / demultiplexing unit ex303 and output to the outside.
- these signals may be temporarily stored in the buffers ex320, ex321, etc. so that the audio signal and the video signal are synchronized.
- a plurality of buffers ex318 to ex321 may be provided as shown in the figure, or one or more buffers may be shared.
- data may be stored in the buffer as a buffer material that prevents system overflow and underflow, for example, between the modulation / demodulation unit ex302 and the multiplexing / demultiplexing unit ex303.
- the television ex300 In addition to acquiring audio data and video data from broadcasting and recording media, the television ex300 has a configuration for receiving AV input of a microphone and a camera, and can perform encoding processing on the data acquired from these. Good.
- the television ex300 has been described as a configuration that can perform the above-described encoding processing, multiplexing, and external output. However, all of these processing cannot be performed, and the reception, decoding processing, and external
- the configuration may be such that only one of the outputs is possible.
- the decoding process or the encoding process may be performed by either the television ex300 or the reader / recorder ex218.
- the television ex300 and the reader / recorder ex218 may be shared with each other.
- FIG. 21 shows the configuration of the information reproducing / recording unit ex400 when data is read from or written to an optical disk.
- the information reproducing / recording unit ex400 includes elements ex401 to ex407 described below.
- the optical head ex401 irradiates a laser spot on the recording surface of the recording medium ex215 that is an optical disc to write information, and detects information reflected from the recording surface of the recording medium ex215 to read the information.
- the modulation recording unit ex402 electrically drives a semiconductor laser built in the optical head ex401 and modulates the laser beam according to the recording data.
- the reproduction demodulator ex403 amplifies the reproduction signal obtained by electrically detecting the reflected light from the recording surface by the photodetector built in the optical head ex401, separates and demodulates the signal component recorded on the recording medium ex215, and is necessary. To play back information.
- the buffer ex404 temporarily holds information to be recorded on the recording medium ex215 and information reproduced from the recording medium ex215.
- the disk motor ex405 rotates the recording medium ex215.
- the servo control unit ex406 moves the optical head ex401 to a predetermined information track while controlling the rotational drive of the disk motor ex405, and performs a laser spot tracking process.
- the system control unit ex407 controls the entire information reproduction / recording unit ex400.
- the system control unit ex407 uses various types of information held in the buffer ex404, and generates and adds new information as necessary. This is realized by recording / reproducing information through the optical head ex401 while the unit ex403 and the servo control unit ex406 are cooperatively operated.
- the system control unit ex407 includes, for example, a microprocessor, and executes these processes by executing a read / write program.
- the optical head ex401 has been described as irradiating a laser spot, but it may be configured to perform higher-density recording using near-field light.
- FIG. 22 shows a schematic diagram of a recording medium ex215 that is an optical disk.
- Guide grooves grooves
- address information indicating the absolute position on the disc is recorded in advance on the information track ex230 by changing the shape of the groove.
- This address information includes information for specifying the position of the recording block ex231 which is a unit for recording data, and the recording and reproducing apparatus specifies the recording block by reproducing the information track ex230 and reading the address information. be able to.
- the recording medium ex215 includes a data recording area ex233, an inner peripheral area ex232, and an outer peripheral area ex234.
- the area used for recording user data is the data recording area ex233, and the inner circumference area ex232 and the outer circumference area ex234 arranged on the inner circumference or outer circumference of the data recording area ex233 are used for specific purposes other than user data recording. Used.
- the information reproducing / recording unit ex400 reads / writes encoded audio data, video data, or encoded data obtained by multiplexing these data to / from the data recording area ex233 of the recording medium ex215.
- an optical disk such as a single-layer DVD or BD has been described as an example.
- the present invention is not limited to these, and an optical disk having a multilayer structure and capable of recording other than the surface may be used.
- it has a structure that performs multi-dimensional recording / reproduction, such as recording information using light of various colors at the same location on the disk, and recording different information layers from various angles. It may be an optical disk.
- the car ex210 having the antenna ex205 can receive data from the satellite ex202 and the like, and the moving image can be reproduced on a display device such as the car navigation ex211 that the car ex210 has.
- the configuration of the car navigation ex211 may include a configuration in which a GPS receiving unit is added to the configuration illustrated in FIG. 20, and the same may be applied to the computer ex111, the mobile phone ex114, and the like.
- the transmission / reception terminal having both an encoder and a decoder there are three types of terminals such as the mobile phone ex114, such as a transmitting terminal having only an encoder and a receiving terminal having only a decoder, as in the television ex300. The implementation form of can be considered.
- the image encoding method or the image decoding method described in the above embodiment can be used in any of the above-described devices and systems, and by doing so, the effects described in the above embodiment can be obtained. be able to.
- FIG. 23 shows a configuration of an LSI ex500 that is made into one chip.
- the LSI ex500 includes elements ex501 to ex509 described below, and each element is connected via a bus ex510.
- the power supply circuit unit ex505 starts up to an operable state by supplying power to each unit when the power supply is in an on state.
- the LSI ex500 when performing the encoding process, inputs AV signals from the microphone ex117, the camera ex113, and the like by the AV I / Oex 509 based on the control of the control unit ex501 having the CPU ex502, the memory controller ex503, the stream controller ex504, and the like. Accept.
- the input AV signal is temporarily stored in an external memory ex511 such as SDRAM.
- the accumulated data is appropriately divided into a plurality of times according to the processing amount and the processing speed, and sent to the signal processing unit ex507.
- the signal processing unit ex507 performs encoding of the audio signal and / or encoding of the video signal.
- the encoding process of the video signal is the encoding process described in the above embodiment.
- the signal processing unit ex507 further performs processing such as multiplexing the encoded audio data and the encoded video data according to circumstances, and outputs the result from the stream I / Oex 506 to the outside.
- the output bit stream is transmitted to the base station ex107 or written to the recording medium ex215. It should be noted that data should be temporarily stored in the buffer ex508 so as to be synchronized when multiplexing.
- the LSI ex500 is obtained by reading from the encoded data obtained from the base station ex107 by the stream I / Oex 506 or the recording medium ex215 based on the control of the control unit ex501.
- the encoded data is temporarily stored in the memory ex511 or the like.
- the accumulated data is divided into a plurality of times as appropriate according to the processing amount and the processing speed and sent to the signal processing unit ex507.
- the signal processing unit ex507 performs decoding of audio data and / or decoding of video data.
- the decoding process of the video signal is the decoding process described in the above embodiment.
- each signal may be temporarily stored in the buffer ex508 or the like so that the decoded audio signal and the decoded video signal can be reproduced in synchronization.
- the decoded output signal is output from each output unit such as the mobile phone ex114, the game machine ex115, and the television ex300, through the memory ex511 and the like as appropriate.
- the memory ex511 has been described as an external configuration of the LSI ex500.
- a configuration included in the LSI ex500 may be used.
- the buffer ex508 is not limited to one, and a plurality of buffers may be provided.
- the LSI ex500 may be made into one chip or a plurality of chips.
- LSI LSI
- IC system LSI
- super LSI ultra LSI depending on the degree of integration
- the method of circuit integration is not limited to LSI, and may be realized by a dedicated circuit or a general-purpose processor.
- An FPGA that can be programmed after manufacturing the LSI or a reconfigurable processor that can reconfigure the connection and setting of circuit cells inside the LSI may be used.
- the present invention has an effect of suppressing an increase in calculation amount and sufficiently suppressing deterioration in image quality, and can be used for an image encoding device that encodes a still image and a moving image.
- the present invention can be used for various AV devices such as a mobile phone, a digital camera, a BD recorder, and a digital television.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
本発明の実施の形態1に係る画像符号化方法は、原画像を符号化する符号化方法であって、原画像を複数のブロックに分割し、複数のブロックの1つである対象ブロックの符号化条件を決定し、決定した符号化条件に従って、対象ブロックを符号化する。そして、対象ブロックの符号化条件の決定では、対象ブロックの複雑度を示す値である空間アクティビティ値に基づいて、対象ブロックの符号化条件を決定する。
本発明の実施の形態2に係る画像符号化方法は、実施の形態1に示すような空間アクティビティ条件の評価だけでなく、動きアクティビティ条件の評価を行い、評価結果に基づいて、対象ブロックの符号化条件を決定することを特徴とする。また、空間アクティビティ条件の評価については、対象ブロックに含まれる複数の大ブロックの空間アクティビティと、対象ブロックを含む拡張ブロックに含まれる複数の小ブロッククラスターの空間アクティビティとを評価する。
(実施の形態3)
上記実施の形態で示した画像符号化方法又は画像復号方法の構成を実現するためのプログラムを記憶メディアに記録することにより、上記実施の形態で示した処理を独立したコンピュータシステムにおいて簡単に実施することが可能となる。記憶メディアは、磁気ディスク、光ディスク、光磁気ディスク、ICカード、半導体メモリ等、プログラムを記録できるものであればよい。
上記各実施の形態で示した画像符号化方法及び装置、画像復号方法及び装置は、典型的には集積回路であるLSIで実現される。一例として、図23に1チップ化されたLSIex500の構成を示す。LSIex500は、以下に説明する要素ex501~ex509を備え、各要素はバスex510を介して接続している。電源回路部ex505は電源がオン状態の場合に各部に対して電力を供給することで動作可能な状態に起動する。
110 分割部
120、320 符号化条件決定部
121 空間アクティビティ値算出部
122 比較部
123 決定部
130、330 符号化部
201 原画像
202 拡張ブロック
203 対象ブロック
204 符号化条件
205 符号化画像
211 空間アクティビティ値
212 比較結果
310 サンプル抽出部
321 空間アクティビティ条件評価部
322 予測タイプ評価部
323、708、709 スイッチ部
324 動きアクティビティ条件評価部
325 ORゲート部
331 画像符号化部
332 画像復号部
333 ピクチャメモリ部
334 エントロピー符号化部
401 利用可能予測タイプ情報
402 原画ピクチャ
403、404 原画サンプル
405、806 符号化ツール決定信号
406、604、608 制御信号
407、408、409 符号化ツール決定信号
410 インター予測サンプル
411 最終符号化ツール決定信号
412 量子化残差
413 再構築サンプル
414 再構築ピクチャ
415 圧縮ビットストリーム
501 大ブロック分割部
502 大ブロック空間アクティビティ値算出部
503 最大値算出部
504、508 空間アクティビティ値比較部
505 小ブロッククラスター形成部
506 小ブロッククラスター空間アクティビティ値算出部
507 最小値算出部
509 符号化ツール決定部
601 大ブロックサンプル
602 大ブロック空間アクティビティ値
603 最大空間アクティビティ値
605 小ブロッククラスターサンプル
606 小ブロッククラスター空間アクティビティ値
607 最小空間アクティビティ値
701 参照ピクチャ選択部
702 動き予測部
703 動き補償部
704 コスト算出部
705 コスト比較部
706 動きベクトル比較部
707 符号化ツール決定初期化部
801 参照ピクチャ
802 参照ピクチャインデックス
803 動きベクトル
804、808 制御信号
805 初期化信号
807 コスト値
ex100 コンテンツ供給システム
ex101 インターネット
ex102 インターネットサービスプロバイダ
ex103 ストリーミングサーバ
ex104 電話網
ex106、ex107、ex108、ex109、ex110 基地局
ex111 コンピュータ
ex112 PDA
ex113、ex116 カメラ
ex114 カメラ付デジタル携帯電話(携帯電話)
ex115 ゲーム機
ex117 マイク
ex200 デジタル放送用システム
ex201 放送局
ex202 放送衛星(衛星)
ex203 ケーブル
ex204、ex205、ex601 アンテナ
ex210 車
ex211 カーナビゲーション(カーナビ)
ex212 再生装置
ex213、ex219 モニタ
ex214、ex215、ex216、ex607 記録メディア
ex217 セットトップボックス(STB)
ex218 リーダ/レコーダ
ex220 リモートコントローラ
ex230 情報トラック
ex231 記録ブロック
ex232 内周領域
ex233 データ記録領域
ex234 外周領域
ex300 テレビ
ex301 チューナ
ex302 変調/復調部
ex303 多重/分離部
ex304 音声信号処理部
ex305 映像信号処理部
ex306、ex507 信号処理部
ex307 スピーカ
ex308、ex602 表示部
ex309 出力部
ex310、ex501 制御部
ex311、ex505、ex710 電源回路部
ex312 操作入力部
ex313 ブリッジ
ex314、ex606 スロット部
ex315 ドライバ
ex316 モデム
ex317 インターフェース部
ex318、ex319、ex320、ex321、ex404、ex508 バッファ
ex400 情報再生/記録部
ex401 光ヘッド
ex402 変調記録部
ex403 再生復調部
ex405 ディスクモータ
ex406 サーボ制御部
ex407 システム制御部
ex500 LSI
ex502 CPU
ex503 メモリコントローラ
ex504 ストリームコントローラ
ex506 ストリームI/O
ex509 AV I/O
ex510 バス
ex511 メモリ
ex603 カメラ部
ex604 操作キー
ex605 音声入力部
ex608 音声出力部
ex701 送受信回路部
ex702 LCD制御部
ex703 カメラインターフェース部(カメラI/F部)
ex704 操作入力制御部
ex705 音声処理部
ex706 変復調回路部
ex707 記録再生部
ex708 多重分離部
ex709 画像復号部
ex711 主制御部
ex712 画像符号化部
ex713 同期バス
Claims (13)
- 原画像を符号化する画像符号化方法であって、
前記原画像を複数のブロックに分割し、
前記複数のブロックの1つである対象ブロックの符号化条件を決定し、
決定した符号化条件に従って、前記対象ブロックを符号化し、
前記対象ブロックの符号化条件の決定では、
前記対象ブロックの複雑度を示す値である空間アクティビティ値に基づいて、前記対象ブロックの符号化条件を決定し、
前記空間アクティビティ値に基づいた符号化条件の決定では、
前記対象ブロックの少なくとも一部の領域の複雑度を示す値である第1空間アクティビティ値を算出し、
算出した第1空間アクティビティ値と予め定められた第1閾値とを比較し、
前記第1空間アクティビティ値が前記第1閾値より小さい場合、前記対象ブロックを第1分割サイズのサブブロック毎に符号化することを示す第1符号化条件を、前記対象ブロックの符号化条件として決定し、
前記第1空間アクティビティ値が前記第1閾値以上である場合、前記対象ブロックを、前記第1分割サイズより大きい第2分割サイズのサブブロック毎に符号化することを示す第2符号化条件を、前記対象ブロックの符号化条件として決定する
画像符号化方法。 - 前記第1空間アクティビティ値の算出では、
前記対象ブロックを少なくとも含む拡張ブロックに基づいて、前記対象ブロックの少なくとも一部の領域をそれぞれが含む複数の小ブロッククラスターを形成し、
前記複数の小ブロッククラスターのそれぞれの複雑度を示す値である小ブロッククラスター空間アクティビティ値を算出し、算出した複数の小ブロッククラスター空間アクティビティ値のうち最小の値を、前記第1空間アクティビティ値として算出する
請求項1記載の画像符号化方法。 - 前記符号化条件の決定では、さらに、
前記対象ブロックを複数の大ブロックに分割し、
前記複数の大ブロックのそれぞれの複雑度を示す値である大ブロック空間アクティビティ値を算出し、算出した複数の大ブロック空間アクティビティ値のうち最大の値を、第2空間アクティビティ値として算出し、
算出した第2空間アクティビティ値と予め定められた第2閾値とを比較し、
前記第2空間アクティビティ値が前記第2閾値以下である場合、前記第2符号化条件を前記対象ブロックの符号化条件として決定し、
前記第2空間アクティビティ値が前記第2閾値より大きい場合、前記第1空間アクティビティ値の前記算出及び前記比較を行うことで、前記対象ブロックの符号化条件を決定する
請求項2記載の画像符号化方法。 - 前記拡張ブロックは、前記対象ブロックより大きい
請求項2記載の画像符号化方法。 - 前記対象ブロックの複雑度を示す値は、前記対象ブロックに含まれる複数のサンプルのサンプル値の変動を示す値である
請求項1記載の画像符号化方法。 - 前記符号化条件の決定では、さらに、
前記対象ブロックの動きに基づいて、前記対象ブロックの符号化条件を決定し、
前記動きに基づいた符号化条件の決定では、
前記対象ブロックの動きベクトルを算出し、
算出した動きベクトルの絶対値と予め定められた第3閾値とを比較し、
前記動きベクトルの絶対値が前記第3閾値より大きい場合、前記第2符号化条件を前記対象ブロックの符号化条件として決定し、
前記動きベクトルの絶対値が前記第3閾値以下である場合、前記空間アクティビティ値に基づいた符号化条件の決定結果に従って、前記対象ブロックの符号化条件を決定する
請求項1記載の画像符号化方法。 - 前記動きに基づいた符号化条件の決定では、さらに、
算出した動きベクトルを用いて動き補償を行うことで、前記対象ブロックの予測ブロックを生成し、
前記対象ブロックと前記予測ブロックと前記動きベクトルとを用いて、前記対象ブロックの符号化効率を示すコスト値を算出し、
前記動きベクトルの絶対値が前記第3閾値以下である場合、前記コスト値と予め定められた第4閾値との比較を行い、
前記コスト値が前記第4閾値より小さい場合、前記第2符号化条件を前記対象ブロックの符号化条件として決定し、
前記コスト値が前記第4閾値以上である場合、前記空間アクティビティ値に基づいた符号化条件の決定結果に従って、前記対象ブロックの符号化条件を決定する
請求項6記載の画像符号化方法。 - 前記コスト値は、前記対象ブロックと前記予測ブロックとの差分絶対値和に基づいて算出される
請求項6記載の画像符号化方法。 - 前記第1分割サイズ又は前記第2分割サイズのサブブロックは、イントラ予測、動き補償、及び、周波数変換の少なくとも1つの処理が実行される処理単位である
請求項1記載の画像符号化方法。 - 原画像を符号化する画像符号化方法であって、
前記原画像を複数のブロックに分割し、
前記複数のブロックの1つである対象ブロックの符号化条件を決定し、
決定した符号化条件に従って、前記対象ブロックを符号化し、
前記対象ブロックの符号化条件の決定では、
前記対象ブロックの動きベクトルを算出し、
算出した動きベクトルの絶対値と予め定められた閾値とを比較し、
前記動きベクトルの絶対値が前記閾値以下である場合、前記対象ブロックを第1分割サイズのサブブロック毎に符号化することを示す第1符号化条件を、前記対象ブロックの符号化条件として決定し、
前記動きベクトルの絶対値が前記閾値より大きい場合、前記対象ブロックを、前記第1分割サイズより大きい第2分割サイズのサブブロック毎に符号化することを示す第2符号化条件を前記対象ブロックの符号化条件として決定する
画像符号化方法。 - 原画像を符号化する画像符号化装置であって、
前記原画像を複数のブロックに分割する分割部と、
前記複数のブロックの1つである対象ブロックの符号化条件を決定する符号化条件決定部と、
前記符号化条件決定部によって決定された符号化条件に従って、前記対象ブロックを符号化する符号化部とを備え、
前記符号化条件決定部は、
前記対象ブロックの少なくとも一部の領域の複雑度を示す値である空間アクティビティ値を算出する空間アクティビティ値算出部と、
前記空間アクティビティ値算出部によって算出された空間アクティビティ値と予め定められた閾値とを比較する比較部と、
前記空間アクティビティ値が前記閾値より小さい場合、前記対象ブロックを第1分割サイズのサブブロック毎に符号化することを示す第1符号化条件を、前記対象ブロックの符号化条件として決定し、前記空間アクティビティ値が前記閾値以上である場合、前記対象ブロックを、前記第1分割サイズより大きい第2分割サイズのサブブロック毎に符号化することを示す第2符号化条件を、前記対象ブロックの符号化条件として決定する決定部とを備える
画像符号化装置。 - 原画像を符号化する画像符号化方法をコンピュータに実行させるためのプログラムであって、
前記画像符号化方法では、
前記原画像を複数のブロックに分割し、
前記複数のブロックの1つである対象ブロックの符号化条件を決定し、
決定した符号化条件に従って、前記対象ブロックを符号化し、
前記対象ブロックの符号化条件の決定では、
前記対象ブロックの少なくとも一部の領域の複雑度を示す値である空間アクティビティ値を算出し、
算出した空間アクティビティ値と予め定められた閾値とを比較し、
前記空間アクティビティ値が前記閾値より小さい場合、前記対象ブロックを第1分割サイズのサブブロック毎に符号化することを示す第1符号化条件を、前記対象ブロックの符号化条件として決定し、
前記空間アクティビティ値が前記閾値以上である場合、前記対象ブロックを、前記第1分割サイズより大きい第2分割サイズのサブブロック毎に符号化することを示す第2符号化条件を、前記対象ブロックの符号化条件として決定する
プログラム。 - 原画像を符号化する集積回路であって、
前記原画像を複数のブロックに分割する分割部と、
前記複数のブロックの1つである対象ブロックの符号化条件を決定する符号化条件決定部と、
前記符号化条件決定部によって決定された符号化条件に従って、前記対象ブロックを符号化する符号化部とを備え、
前記符号化条件決定部は、
前記対象ブロックの少なくとも一部の領域の複雑度を示す値である空間アクティビティ値を算出する空間アクティビティ値算出部と、
前記空間アクティビティ値算出部によって算出された空間アクティビティ値と予め定められた閾値とを比較する比較部と、
前記空間アクティビティ値が前記閾値より小さい場合、前記対象ブロックを第1分割サイズのサブブロック毎に符号化することを示す第1符号化条件を、前記対象ブロックの符号化条件として決定し、前記空間アクティビティ値が前記閾値以上である場合、前記対象ブロックを、前記第1分割サイズより大きい第2分割サイズのサブブロック毎に符号化することを示す第2符号化条件を、前記対象ブロックの符号化条件として決定する決定部とを備える
集積回路。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201080002356.9A CN102124741B (zh) | 2009-06-22 | 2010-06-16 | 图像编码方法及图像编码装置 |
JP2011519558A JP5588438B2 (ja) | 2009-06-22 | 2010-06-16 | 画像符号化方法及び画像符号化装置 |
US13/059,804 US8902985B2 (en) | 2009-06-22 | 2010-06-16 | Image coding method and image coding apparatus for determining coding conditions based on spatial-activity value |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2009148043 | 2009-06-22 | ||
JP2009-148043 | 2009-06-22 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2010150486A1 true WO2010150486A1 (ja) | 2010-12-29 |
Family
ID=43386274
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2010/003985 WO2010150486A1 (ja) | 2009-06-22 | 2010-06-16 | 画像符号化方法及び画像符号化装置 |
Country Status (4)
Country | Link |
---|---|
US (1) | US8902985B2 (ja) |
JP (1) | JP5588438B2 (ja) |
CN (1) | CN102124741B (ja) |
WO (1) | WO2010150486A1 (ja) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015012253A1 (ja) * | 2013-07-25 | 2015-01-29 | 日本電信電話株式会社 | 画像符号化装置及び方法、及び画像符号化プログラム |
RU2551473C2 (ru) * | 2011-03-09 | 2015-05-27 | Ниппон Телеграф Энд Телефон Корпорейшн | Устройство кодирования видео, способ кодирования видео и программа кодирования видео |
JP2015115903A (ja) * | 2013-12-13 | 2015-06-22 | キヤノン株式会社 | 撮像装置、撮像装置の制御方法、コンピュータプログラム |
JP2015211269A (ja) * | 2014-04-24 | 2015-11-24 | 富士通株式会社 | 動画像符号化装置、動画像符号化方法及び動画像符号化用コンピュータプログラム |
JP2016213615A (ja) * | 2015-05-01 | 2016-12-15 | 富士通株式会社 | 動画像符号化装置、動画像符号化方法及び動画像符号化用コンピュータプログラム |
KR101739822B1 (ko) | 2015-12-07 | 2017-05-25 | (주)넥스트칩 | 아날로그 영상 신호 압축 방법 및 장치 |
JP2017515374A (ja) * | 2014-04-15 | 2017-06-08 | クアルコム,インコーポレイテッド | ディスプレイストリーム圧縮(dsc)のための平坦度検出のためのシステムおよび方法 |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2674310C (en) * | 2007-01-16 | 2016-05-24 | Thomson Licensing | System and method for reducing artifacts in images |
KR101483497B1 (ko) * | 2008-09-25 | 2015-01-20 | 에스케이텔레콤 주식회사 | 임펄스 신호를 고려한 영상 부호화/복호화 장치 및 방법 |
JP6222576B2 (ja) * | 2012-03-21 | 2017-11-01 | サン パテント トラスト | 画像符号化方法、画像復号方法、画像符号化装置、画像復号装置、および画像符号化復号装置 |
JP6083973B2 (ja) * | 2012-08-08 | 2017-02-22 | 株式会社メガチップス | データ記憶制御装置およびデータ記憶方法 |
US10003792B2 (en) * | 2013-05-27 | 2018-06-19 | Microsoft Technology Licensing, Llc | Video encoder for images |
EP3120556B1 (en) | 2014-03-17 | 2021-01-13 | Microsoft Technology Licensing, LLC | Encoder-side decisions for screen content encoding |
JP2016066922A (ja) * | 2014-09-25 | 2016-04-28 | ソニー株式会社 | 信号処理装置、撮像装置、および、それらにおける信号処理方法。 |
US10924743B2 (en) | 2015-02-06 | 2021-02-16 | Microsoft Technology Licensing, Llc | Skipping evaluation stages during media encoding |
US10136132B2 (en) | 2015-07-21 | 2018-11-20 | Microsoft Technology Licensing, Llc | Adaptive skip or zero block detection combined with transform size decision |
CN116883522A (zh) * | 2015-12-14 | 2023-10-13 | 松下电器(美国)知识产权公司 | 三维数据编码方法、解码方法、编码装置、解码装置 |
US10674172B2 (en) * | 2016-04-19 | 2020-06-02 | Mitsubishi Electric Corporation | Image processing apparatus, image processing method, and computer-readable recording medium |
WO2018008678A1 (ja) * | 2016-07-08 | 2018-01-11 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ | 復号装置、符号化装置、復号方法及び符号化方法 |
US10694202B2 (en) * | 2016-12-01 | 2020-06-23 | Qualcomm Incorporated | Indication of bilateral filter usage in video coding |
KR20230143626A (ko) | 2017-03-22 | 2023-10-12 | 한양대학교 산학협력단 | 영상을 구성하는 화소값 범위를 이용한 영상 부호화/복호화 방법 |
CN116527883A (zh) * | 2019-01-16 | 2023-08-01 | Oppo广东移动通信有限公司 | 信息处理方法及装置、设备、存储介质 |
US11190774B1 (en) * | 2020-05-29 | 2021-11-30 | Microsoft Technology Licensing, Llc | Screen content encoding mode evaluation including intra-block evaluation of multiple potential encoding modes |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0591338A (ja) * | 1991-09-30 | 1993-04-09 | Victor Co Of Japan Ltd | 映像情報量圧縮及び/又は情報量伸長装置 |
JP2006519565A (ja) * | 2003-03-03 | 2006-08-24 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | ビデオ符号化 |
JP2007060164A (ja) * | 2005-08-23 | 2007-03-08 | Matsushita Electric Ind Co Ltd | 動きベクトル検出装置および動きベクトル検出方法 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH07284105A (ja) | 1994-04-13 | 1995-10-27 | Matsushita Electric Ind Co Ltd | 動きベクトル検出装置 |
KR0139164B1 (ko) | 1994-12-19 | 1998-06-01 | 김광호 | 적응적 직교변환부호화 장치 |
JP4447197B2 (ja) | 2002-01-07 | 2010-04-07 | 三菱電機株式会社 | 動画像符号化装置および動画像復号装置 |
US20040125204A1 (en) | 2002-12-27 | 2004-07-01 | Yoshihisa Yamada | Moving picture coding apparatus and moving picture decoding apparatus |
JP2006519564A (ja) | 2003-03-03 | 2006-08-24 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | ビデオ符号化 |
US20060002474A1 (en) * | 2004-06-26 | 2006-01-05 | Oscar Chi-Lim Au | Efficient multi-block motion estimation for video compression |
JP2007110568A (ja) | 2005-10-14 | 2007-04-26 | Matsushita Electric Ind Co Ltd | 画像符号化装置 |
KR100728031B1 (ko) * | 2006-01-23 | 2007-06-14 | 삼성전자주식회사 | 가변 블록 크기 움직임 예측을 위한 부호화 모드 결정 방법및 장치 |
JP4775132B2 (ja) | 2006-06-20 | 2011-09-21 | ソニー株式会社 | 画像理装置および方法、プログラム、並びに記録媒体 |
-
2010
- 2010-06-16 CN CN201080002356.9A patent/CN102124741B/zh active Active
- 2010-06-16 JP JP2011519558A patent/JP5588438B2/ja active Active
- 2010-06-16 WO PCT/JP2010/003985 patent/WO2010150486A1/ja active Application Filing
- 2010-06-16 US US13/059,804 patent/US8902985B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0591338A (ja) * | 1991-09-30 | 1993-04-09 | Victor Co Of Japan Ltd | 映像情報量圧縮及び/又は情報量伸長装置 |
JP2006519565A (ja) * | 2003-03-03 | 2006-08-24 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | ビデオ符号化 |
JP2007060164A (ja) * | 2005-08-23 | 2007-03-08 | Matsushita Electric Ind Co Ltd | 動きベクトル検出装置および動きベクトル検出方法 |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
RU2551473C2 (ru) * | 2011-03-09 | 2015-05-27 | Ниппон Телеграф Энд Телефон Корпорейшн | Устройство кодирования видео, способ кодирования видео и программа кодирования видео |
US9445089B2 (en) | 2011-03-09 | 2016-09-13 | Nippon Telegraph And Telephone Corporation | Video encoding device, video encoding method and video encoding program |
WO2015012253A1 (ja) * | 2013-07-25 | 2015-01-29 | 日本電信電話株式会社 | 画像符号化装置及び方法、及び画像符号化プログラム |
JP2015026964A (ja) * | 2013-07-25 | 2015-02-05 | 日本電信電話株式会社 | 画像符号化方法、画像符号化装置及び画像符号化プログラム |
US20160156911A1 (en) * | 2013-07-25 | 2016-06-02 | Nippon Telegraph And Telephone Corporation | Image encoding apparatus and method, and image encoding program |
US10375402B2 (en) | 2013-07-25 | 2019-08-06 | Nippon Telegraph And Telephone Corporation | Image encoding apparatus and method, and image encoding program |
JP2015115903A (ja) * | 2013-12-13 | 2015-06-22 | キヤノン株式会社 | 撮像装置、撮像装置の制御方法、コンピュータプログラム |
JP2017515374A (ja) * | 2014-04-15 | 2017-06-08 | クアルコム,インコーポレイテッド | ディスプレイストリーム圧縮(dsc)のための平坦度検出のためのシステムおよび方法 |
JP2015211269A (ja) * | 2014-04-24 | 2015-11-24 | 富士通株式会社 | 動画像符号化装置、動画像符号化方法及び動画像符号化用コンピュータプログラム |
JP2016213615A (ja) * | 2015-05-01 | 2016-12-15 | 富士通株式会社 | 動画像符号化装置、動画像符号化方法及び動画像符号化用コンピュータプログラム |
KR101739822B1 (ko) | 2015-12-07 | 2017-05-25 | (주)넥스트칩 | 아날로그 영상 신호 압축 방법 및 장치 |
Also Published As
Publication number | Publication date |
---|---|
US20110142134A1 (en) | 2011-06-16 |
JPWO2010150486A1 (ja) | 2012-12-06 |
CN102124741B (zh) | 2014-09-24 |
JP5588438B2 (ja) | 2014-09-10 |
CN102124741A (zh) | 2011-07-13 |
US8902985B2 (en) | 2014-12-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5588438B2 (ja) | 画像符号化方法及び画像符号化装置 | |
JP5574345B2 (ja) | 符号化方法、エラー検出方法、復号方法、符号化装置、エラー検出装置及び復号装置 | |
JP5479470B2 (ja) | 動画像符号化方法、動画像符号化装置、プログラム、および集積回路 | |
WO2010087157A1 (ja) | 画像符号化方法及び画像復号方法 | |
WO2010050156A1 (ja) | 画像符号化方法、画像復号方法、画像符号化装置、画像復号装置、集積回路およびプログラム | |
WO2010026770A1 (ja) | 画像符号化方法、画像復号方法、画像符号化装置、画像復号装置、システム、プログラム、及び集積回路 | |
WO2011016247A1 (ja) | 符号化方法、復号方法、符号化装置及び復号装置 | |
WO2010113524A1 (ja) | 動画像符号化方法、動画像復号方法、動画像符号化装置および動画像復号装置 | |
WO2010061607A1 (ja) | 動画像復号方法、動画像符号化方法、動画像復号装置、動画像符号化方法、プログラム、及び集積回路 | |
WO2011052215A1 (ja) | 復号方法、復号装置、符号化方法、および符号化装置 | |
WO2011016251A1 (ja) | 符号化方法、復号方法、符号化装置及び復号装置 | |
WO2011052217A1 (ja) | 画像復号方法、画像符号化方法、画像復号装置、画像符号化装置、プログラム、及び集積回路 | |
WO2011016250A1 (ja) | 符号化方法、復号方法、符号化装置及び復号装置 | |
WO2011111341A1 (ja) | 動画像復号装置、動画像符号化装置、動画像復号回路及び動画像復号方法 | |
JP5679980B2 (ja) | 画像符号化方法、画像復号方法、画像符号化装置及び画像復号装置 | |
JP5499035B2 (ja) | 画像符号化方法、画像符号化装置、プログラムおよび集積回路 | |
JP5546044B2 (ja) | 画像復号装置、画像符号化装置、画像復号回路及び画像復号方法 | |
WO2011052216A1 (ja) | 画像符号化方法、画像復号方法、画像符号化装置および画像復号装置 | |
JP5468604B2 (ja) | 画像復号装置、集積回路、画像復号方法及び画像復号システム | |
WO2011089865A1 (ja) | 画像符号化方法、画像復号方法、それらの装置、プログラムおよび集積回路 | |
JP2011182132A (ja) | 画像符号化装置、画像復号装置、画像符号化方法、画像復号方法、集積回路およびプログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 201080002356.9 Country of ref document: CN |
|
ENP | Entry into the national phase |
Ref document number: 2011519558 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13059804 Country of ref document: US |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10791810 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 10791810 Country of ref document: EP Kind code of ref document: A1 |