WO2011037049A1 - 画像処理装置および方法 - Google Patents
画像処理装置および方法 Download PDFInfo
- Publication number
- WO2011037049A1 WO2011037049A1 PCT/JP2010/065862 JP2010065862W WO2011037049A1 WO 2011037049 A1 WO2011037049 A1 WO 2011037049A1 JP 2010065862 W JP2010065862 W JP 2010065862W WO 2011037049 A1 WO2011037049 A1 WO 2011037049A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- line
- coefficient
- unit
- order
- data
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/63—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using sub-band based transform, e.g. wavelets
- H04N19/64—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using sub-band based transform, e.g. wavelets characterised by ordering of coefficients or of bits for transmission
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/129—Scanning of coding units, e.g. zig-zag scan of transform coefficients or flexible macroblock ordering [FMO]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/15—Data rate or code amount at the encoder output by monitoring actual compressed data size at the memory before deciding storage at the transmission buffer
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/18—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/1883—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit relating to sub-band structure, e.g. hierarchical level, directional tree, e.g. low-high [LH], high-low [HL], high-high [HH]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/63—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using sub-band based transform, e.g. wavelets
- H04N19/635—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using sub-band based transform, e.g. wavelets characterised by filter definition or implementation details
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/65—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/91—Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
Definitions
- the present invention relates to an image processing apparatus and method, and more particularly, to an image processing apparatus and method capable of improving the processing performance per unit time of image encoding and decoding.
- Typical conventional image compression methods include JPEG (Joint Photographic Experts Group) and JPEG2000 standardized by ISO (International Standards Organization).
- JPEG2000 whose international standardization was completed in January 2001, uses a method that combines this wavelet transform with highly efficient entropy coding (bit modeling and arithmetic coding for each bit plane), compared to JPEG. Significant improvement in coding efficiency.
- This JPEG2000 has been selected as a standard codec for the digital cinema standard (DCI (Digital Cinema Initiative) standard), and is beginning to be used for compression of moving images such as movies.
- DCI Digital Cinema Initiative
- manufacturers have begun to produce products that apply JPEG2000 to surveillance cameras, coverage cameras for broadcast stations, security recorders, and so on.
- JPEG2000 basically encodes and decodes in units of pictures. Therefore, when low delay is to be realized for use in real-time transmission / reception, at least one picture in encoding (encoding) and at least in decoding (decoding) A delay of one picture occurs.
- the present invention has been proposed in view of such a situation, and an object thereof is to improve the processing performance per unit time of image encoding and decoding.
- a line block including image data corresponding to the number of lines necessary to generate coefficient data for one line of at least the subband of the lowest frequency component by hierarchically performing analysis filter processing, Analysis filter means for converting into coefficient data decomposed for each band, encoding means for encoding the coefficient data generated by the analysis filter means, and the coefficient data encoded by the encoding means
- the encoded data is arranged in a predetermined order different from the order of the synthesis filter processing from the output order of the analysis filter processing by the analysis filter means, and in the order of transmission that improves the resistance to instability of the transmission path.
- An image processing apparatus including encoded data rearranging means for rearranging.
- the encoded data rearranging means can rearrange the encoded data of each division level in the order from the low frequency component to the high frequency component as the transmission order.
- the arrangement of the coefficient data generated by the analysis filter means is in a predetermined order different from the order of the synthesis filter processing and the transmission order from the output order of the analysis filter processing by the analysis filter means.
- Coefficient data rearranging means for rearranging in order of encoding that facilitates rate control of the encoded data;
- Quantization means for quantizing the coefficient data rearranged by the coefficient data rearranging means;
- Rate control means for controlling the rate of the encoded data by controlling the quantization step size of the quantization by the quantization means based on the encoding result by the encoding means; and
- the encoding means can encode the coefficient data quantized by the quantization means.
- the coefficient data rearranging means can rearrange the coefficient data of each division level in the order from the high frequency component to the low frequency component as the transmission order.
- the analysis filter means can perform the analysis filter processing using a lifting operation.
- the analysis filter means performs the lifting calculation after performing the necessary symmetric expansion of the pixel block in the initial state, and uses the previous lifting calculation result for the line block in the steady state. Thus, the lifting operation can be performed.
- the analysis filter means can perform the lifting operation in the vertical direction after performing the lifting operation in the horizontal direction on the coefficient data.
- the number of lines necessary for the analysis filter unit of the image processing apparatus to perform analysis filter processing hierarchically and generate coefficient data for at least one line of the subband of the lowest frequency component.
- a line block including image data for each minute is converted into coefficient data decomposed for each frequency band, and the encoding means of the image processing apparatus encodes the coefficient data generated by the analysis filter processing,
- the encoded data rearranging means of the processing device changes the sequence of the encoded data obtained by encoding the coefficient data in a predetermined order different from the order of the synthesis filter processing from the output order of the analysis filter processing.
- image data of a predetermined number of lines is decomposed for each frequency band by hierarchical analysis filter processing, and includes at least one line of coefficient data of at least the lowest frequency component subband.
- an image processing apparatus including a synthesis filter means for converting the image data into image data.
- the synthesizing filter means can preferentially execute the synthesizing filter process on the subband coefficient data in the lower hierarchy, while being executable.
- the encoded data is divided and extracted for each encoded data corresponding to one line of the coefficient data of each layer Extraction means, and the decoding means can decode the encoded data extracted by the extraction means.
- the coefficient data rearranging means can determine the order of the coefficient data based on the result of decoding by the decoding means, and can change the determined order of the order of synthesis filter processing.
- the synthesis filter means can perform the synthesis filter process using a lifting operation.
- the synthesizing filter means performs the lifting calculation after symmetrically extending necessary coefficient data for the initial state line block, and the previous lifting calculation result for the steady state line block. Utilizing this, the lifting operation can be performed.
- the synthesis filter means can perform the lifting operation in the vertical direction after performing the lifting operation in the horizontal direction on the coefficient data.
- the decoding means of the image processing apparatus further comprises: coefficient data of subbands of at least the lowest frequency component, in which a predetermined number of lines of image data are decomposed for each frequency band by hierarchical analysis filter processing Is encoded and generated by encoding a line block including a coefficient data group of each subband including at least one line, and the coefficient data rearranging means of the image processing device is obtained by decoding.
- the arrangement of the coefficient data is rearranged from the decoded order to the order of the synthesis filter processing, and the synthesis filter means of the image processing device performs the synthesis filter processing in a hierarchical manner, An image processing method for converting the coefficient data of a band into the image data.
- a line block including image data corresponding to the number of lines necessary to generate analysis data for at least one line of subbands of at least the lowest frequency component is subjected to hierarchical analysis filtering.
- the coefficient data generated by the analysis filter process is encoded, and the sequence of encoded data obtained by encoding the coefficient data is the output order of the analysis filter process.
- they are rearranged in a predetermined order that is different from the order in which the synthesis filter processing is performed, and in the order of transmission that improves the resistance to instability of the transmission path.
- each subband includes at least one line of coefficient data of at least the lowest-frequency component subband, in which image data of a predetermined number of lines is decomposed for each frequency band by hierarchical analysis filtering.
- the encoded data generated by encoding the line block consisting of the coefficient data group is decoded, and the sequence of the coefficient data obtained by the decoding is changed from the decoded order to the synthesis filter processing order. Rearrangement and hierarchical synthesis filter processing are performed, and the rearranged coefficient data of each subband is converted into image data.
- an image can be encoded or decoded.
- it is possible to further improve the processing capacity per unit time of processing for encoding and decoding an image.
- FIG. 1 shows a configuration of an embodiment of an image encoding apparatus as an image processing apparatus to which the present invention is applied.
- an image encoding apparatus 100 is an apparatus that encodes image data.
- the image encoding device 100 includes an image line input unit 101, a line buffer unit 102, a wavelet transform unit 103, a coefficient line rearrangement unit 104, a quantization unit 105, an entropy encoding unit 106, an addition unit 107, and a code line rearrangement unit. 108.
- the image encoding device 100 further includes a rate control unit 109.
- the image line input unit 101 supplies the input image data (arrow D10) to the line buffer unit 102 for each line (arrow D11) and accumulates it in the line buffer unit 102.
- the line buffer unit 102 holds the image data supplied from the image line input unit 101 and the coefficient data supplied from the wavelet transform unit 103, and supplies the image data and coefficient data to the wavelet transform unit 103 at a predetermined timing. (Arrow D12).
- the wavelet transform unit 103 performs wavelet transform on the image data and coefficient data supplied from the line buffer unit 102, and generates coefficient data of a low frequency component and a high frequency component of the next layer. Details of the wavelet transform will be described later.
- the wavelet transform unit 103 supplies the low-frequency component of the generated coefficient data in the vertical direction and the horizontal direction to the line buffer unit 102 (arrow D13), and supplies the other components to the coefficient line rearrangement unit 104. (Arrow D14). When the generated coefficient data is the highest layer, the wavelet transform unit 103 also supplies the coefficient line rearrangement unit 104 with a low-frequency component in the vertical direction and the horizontal direction.
- Coefficient data (coefficient line) is supplied from the wavelet transform unit 103 to the coefficient line rearrangement unit 104 (arrow D14).
- the coefficient line rearrangement unit 104 rearranges the order of the coefficient data (coefficient lines) into an order for encoding processing that is a predetermined order different from the order of wavelet inverse transform processing and the order at the time of transmission.
- the coefficient line rearranging unit 104 includes a coefficient line rearranging buffer 111 and a coefficient line reading unit 112.
- the coefficient line rearrangement buffer 111 holds the coefficient lines supplied from the wavelet transform unit 103.
- the coefficient line reading unit 112 performs rearrangement by reading the coefficient lines held in the coefficient line rearrangement buffer 111 in the order for encoding processing (arrow D15). Details of the rearrangement will be described later.
- the coefficient line rearranging unit 104 (coefficient line reading unit 112) supplies the coefficient data rearranged in order to the quantization unit 105 (arrow D16).
- the quantization unit 105 quantizes the coefficient data supplied from the coefficient line rearrangement unit 104. Any quantization method may be used. For example, general means, that is, coefficient data W as shown in the following equation (1) is divided by the quantization step size Q. A technique may be used.
- the quantization step size Q is designated by the rate control unit 109.
- the quantization unit 105 supplies the quantized coefficient data to the entropy encoding unit 106 (arrow D17).
- the entropy encoding unit 106 encodes the coefficient data supplied from the quantization unit 105 by a predetermined entropy encoding method such as Huffman encoding or arithmetic encoding.
- a predetermined entropy encoding method such as Huffman encoding or arithmetic encoding.
- the entropy encoding unit 106 supplies one code line, which is encoded data generated from the one coefficient line, to the adding unit 107 (arrow D18).
- the entropy encoding unit 106 further supplies the code amount of the one code line to the adding unit 107 (dotted arrow D24).
- the addition unit 107 adds the code amount of the one code line supplied from the entropy encoding unit 106 to the one code line supplied from the entropy encoding unit 106 as header information. Details of adding the header information will be described later. When the header information is added, the adding unit 107 supplies the encoded data (code line) to the code line rearranging unit 108 (arrow D19).
- the code line rearrangement unit 108 rearranges the order of the code lines into a transmission order that is a predetermined order different from the order of the wavelet inverse transform processing.
- the code line rearrangement unit 108 includes a code line rearrangement buffer 121 and a code line reading unit 122.
- the code line rearrangement buffer 121 holds a code line (including header information) supplied from the adding unit 107.
- the code line reading unit 122 performs rearrangement by reading the code lines held in the code line rearrangement buffer 121 in the order for transmission (arrow D20). Details of the rearrangement will be described later.
- the code line rearrangement unit 108 (code line reading unit 122) outputs the encoded data whose order has been rearranged to the outside of the image encoding device 100 (arrow D21).
- the encoded data output to the outside of the image encoding device 100 is supplied to an image decoding device, which will be described later, via a network or the like, for example.
- the encoded data is rearranged by the code line rearrangement unit 108 in the order suitable for transmission. Thereby, the tolerance with respect to the instability of a transmittable rate can be improved, for example.
- the entropy encoding unit 106 also supplies the code amount of each code line to the rate control unit 109 (dotted arrow D22).
- the rate control unit 109 estimates the encoding difficulty level of the image based on the code amount for each code line supplied from the entropy encoding unit 106, and is used by the quantization unit 105 according to the encoding difficulty level.
- the quantization step size Q is designated (dotted line arrow D23). That is, the rate control unit 109 controls the rate of encoded data by specifying the quantization step size Q.
- the coefficient lines are rearranged by the coefficient line rearrangement unit 104 in such an order that the rate control unit 109 can estimate the encoding difficulty earlier. That is, the coefficient lines are rearranged in the order that the entropy encoding unit 106 encodes the coefficient lines that have more features of the image.
- the code amount that more strongly reflects the feature of the image is supplied to the rate control unit 109 earlier, so the rate control unit 109 sets the quantization step size Q earlier.
- the value can be set according to the degree of difficulty in encoding the image (appropriate rate control is performed earlier).
- wavelet transform image data is hierarchically constructed by recursively repeating analysis filtering that divides image data into high spatial frequency components (high frequency components) and low frequency components (low frequency components). This is a process of converting into coefficient data for each frequency component.
- the division level is lower in the higher-frequency component hierarchy and higher in the lower-frequency component hierarchy.
- analysis filtering is performed both in the horizontal direction and in the vertical direction.
- horizontal analysis filtering is performed, and then vertical analysis filtering is performed. Therefore, coefficient data (image data) of one layer is divided into four subbands (LL, LH, HL, and HH) by analysis filtering for one layer.
- the analysis filtering of the next layer is performed on the low frequency component (LL) in both the horizontal direction and the vertical direction among the generated four subbands.
- FIG. 2 is a diagram for explaining the configuration of coefficient data generated by repeating analysis filtering four times.
- the image data is converted into four subbands (1LL, 1LH, 1HL, and 1HH) at division level 1.
- the division level 2 analysis filtering is performed on the subband 1LL of the low-frequency component in both the horizontal direction and the vertical direction at the division level 1, and four subbands (2LL, 2LH,. 2HL, and 2HH).
- the division level 3 analysis filtering is performed on the subband 2LL of the low frequency component in both the horizontal direction and the vertical direction at the division level 2, and the four subbands (3LL, 3LH, 3HL and 3HH).
- the division level 4 analysis filtering is performed on the subband 3LL of the low frequency component in both the horizontal direction and the vertical direction at the division level 3, and the four subbands (4LL, 4LH,. 4HL, and 4HH).
- FIG. 2 shows the configuration of coefficient data divided into 13 subbands in this way.
- coefficient data of four subbands on one layer is generated line by line from the image data or coefficient data of two lines to be processed. Therefore, as indicated by the hatched portion in FIG. 2, subline 3LL requires 2 lines and subband 2LL requires 4 lines in order to generate the division level 4 subband coefficient data line by line. Yes, subband 1LL requires 8 lines. That is, 16 lines of image data are required.
- the image data of the number of lines necessary to generate the coefficient data for one line of the subband of the lowest frequency component is called a line block (or precinct).
- the line block also indicates a set of coefficient data of each subband obtained by wavelet transforming the image data of the one line block.
- 16 lines of image data form one line block.
- the line block is generated from the 16 lines of image data, the coefficient data of 8 lines of each subband of division level 1, the coefficient data of 4 lines of each subband of division level 2, and each of the data of division level 3
- coefficient data of two lines of subbands and coefficient data of one line of each subband of division level 4 may be indicated.
- the wavelet transform unit 103 performs the wavelet transform for each line block.
- a line indicates one line formed in a picture or field corresponding to image data before wavelet transform, in each division level, or in each subband.
- This coefficient data (image data) for one line is also called a coefficient line.
- a certain line of a certain subband is referred to as a “coefficient line of a certain subband”, and all subbands (LH, HL) of a certain hierarchy (division level) generated from the same two coefficient lines of the hierarchy one level lower.
- And HH are called “coefficient lines of a certain division level (or hierarchy)”.
- “division level 4 (highest layer) coefficient line” is generated from subbands 4LL, 4LH, 4HL, and 4HH that correspond to each other (the same coefficient line at the lower division level). One line).
- the “division level 3 coefficient line” indicates one line corresponding to each other of the subbands 3LH, 3HL, and 3HH. Further, the “coefficient line of subband 2HH” indicates one line with subband 2HH.
- encoded data for one line obtained by encoding one coefficient line is also referred to as a code line.
- division level 4 wavelet transform has been described.
- wavelet transformation is basically performed up to division level 4, but in practice, the number of wavelet transformation layers (division level) is arbitrary.
- the wavelet transform unit 103 normally performs the above-described processing using a filter bank composed of a low-pass filter and a high-pass filter. Since a digital filter usually has an impulse response having a plurality of taps, that is, a filter coefficient, it is necessary to buffer in advance input image data or coefficient data that can be filtered. Similarly, when wavelet transform is performed in multiple stages, it is necessary to buffer the wavelet transform coefficients generated in the previous stage as many times as can be filtered.
- a method using a 5 ⁇ 3 filter will be described as a specific example of this wavelet transform.
- the method using this 5 ⁇ 3 filter is also adopted in the JPEG (Joint Photographic Experts2000Group) 2000 standard already described in the prior art, and is an excellent method in that wavelet transform can be performed with a small number of filter taps. is there.
- the impulse response (Z conversion expression) of the 5 ⁇ 3 filter is obtained from the low-pass filter H 0 (z) and the high-pass filter H 1 (z) as shown in the following equations (2) and (3). Composed. From the equations (2) and (3), it can be seen that the low-pass filter H 0 (z) is 5 taps and the high-pass filter H 1 (z) is 3 taps.
- the coefficients of the low frequency component and the high frequency component can be directly calculated.
- the calculation of filter processing can be reduced by using a lifting technique.
- FIG. 3 is a lifting representation of a 5 ⁇ 3 filter.
- the uppermost row is an input signal row.
- Data processing flows downward from the top of the screen, and the coefficient of the high frequency component (high frequency coefficient) and the coefficient of the low frequency component (low frequency coefficient) are output by the following equations (4) and (5).
- the equations (4) and (5) are output by the following equations (4) and (5).
- FIG. 4 is a diagram when filtering is performed on a vertical line using a 5 ⁇ 3 analysis filter.
- the horizontal direction shows the calculation process and the low-frequency and high-frequency coefficients generated thereby.
- FIG. 3 it can be seen that only the horizontal has changed to vertical, and the calculation method is exactly the same.
- the top line is symmetrically expanded from Line-1 like a dotted line, and one line is filled.
- a lifting operation is performed using this, a total of three lines of Line-0 and Line-1, and a coefficient a is generated by the operation of Step-1. This is a high frequency coefficient (H0).
- the next high frequency coefficient a is calculated using these three lines.
- This is a high frequency coefficient (H1).
- the coefficient b is generated by calculating according to Equation 2.
- the This is a low frequency coefficient (L1). That is, as indicated by a frame 153, the low-frequency coefficient (L1) and the high-frequency coefficient (H1) using the three lines Line-1, Line-2, and Line-3 and the high-frequency coefficient (H0). Is generated.
- FIG. 4 shows an example in which the filtering is performed on the vertical line, but it is obvious that the same can be considered in the case of the horizontal filtering.
- each subband (2LL, 2LH, 2HL, 2HH) at the division level 2 is generated line by line.
- a lifting operation is performed every time two 1LL coefficient lines are generated, and one coefficient line of each subband at the division level 2 is generated.
- the baseband image data is from 11 lines, and as shown on the right of FIG. Five coefficient lines are generated for each of the 1LH, 1HL, and 1HH subbands.
- the analysis filtering requires three lines of image data or coefficient data as input in the initial state at the upper end of the image. However, in the steady state of the other portions, analysis filtering is performed every time two lines of image data or coefficient data are input.
- the lifting operation proceeds in the above procedure.
- FIG. 9 is a diagram illustrating data output from the wavelet transform unit 103 in time series in the initial state.
- the data output from the wavelet transform unit 103 are arranged in time series in the direction from the top to the bottom in the figure.
- the wavelet transform unit 103 performs analysis filtering according to the procedure described above, in the initial state, the first coefficient line (line 1) from the top of the division level 1 (subbands 1HH, 1HL, and 1LH) is the wavelet transform.
- the data is output from the unit 103 and supplied to the coefficient line rearrangement unit 104.
- the line 1 of the subband 1LL is supplied to the line buffer unit 102 and held.
- line 2 (second coefficient line from the top) and line 3 (third coefficient line from the top) at division level 1 are sequentially supplied to the coefficient line rearrangement unit 104 as soon as they are generated.
- Lines 2 and 3 of the subband 1LL are supplied to and held in the line buffer unit 102.
- the wavelet transform unit 103 performs division level 1 analysis filtering on the coefficient lines. .
- line 1 at division level 2 (subbands 2HH, 2HL, and 2LH) is output from wavelet transform section 103 and supplied to coefficient line rearrangement section 104.
- the line 1 of the subband 2LL is supplied to the line buffer unit 102 and held.
- line 4 (fourth coefficient line from the top) and line 5 (fifth coefficient line from the top) at the division level 1 are generated in this order, and are sequentially supplied to the coefficient line rearrangement unit 104.
- the lines 4 and 5 of the subband 1LL are supplied to the line buffer unit 102 and held.
- the line buffer unit 102 holds two coefficient lines of the subband 1LL, division level 1 analysis filtering is performed on them, and the division level 2 line 2 is output from the wavelet transform unit 103 to obtain coefficient lines.
- the data is supplied to the sorting unit 104.
- the line 2 of the subband 2LL is supplied to the line buffer unit 102 and held.
- line 6 (sixth coefficient line from the top) and line 7 (seventh coefficient line from the top) at division level 1 are generated in this order, and are sequentially supplied to the coefficient line rearrangement unit 104.
- the lines 6 and 7 of the subband 1LL are supplied to the line buffer unit 102 and held.
- the division level 1 line 8 (the eighth coefficient line from the top) and the line 9 (the ninth coefficient line from the top) are generated in this order, and are sequentially supplied to the coefficient line rearrangement unit 104.
- the lines 8 and 9 of the subband 1LL are supplied to the line buffer unit 102 and held.
- the division level 1 line 10 (the tenth coefficient line from the top) and the line 11 (the eleventh coefficient line from the top) are generated in this order, and are sequentially supplied to the coefficient line rearrangement unit 104.
- the lines 10 and 11 of the subband 1LL are supplied to and held in the line buffer unit 102.
- the division level 1 line 12 (the 12th coefficient line from the top) and the line 13 (the 13th coefficient line from the top) are generated in this order, and are sequentially supplied to the coefficient line rearrangement unit 104.
- the line 12 and the line 13 of the subband 1LL are supplied to and held in the line buffer unit 102.
- the division level 1 line 14 (the 14th coefficient line from the top) and the line 15 (the 15th coefficient line from the top) are generated in this order, and are sequentially supplied to the coefficient line rearrangement unit 104.
- the line 14 and the line 15 of the subband 1LL are supplied to and held in the line buffer unit 102.
- the above is the coefficient line group for one line block output from the wavelet transform unit 103 in the initial state.
- the process shifts to a steady state where two lines are processed.
- FIG. 10 is a diagram showing data output from the wavelet transform unit 103 in time series in a steady state.
- the data output from the wavelet transform unit 103 are arranged in time series in the direction from top to bottom in the figure.
- the wavelet transform unit 103 Since the wavelet transform unit 103 performs analysis filtering in the above-described procedure, at a certain timing in the steady state, the division level 1 line L (Lth coefficient line from the top) and line (L + 1) (from the top (L + 1) As soon as the (th) coefficient line) is generated, it is sequentially output from the wavelet transform unit 103 and supplied to the coefficient line rearrangement unit 104. The line L and the line (L + 1) of the subband 1LL are supplied to the line buffer unit 102 and held.
- the division level 1 analysis filtering is performed on them, and the division level 2 line M (the Mth coefficient line from the top) is the wavelet. It is output from the conversion unit 103 and supplied to the coefficient line rearrangement unit 104.
- the line M of the subband 2LL is supplied to the line buffer unit 102 and held.
- a division level 1 line (L + 2) (the (L + 2) th coefficient line from the top) and a line (L + 3) (the (L + 3) th coefficient line from the top) are generated in this order, and the coefficient lines are sequentially arranged. It is supplied to the replacement unit 104.
- the line (L + 2) and the line (L + 3) of the subband 1LL are supplied to and held in the line buffer unit 102.
- the division level 3 line N (the Nth coefficient line from the top) is the wavelet. It is output from the conversion unit 103 and supplied to the coefficient line rearrangement unit 104. Further, the line N of the subband 3LL is supplied to the line buffer unit 102 and held.
- a division level 1 line (L + 4) ((L + 4) th coefficient line from the top) and a line (L + 5) ((L + 5) th coefficient line from the top) are generated in this order, and the coefficient lines are sequentially arranged. It is supplied to the replacement unit 104.
- the line (L + 4) and the line (L + 5) of the subband 1LL are supplied to and held in the line buffer unit 102.
- the division level 1 line (L + 6) (the (L + 6) th coefficient line from the top) and the line (L + 7) (the (L + 7) th coefficient line from the top) are generated in this order, and the coefficient lines are sequentially arranged. It is supplied to the replacement unit 104.
- the line (L + 6) and the line (L + 7) of the subband 1LL are supplied to and held in the line buffer unit 102.
- line P at division level 4 (Pth coefficient line from the top) is wavelet. It is output from the conversion unit 103 and supplied to the coefficient line rearrangement unit 104.
- the processing order of the coefficient lines of the wavelet transform unit 103 that is, the output order of the coefficient lines from the wavelet transform unit 103 is arbitrary, and may be an order other than those described above. However, by performing analysis filtering according to the procedure described above, the wavelet transform unit 103 can efficiently generate each coefficient line, and can perform transform processing with low delay.
- the coefficient lines of each division level output from the wavelet transform unit 103 in the order described with reference to FIGS. 9 and 10 are held in the coefficient line rearrangement buffer 111 of the coefficient line rearrangement unit 104.
- the coefficient line reading unit 112 sets the coefficient lines in the order for encoding processing, which is a predetermined order different from the order of wavelet inverse transform processing and the order at the time of transmission. By reading, the coefficient lines are rearranged.
- the order of wavelet transform processing and the order at the time of transmission will be described later.
- the order for encoding processing is such an order that the rate control unit 109 can estimate the encoding difficulty level earlier. .
- the coefficient line reading unit 112 preferentially reads coefficient lines that include more image features.
- the high-frequency component reflects an edge or noise component that is difficult to detect due to human visual characteristics.
- the high-frequency component originally has few coefficients, by making the high-frequency component coefficient encoded earlier, the degree of difficulty in encoding the image can be determined earlier. .
- the coefficient line reading unit 112 reads the coefficient lines in order from the high frequency component to the low frequency component as shown in FIG.
- each coefficient line shown in FIG. 11 is processed in order from the top in the figure.
- each coefficient line output from the wavelet transform unit 103 in the order shown on the left side of FIG. 11 (wavelet transform output order) is converted to a high level as shown on the right side of FIG. They are rearranged in the order from the band component to the low band component (coding order).
- the coefficient line reading unit 112 reads each coefficient line of the division level 1, reads each coefficient line of the division level 2, reads each coefficient line of the division level 3, and finally, the coefficient of the division level 4
- the lines are read, and the read coefficient lines are supplied to the quantization unit 105 in the order of reading.
- the quantization unit 105 processes each coefficient line in the supplied order and supplies it to the entropy coding unit 106, the entropy coding unit 106 also processes each coefficient line in the order shown on the left in FIG. Will do. That is, the order read by the coefficient line reading unit 112 is the encoding order.
- the rate control unit 109 sets the quantization step size to be small if the coefficient value is small to promote code amount generation, and sets the step size to be large if the coefficient value is large to generate the code amount. Control such as suppression. When such control is performed based on the code amount of the high frequency component, the rate control unit 109 can estimate the degree of difficulty in encoding the image more accurately.
- the rate control unit 109 sets the number of quantization steps to an appropriate value earlier. Can do. That is, rate control becomes easy.
- the image In the case of the image encoding device 100, the image must be encoded with low delay, and the rate control unit 109 performs parallel to the entropy encoding process based on very limited information (code amount for each coefficient line). Thus, rate control must be performed in a very limited time.
- this rate control cannot be performed properly, unnecessary image quality may be deteriorated, and the image encoding device 100 may not be realized as a realistic device. Therefore, it is very important to perform appropriate rate control in a short time based on limited information. And the difficulty of this rate control becomes high, so that the throughput of the image coding apparatus 100 improves.
- the coefficient line rearranging unit 104 rearranges the coefficient lines, so that the number of quantization steps can be set to an appropriate value earlier and the rate control becomes easy. 109 can cope with further improvement in throughput. That is, the image encoding device 100 can further improve the throughput of image encoding.
- the coding order of the coefficient lines rearranged by the coefficient line rearrangement unit 104 may be other than the order described above. However, for the reasons described above, it is generally desirable that the order from the high frequency component to the low frequency component is the encoding order.
- the rearrangement of the coefficient lines may be performed before entropy encoding, or may be performed after the quantization process. That is, the coefficient line rearrangement unit 104 may be provided between the quantization unit 105 and the entropy encoding unit 106. In this case, the coefficient lines output from the wavelet transform unit 103 are quantized by the quantization unit 105 and then supplied to the coefficient line rearrangement buffer 111 to be held.
- the adding unit 107 adds the code amount of the code line as header information to each code line.
- FIG. 12 shows an example of such a situation.
- the adding unit 107 adds the code amount as header information (Code_info) to the code line (code word) of each division level. For example, when the code amount of the code line (line L) at the division level 1 is 100 bytes, information indicating “100 bytes” at the head of the code line (line L) is header information (Code_info (L)). Added as.
- each unit of the image encoding device 100 handles coefficient data for each coefficient line. That is, each unit can grasp the boundary of the coefficient line. However, since each code line is continuously supplied to the image decoding apparatus that decodes the encoded data generated by the image encoding apparatus 100, the boundary between the code lines cannot be grasped.
- the adding unit 107 adds the code amount of each code line to the encoded data, so that the image decoding apparatus divides the encoded data (stream) for each code line based on the code amount and processes it. Will be able to.
- the order of wavelet inverse transform processing is an order suitable for transmission of encoded data (code stream), for example, to improve tolerance to instability of the transmittable rate.
- the order is such that
- the transmission rate is unstable, such as the Internet or wireless communication, and data loss or transmission may occur.
- encoded data is transmitted via a transmission line with a high occurrence frequency of errors such as errors, it is necessary to consider measures for such problems in order to construct a practical system.
- the code line reading unit 122 reads the code lines in order from the low frequency component to the high frequency component as shown in FIG.
- each code line shown in FIG. 13 is processed in order from the top in the figure.
- the code line reading unit 122 reads the code line at the division level 4, reads each code line at the division level 3, reads each code line at the division level 2, and finally reads each code at the division level 1.
- the lines are read out, and the read code lines are output to the outside of the image coding apparatus 100 in the order of reading. That is, each code line is transmitted in this transmission order (in order from the low frequency component to the high frequency component).
- the code line of the lower frequency component has a greater influence on the image quality. That is, since the code line of the low frequency component that is important for the image quality is more important, it is transmitted earlier in time.
- retransmission processing of the lost data is performed.
- the process of encoding and transmitting an image and decoding the transmitted encoded data is performed with a lower delay (substantially in real time)
- the time allowed for the retransmission process is finite and very short. The shorter the processing time is, the shorter the time is.
- the code lines are not rearranged and are transmitted in the coding order, that is, in the order from the high frequency component to the low frequency component, the time allowed for the retransmission processing of the low frequency component code line is very large. Since the success rate of the retransmission process is reduced, the risk that the image quality of the decoded image is reduced increases as a result. In particular, when an unstable network is used as a transmission path, it is presumed that such an error occurs to some extent, and it can be said that the image quality of the decoded image is degraded accordingly.
- the code line buffer amount during transmission increases and overflow occurs.
- Such control not only can avoid buffer overflow, but also can maintain low-delay transmission.
- the code line of the low-frequency component must be discarded, and the image quality of the decoded image may be greatly degraded.
- the transmission rate control is performed, the effect on the image quality of the decoded image is affected. Can be further reduced.
- the code line rearrangement unit 108 rearranges the order of the code lines in the order for transmission (for example, the order from the low frequency component toward the high frequency component), so that the image coding apparatus 100 transmits
- the tolerance to road instability can be increased. If this tolerance is weak and the image quality of the decoded image greatly deteriorates due to an error or the like during transmission, it cannot be said to be a realistic system.
- the image encoding device 100 can cope with further improvement in throughput. That is, the image encoding device 100 can further improve the throughput of image encoding.
- the code line rearrangement unit 108 rearranges the code lines in the order in which the wavelet inverse transform processing is performed in order to reduce delay. .
- the coefficient line rearrangement unit 104 sets the coefficient lines in the transmission order or wavelet inverse. It is desirable to rearrange in the order of conversion processing.
- the image encoding apparatus 100 can realize the improvement of the throughput realistically by appropriately rearranging the coefficient lines and the code lines.
- step S101 the image line input unit 101 receives the image data input for each line (while holding it in the line buffer unit 102), and the wavelet transform unit 103 executes the line buffer. Using the coefficient lines held in the unit 102, wavelet transformation is performed for one line block.
- step S102 the wavelet transform unit 103 determines whether or not processing for one line block has been performed. If it is determined that the processing for one line block has not been performed, the process returns to step S101 to continue the wavelet transform process.
- step S103 If it is determined that the wavelet transform process has been performed for one line block, the process proceeds to step S103.
- step S103 the coefficient line rearranging unit 104 rearranges the wavelet transformed coefficient data in the order of encoding processing.
- step S ⁇ b> 104 the quantization unit 105 quantizes the coefficient data with a quantization step size specified by the rate control unit 109.
- step S105 the entropy encoding unit 106 performs entropy encoding on the coefficient data.
- the adding unit 107 adds the code amount to each code line as header information.
- the code line rearrangement unit 108 rearranges the encoded data (code lines) in the order of transmission.
- step S108 the code line rearrangement unit 108 outputs the encoded data rearranged in the transmission order.
- step S109 the rate control unit 109 performs rate control based on the entropy encoding information in the entropy encoding unit 106.
- step S110 the wavelet transform unit 103 determines whether or not processing has been performed up to the last line block (for example, the bottom line block) of the processing target picture. If it is determined that processing is not performed, the process returns to step S101, and the subsequent processing is repeated for the next line block. If it is determined in step S110 that the process has been completed up to the last line block, the encoding process for the processing target picture is ended.
- the last line block for example, the bottom line block
- the image encoding device 100 can improve the throughput of image encoding.
- header information including the code amount is added to the code line of each division level so that the image decoding apparatus can identify the code line break of each division level in the code stream.
- a dedicated marker may be added as shown in FIG. By detecting this marker, the image decoding apparatus can specify the boundary of the code line at each division level.
- the image decoding apparatus can distinguish the code line of each division level based on the marker, but cannot determine the code amount of the code line. That is, the arrangement order of the code lines at each division level cannot be specified immediately from the marker. Therefore, the image decoding apparatus needs to grasp the arrangement order by some other means.
- FIG. 16 is a block diagram illustrating a configuration example of an embodiment of an image decoding device as an image processing device to which the present invention has been applied.
- the image decoding apparatus 200 decodes the encoded data output from the image encoding apparatus 100 and generates a decoded image.
- the image decoding apparatus 200 includes a codeword decoding unit 201, a line extraction unit 202, an entropy decoding unit 203, an inverse quantization unit 204, a coefficient line rearrangement unit 205, a wavelet inverse transformation unit 206, and a buffer unit 207.
- the codeword decoding unit 201 decodes the input encoded data (codeword) (arrow D51) and extracts related information relating to the data and the encoding process. Any information may be included in this related information. For example, image resolution (horizontal / vertical size), quantization step size, number of wavelet transform decomposition, and coefficient line (code line) arrangement order Information about
- the information regarding the arrangement order of the coefficient lines may be any information indicating the arrangement order of the code lines of each division level or information necessary for obtaining the arrangement order. Good. For example, header information including the code amount of the code line of each division level as shown in FIG. 12, a marker detection result as shown in FIG.
- the codeword decoding unit 201 supplies the input encoded data (code stream) to the line extraction unit 202 (arrow D52). Further, the codeword decoding unit 201 supplies information necessary for distinguishing the code lines of the respective division levels in the code stream to the line extraction unit 202 (dotted arrow D61). For example, the codeword decoding unit 201 supplies the code amount of the code line at each division level, the marker detection result, and the like to the line extraction unit 202.
- the codeword decoding unit 201 supplies information indicating the quantization step size to the inverse quantization unit 204 (dotted line arrow D62).
- the codeword decoding unit 201 supplies information necessary for rearranging the coefficient lines at each division level to the coefficient line reading unit 212 (dotted arrow D63).
- the coefficient line reading unit 212 grasps in advance the wavelet inverse transformation processing order by the wavelet inverse transformation unit 206, which is the arrangement order after the rearrangement.
- the coefficient line reading unit 212 needs to grasp the arrangement order of the code lines at the time of transmission, which is the arrangement order before the rearrangement, in order to rearrange the coefficient lines. Therefore, the codeword decoding unit 201 provides the coefficient line reading unit 212 with information indicating the arrangement order of the code lines at the time of transmission or information necessary for obtaining the arrangement order.
- the codeword decoding unit 201 may specify the order of arrangement of the code lines at the time of transmission by decoding the codeword and provide the coefficient line reading unit 212 with information indicating the arrangement order.
- the codeword decoding unit 201 may sequentially provide information indicating the code amount of the code line of each division level extracted from the code stream to the coefficient line reading unit 212.
- the coefficient line reading unit 212 grasps the arrangement order of the coefficient lines based on the order of the code amounts supplied from the codeword decoding unit 201.
- the coefficient line rearrangement buffer 211 stores the coefficient lines at each division level in a distinguishable state. Therefore, the coefficient line reading unit 212 obtains the data amount of the coefficient line at each division level held in the coefficient line rearrangement buffer 211, and grasps the arrangement order of the coefficient lines from the arrangement order of the data amount. Also good. In this case, provision of information from the codeword decoding unit 201 can be omitted.
- the codeword decoding unit 201 further supplies the wavelet inverse transform unit 206 with information necessary for the wavelet inverse transform process, such as the resolution of the image and the number of wavelet transform decompositions (dotted arrow D64).
- the line extraction unit 202 generates code for each division level from the code stream supplied from the codeword decoding unit 201 based on information necessary for distinguishing the code lines of each division level supplied from the codeword decoding unit 201. Extract lines. That is, the line extraction unit 202 divides the encoded data (code stream) for each code line at the division level. The line extraction unit 202 supplies the extracted code lines at each division level to the entropy decoding unit 203 (arrow D53).
- the entropy decoding unit 203 entropy-decodes the code lines of each division level by a method corresponding to the entropy encoding of the entropy encoding unit 106 (FIG. 1), and generates coefficient data (quantized coefficients). That is, the entropy decoding unit 203 decodes the code lines of the respective division levels divided by the line extraction unit 202 in the order of supply, and converts them into coefficient lines of the division level. The entropy decoding unit 203 supplies the coefficient line (quantization coefficient) at the division level to the inverse quantization unit 204 (arrow D54).
- the inverse quantization unit 204 converts the coefficient lines (quantization coefficients) supplied from the entropy decoding unit 203 according to the quantization step size determined based on the information supplied from the codeword decoding unit 201. Inverse quantization.
- the inverse quantization unit 204 supplies the inversely quantized coefficient lines (wavelet transform coefficients) at each division level to the coefficient line rearrangement unit 205 (arrow D55).
- the coefficient line rearrangement unit 205 rearranges the order of the coefficient data (coefficient lines) (the order at the time of transmission) to the order of the wavelet inverse transform processing.
- the coefficient line rearranging unit 205 includes a coefficient line rearranging buffer 211 and a coefficient line reading unit 212.
- the coefficient line rearrangement buffer 211 holds coefficient lines for each division level supplied from the inverse quantization unit 204.
- the coefficient line reading unit 212 performs the rearrangement by reading the coefficient lines of the respective division levels held in the coefficient line rearrangement buffer 211 in the order for the wavelet inverse transform processing (arrow D56). Details of the rearrangement will be described later.
- the coefficient line rearranging unit 205 (coefficient line reading unit 212) supplies the coefficient data rearranged in order to the wavelet inverse transform unit 206 (arrow D57).
- the wavelet inverse transform unit 206 performs the inverse process of the wavelet transform performed in the wavelet transform unit 103 (FIG. 1) based on the information supplied from the codeword decrypting unit 201 to generate a decoded image. Details of the inverse wavelet transform will be described later.
- the wavelet inverse transformation unit 206 performs wavelet inverse transformation by repeating synthesis filtering for synthesizing the low frequency component and the high frequency component of the coefficient data. At this time, the wavelet inverse transformation unit 206 supplies the coefficient data of the next lower layer generated by performing the synthesis filtering to the buffer unit 207 (arrow D58), and uses it for the next synthesis filtering. That is, the wavelet inverse transform unit 206 uses the coefficient data (arrow D59) read from the buffer unit 207 as necessary in addition to the coefficient data (arrow D57) supplied from the coefficient line reading unit 212. Perform synthetic filtering.
- the wavelet inverse transform unit 206 When the synthesis filtering is repeatedly performed as described above to restore the decoded image, the wavelet inverse transform unit 206 outputs the image data to the outside of the image decoding device 200 (arrow D60).
- the code lines of the respective division levels are transmitted in a predetermined transmission order (for example, the order from the low frequency component to the high frequency component).
- the coefficient line rearrangement unit 205 rearranges the coefficient lines of each division level from this transmission order to the wavelet inverse transform processing order.
- the coefficient lines at each division level are held in the coefficient line rearrangement buffer 211 of the coefficient line rearrangement unit 205.
- the coefficient line reading unit 212 rearranges the coefficient lines by reading each coefficient line in the order of wavelet transform processing.
- two lines of low-frequency sub-band coefficient lines in the horizontal and vertical directions in the hierarchy one level lower than the coefficient line of each sub-band of a certain hierarchy are generated by one synthesis filtering.
- such synthesis filtering is repeated in a predetermined order.
- the order is arbitrary, but in order to perform with lower delay, it is desirable to preferentially execute the synthesis filtering for the lower layer among the feasible ones.
- the coefficient line rearrangement unit 205 can perform the wavelet inverse transformation processing on the coefficient lines of the respective division levels in such an order without requiring an unnecessary waiting time or the like. Rearrange in a predetermined order.
- FIG. 17 shows an example of the rearrangement.
- the coefficient lines are arranged in the processing order.
- a time series is shown from the top to the bottom in the figure. That is, each coefficient line shown in FIG. 17 is processed in order from the top in the figure.
- the coefficient lines of the respective division levels output from the inverse quantization unit 204 in the order shown in the left side of FIG. 17 are converted into the order shown in the right side of FIG. Rearranged in the order of wavelet inverse transform.
- the coefficient line rearrangement buffer 211 stores the coefficient lines of each division level in the order shown on the left side of FIG.
- the coefficient line reading unit 212 reads the coefficient lines of each division level from the coefficient line rearrangement buffer 211 in the order shown on the right side of FIG.
- the coefficient line reading unit 212 reads the coefficient line of the division level 4 line P, reads the coefficient line of the division level 3 line N, reads the coefficient line of the division level 2 line M, and divides it.
- the coefficient line of the level 1 line L is read, and the coefficient line of the division level 1 line (L + 1) is read.
- the coefficient line reading unit 212 reads the division level 2 line (M + 1) coefficient line, reads the division level 1 line (L + 2) coefficient line, and reads the division level 1 line (L + 3) coefficient line. read out.
- the coefficient line reading unit 212 reads the division level 3 line (N + 1) coefficient line, reads the division level 2 line (M + 2) coefficient line, and reads the division level 1 line (L + 4) coefficient line. Read, read coefficient line of division level 1 line (L + 5).
- the coefficient line reading unit 212 reads the coefficient line of the division level 2 line (M + 3), reads the coefficient line of the division level 1 line (L + 6), and reads the coefficient line of the division level 1 line (L + 7). read out.
- the wavelet inverse transform unit 206 performs synthesis filtering using the coefficient lines supplied in this order in the order of supply.
- the coefficient line rearrangement unit 205 rearranges the coefficient lines of the respective division levels supplied in the transmission order different from the wavelet inverse transformation processing order, in the order of the wavelet inverse transformation processing for each line block. Management of coefficient data in the inverse transform process becomes easy, and the wavelet inverse transform unit 206 can perform the wavelet inverse transform process with a lower delay.
- the image decoding apparatus 200 can decode the encoded data with lower delay even when the code lines of the respective division levels are transmitted in a transmission order different from the order of the wavelet transform processing. it can. That is, the image decoding apparatus 200 can improve the decoding throughput of encoded data.
- FIG. 17 rearrangement of one line block in a steady state is shown.
- the rearrangement method of the initial state corresponding to the example of FIG. 17 is easily based on the example of FIG. Since it can identify, the description is abbreviate
- the wavelet inverse transformation unit 206 performs wavelet inverse transformation by a method corresponding to the wavelet transformation processing by the wavelet transformation unit 103. For example, as described above, when the wavelet transform unit 103 performs analysis filtering using a 5 ⁇ 3 filter, the wavelet inverse transform unit 206 also performs synthesis filtering using a 5 ⁇ 3 filter.
- the process is basically the same as in the case of analysis filtering. That is, also in the case of synthesis filtering, the calculation of the filter processing can be reduced by using a lifting technique as shown in FIG.
- FIG. 18 is a diagram in the case where filtering is performed on vertical lines using a 5 ⁇ 3 synthesis filter.
- the horizontal direction illustrates the calculation process and the low-order coefficients generated thereby.
- horizontal processing is performed in the same manner as vertical processing.
- the division level 2 coefficient line is N / 4.
- N / 2 lines of the division level 1 subband 1LL are generated.
- FIG. 20 is a diagram showing data processed by the wavelet inverse transform unit 206 in time series in a steady state.
- the data processed by the wavelet inverse transform unit 206 are arranged in time series in the direction from top to bottom in the figure.
- the wavelet inverse transform unit 206 When wavelet transforming one line block in a steady state, the wavelet inverse transform unit 206 first converts one coefficient line (line P) of each subband (subbands 4HH, 4HL, 4LH, and 4LL) of division level 4 into one. Then, synthesis filtering is performed to generate two coefficient lines (line N and line (N + 1)) of subband 3LL at division level 3. Among these, the coefficient line of the line (N + 1) is supplied to the buffer unit 207 and held.
- the wavelet inverse transform unit 206 performs synthesis filtering on the division level 3 subband 3LL (line N) and one coefficient line (line N) of each of the other subbands (subbands 3HH, 3HL, and 3LH).
- Two coefficient lines (line M and line (M + 1)) are generated from the subband 2LL at the division level 2. Among these, the coefficient line of the line (M + 1) is supplied to the buffer unit 207 and held.
- the wavelet inverse transform unit 206 synthesizes and filters the division level 2 subband 2LL (line M) and one coefficient line (line M) of each of the other subbands (subbands 2HH, 2HL, and 2LH), Two coefficient lines (line L and line (L + 1)) are generated from the division level 1 subband 1LL. Among these, the coefficient line of the line (L + 1) is supplied to the buffer unit 207 and held.
- the wavelet inverse transformation unit 206 performs synthesis filtering on the subband 1LL (line L) at the division level 1 and one coefficient line (line L) of each of the other subbands (subbands 1HH, 1HL, and 1LH), Two lines (line K and line (K + 1)) of the baseband image data are generated. These two lines are output to the outside of the image decoding apparatus 200.
- the wavelet inverse transform unit 206 reads the division level 1 subband 1LL (line (L + 1)) from the buffer unit 207, and 1 coefficient line (subbands 1HH, 1HL, and 1LH) of each subband (subband 1HH, 1HL, and 1LH).
- Line (L + 1)) is combined and filtered to generate two lines (line (K + 2) and line (K + 3)) of the baseband image data. These two lines are output to the outside of the image decoding apparatus 200.
- the wavelet inverse transformation unit 206 reads the division level 2 subband 2LL (line (M + 1)) from the buffer unit 207, and 1 coefficient line (subbands 2HH, 2HL, and 2LH) of one coefficient line (subband 2HH, 2HL, and 2LH).
- Line (M + 1)) is combined and filtered to generate two coefficient lines (line (L + 2) and line (L + 3)) from subband 1LL at division level 1.
- the coefficient line of the line (L + 3) is supplied to the buffer unit 207 and held.
- the wavelet inverse transform unit 206 performs division level 1 subband 1LL (line (L + 2)) and one coefficient line (line (L + 2)) of each of the other subbands (subbands 1HH, 1HL, and 1LH). Synthetic filtering is performed to generate two lines (line (K + 4) and line (K + 5)) of the baseband image data. These two lines are output to the outside of the image decoding apparatus 200.
- the wavelet inverse transform unit 206 reads the division level 1 subband 1LL (line (L + 3)) from the buffer unit 207, and 1 coefficient line (subbands 1HH, 1HL, and 1LH) of one coefficient line (subband 1HH, 1HL, and 1LH).
- Line (L + 3)) is combined and filtered to generate two lines (line (K + 6) and line (K + 7)) of the baseband image data. These two lines are output to the outside of the image decoding apparatus 200.
- the wavelet inverse transformation unit 206 reads the subband 3LL (line (N + 1)) at the division level 3 from the buffer unit 207, and 1 coefficient line (subbands 3HH, 3HL, and 3LH) of each subband (subband 3HH, 3HL, and 3LH).
- Line (N + 1)) is combined and filtered to generate 2 coefficient lines (line (M + 2) and line (M + 3)) from subband 2LL at division level 2.
- the coefficient line of the line (M + 3) is supplied to the buffer unit 207 and held.
- the wavelet inverse transformation unit 206 performs division level 2 subband 2LL (line (M + 2)) and one coefficient line (line (M + 2)) of each of the other subbands (subbands 2HH, 2HL, and 2LH). Synthetic filtering is performed to generate two coefficient lines (line (L + 4) and line (L + 5)) from the subband 1LL at the division level 1. Among these, the coefficient line of the line (L + 5) is supplied to the buffer unit 207 and held.
- the wavelet inverse transform unit 206 performs division level 1 subband 1LL (line (L + 4)) and one coefficient line (line (L + 4)) of each of the other subbands (subbands 1HH, 1HL, and 1LH). Synthetic filtering is performed to generate two lines (line (K + 8) and line (K + 9)) of the baseband image data. These two lines are output to the outside of the image decoding apparatus 200.
- the wavelet inverse transform unit 206 reads the division level 1 subband 1LL (line (L + 5)) from the buffer unit 207, and 1 coefficient line (subband 1HH, 1HL, and 1LH) of one coefficient line (subband 1HH, 1HL, and 1LH).
- Line (L + 5)) is combined and filtered to generate two lines (line (K + 10) and line (K + 11)) of the baseband image data. These two lines are output to the outside of the image decoding apparatus 200.
- the wavelet inverse transformation unit 206 reads the sub-band 2LL (line (M + 3)) at the division level 2 from the buffer unit 207, and 1 coefficient line (sub-band 2HH, 2HL, and 2LH) of one coefficient line ( Line (M + 3)) is combined and filtered to generate 2 coefficient lines (line (L + 6) and line (L + 7)) from subband 1LL of division level 1. Among these, the coefficient line of the line (L + 7) is supplied to the buffer unit 207 and held.
- the wavelet inverse transformation unit 206 performs division level 1 subband 1LL (line (L + 6)) and one coefficient line (line (L + 6)) of each of the other subbands (subbands 1HH, 1HL, and 1LH). Synthetic filtering is performed to generate two lines (line (K + 12) and line (K + 13)) of the baseband image data. These two lines are output to the outside of the image decoding apparatus 200.
- the wavelet inverse transform unit 206 reads the division level 1 subband 1LL (line (L + 7)) from the buffer unit 207, and 1 coefficient line (subbands 1HH, 1HL, and 1LH) of one coefficient line (subband 1HH, 1HL, and 1LH).
- Line (L + 7)) and synthetic filtering are performed to generate two lines (line (K + 14) and line (K + 15)) of the baseband image data. These two lines are output to the outside of the image decoding apparatus 200.
- the wavelet inverse transformation unit 206 performs synthesis filtering in the above-described procedure, inversely wavelet transforms the coefficient lines for one line block, and generates 16 lines of baseband image data.
- the coefficient line rearrangement unit 205 performs rearrangement as shown in FIG. 17 in accordance with the order of such wavelet inverse transformation processing.
- each coefficient line is supplied to the wavelet inverse transform unit 206 in the order required for repeated synthesis filtering. Therefore, the wavelet inverse transform unit 206 does not need unnecessary waiting time and the like, and the wavelet has a low delay. Inverse conversion processing can be performed.
- the order after the coefficient lines are rearranged by the coefficient line rearrangement unit 205 can be easily obtained by reflecting the difference shown in FIGS. 9 and 10 in the example of FIG.
- image data (and encoded data) is subjected to wavelet transform / wavelet inverse transform processing (encoding / decoding processing) in units of line blocks by the image encoding device 100 and the image decoding device 200.
- one line block composed of 16 lines of baseband image data is wavelet transformed by the image encoding device 100, and as shown in FIG. Division level 1 coefficient lines, 4 division level 2 coefficient lines, 2 division level 3 coefficient lines, and 1 division level 4 coefficient line.
- each coefficient line is inversely wavelet transformed by the image decoding apparatus 200, and converted to 16 lines of baseband image data (decoded image) as shown in C of FIG.
- step S201 the codeword decoding unit 201 receives input of encoded data for one line block.
- step S202 the codeword decoding unit 201 decodes the codeword of the input encoded data and extracts related information.
- the codeword decoding unit 201 provides necessary information to each processing unit based on the extracted related information.
- step S203 the line extraction unit 202 extracts a processing target line based on information (for example, a code amount) provided from the codeword decoding unit 201.
- step S204 the entropy decoding unit 203 performs entropy decoding on the processing target line supplied from the line extraction unit 202.
- step S205 the inverse quantization unit 204 inversely quantizes the processing target line.
- step S206 the coefficient line rearrangement buffer 211 holds the processing target line.
- step S207 the line extraction unit 202 determines whether one line block has been processed. If it is determined that there is an unprocessed coefficient line, the process returns to step S203, and the subsequent processing is repeated. If it is determined in step S207 that processing for one line block has been performed, the process proceeds to step S208.
- step S208 the coefficient line reading unit 212 reads the coefficient data from the coefficient line rearrangement buffer 211, thereby rearranging the coefficient data in the wavelet inverse transformation order.
- step S209 the wavelet inverse transformation unit 206 inversely transforms the coefficient data.
- step S210 the wavelet inverse transform unit 206 determines whether one line block has been processed. If it is determined that there is an unprocessed coefficient line, the process returns to step S209, and the subsequent processing is repeated. If it is determined in step S210 that one line block has been processed, the process proceeds to step S211.
- step S211 the codeword decoding unit 201 determines whether or not processing has been performed up to the last line block (for example, the bottom line block) of the processing target picture. If it is determined that processing is not performed, the process returns to step S201, and the subsequent processing is repeated for the next line block. If it is determined in step S211 that the processing has been completed up to the last line block, the decoding processing for the processing target picture is ended.
- the last line block for example, the bottom line block
- the image decoding apparatus 200 can improve the decoding throughput of the encoded data.
- the coefficient line rearrangement unit 205 has been described so that the coefficient lines arranged in the transmission order are rearranged in the wavelet inverse transformation order.
- This transmission order means the order in which encoded data is transmitted. .
- the coefficient line rearrangement unit 205 rearranges the coefficient lines arranged in the coding order in the wavelet inverse transformation order.
- the coefficient line rearrangement unit 205 rearranges the coefficient lines in the wavelet transform order in the wavelet inverse transform order.
- the coefficient line rearrangement unit 205 outputs the coefficient lines in the wavelet inverse transformation order.
- the coefficient line rearrangement unit 205 can determine the arrangement order of such input coefficient lines based on the information supplied from the codeword decoding unit 201, so that the arrangement according to the arrangement order can be easily performed. It can be performed. That is, regardless of the order of the input coefficient lines, the coefficient line rearrangement unit 205 can easily arrange them in the wavelet inverse transformation order based on the information supplied from the codeword decoding unit 201. Can be replaced.
- FIG. 23 is a diagram illustrating a configuration example of an image transmission system that encodes and transmits an input image, decodes the encoded data at a transmission destination, and outputs the obtained decoded image.
- the image transmission system 300 is a system that transmits an image with lower delay.
- the image transmission system 300 includes a transmission device 301 and a reception device 303 that are connected via a network 302.
- the transmission device 301 is a device that transmits an input image to the reception device 303 via the network 302.
- the transmission device 301 encodes image data and transmits the encoded data to the reception device 303 in order to efficiently transmit an image.
- the transmission device 301 includes an encoding unit 311, a packetizing processing unit 312, and a transmission unit 313.
- the encoding unit 311 encodes an input image and outputs encoded data.
- the image encoding device 100 described in the first embodiment is applied to the encoding unit 311. That is, the encoding unit 311 has the same configuration as that of the image encoding device 100 and performs the same processing.
- the packetization processing unit 312 packetizes the encoded data (code stream) output from the encoding unit 311.
- the transmission unit 313 transmits the packet generated by the packetization processing unit 312 to the reception unit 321 via the network 302.
- the network 302 is an unstable transmission path in which the communicable bandwidth is not constant and packet loss and transmission / reception errors may occur, such as the Internet and wireless LAN.
- the configuration of the network 302 is arbitrary as long as it is a transmission line having unstable characteristics as a whole.
- the reception device 303 is a device that receives a packet supplied from the transmission device 301 via the network 302, decodes encoded data included in the packet, generates a decoded image, and outputs the decoded image.
- the reception device 303 includes a reception unit 321, a depacketization processing unit 322, and a decoding unit 323.
- the reception unit 321 performs processing corresponding to the transmission unit 313 of the transmission device 301 and performs processing of receiving a packet supplied from the transmission unit 313 via the network.
- the depacketizing processing unit 322 depacketizes the packet received by the receiving unit 321 and extracts encoded data.
- the decoding unit 323 decodes the encoded data extracted by the depacketization processing unit 322 and outputs a decoded image.
- the image decoding apparatus 200 described in the second embodiment is applied to the decoding unit 323. That is, the decoding unit 323 has the same configuration as the image decoding device 200 and performs the same processing.
- the transmission device 301 can easily perform rate control and improve resistance to instability of the transmission path. Further, by applying the image decoding apparatus 200 as the decoding unit 323, the receiving apparatus 303 can facilitate data management at the time of wavelet inverse transformation, and the transmitting apparatus 301 can easily perform rate control, Even when control is performed to improve the tolerance to instability, the decoding can be performed with low delay.
- the reception unit 321 requests the transmission unit 313 to retransmit the lost packet.
- the encoding unit 311 outputs the code lines of the respective division levels in order from the low range to the high range.
- the transmission part 313 will transmit the low frequency component which has a big influence on an image quality previously. That is, when an error occurs and the receiving unit 321 performs a retransmission request, a longer time for the retransmission process can be secured for a low frequency component that greatly affects image quality.
- the transmission unit 313 can cope with the rapid rate change by interrupting and omitting packet transmission in the middle. At this time, it is possible to reduce the influence of such transmission control on the image quality as much as possible by transmitting the low-frequency component that greatly affects the image quality first.
- the image transmission system 300 can improve the throughput of the entire image data transmission process.
- the CPU 401 of the personal computer 400 executes various processes in accordance with a program stored in a ROM (Read Only Memory) 402 or a program loaded from a storage unit 413 to a RAM (Random Access Memory) 403.
- the RAM 403 also appropriately stores data necessary for the CPU 401 to execute various processes.
- the CPU 401, the ROM 402, and the RAM 403 are connected to each other via a bus 404.
- An input / output interface 410 is also connected to the bus 404.
- the input / output interface 410 includes an input unit 411 including a keyboard and a mouse, a display including a CRT (Cathode Ray Tube) and an LCD (Liquid Crystal Display), an output unit 412 including a speaker, a hard disk, and the like.
- a communication unit 414 including a storage unit 413 and a modem is connected. The communication unit 414 performs communication processing via a network including the Internet.
- a drive 415 is connected to the input / output interface 410 as necessary, and a removable medium 421 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory is appropriately mounted, and a computer program read from these is loaded. It is installed in the storage unit 413 as necessary.
- a program constituting the software is installed from a network or a recording medium.
- the recording medium is distributed to distribute the program to the user separately from the apparatus main body, and includes a magnetic disk (including a flexible disk) on which the program is recorded, an optical disk ( It only consists of removable media 421 consisting of CD-ROM (compact disc-read only memory), DVD (including digital versatile disc), magneto-optical disc (including MD (mini disc)), or semiconductor memory. Rather, it is composed of a ROM 402 on which a program is recorded and a hard disk included in the storage unit 413, which is distributed to the user in a state of being incorporated in the apparatus main body in advance.
- a magnetic disk including a flexible disk
- an optical disk It only consists of removable media 421 consisting of CD-ROM (compact disc-read only memory), DVD (including digital versatile disc), magneto-optical disc (including MD (mini disc)), or semiconductor memory. Rather, it is composed of a ROM 402 on which a program is recorded and a hard disk included in the storage unit 413, which is distributed to the user in a
- the program executed by the computer may be a program that is processed in time series in the order described in this specification, or in parallel or at a necessary timing such as when a call is made. It may be a program for processing.
- the step of describing the program recorded on the recording medium is not limited to the processing performed in chronological order according to the described order, but may be performed in parallel or It also includes processes that are executed individually.
- system represents the entire apparatus composed of a plurality of devices (apparatuses).
- the configuration described as one device (or processing unit) may be divided and configured as a plurality of devices (or processing units).
- the configurations described above as a plurality of devices (or processing units) may be combined into a single device (or processing unit).
- a configuration other than that described above may be added to the configuration of each device (or each processing unit).
- a part of the configuration of a certain device (or processing unit) may be included in the configuration of another device (or other processing unit). . That is, the embodiment of the present invention is not limited to the above-described embodiment, and various modifications can be made without departing from the gist of the present invention.
- 100 image encoding device 101 image line input unit, 102 line buffer unit, 103 wavelet transform unit, 104 coefficient line rearrangement unit, 105 quantization unit, 106 entropy encoding unit, 107 addition unit, 108 code line rearrangement unit , 109 rate control unit, 111 coefficient line rearrangement buffer, 112 coefficient line readout unit, 121 code line rearrangement buffer, 122 code line readout unit, 200 image decoding device, 201 codeword decoding unit, 202 line extraction unit, 203 entropy Decoding unit, 204 Inverse quantization unit, 205 Coefficient line rearrangement unit, 206 Wavelet inverse transformation unit, 207 Buffer unit, 211 Coefficient line rearrangement buffer, 212 Coefficient line reading block
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Of Band Width Or Redundancy In Fax (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
前記係数データ並び替え手段により並び替えられた前記係数データを量子化する量子化手段と、
前記符号化手段による符号化結果に基づいて、前記量子化手段による量子化の量子化ステップサイズを制御することにより前記符号化データのレートを制御するレート制御手段と
をさらに備え、
前記符号化手段は、前記量子化手段により量子化された前記係数データを符号化することができる。
1.第1の実施の形態(画像符号化装置)
2.第2の実施の形態(画像復号装置)
3.第3の実施の形態(伝送システム)
4.第4の実施の形態(パーソナルコンピュータ)
[デバイスの構成]
図1は、本発明を適用した画像処理装置としての画像符号化装置の一実施の形態の構成を表している。
次に、ウェーブレット変換について説明する。ウェーブレット変換は、画像データを空間周波数の高い成分(高域成分)と低い成分(低域成分)とに分割する分析フィルタリングを、再帰的に繰り返すことにより、画像データを、階層的に構成される周波数成分毎の係数データに変換する処理である。なお、以下において、分割レベルは、高域成分の階層ほど下位とし、低域成分の階層ほど上位とする。
ウェーブレット変換部103は、通常、低域フィルタと高域フィルタとから構成されるフィルタバンクを用いて、上述のような処理を行う。なお、デジタルフィルタは、通常、複数タップ長のインパルス応答すなわちフィルタ係数を持っているため、フィルタ処理を行えるだけの入力画像データまたは係数データを予めバッファリングしておく必要がある。また、ウェーブレット変換を多段にわたって行う場合も同様に、前段で生成したウェーブレット変換係数を、フィルタ処理が行える数だけバッファリングしておく必要がある。
H1(z)=(-1+2z-1-z-2)/2 ・・・(3)
si 1=si 0+1/4(di-1 1+di 1) ・・・(5)
以上のような分析フィルタリングは、図5乃至図8のように進められる。
次に、以上のような手順で分析フィルタリングを行うウェーブレット変換部103から出力されるデータについて説明する。図9は、初期状態において、ウェーブレット変換部103から出力されるデータを時系列順に示す図である。図9においては、ウェーブレット変換部103から出力されるデータが、図中、上から下に向かう方向に時系列順に並べられている。
図9および図10を参照して説明したような順でウェーブレット変換部103から出力された各分割レベルの係数ラインは、係数ライン並び替え部104の係数ライン並び替えバッファ111に保持される。係数ラインが1ラインブロック分蓄積されると、係数ライン読み出し部112は、各係数ラインを、ウェーブレット逆変換処理の順序とも、伝送時の順序とも異なる所定の順序である符号化処理用の順序で読み出すことにより、係数ラインの並び替えを行う。
上述したように、付加部107は、各符号ラインに対して、その符号ラインの符号量を、ヘッダ情報として付加する。図12にその様子の例を示す。
図11を参照して説明したような順で符号化された(付加部107から出力された)各分割レベルの符号ラインは、符号ライン並び替え部108の符号ライン並び替えバッファ121に保持される。符号ラインが1ラインブロック分蓄積されると、符号ライン読み出し部122は、各符号ラインを、符号化処理用の順序から、ウェーブレット逆変換処理の順序とは異なる所定の順序である伝送用の順序で読み出すことにより、符号ラインの並び替えを行う。
以上のような画像符号化装置100の各部により実行される符号化処理の流れの例を、図14のフローチャートを参照して説明する。なお、この符号化処理は、入力画像のピクチャ毎に実行される。
なお、図12においては、画像復号装置がコードストリームにおいて各分割レベルの符号ラインの区切りを識別することができるように、各分割レベルの符号ラインにその符号量を含むヘッダ情報を付加するように説明した。しかしながら、この区切りを示す方法としては、これ以外にも、例えば、図15に示されるように専用のマーカを付加するようにしてもよい。画像復号装置は、このマーカを検出することにより、各分割レベルの符号ラインの境界を特定することができる。ただし、この場合、画像復号装置は、このマーカに基づいて、各分割レベルの符号ラインを区別することはできるが、その符号ラインの符号量を判別することはできない。つまり、マーカからすぐに各分割レベルの符号ラインの並び順を特定することはできない。したがって、画像復号装置は、何らかの別の手段により、その並び順を把握する必要がある。
[デバイスの構成]
次に、第1の実施の形態において説明した画像符号化装置100に対応する画像復号装置について説明する。図16は、本発明を適用した画像処理装置としての画像復号装置の一実施の形態の構成例を示すブロック図である。
図13を参照して上述したように、各分割レベルの符号ラインは、所定の伝送順(例えば、低域成分から高域成分に向かう順)で伝送される。
ウェーブレット逆変換部206は、ウェーブレット変換部103によるウェーブレット変換処理に対応する方法でウェーブレット逆変換を行う。例えば、上述したように、ウェーブレット変換部103が5×3フィルタを用いて分析フィルタリングを行った場合、ウェーブレット逆変換部206も、5×3フィルタを用いて合成フィルタリングを行う。
以上の合成フィルタリング(リフティング)は、各階層について再帰的に行われる。したがって、階層が1つ下位になる毎にライン数は2倍になる。
以上のようなウェーブレット逆変換部206による合成フィルタリングの処理手順の例について、より具体的に説明する。
以上のようにして、画像符号化装置100および画像復号装置200により画像データ(および符号化データ)は、ラインブロック単位でウェーブレット変換・ウェーブレット逆変換処理(符号化・復号処理)される。
以上のような画像復号装置200の各部により実行される復号処理の流れの例を、図22のフローチャートを参照して説明する。なお、この復号処理は、1ピクチャ分の画像に対応する符号化データ毎に実行される。
[システムの構成]
第1の実施の形態で説明した画像符号化装置100および第2の実施の形態で説明した画像復号装置200の適用例について説明する。図23は、入力画像を符号化して伝送し、その符号化データを伝送先で復号し、得られた復号画像を出力する画像伝送システムの構成例を示す図である。
[パーソナルコンピュータ]
上述した一連の処理は、ハードウェアにより実行させることもできるし、ソフトウェアにより実行させることもできる。この場合、例えば、図24に示されるようなパーソナルコンピュータとして構成されるようにしてもよい。
Claims (16)
- 階層的に分析フィルタ処理を行い、少なくとも最低域成分のサブバンドの1ライン分の係数データを生成するのに必要なライン数分の画像データを含むラインブロックを、周波数帯域毎に分解された係数データに変換する分析フィルタ手段と、
前記分析フィルタ手段により生成された前記係数データを符号化する符号化手段と、
前記符号化手段により前記係数データが符号化されて得られる符号化データの並びを、前記分析フィルタ手段による前記分析フィルタ処理の出力順から、合成フィルタ処理される順とは異なる所定の順であり、伝送路の不安定さに対する耐性を向上させる伝送順に並び替える符号化データ並び替え手段と
を備える画像処理装置。 - 前記符号化データ並び替え手段は、各分割レベルの前記符号化データを、前記伝送順として、低域成分から高域成分に向かう順に並び替える
請求項1に記載の画像処理装置。 - 前記分析フィルタ手段により生成された前記係数データの並びを、前記分析フィルタ手段による前記分析フィルタ処理の出力順から、前記合成フィルタ処理される順および前記伝送順と異なる所定の順であり、前記符号化データのレート制御を容易にする符号化順に並び替える係数データ並び替え手段と、
前記係数データ並び替え手段により並び替えられた前記係数データを量子化する量子化手段と、
前記符号化手段による符号化結果に基づいて、前記量子化手段による量子化の量子化ステップサイズを制御することにより前記符号化データのレートを制御するレート制御手段と
をさらに備え、
前記符号化手段は、前記量子化手段により量子化された前記係数データを符号化する
請求項1に記載の画像処理装置。 - 前記係数データ並び替え手段は、各分割レベルの前記係数データを、前記伝送順として、高域成分から低域成分に向かう順に並び替える
請求項3に記載の画像処理装置。 - 前記分析フィルタ手段は、前記分析フィルタ処理を、リフティング演算を用いて行う
請求項1に記載の画像処理装置。 - 前記分析フィルタ手段は、初期状態のラインブロックに対しては、必要な画素の対称拡張を行ってから前記リフティング演算を行い、定常状態のラインブロックに対しては、前回の前記リフティング演算結果を利用して、前記リフティング演算を行う
請求項5に記載の画像処理装置。 - 前記分析フィルタ手段は、前記係数データに対して、水平方向に前記リフティング演算を行ってから、垂直方向に前記リフティング演算を行う
請求項5に記載の画像処理装置。 - 画像処理装置の分析フィルタ手段が、階層的に分析フィルタ処理を行い、少なくとも最低域成分のサブバンドの1ライン分の係数データを生成するのに必要なライン数分の画像データを含むラインブロックを、周波数帯域毎に分解された係数データに変換し、
前記画像処理装置の符号化手段が、前記分析フィルタ処理により生成された前記係数データを符号化し、
前記画像処理装置の符号化データ並び替え手段が、前記係数データが符号化されて得られる符号化データの並びを、前記分析フィルタ処理の出力順から、合成フィルタ処理される順とは異なる所定の順であり、伝送路の不安定さに対する耐性を向上させる伝送順に並び替える
画像処理方法。 - 階層的な分析フィルタ処理により所定ライン数の画像データが周波数帯域毎に分解された、少なくとも最低域成分のサブバンドの係数データを1ライン以上含む、各サブバンドの係数データ群からなるラインブロックが符号化されて生成された符号化データを、復号する復号手段と、
前記復号手段により復号されて得られた前記係数データの並びを、前記復号手段により復号された順から、合成フィルタ処理される順に並び替える係数データ並び替え手段と、 階層的に前記合成フィルタ処理を行い、前記係数データ並び替え手段により並び替えられた各サブバンドの前記係数データを前記画像データに変換する合成フィルタ手段と
を備える画像処理装置。 - 前記合成フィルタ手段は、実行可能な中で、より下位の階層のサブバンドの係数データに対する前記合成フィルタ処理を優先的に実行する
請求項9に記載の画像処理装置。 - 前記符号化データを解読する解読手段と、
前記解読手段による解読結果に基づいて、前記符号化データを、各階層の前記係数データの1ライン分に対応する符号化データ毎に分割し、抽出する抽出手段と
をさらに備え、
前記復号手段は、前記抽出手段により抽出された前記符号化データを復号する
請求項9に記載の画像処理装置。 - 前記係数データ並び替え手段は、前記解読手段による解読結果に基づいて、前記係数データの並び順を判別し、判別された並び順を、合成フィルタ処理される順に替える
請求項11に記載の画像処理装置。 - 前記合成フィルタ手段は、前記合成フィルタ処理を、リフティング演算を用いて行う
請求項9に記載の画像処理装置。 - 前記合成フィルタ手段は、初期状態のラインブロックに対しては、必要な係数データの対称拡張を行ってから前記リフティング演算を行い、定常状態のラインブロックに対しては、前回の前記リフティング演算結果を利用して、前記リフティング演算を行う
請求項13に記載の画像処理装置。 - 前記合成フィルタ手段は、前記係数データに対して、水平方向に前記リフティング演算を行ってから、垂直方向に前記リフティング演算を行う
請求項13に記載の画像処理装置。 - 画像処理装置の復号手段は、階層的な分析フィルタ処理により所定ライン数の画像データが周波数帯域毎に分解された、少なくとも最低域成分のサブバンドの係数データを1ライン以上含む、各サブバンドの係数データ群からなるラインブロックが符号化されて生成された符号化データを、復号し、
前記画像処理装置の係数データ並び替え手段は、復号されて得られた前記係数データの並びを、その復号された順から、合成フィルタ処理される順に並び替え、
前記画像処理装置の合成フィルタ手段は、階層的に前記合成フィルタ処理を行い、並び替えられた各サブバンドの前記係数データを前記画像データに変換する
画像処理方法。
Priority Applications (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
IN2393DEN2012 IN2012DN02393A (ja) | 2009-09-24 | 2010-09-14 | |
RU2012111256/08A RU2012111256A (ru) | 2009-09-24 | 2010-09-14 | Устройство обработки изображений и способ обработки изображений |
US13/496,748 US8634665B2 (en) | 2009-09-24 | 2010-09-14 | Image processing apparatus and image processing method |
EP10818724.6A EP2482540A4 (en) | 2009-09-24 | 2010-09-14 | IMAGE PROCESSOR AND IMAGE PROCESSING METHOD |
CN2010800412266A CN102511161A (zh) | 2009-09-24 | 2010-09-14 | 图像处理设备和图像处理方法 |
BR112012006071A BR112012006071A2 (pt) | 2009-09-24 | 2010-09-14 | aparelho de processamento de imagem, e, método de processamento de imagem para uso com um aparelho de processamento de imagem |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2009-219627 | 2009-09-24 | ||
JP2009219627A JP5263621B2 (ja) | 2009-09-24 | 2009-09-24 | 画像処理装置および方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2011037049A1 true WO2011037049A1 (ja) | 2011-03-31 |
Family
ID=43795799
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2010/065862 WO2011037049A1 (ja) | 2009-09-24 | 2010-09-14 | 画像処理装置および方法 |
Country Status (8)
Country | Link |
---|---|
US (1) | US8634665B2 (ja) |
EP (1) | EP2482540A4 (ja) |
JP (1) | JP5263621B2 (ja) |
CN (1) | CN102511161A (ja) |
BR (1) | BR112012006071A2 (ja) |
IN (1) | IN2012DN02393A (ja) |
RU (1) | RU2012111256A (ja) |
WO (1) | WO2011037049A1 (ja) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE112012003260T5 (de) | 2011-08-05 | 2014-05-15 | Sumitomo Electric Industries, Ltd. | Substrat, Halbleitervorrichtung und Verfahren zur Herstellung derselben |
US20150078674A1 (en) * | 2012-02-20 | 2015-03-19 | Wenfei Jiang | Component sorting based encoding for 3d mesh compression |
EP2866439B1 (en) | 2012-06-26 | 2020-11-04 | LG Electronics Inc. | Video decoding method and video encoding method |
US20140072027A1 (en) * | 2012-09-12 | 2014-03-13 | Ati Technologies Ulc | System for video compression |
CN105491392B (zh) * | 2015-11-24 | 2019-03-29 | 北京优素科技有限公司 | 多级idwt并行处理方法及系统 |
CN110397578B (zh) * | 2019-04-11 | 2020-08-14 | 岳阳县辉通物联网科技有限公司 | 智能化动作执行装置 |
CN111567042B (zh) * | 2019-04-29 | 2022-05-17 | 深圳市大疆创新科技有限公司 | 小波逆变换电路和方法 |
GB202019160D0 (en) | 2020-02-05 | 2021-01-20 | Iconx Ltd | Key scanning |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005117156A (ja) * | 2003-10-03 | 2005-04-28 | Sanyo Electric Co Ltd | データ処理装置 |
JP2005184086A (ja) * | 2003-12-16 | 2005-07-07 | Ricoh Co Ltd | 画像処理装置、画像処理方法、プログラム及び情報記録媒体 |
JP2007311924A (ja) | 2006-05-16 | 2007-11-29 | Sony Corp | 帯域分析装置及び方法、帯域合成装置及び方法、画像符号化装置及び方法、画像復号装置及び方法、並びにプログラム及び記録媒体 |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3213582B2 (ja) * | 1997-05-29 | 2001-10-02 | シャープ株式会社 | 画像符号化装置及び画像復号装置 |
JP2000184204A (ja) * | 1998-12-11 | 2000-06-30 | Canon Inc | 画像処理装置及び方法及び記憶媒体 |
US6898323B2 (en) * | 2001-02-15 | 2005-05-24 | Ricoh Company, Ltd. | Memory usage scheme for performing wavelet processing |
JP2002315000A (ja) * | 2001-04-12 | 2002-10-25 | Megafusion Corp | データ伝送装置 |
JP4665898B2 (ja) * | 2004-02-23 | 2011-04-06 | 日本電気株式会社 | 2次元信号の符号化/復号方法および装置 |
TW200746655A (en) * | 2005-11-18 | 2007-12-16 | Sony Corp | Encoding device and method, decoding device and method, and transmission system |
JP4254866B2 (ja) * | 2007-01-31 | 2009-04-15 | ソニー株式会社 | 情報処理装置および方法、プログラム、並びに記録媒体 |
CN101543077B (zh) * | 2007-01-31 | 2011-01-19 | 索尼株式会社 | 信息处理装置和方法 |
JP5162939B2 (ja) * | 2007-03-30 | 2013-03-13 | ソニー株式会社 | 情報処理装置および方法、並びにプログラム |
JP4356028B2 (ja) * | 2007-05-17 | 2009-11-04 | ソニー株式会社 | 情報処理装置および方法 |
JP4356033B2 (ja) * | 2007-05-17 | 2009-11-04 | ソニー株式会社 | 画像データ処理装置および方法 |
CN101569170B (zh) * | 2007-05-17 | 2011-11-16 | 索尼株式会社 | 编码设备和编码方法以及解码设备和解码方法 |
JP4488027B2 (ja) * | 2007-05-17 | 2010-06-23 | ソニー株式会社 | 情報処理装置および方法、並びに、情報処理システム |
KR101442273B1 (ko) * | 2007-05-17 | 2014-09-23 | 소니 주식회사 | 정보 처리 장치 및 방법 |
JP4793320B2 (ja) * | 2007-05-17 | 2011-10-12 | ソニー株式会社 | 情報処理装置および方法 |
-
2009
- 2009-09-24 JP JP2009219627A patent/JP5263621B2/ja not_active Expired - Fee Related
-
2010
- 2010-09-14 CN CN2010800412266A patent/CN102511161A/zh active Pending
- 2010-09-14 WO PCT/JP2010/065862 patent/WO2011037049A1/ja active Application Filing
- 2010-09-14 EP EP10818724.6A patent/EP2482540A4/en not_active Ceased
- 2010-09-14 BR BR112012006071A patent/BR112012006071A2/pt not_active IP Right Cessation
- 2010-09-14 US US13/496,748 patent/US8634665B2/en not_active Expired - Fee Related
- 2010-09-14 IN IN2393DEN2012 patent/IN2012DN02393A/en unknown
- 2010-09-14 RU RU2012111256/08A patent/RU2012111256A/ru not_active Application Discontinuation
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005117156A (ja) * | 2003-10-03 | 2005-04-28 | Sanyo Electric Co Ltd | データ処理装置 |
JP2005184086A (ja) * | 2003-12-16 | 2005-07-07 | Ricoh Co Ltd | 画像処理装置、画像処理方法、プログラム及び情報記録媒体 |
JP2007311924A (ja) | 2006-05-16 | 2007-11-29 | Sony Corp | 帯域分析装置及び方法、帯域合成装置及び方法、画像符号化装置及び方法、画像復号装置及び方法、並びにプログラム及び記録媒体 |
Non-Patent Citations (1)
Title |
---|
See also references of EP2482540A4 |
Also Published As
Publication number | Publication date |
---|---|
EP2482540A1 (en) | 2012-08-01 |
US20120230598A1 (en) | 2012-09-13 |
RU2012111256A (ru) | 2013-09-27 |
BR112012006071A2 (pt) | 2016-05-31 |
IN2012DN02393A (ja) | 2015-08-21 |
EP2482540A4 (en) | 2014-07-02 |
CN102511161A (zh) | 2012-06-20 |
JP5263621B2 (ja) | 2013-08-14 |
US8634665B2 (en) | 2014-01-21 |
JP2011071648A (ja) | 2011-04-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5263621B2 (ja) | 画像処理装置および方法 | |
JP4883076B2 (ja) | 情報処理装置および方法 | |
EP2337306B1 (en) | Transmitting apparatus and method, and receiving apparatus and method | |
CN101309417B (zh) | 用于处理图像数据的方法和设备 | |
TWI379593B (en) | Image processing apparatus and image processing method | |
US8391616B2 (en) | Information processor and method therefor | |
CN101854556B (zh) | 信息处理设备和方法 | |
JP2012060261A (ja) | 画像処理装置および方法 | |
JP5469127B2 (ja) | 画像データ符号化装置ならびにその動作制御方法およびそのプログラム | |
JP2009302638A (ja) | 情報処理装置および方法 | |
JP4086196B2 (ja) | 画像符号化装置、プログラム及び記憶媒体 | |
JP2011071649A (ja) | 画像処理装置および方法 | |
EP1993041A2 (en) | Apparatus and method for processing information | |
JP2007142614A (ja) | 画像処理装置、画像処理方法、プログラム及び情報記録媒体 | |
JP2004166254A (ja) | 画像符号化装置及び方法、並びにプログラム及び記録媒体 | |
JP2011147050A (ja) | 画像処理装置および方法 | |
JP2011160075A (ja) | 画像処理装置および方法 | |
WO2010064569A1 (ja) | 動画像データの圧縮方法 | |
JP2011239066A (ja) | 画像処理装置および方法 | |
JP4114144B2 (ja) | 画像符号化装置、画像符号化プログラム、及び画像復号化プログラム | |
JP2004166132A (ja) | 画像送信装置、ネットワークシステム、プログラム及び記憶媒体 | |
JP2004166156A (ja) | 画像送信装置、ネットワークシステム、プログラム及び記憶媒体 | |
JP2005086723A (ja) | 画像送信装置、画像転送方法、画像処理システム、プログラム、及び、情報記録媒体 | |
JP2006303669A (ja) | 符号化データ処理装置及び符号化データ処理プログラム | |
JP2011114592A (ja) | 情報処理装置および方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 201080041226.6 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10818724 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010818724 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2393/DELNP/2012 Country of ref document: IN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2012111256 Country of ref document: RU |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13496748 Country of ref document: US |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112012006071 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: 112012006071 Country of ref document: BR Kind code of ref document: A2 Effective date: 20120316 |