WO2015166733A1 - 符号化装置、復号装置、及びその方法、プログラム - Google Patents
符号化装置、復号装置、及びその方法、プログラム Download PDFInfo
- Publication number
- WO2015166733A1 WO2015166733A1 PCT/JP2015/057727 JP2015057727W WO2015166733A1 WO 2015166733 A1 WO2015166733 A1 WO 2015166733A1 JP 2015057727 W JP2015057727 W JP 2015057727W WO 2015166733 A1 WO2015166733 A1 WO 2015166733A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- vector
- decoding
- encoding
- prediction
- decoded
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 95
- 239000013598 vector Substances 0.000 claims abstract description 773
- 238000012937 correction Methods 0.000 claims abstract description 320
- 238000001228 spectrum Methods 0.000 claims description 90
- 238000013139 quantization Methods 0.000 claims description 52
- 238000004364 calculation method Methods 0.000 description 77
- 238000007792 addition Methods 0.000 description 40
- 230000005540 biological transmission Effects 0.000 description 19
- 238000010586 diagram Methods 0.000 description 14
- 230000004048 modification Effects 0.000 description 13
- 238000012986 modification Methods 0.000 description 13
- 230000003595 spectral effect Effects 0.000 description 12
- 241000209094 Oryza Species 0.000 description 9
- 235000007164 Oryza sativa Nutrition 0.000 description 9
- 235000009566 rice Nutrition 0.000 description 9
- 230000000694 effects Effects 0.000 description 7
- 230000006870 function Effects 0.000 description 6
- 230000005236 sound signal Effects 0.000 description 6
- 230000007423 decrease Effects 0.000 description 5
- 230000003111 delayed effect Effects 0.000 description 4
- 238000010606 normalization Methods 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000009499 grossing Methods 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
- G10L19/07—Line spectrum pair [LSP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/005—Correction of errors induced by the transmission channel, if related to the coding algorithm
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0016—Codebook for LPC parameters
Definitions
- the present invention relates to a linear prediction coefficient and a technique for encoding and decoding a coefficient that can be converted to the linear prediction coefficient.
- the encoding apparatus encodes the linear prediction coefficient and sends a code corresponding to the linear prediction coefficient to the decoding apparatus so that the information on the linear prediction coefficient used in the encoding process can be decoded on the decoding apparatus side.
- an encoding device converts a linear prediction coefficient into an LSP (Line Spectrum Spectrum) parameter sequence, which is a frequency domain parameter equivalent to the linear prediction coefficient, and encodes the LSP parameter sequence. Send the LSP code to the decoder.
- LSP Line Spectrum Spectrum
- Non-Patent Document 1 vector encoding and decoding techniques using moving average prediction (MA prediction) are used to reduce the code amount of LSP codes.
- MA prediction moving average prediction
- FIG. 1 shows a configuration of a conventional linear prediction coefficient encoding device 80.
- the linear prediction coefficient encoding apparatus 80 receives LSP (Line Spectrum Pairs) parameters ⁇ f [1], ⁇ f [2],..., ⁇ f [p] for each frame, and the linear prediction coefficient encoding apparatus 80 , following the prediction corresponding subtraction unit 83 for each frame, vector coding unit 84 performs a process of delaying the input unit 87, and outputs to obtain LSP code C f.
- LSP Line Spectrum Pairs
- the linear prediction coefficient encoding device 80 When the input acoustic signal X f is input to the linear prediction coefficient encoding device 80, the linear prediction coefficient encoding device 80 also includes a linear prediction analysis unit 81 and an LSP calculation unit 82, and the input acoustic signal X f in units of frames. Are input continuously, and the following processing is performed for each frame.
- Linear prediction analysis unit 81 receives an input audio signal X f, the input audio signal X f by linear predictive analysis of the linear prediction coefficients a f [1], a f [2], ..., a a f [p] Find and output.
- a f [i] represents an i-th order linear prediction coefficient obtained by linear prediction analysis of the input acoustic signal X f of the f-th frame.
- the LSP calculator 82 receives the linear prediction coefficients a f [1], a f [2],..., A f [p], and receives the linear prediction coefficients a f [1], a f [2] ,.
- Lp parameters ⁇ f [1], ⁇ f [2], ..., ⁇ f [p] are obtained from [p]
- the LSP parameter vector ⁇ f ( ⁇ f [1 ], ⁇ f [2], ..., ⁇ f [p]) Output T.
- ⁇ f [i] is an i-th order LSP parameter corresponding to the input acoustic signal X f of the f-th frame.
- the prediction correspondence subtraction unit 83 includes, for example, a storage unit 83c that stores a predetermined coefficient ⁇ , a storage unit 83d that stores a prediction correspondence average vector V, a multiplication unit 88, and subtraction units 83a and 83b.
- the prediction correspondence subtraction unit 83 receives the LSP parameter vector ⁇ f and the previous frame quantization difference vector ⁇ S f ⁇ 1 .
- the multiplier 88 multiplies the predetermined coefficient ⁇ stored in the storage unit 83c by the decoded difference vector ⁇ S f-1 of the previous frame to obtain a vector ⁇ ⁇ ⁇ S f-1 .
- the subtraction unit 83a subtracts the prediction corresponding average vector V stored in the storage unit 83d from the LSP parameter vector ⁇ f in the subtraction unit 83a using the two subtraction units 83a and 83b.
- the vector ⁇ ⁇ ⁇ S f ⁇ 1 is subtracted, but this order may be reversed.
- the difference vector S f may be generated by subtracting the vector V + ⁇ ⁇ ⁇ S f ⁇ 1 obtained by adding the prediction-corresponding average vector V and the vector ⁇ ⁇ ⁇ S f ⁇ 1 from the LSP parameter vector ⁇ f .
- the difference vector S f of the current frame is obtained by subtracting at least a vector including a prediction from a past frame from a vector (LSP parameter vector ⁇ f ) by a coefficient that can be converted into a multi-order linear prediction coefficient of the current frame. It may be said that the obtained vector.
- the encoding of the difference vector S f, a method of vector quantizing the difference vector S f, a method of vector quantization of each sub-vector by dividing the difference vector S f into a plurality of sub-vectors, the difference vector S f or sub-vectors Any known encoding method such as a method of performing multi-stage vector quantization, a method of performing scalar quantization on vector elements, or a combination of these may be used.
- the vector encoding unit 84 searches for a candidate difference vector closest to the difference vector S f from among a plurality of candidate difference vectors stored in the vector codebook 86, and outputs it as a quantized difference vector ⁇ S f. Then, the difference vector code corresponding to the quantized difference vector ⁇ S f is output as the LSP code C f . Note that the quantized difference vector ⁇ S f corresponds to a decoded difference vector described later.
- each candidate difference vector and a difference vector code corresponding to each candidate difference vector are stored in advance.
- Delayed input unit 87 receives the quantized difference vector ⁇ S f, holds the quantized difference vector ⁇ S f, is delayed one frame, and outputs before as a frame quantized differential vector ⁇ S f-1. That is, when the prediction corresponding subtraction unit 83 is processing the quantization difference vector ⁇ S f of the f-th frame, the quantization difference vector ⁇ S f-1 for the f-1-th frame is output. .
- FIG. 2 shows a configuration of a conventional linear prediction coefficient decoding device 90.
- Vector decoding unit 91 receives the LSP code C f, decodes the LSP code C f, and outputs to obtain a decoded differential vector ⁇ S f corresponding to the LSP code C f.
- a decoding method corresponding to the encoding method of the vector encoding unit 84 of the encoding device is used.
- the vector decoding unit 91 searches a plurality of difference vector codes corresponding to the LSP code C f from the difference vector codes stored in the vector codebook 92, and decodes candidate difference vectors corresponding to the difference vector codes. Output as difference vector ⁇ S f .
- the decoded difference vector ⁇ S f corresponds to the above-described quantized difference vector ⁇ S f, and corresponding elements have the same value if there is no error in the process of transmission error, encoding, and decoding.
- each candidate difference vector and a difference vector code corresponding to each candidate difference vector are stored in advance.
- the vector codebook 92 includes information common to the vector codebook 86 of the linear prediction coefficient encoding device 80 described above.
- Delayed input unit 93 receives the decoded differential vector ⁇ S f, holds the decoded differential vector ⁇ S f, is delayed one frame, and outputs it as the previous frame decoded differential vector ⁇ S f-1. That is, when the prediction corresponding adder section 95 is processing for the decoded differential vector ⁇ S f of the f-th frame, and outputs the decoded difference vector ⁇ S f-1 of f-1-th frame.
- the prediction corresponding addition unit 95 includes, for example, a storage unit 95c that stores a predetermined coefficient ⁇ , a storage unit 95d that stores a prediction corresponding average vector V, a multiplication unit 94, and addition units 95a and 95b.
- the prediction corresponding addition unit 95 receives the decoded difference vector ⁇ S f and the previous frame decoded difference vector ⁇ S f ⁇ 1 of the current frame.
- the LSP parameter vector ⁇ ⁇ f ( ⁇ S f + V + ⁇ ⁇ S f-1 ) corresponding to the decoded prediction, which is a vector obtained by adding and is output.
- the multiplying unit 94 multiplies the predetermined coefficient ⁇ stored in the storage unit 95c by the previous frame decoding difference vector ⁇ S f-1 to obtain a vector ⁇ ⁇ ⁇ S f-1 .
- the adder 95a first adds the vector ⁇ ⁇ ⁇ S f-1 to the decoded difference vector ⁇ S f of the current frame, and then adds the adder 95b.
- the prediction-corresponding average vector V is added at, this order may be reversed.
- the decoded prediction-compatible LSP parameter vector ⁇ ⁇ f may be generated by adding a vector obtained by adding the vector ⁇ ⁇ ⁇ S f-1 and the prediction-corresponding average vector V to the decoded difference vector ⁇ S f .
- prediction-corresponding average vector V used here is the same as the prediction-corresponding average vector V used in the prediction-corresponding subtraction unit 83 of the linear prediction coefficient encoding device 80 described above.
- the linear prediction coefficient decoding device 90 may include a decoded prediction-compatible linear prediction coefficient calculation unit 96.
- the decoded prediction-supported linear prediction coefficient calculation unit 96 receives the decoded prediction-supported LSP parameter vector ⁇ ⁇ f and converts the decoded prediction-supported LSP parameter vector ⁇ ⁇ f into the decoded prediction-supported linear prediction coefficient ⁇ a f [1], Convert to ⁇ a f [2],..., ⁇ a f [p] and output.
- the linear prediction coefficient decoding apparatus of Non-Patent Document 1 since the LSP parameter obtained by decoding is used only for linear prediction synthesis, even if the LSP parameter cannot be correctly decoded, the decoded acoustic signal is decoded in a plurality of consecutive frames. The problem is that the sound quality deteriorates. That is, the linear prediction coefficient encoding device and the linear prediction coefficient decoding device of Non-Patent Document 1 have a configuration in which priority is given to expressing the LSP parameters with a small code amount, rather than the problem when the LSP parameters cannot be correctly decoded. It can be said.
- the linear prediction coefficient encoding device and the linear prediction coefficient decoding device not only use the LSP parameters for linear prediction analysis and synthesis, but also variable length encoding that depends on each amplitude value constituting the spectral envelope obtained from the LSP parameters. It is also used for an encoding device and a decoding device that are also used for decoding. In this case, when the LSP parameter cannot be correctly decoded in one frame, there is a problem in that variable-length decoding cannot be correctly performed in a plurality of consecutive frames including the frame and a decoded acoustic signal cannot be obtained. Arise.
- a prediction-compatible code that is a coding method and a decoding method that can accurately represent a coefficient that can be converted into a linear prediction coefficient with a small code amount, such as that used for linear prediction analysis and synthesis.
- Encoding method and decoding method for example, codes corresponding to coefficients that can be converted to linear prediction coefficients of the previous frame, such as those used for variable length encoding / decoding depending on each amplitude value constituting the spectral envelope determined from the LSP parameter
- a linear prediction coefficient code for example, LSP code
- Coding of coefficients that can be converted into linear prediction coefficients that can be used in combination with an encoding method and a decoding method that can correctly decode coefficients that can be converted into linear prediction coefficients And to provide a method and a decoding method.
- an encoding device predicts a vector based on a coefficient that can be converted into a plurality of linear prediction coefficients of a current frame from at least a past frame.
- a prediction correspondence encoding unit that obtains a first code by encoding a difference vector consisting of a difference from a prediction vector including a first code and obtains a quantized difference vector corresponding to the first code, and multi-order linear prediction of the current frame
- a non-predictive encoding unit that generates a second code by encoding a vector based on a coefficient that can be converted into a coefficient and a correction vector that is a difference between the quantization difference vector or a part of the difference element.
- an encoding apparatus includes a vector based on a coefficient that can be converted into a plurality of linear prediction coefficients of a current frame, and a prediction from at least a past frame.
- a prediction corresponding encoding unit that encodes a difference vector consisting of a difference between a prediction vector consisting of a predetermined vector and a predetermined vector to obtain a first code, and obtains a quantized difference vector corresponding to the first code; and a current frame Encoding a correction vector consisting of a difference or a part of a difference element obtained by subtracting a quantized difference vector and a predetermined vector from a vector of coefficients that can be converted into a plurality of linear prediction coefficients And a non-predictive corresponding encoding unit.
- a decoding apparatus obtains a decoded difference vector by decoding a first code, and performs prediction from a decoded difference vector and at least a past frame.
- a prediction-corresponding decoding unit that generates a first decoded vector including a decoded value of a coefficient that can be converted into a plurality of linear prediction coefficients of the current frame by adding the prediction vector included therein, and decoding and decoding the second code
- a correction vector is obtained, and the decoded correction vector and at least the corresponding order elements of the decoding difference vector are added to each other, and the second value is formed of a decoded value of a coefficient that can be converted into a plurality of linear prediction coefficients of the current frame.
- a non-predictive decoding unit that generates a decoding vector.
- a decoding apparatus obtains a decoded difference vector by decoding a first code, and obtains a decoded difference vector and at least a prediction from a past frame.
- a prediction corresponding decoding unit that generates a first decoded vector including a decoded value of a coefficient that can be converted into a plurality of linear prediction coefficients of the current frame by adding a prediction vector including a predetermined vector;
- the code is decoded to obtain a decoding correction vector, and at least the decoding difference vector and a predetermined vector are added to the decoding correction vector for each corresponding degree element to obtain a multiple-order linear prediction coefficient of the current frame.
- a non-predictive decoding unit that generates a second decoded vector including a decoded value of a transformable coefficient.
- an encoding method includes at least a vector from a previous frame for a vector of coefficients that can be converted to a plurality of linear prediction coefficients of a current frame.
- an encoding method includes a vector of coefficients that can be converted into a plurality of linear prediction coefficients of a current frame, and a prediction from at least a past frame.
- a predictive encoding step for obtaining a first code by encoding a difference vector consisting of a difference between a prediction vector consisting of a predetermined vector and a predetermined vector and obtaining a quantized difference vector corresponding to the first code; and a current frame Encoding a correction vector consisting of a difference or a part of a difference element obtained by subtracting a quantized difference vector and a predetermined vector from a vector of coefficients that can be converted into a plurality of linear prediction coefficients And a non-predictive corresponding encoding step.
- a decoding method includes decoding a first code to obtain a decoded difference vector, a decoded difference vector, and prediction from at least a past frame.
- the code is decoded to obtain a decoding correction vector, and at least the decoding difference vector and a predetermined vector are added to the decoding correction vector for each corresponding degree element to obtain a multiple-order linear prediction coefficient of the current frame.
- a non-predictive decoding step that generates a second decoding vector composed of decoded values of transformable coefficients.
- an encoding method and a decoding method which are encoding methods and decoding methods that can accurately represent a coefficient that can be converted into a linear prediction coefficient with a small amount of code, and a linear prediction coefficient code of a previous frame are provided. Even if it is not correctly input to the linear prediction coefficient decoding device, if the linear prediction coefficient code of the current frame is correctly input to the linear prediction coefficient decoding device, the coefficient that can be converted into the linear prediction coefficient of the current frame can be correctly decoded.
- An effective encoding method and decoding method can be used.
- the figure which shows the structure of the conventional linear prediction coefficient encoding apparatus The figure which shows the structure of the conventional linear prediction coefficient decoding apparatus.
- the functional block diagram of the linear prediction coefficient encoding apparatus which concerns on 1st embodiment The figure which shows the example of the processing flow of the linear prediction coefficient encoding apparatus which concerns on 1st embodiment.
- the functional block diagram of the linear prediction coefficient decoding apparatus which concerns on 1st embodiment The figure which shows the example of the processing flow of the linear prediction coefficient decoding apparatus which concerns on 1st embodiment.
- the functional block diagram of the linear prediction coefficient encoding apparatus which concerns on 2nd embodiment The figure which shows the example of the processing flow of the linear prediction coefficient encoding apparatus which concerns on 2nd, 3rd embodiment.
- the functional block diagram of the linear prediction coefficient decoding apparatus which concerns on 2nd embodiment The figure which shows the example of the processing flow of the linear prediction coefficient decoding apparatus which concerns on 2nd, 3rd embodiment.
- the functional block diagram of the linear prediction coefficient encoding apparatus which concerns on 3rd embodiment The functional block diagram of the linear prediction coefficient decoding apparatus which concerns on 3rd embodiment.
- the functional block diagram of the encoding apparatus which concerns on 4th embodiment The figure which shows the example of the processing flow of the encoding apparatus which concerns on 4th embodiment.
- FIG. 3 is a functional block diagram of the linear prediction coefficient encoding apparatus according to the first embodiment, and FIG. 4 shows an example of the processing flow.
- the linear prediction coefficient encoding apparatus 100 includes a linear prediction analysis unit 81, an LSP calculation unit 82, a prediction corresponding encoding unit 120, and a non-prediction corresponding encoding unit 110.
- the processes in the linear prediction analysis unit 81 and the LSP calculation unit 82 are the same as those described in the prior art, and correspond to s81 to s82 in FIG.
- Linear prediction coefficient coding unit 100 receives the audio signal X f, and outputs to obtain LSP code C f and correction LSP code D f.
- the code output from the linear prediction coefficient encoding apparatus 100 is input to the linear prediction coefficient decoding apparatus 200.
- the linear prediction coefficient encoding device 100 may not include the linear prediction analysis unit 81 and the LSP calculation unit 82.
- the prediction correspondence encoding unit 120 includes a prediction correspondence subtraction unit 83, a vector encoding unit 84, a vector codebook 86, and a delay input unit 87, and the processing in each unit is the same as that described in the related art.
- the processes in the prediction correspondence subtraction unit 83, the vector encoding unit 84, and the delay input unit 87 respectively correspond to s83 to s87 in FIG.
- the vector encoding unit 84 outputs the quantized difference vector ⁇ S f not only to the delay input unit 87 but also to the non-predictive corresponding encoding unit 110.
- Predictive corresponding coding unit 120 receives the LSP parameter vector theta f, the LSP parameter vector theta f, the differential vector S f made from a difference between the prediction vector containing the predicted from at least a past frame by encoding, LSP code to obtain a quantized difference vector ⁇ S f corresponding to C f and LSP code C f (s120) outputs.
- the quantized difference vector ⁇ S f corresponding to the LSP code C f is a vector composed of quantized values corresponding to each element value of the difference vector S f .
- the prediction vector including the prediction from at least the past frame is, for example, a predetermined prediction-corresponding average vector V and the quantization difference vector of the previous frame (previous frame quantization difference vector) ⁇ S f A vector V + ⁇ ⁇ ⁇ S f-1 obtained by adding a vector obtained by multiplying each element of ⁇ 1 by a predetermined ⁇ .
- the vector representing the prediction from the past frame included in the prediction vector is ⁇ ⁇ ⁇ S f-1 which is ⁇ times the previous frame quantization difference vector ⁇ S f-1 .
- predictive corresponding coding unit 120 does not require input from the outside in addition to LSP parameter vector theta f, it may be said to have gotten the LSP code C f encodes the LSP parameter vector theta f.
- the non-predictive correspondence encoding unit 110 includes a non-predictive correspondence subtraction unit 111, a correction vector encoding unit 112, and a correction vector codebook 113.
- Non-predictive corresponding coding unit 110 receives the LSP parameter vector theta f and the quantized difference vector ⁇ S f, encodes the correction vector which is the difference between the LSP parameter vector theta f and the quantized difference vector ⁇ S f to obtain a correction LSP code D f (s110) outputs.
- the correction vector is ⁇ f ⁇ ⁇ S f
- the correction vector is the previous frame quantization difference vector ⁇ ⁇ ⁇ S f ⁇ obtained by multiplying the quantization error vector ⁇ f ⁇ ⁇ ⁇ f of the prediction correspondence encoding unit 120 by the prediction correspondence average vector V and ⁇ times. It is the sum of 1 .
- the non-predictive encoding unit 110 encodes the sum of the quantization error vector ⁇ f ⁇ ⁇ ⁇ f and the prediction vector V + ⁇ ⁇ ⁇ S f ⁇ 1 to obtain a corrected LSP code D f. It can be said that there is.
- Correction vector theta f - ⁇ is the encoding of S f
- the correction vector theta f - obtained by subtracting the non-predictive corresponding mean vector Y from ⁇ S f A method for vector quantization of the object will be described.
- the non-predictive correspondence subtraction unit 111 includes, for example, a storage unit 111c that stores a non-predictive correspondence average vector Y, and addition units 111a and 111b.
- the correction vector U f ⁇ f ⁇ Y ⁇ ⁇ S f which is a vector obtained by subtracting, is generated (s111) and output.
- the two subtraction units 111 a and 111 b are used to first subtract the non-prediction-corresponding average vector Y stored in the storage unit 111 c from the LSP parameter vector ⁇ f in the subtraction unit 111 a, and then the subtraction unit 111 b
- the quantization difference vector ⁇ S f is subtracted in FIG. 4, the order of these subtractions may be reversed.
- the correction vector U f may be generated by subtracting a vector obtained by adding the non-prediction-corresponding average vector Y and the quantized difference vector ⁇ S f from the LSP parameter vector ⁇ f .
- the non-prediction-corresponding average vector Y is a predetermined vector, and may be obtained from a learning acoustic signal in advance, for example.
- an acoustic signal that is collected in the same environment (for example, a speaker, a sound collection device, a place) as an acoustic signal to be encoded is used as an input acoustic signal for learning.
- the average of the differences is defined as a non-predicted average vector.
- the correction vector code book 113 stores each candidate correction vector and a correction vector code corresponding to each candidate correction vector.
- Correction vector encoding unit 112 receives the correction vector U f, to give a correction vector U f a is coded correction LSP code D f (s112) outputs. For example, the correction vector encoding unit 112 searches for a candidate correction vector closest to the correction vector U f from among a plurality of candidate correction vectors stored in the correction vector codebook 113, and corresponds to the candidate correction vector. and it outputs the correction vector code as the correction LSP code D f. It may not be generated in the actual the correction vector encoding unit 112, but the following description the closest candidate correction vector in the correction vector U f as quantized correction vector ⁇ U f.
- the correction vector includes at least the previous frame quantization difference vector ⁇ S f-1 , which is a predicted amount from the previous frame of the prediction corresponding encoding unit 120, so that the correction vector encoding unit 112 at least predicts It can also be said that the prediction from the previous frame of the corresponding encoding unit 120 is encoded.
- the non-prediction-compatible encoding unit 110 may not generate the non-prediction-compatible encoding LSP parameter vector ⁇ ⁇ f obtained by quantizing each element of the LSP parameter vector ⁇ f in the non-prediction-compatible encoding unit 110.
- FIG. 5 is a functional block diagram of the linear prediction coefficient decoding apparatus according to the first embodiment, and FIG. 6 shows an example of its processing flow.
- the linear prediction coefficient decoding apparatus 200 includes a prediction corresponding decoding unit 220 and a non-prediction corresponding decoding unit 210.
- the prediction corresponding decoding unit 220 has the same configuration as that of the linear prediction coefficient decoding device 90 of the prior art, and includes a vector codebook 92, a vector decoding unit 91, a delay input unit 93, and a prediction corresponding addition unit 95.
- the decoding prediction-compatible linear prediction coefficient calculation unit 96 is also included. The processes in the vector decoding unit 91, the delay input unit 93, the prediction corresponding addition unit 95, and the decoded prediction corresponding linear prediction coefficient calculation unit 96 correspond to s91 to 96 in FIG.
- the prediction-compatible decoding unit 220 further converts the decoded prediction-compatible LSP parameter vector ⁇ ⁇ f into a decoded prediction-compatible linear prediction coefficient ⁇ a f [1], ⁇ a f [2], ..., ⁇ a f [ p] (s220) and output.
- the prediction vector is a vector (V + ⁇ ⁇ ⁇ S f-1) obtained by adding a predetermined prediction-corresponding average vector V and ⁇ times the decoded difference vector ⁇ S f-1 of the past frame. ).
- the vector decoding unit 91 outputs the decoded difference vector ⁇ S f to the non-predictive corresponding adding unit 213 of the non-predictive corresponding decoding unit 210 in addition to the delay input unit 93 and the predictive corresponding adding unit 95.
- the non-prediction correspondence decoding unit 210 includes a correction vector codebook 212, a correction vector decoding unit 211, and a non-prediction correspondence addition unit 213, and also includes a decoded non-prediction correspondence linear prediction coefficient calculation unit 214 as necessary.
- the non-predictive correspondence decoding unit 210 receives the corrected LSP code D f and the decoded difference vector ⁇ S f .
- the non-predictive decoding unit 210 further decodes the decoded non-predictive LSP parameter vector ⁇ ⁇ f as necessary, by decoding the non-predictive linear prediction coefficients ⁇ b f [1], ⁇ b f [2],. Convert to b f [p] (s210) and output.
- the decoding non-predictive LSP parameter vector ⁇ ⁇ f is a decoded correction vector ⁇ U f obtained by decoding the corrected LSP code D f , and a decoded difference vector ⁇ obtained by decoding the LSP code C f This is a vector obtained by adding S f and a predetermined non-predicted correspondence average vector Y. That is, the non-predictive decoding unit 210 obtains the decoding vector ⁇ ⁇ f of the LSP parameter vector of the current frame only from the code input in the current frame.
- the correction vector codebook 212 stores information having the same contents as the correction vector codebook 113 in the linear prediction coefficient encoding device 100. That is, the correction vector codebook 212 stores each candidate correction vector and a correction vector code corresponding to each candidate correction vector.
- Correction vector decoding unit 211 corrects receives LSP code D f, the correction LSP code D f decoded to obtain a decoded correction vector ⁇ U f and a (s211) outputs. For example, the correction vector decoding unit 211 selects a correction vector code corresponding to the correction LSP code D f input to the linear prediction coefficient decoding device 200 from among a plurality of correction vector codes stored in the correction vector codebook 212. The candidate correction vector corresponding to the searched correction vector code is output as a decoded correction vector ⁇ U f .
- the non-predictive correspondence adding unit 213 includes, for example, a storage unit 213c that stores the non-predictive correspondence average vector Y, and addition units 213a and 213b.
- the non-predictive addition unit 213 receives the decoded correction vector ⁇ U f and the decoded difference vector ⁇ S f .
- the adder 213a adds the decoded difference vector ⁇ S f to the decoded correction vector ⁇ U f , and then the adder 213b stores it in the storage unit 213c.
- the non-predicted corresponding average vectors Y are added, but the order of these additions may be reversed.
- the decoded non-predicted LSP parameter vector ⁇ ⁇ f may be generated by adding a vector obtained by adding the non-predicted corresponding average vector Y and the decoded difference vector ⁇ S f to the decoded correction vector ⁇ U f .
- non-predictive correspondence average vector Y used here is the same as the non-predictive correspondence average vector Y used in the non-predictive correspondence subtraction unit 111 of the linear prediction coefficient encoding apparatus 100 described above.
- the decoded non-predictive correspondence linear prediction coefficient calculation unit 214 receives the decoded non-predictive correspondence LSP parameter vector ⁇ ⁇ f .
- the decoded non-predictive linear prediction coefficient calculation unit 214 converts the decoded non-predictive LSP parameter vector ⁇ ⁇ f into a decoded non-predictive linear predictive coefficient ⁇ b f [1], ⁇ b f [2], ..., ⁇ b f [ p] (s214) and output.
- the non-predictive correspondence decoding unit 210 can obtain a decoded non-predictive correspondence LSP parameter vector ⁇ ⁇ f that is a decoded value of the LSP parameter vector that does not depend on the decoded difference vector ⁇ S f-1 , so that the LSP code of the f-1th frame
- the transmission error of C f ⁇ 1 can be prevented from affecting the decoding non-predictive LSP parameter vector ⁇ ⁇ f of the f-th frame.
- a non-predictive quantization LSP parameter vector / decoded non-predictive LSP parameter vector ⁇ ⁇ f is used as an LSP parameter vector used for variable length coding / decoding depending on each amplitude value constituting a spectrum envelope obtained from an LSP parameter vector. If it is used, correct decoding non-predictive LSP parameter vector ⁇ ⁇ f is not obtained in the f-1 frame and correct decoding is not possible in the f frame even if variable length decoding cannot be performed correctly. A non-predictive LSP parameter vector ⁇ ⁇ f is obtained, and variable length decoding can be performed correctly.
- the correction vector does not need to be quantized as accurately as the LSP parameter vector (so that the quantization error is reduced), and therefore the type of candidate correction vector prepared in the correction vector codebook 113 is at least good.
- the bit length of the correction vector code is 2 bits
- the correction vector codebook 113 includes four types of candidate corrections corresponding to four types of correction vector codes (“00”, “01”, “10”, “11”). Contains vectors.
- the types of candidate correction vectors prepared in the correction vector codebook can be reduced, and a code with a small code amount can be assigned. Therefore, encoding and decoding with less distortion than conventional ones can be realized with a small increase in code amount.
- LSP parameters are described, but other coefficients may be used as long as they are coefficients that can be converted into a multi-order linear prediction coefficient.
- a PARCOR coefficient, a coefficient obtained by modifying an LSP parameter or a PARCOR coefficient, or a linear prediction coefficient itself may be used. All these coefficients can be converted into each other in the technical field of speech coding, and the effect of the first embodiment can be obtained by using any coefficient.
- the LSP code C f or the code corresponding to the LSP code C f is also referred to as a first code
- the prediction corresponding encoding unit is also referred to as a first encoding unit.
- the correction LSP code or a code corresponding to the correction LSP code is also referred to as a second code
- the non-predictive encoding unit is also referred to as a second encoding unit.
- the vector corresponding to the decoded prediction compatible LSP parameter vector ⁇ ⁇ f or the decoded prediction compatible LSP parameter vector ⁇ ⁇ f is also referred to as a first decoded vector
- the prediction corresponding decoding unit is also referred to as a first decoding unit.
- a vector corresponding to the decoded non-predictive LSP parameter vector ⁇ ⁇ f or the decoded non-predictive LSP parameter vector ⁇ ⁇ f is also referred to as a second decoded vector, and the non-predictive corresponding decoding unit is also referred to as a second decoding unit.
- whether or not the correction vector is encoded and whether or not the correction LSP code is decoded are largely determined by the change in the amplitude unevenness of the spectrum envelope, in other words, the magnitude of the peaks and valleys of the spectrum envelope. Use to determine.
- FIG. 7 is a functional block diagram of the linear prediction coefficient encoding apparatus 300 according to the second embodiment, and FIG. 8 shows an example of the processing flow.
- the linear prediction coefficient encoding apparatus 300 includes a non-predictive encoding unit 310 instead of the non-predictive encoding unit 110. Similar to the linear prediction coefficient encoding apparatus 100 of the first embodiment, the LSP parameter ⁇ derived from the acoustic signal X f is generated by another apparatus, and the input of the linear prediction coefficient encoding apparatus 300 is the LSP parameter ⁇ f. In the case of [1], ⁇ f [2],..., ⁇ f [p], the linear prediction coefficient encoding device 300 may not include the linear prediction analysis unit 81 and the LSP calculation unit 82.
- the non-predictive correspondence encoding unit 310 includes a non-predictive correspondence subtraction unit 311, a correction vector encoding unit 312, a correction vector codebook 113, a prediction correspondence addition unit 314, and an index calculation unit 315. According to the calculation result of the index calculation unit 315, it is determined whether or not the non-predictive correspondence subtraction unit 311 executes the subtraction process and whether or not the correction vector encoding unit 312 executes the encoding process. Is different.
- the predictive encoding unit 120 outputs a vector ⁇ ⁇ ⁇ S f ⁇ 1 that is an output value of the multiplication unit 88 in addition to the quantized difference vector ⁇ S f .
- the prediction correspondence adding unit 314 includes, for example, a storage unit 314c that stores the prediction correspondence average vector V, and addition units 314a and 314b.
- the predictive addition unit 314 receives the quantization difference vector ⁇ S f of the current frame and the vector ⁇ ⁇ ⁇ S f-1 obtained by multiplying the previous frame quantization difference vector ⁇ S f-1 by a predetermined coefficient ⁇ .
- the adder 314b first adds the vector ⁇ ⁇ ⁇ S f-1 to the quantized difference vector ⁇ S f of the current frame, and then adds the adder In 314a, the prediction-corresponding average vector V is added, but this order may be reversed. Alternatively, also generate a vector ⁇ ⁇ ⁇ S f-1 and the predicted corresponding mean was vector sum of the vector V, quantized differential vector ⁇ S predicted by adding the f corresponding quantized LSP parameter vector ⁇ theta f Good.
- the current frame quantization difference vector ⁇ S f and the previous frame quantization difference vector ⁇ S f-1 multiplied by a predetermined coefficient ⁇ are input to the prediction correspondence adder 314 and a vector ⁇ ⁇ ⁇ S f-1.
- the prediction correspondence average vector V stored in the storage unit 314 c in the prediction correspondence addition unit 314 is stored in the storage unit 83 d in the prediction correspondence encoding unit 120.
- the prediction-corresponding encoding unit 120 performs the processing performed by the prediction-corresponding addition unit 314 to generate the prediction-corresponding quantized LSP parameter vector ⁇ ⁇ f and thus the non-predictive-corresponding code.
- the non-predictive correspondence encoding unit 310 may be configured not to include the prediction correspondence addition unit 314.
- the index calculation unit 315 receives the prediction corresponding quantized LSP parameter vector ⁇ ⁇ f .
- the index calculation unit 315 uses the prediction-corresponding quantization LSP parameter vector ⁇ ⁇ f and uses the prediction-corresponding quantization LSP parameter vector ⁇ ⁇ f to indicate the index Q corresponding to the magnitude of the peak and valley of the spectrum envelope, that is, the spectrum envelope.
- An index Q that increases as the peak and valley of the spectrum envelop and / or an index Q ′ that corresponds to the smallness of the peak and valley of the spectrum envelope, that is, an index Q ′ that decreases as the peak and valley of the spectrum envelope increase, are calculated (s315).
- the index calculation unit 315 performs the encoding process on the correction vector encoding unit 312 according to the size of the index Q and / or Q ′, or executes the encoding process with a predetermined number of bits. Output a control signal C. In addition, the index calculation unit 315 outputs a control signal C so as to execute a subtraction process to the non-prediction correspondence subtraction unit 311 according to the magnitude of the index Q and / or Q ′. A method for generating the control signal C will be described below.
- the LSP parameter is a frequency domain parameter sequence that correlates with the power spectrum envelope of the input acoustic signal, and each value of the LSP parameter correlates with the frequency position of the extreme value of the power spectrum envelope of the input acoustic signal.
- the LSP parameters are ⁇ [1], ⁇ [2], ..., ⁇ [p]
- the steep slope of the tangent around this extreme value is the smaller the interval between ⁇ [i] and ⁇ [i + 1] (that is, the value of ( ⁇ [i + 1] - ⁇ [i])) .
- the steepness of the amplitude of the power spectrum envelope becomes steeper, and for each i, the interval between ⁇ [i] and ⁇ [i + 1] becomes non-uniform, that is, the variance of the LSP parameter interval increases.
- the interval between ⁇ [i] and ⁇ [i + 1] is close to an equal interval, that is, the variance of the LSP parameter interval is small.
- a large index corresponding to the dispersion of the interval of the LSP parameters means that the change in the unevenness of the amplitude of the power spectrum envelope is large.
- a small index corresponding to the minimum value of the LSP parameter interval means that the change in the amplitude unevenness of the power spectrum envelope is large.
- Predictive quantization LSP parameters ⁇ ⁇ f [1], ⁇ ⁇ f [2],..., ⁇ ⁇ f [p] are LSP parameters ⁇ f [1], ⁇ f [2],..., ⁇ f [p] LSP parameters ⁇ ⁇ f [1], ⁇ ⁇ f [2],..., ⁇ ⁇ f [p] are obtained by quantizing the LSP code C f from the linear prediction encoder.
- Predictive quantization LSP parameters ⁇ ⁇ f [1], ⁇ ⁇ f [2],..., ⁇ ⁇ f [p] are the same as long as they are input to the linear predictive decoder without error.
- LSP parameters ⁇ ⁇ f [1], ⁇ ⁇ f [2],..., ⁇ ⁇ f [p] and decoding prediction compatible LSP parameters ⁇ ⁇ f [1], ⁇ ⁇ f [2],..., ⁇ ⁇ f [ p] has the same properties as the LSP parameters ⁇ f [1], ⁇ f [2],..., ⁇ f [p].
- the index Q which increases the value corresponding to the variance of the interval of the predictive quantization LSP parameters ⁇ ⁇ f [1], ⁇ ⁇ f [2],..., ⁇ ⁇ f [p] as the peak of the spectral envelope increases.
- Quantization LSP parameter vector for prediction corresponding to the quantization LSP parameter of the order corresponding to the order in ⁇ ⁇ f ( ⁇ ⁇ f [1], ⁇ ⁇ f [2],..., ⁇ ⁇ f [p])
- the minimum value of the difference ( ⁇ ⁇ f [i + 1] ⁇ ⁇ ⁇ f [i]) can be used as an index Q ′ that decreases as the peak and valley of the spectrum envelope increase.
- the index Q which increases as the peak of the spectral envelope increases, is, for example, the variance of the interval of the prediction-corresponding quantization LSP parameter that is an element of the prediction-corresponding quantization LSP parameter vector ⁇ ⁇ f of a predetermined order T (T ⁇ p) or less Index Q representing Calculate with
- the index Q ′ which decreases as the peak of the spectral envelope increases, is, for example, a prediction-corresponding quantization LSP parameter in which the order of a prediction-corresponding quantization LSP parameter vector ⁇ ⁇ f of a predetermined order T (T ⁇ p) or less is adjacent.
- Index Q ′ representing the minimum value of the interval of
- an index Q representing the minimum value among the intervals of the prediction-corresponding quantized LSP parameters adjacent to the order of the prediction-corresponding quantized LSP parameter vector ⁇ ⁇ f and the value of the lowest-order predictive-corresponding quantized LSP parameter '
- the least-predictive quantization LSP parameter ⁇ ⁇ f [1] in this equation is the interval between ⁇ ⁇ f [1] and 0 ( ⁇ ⁇ f [1] -0).
- the index calculation unit 315 determines that the peak or valley of the spectrum envelope is larger than a predetermined reference, that is, (A-1) in the above example, the index Q is equal to or greater than a predetermined threshold Th1, and / or (B-1)
- a predetermined reference that is, (A-1) in the above example, the index Q is equal to or greater than a predetermined threshold Th1, and / or (B-1)
- the control signal C indicating that the correction encoding process is executed is output to the non-predictive correspondence subtraction unit 311 and the correction vector encoding unit 312, and otherwise Then, the control signal C indicating that the correction encoding process is not executed is output to the non-prediction correspondence subtraction unit 311 and the correction vector encoding unit 312.
- the index calculation unit 315 outputs a positive integer (or a sign representing a positive integer) representing a predetermined number of bits as the control signal C in the case of (A-1) and / or (B-1). In other cases, 0 may be output as the control signal C.
- the non-predictive subtracting unit 311 performs a subtraction process when the control signal C is received
- the correction vector encoding unit 312 executes the encoding process when the control signal C is received.
- the index calculation unit 315 may be configured not to output the control signal C in cases other than (A-1) and / or (B-1).
- the correction vector encoding unit 312 receives the control signal C and the correction vector U f .
- control signal C indicating that correction encoding processing is executed or a positive integer (or a sign representing a positive integer) is received as control signal C
- the main point is that the peak and valley of the spectrum envelope is larger than a predetermined reference , i.e. in the above example in the case of (a-1) and / or (B-1)
- the correction vector encoding unit 312 obtains the correction LSP code D f by encoding the correction vector U f (s312) Output.
- the encoding process itself for encoding the correction vector U f is the same as that of the correction vector encoding unit 112.
- control signal C indicating that correction encoding processing is not executed or 0 is received as control signal C
- the point is that the peak or valley of the spectrum envelope is not larger than a predetermined reference, that is, in the above example (A-1 ) And / or (B-1)
- the correction vector encoding unit 312 does not encode the correction vector U f and does not obtain and output the correction LSP code D f .
- FIG. 9 is a functional block diagram of the linear prediction coefficient decoding apparatus 400 according to the second embodiment, and FIG. 10 shows an example of its processing flow.
- the linear prediction coefficient decoding apparatus 400 includes a non-predictive correspondence decoding unit 410 instead of the non-predictive correspondence decoding unit 210.
- the non-predictive correspondence decoding unit 410 includes a correction vector codebook 212, a correction vector decoding unit 411, a non-predictive correspondence addition unit 413, a decoded non-predictive correspondence linear prediction coefficient calculation unit 214, and an index calculation unit 415, as necessary.
- a decoding non-predictive linear prediction coefficient calculation unit 214 is also included.
- the index calculation unit 415 sends a control signal C indicating whether or not to perform correction decoding processing to the correction vector decoding unit 411 and the non-predictive addition unit 413 according to the size of the index Q and / or Q ′, or The control signal C indicating that the correction decoding process is executed with a predetermined number of bits is output.
- the indexes Q and Q ′ are the same as those described in the index calculation unit 315.
- the index calculation unit 315 uses the decoded prediction compatible LSP parameter vector ⁇ ⁇ f instead of the prediction corresponding quantization LSP parameter vector ⁇ ⁇ f. The calculation may be performed in the same manner as described above.
- the index calculation unit 415 determines that the peak or valley of the spectrum envelope is larger than a predetermined reference, that is, (A-1) in the above example, the index Q is equal to or greater than a predetermined threshold Th1, and / or (B-1)
- a predetermined reference that is, (A-1) in the above example, the index Q is equal to or greater than a predetermined threshold Th1, and / or (B-1)
- the control signal C indicating that the correction decoding process is executed is output to the non-predictive corresponding addition unit 413 and the correction vector decoding unit 411.
- a control signal C indicating that the correction decoding process is not executed is output to the prediction corresponding addition unit 413 and the correction vector decoding unit 411.
- the index calculation unit 415 outputs a positive integer (or a sign representing a positive integer) representing a predetermined number of bits as the control signal C in the case of (A-1) and / or (B-1). In other cases, 0 may be output as the control signal C.
- the index calculation unit 415 may be configured not to output the control signal C.
- the correction vector decoding unit 411 receives the correction LSP code D f and the control signal C.
- the control signal C indicating that the correction decoding process is performed or a positive integer (or a sign representing a positive integer) is received as the control signal C
- the correction LSP code D f is decoded by referring to the correction vector codebook 212 to obtain the decoded correction vector ⁇ U f (S411) Output.
- the decoding process itself for decoding the corrected LSP code D f is the same as that of the correction vector decoding unit 211.
- the correction vector decoding unit 411 When the correction vector decoding unit 411 receives the control signal C indicating that the correction decoding process is not executed or 0 as the control signal C, the correction vector decoding unit 411 is, in short, the case where the peak or valley of the spectrum envelope is not larger than a predetermined reference, that is, In the example, in cases other than (A-1) and / or (B-1), the corrected LSP code D f is not decoded, and the decoded correction vector ⁇ U f is not obtained and output.
- the non-predictive correspondence adding unit 413 includes, for example, a storage unit 413c that stores a non-predictive correspondence average vector Y, and addition units 413a and 413b.
- the non-predictive addition unit 413 receives the control signal C and the decoded difference vector ⁇ S f .
- the control signal C indicating that the correction decoding process is performed or a positive integer (or a sign representing a positive integer) is received as the control signal C, in short, when the peak and valley of the spectrum envelope is larger than a predetermined reference, In the case of (A-1) and / or (B-1), the decoding correction vector ⁇ U f is also received.
- the non-predictive correspondence adding unit 413 adds the decoded difference vector ⁇ S f to the decoded correction vector ⁇ U f and the non-predictive corresponding average vector Y stored in the storage unit 413c, thereby obtaining the decoded non-predictive correspondence.
- An LSP parameter vector ⁇ ⁇ f ⁇ U f + Y + ⁇ S f is generated (s413) and output.
- the adder 413a adds the decoded difference vector ⁇ S f to the decoded correction vector ⁇ U f , and then the adder 413b stores it in the storage unit 413c.
- the decoded non-predicted LSP parameter vector ⁇ ⁇ f may be generated by adding a vector obtained by adding the non-predicted corresponding average vector Y and the decoded difference vector ⁇ S f to the decoded correction vector ⁇ U f .
- non-predictive correspondence average vector Y used here is the same as the non-predictive correspondence average vector Y used in the non-predictive correspondence subtraction unit 311 of the linear prediction coefficient encoding apparatus 300 described above.
- the code amount for the corrected LSP code D f can be reduced. That is, it is possible to perform encoding and decoding processing with less sound quality deterioration due to transmission error of the code of the previous frame than that of the prior art, while reducing the code amount as a whole as compared with the encoding and decoding of the first embodiment.
- ⁇ Modification> As described in the modification of the first embodiment, other coefficients may be used as long as they can be converted into linear prediction coefficients instead of the LSP parameters.
- a PARCOR coefficient, a coefficient obtained by modifying any one of the LSP parameter and the PARCOR coefficient, and the linear prediction coefficient itself may be targeted.
- the PARCOR coefficients k f [1], k f [2],..., K f [p] are used will be described.
- the index calculation unit 315 receives the quantized PARCOR coefficient ⁇ k f [1], ⁇ k f [2], ..., ⁇ k f [p], and calculates the spectral envelope.
- Index Q ' correspond to the smallness of the mountain valley (S315).
- the index calculation unit 315 controls the correction vector encoding unit 312 and the non-predictive corresponding subtraction unit 311 according to the magnitude of the index Q ′, or a control signal C indicating whether or not to execute the correction encoding process, or a predetermined signal A control signal C which is a positive integer representing the number of bits or 0 is output.
- the index calculation unit 415 controls the correction vector decoding unit 411 and the non-predictive corresponding addition unit 413 according to the magnitude of the index Q ′, or a control signal C indicating whether or not to execute the correction decoding process, or a predetermined value A control signal C which is a positive integer representing the number of bits or 0 is output.
- the index calculation unit 315 and the index calculation unit 415 may be configured to output the index Q and / or the index Q ′ instead of the control signal C. In that case, it is only necessary to determine whether the correction vector encoding unit 312 and the correction vector decoding unit 411 execute the encoding process and the decoding process, respectively, according to the size of the index Q and / or the index Q ′. Similarly, depending on the size of the index Q and / or the index Q ′, whether or not to perform the subtraction processing in the non-predictive corresponding subtracting unit 311 and the non-predictive corresponding adding unit 413, and what kind of addition processing is performed. What is necessary is just to judge whether it performs.
- the determinations in the correction vector encoding unit 312, the correction vector decoding unit 411, the non-predictive correspondence subtracting unit 311, and the non-predictive correspondence adding unit 413 are the same as described in the index calculation unit 315 and the index calculation unit 415. .
- the correction vector encoding unit and the correction vector decoding unit are executed using a correction vector codebook with higher accuracy as the influence of a decrease in decoding accuracy due to transmission error of the LSP code is larger.
- FIG. 11 is a functional block diagram of the linear prediction coefficient encoding apparatus 500 of the third embodiment, and FIG. 8 shows an example of the processing flow.
- the linear prediction coefficient coding apparatus 500 includes a non-prediction-compatible encoding unit 510 instead of the non-prediction-compatible encoding unit 310.
- the non-predictive correspondence encoding unit 510 includes a non-predictive correspondence subtraction unit 311, a correction vector encoding unit 512, correction vector codebooks 513A and 513B, a prediction correspondence addition unit 314, and an index calculation unit 315.
- the LSP parameter ⁇ derived from the acoustic signal X f is generated by another apparatus, and the input of the linear prediction coefficient encoding apparatus 500 is In the case of the LSP parameters ⁇ f [1], ⁇ f [2],..., ⁇ f [p], the linear prediction coefficient encoding apparatus 500 does not include the linear prediction analysis unit 81 and the LSP calculation unit 82. It's okay.
- the linear prediction coefficient encoding apparatus 500 includes a plurality of correction vector codebooks, and the correction vector encoding unit 512 can select either one according to the index Q and / or Q ′ calculated by the index calculation unit 315. This is different from the second embodiment in that encoding is performed by selecting one correction vector codebook.
- Correction vector codebooks 513A and 513B differ in the total number of stored candidate correction vectors.
- a large total number of candidate correction vectors means that the number of bits of the corresponding correction vector code is large.
- more candidate correction vectors can be prepared by increasing the number of bits of the correction vector code. For example, assuming that the number of bits of the correction vector code is A, a maximum of 2 A candidate correction vectors can be prepared.
- correction vector codebook 513A has a larger total number of candidate correction vectors stored than correction vector codebook 513B.
- the code length (average code length) of the code stored in the correction vector codebook 513A is larger than the code length (average code length) of the code stored in the correction vector codebook 513B.
- the correction vector codebook 513A stores 2 A sets of correction vector codes and candidate correction vectors having a code length of A bits
- the correction vector codebook 513B has a code length of B bits (B ⁇ A )
- Correction vector codes and candidate correction vectors 2 B (2 B ⁇ 2 A ) are stored.
- the index calculation unit outputs the index Q and / or the index Q ′ instead of the control signal C, and the index Q and / or In accordance with the magnitude of the index Q ′, the correction vector encoding unit and the correction vector decoding unit determine what encoding and decoding are to be performed, respectively.
- the index calculation unit may determine what type of encoding and decoding is performed and output the control signal C.
- the non-predictive correspondence subtracting unit 311 and the non-predictive correspondence adding unit 413 perform subtraction processing according to the size of the index Q and / or the index Q ′, respectively, as described in the modification example of the second embodiment. It is determined whether or not and what kind of addition processing is performed.
- the correction vector encoding unit 512 receives the index Q and / or the index Q ′ and the correction vector U f .
- the correction vector encoding unit 512 increases the (A-2) index Q and / or (B-2) the smaller the index Q ′, the larger the number of bits (the longer the code length) the corrected LSP code D f. (S512) and output.
- encoding is performed as follows using a predetermined threshold Th2 and / or a predetermined threshold Th2 ′.
- the correction vector encoding unit 512 executes the encoding process when the index Q is equal to or greater than the predetermined threshold Th1 and / or when the index Q ′ is equal to or less than the predetermined threshold Th1 ′.
- Th2 is a larger value than Th1, and Th2 'is a smaller value than Th1'.
- the index Q ′ is larger than the predetermined threshold Th2 ′.
- B which is a positive integer less than the number of bits A, is set as the number of bits of the correction LSP code D f , and the correction vector encoding unit 512
- the correction vector U f is encoded with reference to the correction vector code book 513B storing 2 B sets of correction vector codes and candidate correction vectors of bit number (code length) B, and the correction LSP code D f (S512) and output.
- the correction vector encoding unit 512 of the third embodiment when the index Q calculated by the index calculation unit 315 is larger than the predetermined threshold Th1, and / or when the index Q ′ is smaller than the predetermined threshold Th1 ′. To be executed.
- FIG. 12 is a functional block diagram of the linear prediction coefficient decoding apparatus 600 according to the third embodiment, and FIG. 10 shows an example of the processing flow.
- the linear prediction coefficient decoding apparatus 600 includes a non-prediction support decoding unit 610 instead of the non-prediction support decoding unit 410.
- the non-predictive correspondence decoding unit 610 includes a non-predictive correspondence addition unit 413, a correction vector decoding unit 611, correction vector codebooks 612A and 612B, and an index calculation unit 415, and a decoding non-prediction correspondence linear prediction coefficient calculation unit as necessary 214 is also included.
- the linear prediction coefficient decoding apparatus 600 of the third embodiment includes a plurality of correction vector codebooks, and the correction vector decoding unit 611 is any one according to the index Q and / or Q ′ calculated by the index calculation unit 415. It differs from the linear prediction coefficient decoding apparatus 400 of 2nd embodiment in the point which selects and corrects one correction vector codebook.
- Correction vector codebooks 612A and 612B store the same contents as correction vector codebooks 513A and 513B of linear prediction coefficient encoding apparatus 500, respectively. That is, the correction vector codebooks 612A and 612B store each candidate correction vector and the correction vector code corresponding to each candidate correction vector, and the code length of the code stored in the correction vector codebook 612A ( (Average code length) is larger than the code length (average code length) of codes stored in the correction vector codebook 612B. For example, 2 A sets of correction vector codes and candidate correction vectors having a code length of A bits are stored in the correction vector codebook 612A, and the code length is B bits (B ⁇ A ) Correction vector codes and candidate correction vectors 2 B (2 B ⁇ 2 A ) are stored.
- the correction vector decoding unit 611 receives the index Q and / or the index Q ′ and the correction LSP code D f .
- the correction vector decoding unit 611 decodes the correction LSP code D f having a larger number of bits as (A-2) the index Q is larger and / or (B-2) the index Q ′ is smaller,
- a decoding correction vector ⁇ U f is obtained from many candidate correction vectors (s611). For example, decoding is performed as follows using a predetermined threshold value Th2 and / or Th2 ′.
- the correction vector decoding unit 611 executes the decoding process when the index Q is equal to or greater than the predetermined threshold Th1 and / or when the index Q ′ is equal to or smaller than the predetermined threshold Th1 ′.
- the value is larger than Th1, and Th2 'is smaller than Th1'.
- the correction vector decoding unit 611 corrects the correction vector codebook 612A that stores 2 A pairs of correction vector codes having a bit number (code length) A and candidate correction vectors. , A candidate correction vector corresponding to the correction vector code matching the correction LSP code D f is obtained as a decoded correction vector ⁇ U f (s611) and output.
- A-6 When the index Q is smaller than the predetermined threshold Th2 and the index Q is equal to or larger than the predetermined threshold Th1, and / or (B-6) The index Q ′ is larger than the predetermined threshold Th2 ′.
- B which is a positive integer less than the number of bits A, is set as the number of bits of the correction LSP code D f , and the correction vector decoding unit 611
- a correction vector code corresponding to the correction LSP code D f is referred to by referring to the correction vector code book 612B storing 2 B sets of correction vector codes and candidate correction vectors having a bit number (code length) B.
- a candidate correction vector is obtained as a decoding correction vector ⁇ U f (s611) and output.
- the correction vector decoding unit 611 of the third embodiment when the index Q calculated by the index calculation unit 415 is larger than the predetermined threshold Th1, and / or when the index Q ′ is smaller than the predetermined threshold Th1 ′, To be executed.
- the number of correction vector codebooks is not necessarily two, and may be three or more.
- a correction vector code having a different number of bits (bit length) is stored for each correction vector codebook, and a correction vector corresponding to the correction vector code is stored.
- a threshold value may be set according to the number of correction vector codebooks.
- the threshold value for the index Q may be set such that the larger the threshold value, the larger the number of bits of the correction vector code stored in the correction vector codebook used when the threshold value is greater than or equal to the threshold value.
- the threshold value for the index Q ′ may be set such that the smaller the threshold value, the larger the number of bits of the correction vector code stored in the correction vector codebook used when the threshold value is less than or equal to the threshold value.
- the encoding apparatus 700 includes a TCX (transform coded excitation) encoding method that is an encoding method in the frequency domain of the linear prediction coefficient encoding apparatus 100 and the linear prediction coefficient decoding apparatus 200 of the first embodiment. It is applied to.
- TCX transform coded excitation
- FIG. 13 is a functional block diagram of the encoding apparatus 700 of the fourth embodiment, and FIG. 14 shows an example of the processing flow.
- the encoding device 700 of the fourth embodiment includes a linear prediction coefficient encoding device 100, a linear prediction coefficient decoding device 200, a power spectrum envelope sequence calculation unit 710, a first smoothed power spectrum envelope sequence calculation unit 720A, and a second smoothing.
- a power spectrum envelope sequence calculation unit 720B, a frequency domain conversion unit 730, an envelope normalization unit 740, a variable length coding parameter calculation unit 750, and a variable length coding unit 760 are included.
- the linear prediction coefficient encoding apparatuses 300 and 500 and the linear prediction coefficient decoding apparatuses 400 and 600 of the second and third embodiments may be used. Good.
- the encoding device 700 receives the input acoustic signal Xf and outputs a frequency domain signal code.
- Linear prediction coefficient coding unit 100 receives the audio signal X f, to give a LSP code C f and correction LSP code D f (s100) outputs.
- the linear prediction coefficient decoding apparatus 200 receives the LSP code C f and the corrected LSP code D f, and performs prediction-compliant quantized linear prediction coefficients ⁇ a f [1], ⁇ a f [2], ..., ⁇ a f [p ] And non-predictive quantization linear prediction coefficients ⁇ b f [1], ⁇ b f [2], ..., ⁇ b f [p] are obtained and output (s200).
- the linear prediction coefficient coding unit 100 of the encoding apparatus 700 when obtaining the LSP code C f and the correction LSP code D f, the LSP code C f corresponding to the prediction corresponding quantized linear prediction coefficient ⁇ a f [1 ], ⁇ a f [2], ..., ⁇ a f [p], LSP code C f, and non-predictive quantized linear prediction coefficient corresponding to corrected LSP code D f ⁇ b f [1], ⁇ b f [ 2], ..., ⁇ b f [p] may be obtained.
- the encoding apparatus 700 may not include the linear prediction coefficient decoding apparatus 200.
- the power spectrum envelope sequence calculation unit 710 receives non-predictive quantization-compatible linear prediction coefficients ⁇ b f [1], ⁇ b f [2],..., ⁇ B f [p].
- the power spectrum envelope sequence calculation unit 710 uses the non-prediction-compatible quantized linear prediction coefficients ⁇ b f [1], ⁇ b f [2], ..., ⁇ b f [p] to calculate the input acoustic signal at N points.
- the power spectrum envelope sequence Z [1],..., Z [N] is calculated (s710) and output.
- each value Z [n] of the power spectrum envelope sequence can be obtained by the following equation.
- n is an integer of 1 ⁇ n ⁇ N
- exp ( ⁇ ) is an exponential function with the Napier number as the base
- j is an imaginary unit
- ⁇ 2 is a predicted residual energy.
- the first smoothed power spectrum envelope sequence calculation unit 720A receives the prediction-corresponding quantized linear prediction coefficients ⁇ a f [1], ⁇ a f [2], ..., ⁇ a f [p].
- the first smoothed power spectrum envelope sequence calculation unit 720A has a prediction-corresponding quantized linear prediction coefficient ⁇ a f [1], ⁇ a f [2], ..., ⁇ a f [p] and 1 or less given in advance.
- a correction coefficient ⁇ i that is a positive constant
- the first smoothed power spectrum envelope sequence ⁇ W [1], ⁇ W [2], ..., ⁇ W [N] is the predictive quantization linear prediction coefficient ⁇ a f [1], ⁇ a f [2] ,..., ⁇ A f [p] corresponds to a power spectrum envelope sequence W [1], W [2],..., W [N] whose amplitude irregularities are blunted (smoothed).
- ⁇ i is a positive constant that determines the degree of smoothing.
- the second smoothed power spectrum envelope sequence calculation unit 720B receives the non-prediction-corresponding quantized linear prediction coefficients ⁇ b f [1], ⁇ b f [2], ..., ⁇ b f [p].
- the second smoothed power spectrum envelope sequence calculation unit 720B has a non-prediction-compatible quantized linear prediction coefficient ⁇ b f [1], ⁇ b f [2], ..., ⁇ b f [p] and 1 or less given in advance.
- ⁇ i which is a positive constant of To calculate the second smoothed power spectrum envelope sequence ⁇ Z [1], ⁇ Z [2], ..., ⁇ Z [N] (s720B).
- the second smoothed power spectrum envelope sequence ⁇ Z [1], ⁇ Z [2], ..., ⁇ Z [N] is a non-predicted quantized linear prediction coefficient ⁇ b f [1], ⁇ b f [2 ],..., ⁇ B f [p] corresponds to a power spectrum envelope sequence Z [1], Z [2],..., Z [N] whose amplitude irregularities are blunted (smoothed).
- ⁇ i is a positive constant that determines the degree of smoothing.
- the frequency domain transform unit 730 converts the input time domain input acoustic signal X f into N points of MDCT coefficient sequences X [1],..., X [N] in the frequency domain in units of frames that are predetermined time segments. Convert (s730) and output. N is a positive integer.
- variable length coding parameter calculation unit 750 includes the power spectrum envelope sequence Z [1],..., Z [N] and the second smoothed power spectrum envelope sequence ⁇ Z [1],. column X [1], ..., X [N] to the normal haze MDCT coefficients X N [1], ..., receive and X N [N]. Using these values, a variable-length encoding parameter r i that is a parameter for variable-length encoding the normalized MDCT coefficient sequence X N [1],..., X N [N] is calculated (s750). )Output.
- the variable length encoding parameter r i is a parameter that specifies a range that the amplitude of the normalized MDCT coefficient sequence X N [1],..., X N [N] to be encoded can take.
- the Rice parameter corresponds to a variable length coding parameter
- the range that the amplitude to be encoded can take corresponds to the variable length coding parameter.
- variable length coding parameters are calculated for each coefficient X N [i] of the normalized MDCT coefficient sequence.
- variable length coding is performed collectively for each sample group consisting of a plurality of samples (for example, two samples)
- a variable length coding parameter is calculated for each sample group. That is, the variable length coding parameter calculation unit 750 calculates a variable length coding parameter for each normalized partial coefficient sequence that is a part of the normalized MDCT coefficient sequence.
- the normalized partial coefficient sequences include the coefficients of the normalized MDCT coefficient sequence without overlapping.
- variable length coding parameter calculation method will be described by taking as an example the case of performing rice coding for each sample.
- Step1 e.g., by the following equation, normalized haze MDCT coefficients X N [1], X N [2], ..., X N [N]
- Rice parameter sb as a reference the average of the logarithm of the amplitudes of the coefficients of Calculate as The sb is encoded only once for each frame, and transmitted to the decoding device as a code corresponding to the reference Rice parameter.
- sb is approximately determined from the estimated value of the amplitude of X [i] in common between the encoding device 700 and the decoding device. You may decide how. In this case, it is not necessary to encode sb and output the code corresponding to the reference rice parameter to the decoding device.
- Step 2 The threshold value ⁇ is calculated by the following equation.
- Step 3 sqrt (Z [i]) / sqrt ( ⁇ Z [i])
- Variable length coding unit 760 receives the variable length coding parameters r i, using this value normalized haze coefficient sequence X N (1), ..., X N (N) is variable-length coding, variable length code C X is output (s760).
- the fourth embodiment is a normalized MDCT coefficient sequence X N [1], obtained by normalizing the MDCT coefficient sequence X [1], X [2],..., X [N] with a smoothed power spectrum envelope sequence. ..., X N [N] is encoded using variable-length encoding parameters.
- the envelope normalization unit 740 uses the power spectrum envelope obtained from the smoothed linear prediction coefficient.
- the first smoothed power spectrum envelope sequence ⁇ W [ which is obtained by predictive quantization linear prediction coefficients ⁇ a f [1], ⁇ a f [2],..., ⁇ a f [p] Normalized MDCT coefficient sequences are generated using 1], ⁇ W [2], ..., ⁇ W [N].
- the variable length coding parameter calculation unit 750 uses a power spectrum envelope sequence or a smoothed power spectrum envelope sequence to obtain a variable length coding parameter. Accordingly, the power spectrum envelope sequence and the smoothed power spectrum envelope sequence used in the variable length coding parameter calculation unit 750 are also different from the power spectrum envelope sequence obtained from the linear prediction coefficient and the power spectrum envelope sequence obtained from the smoothed linear prediction coefficient. The smaller is desirable. However, predictive quantized linear prediction coefficients ⁇ a f [1], ⁇ a f [2],..., ⁇ a f [p] are not only used when the transmission error occurs in the LSP code of the current frame. Even when a transmission error occurs in the LSP code of the frame, a correct value cannot be obtained on the decoding side.
- variable length coding parameters from the power spectrum envelope sequence and smoothed power spectrum envelope sequence obtained from predictive quantization linear prediction coefficients ⁇ a f [1], ⁇ a f [2], ..., ⁇ a f [p] If a transmission error occurs in the LSP code of the current frame as well as a transmission error occurs in the LSP code of the previous frame, variable length decoding cannot be performed correctly.
- a variable length coding parameter is obtained using the sequence.
- the current frame does not have the same non-predictive quantization linear prediction coefficient ⁇ b f [1], ⁇ b f [2], ..., ⁇ b f [p], power spectrum envelope sequence Z [1], Z [2], ..., Z [N] and second smoothed power spectrum envelope Since the sequence ⁇ Z [1], ⁇ Z [2], ..., ⁇ Z [N] can be obtained, the same variable length coding parameters as the coding side can be obtained in the current frame, and the LSP code Improved resistance to transmission errors.
- the normalized MDCT coefficient sequence X N [1] obtained using the first smoothed power spectrum envelope sequence ⁇ W [1], ⁇ W [2], ..., ⁇ W [N] ], ..., X N [N] are the targets of variable length coding. Therefore, not only when a transmission error occurs in the LSP code of the current frame but also when a transmission error occurs in the LSP code of the previous frame, the normalized MDCT coefficient sequence X N [1],. , X N [N] is multiplied by the square root of each value of the smoothed power spectrum envelope sequence, and the MDCT coefficient sequence obtained by decoding is distorted.
- this problem is less than a problem that makes variable length decoding itself inaccurate, such as an error in variable length coding parameters.
- each unit of the non-predictive encoding units 110, 310, and 510 will be described.
- non-prediction corresponding mean vector Y (y [1], y [2], ..., y [p]) is a vector consisting of T L following the following elements of the T.
- outputs the low-order LSP parameter vector theta 'f from LSP computation unit 82 consists of T L following the following LSP parameters of the LSP parameter vector theta f, be input to the non-prediction corresponding subtraction unit 111,311 Good.
- the outputs T L consists following following elements lower order quantized differential vector ⁇ S 'f of the quantized difference vector ⁇ S f from the vector encoding section 84, the non-predictive corresponding subtraction unit 111,311 You may enter.
- correction vector encoding unit 112, 312, 512 ⁇ Correction vector encoding unit 112, 312, 512>
- Each candidate correction vector stored in the correction vector codebook 113, 513A, 513B may be a TL- order vector.
- Non-predictive decoding processing Processing performed by the non-predictive correspondence decoding unit 210 of the linear prediction coefficient decoding device 200 according to the first modification, the non-prediction correspondence decoding unit 410 of the linear prediction coefficient decoding device 400, and the non-prediction correspondence decoding unit 610 of the linear prediction coefficient decoding device 600 ( Non-predictive decoding processing) will be described.
- Correction vector decoding unit 211,411,611 receives the correction LSP code D f, the correction vector codebook 212,612A, with reference to 612B, correction LSP code D f decodes decodes low-order correction vector ⁇ U ' Get f and output.
- Each candidate correction vector stored in the correction vector codebooks 212, 612A, and 612B may be a TL- order vector as in the correction vector codebooks 113, 513A, and 513B.
- the non-predictive addition unit 213 adds elements of the decoded low-order correction vector ⁇ U ' f , the decoded difference vector ⁇ S f, and the non-predictive average vector Y for each order below the TL order, and the p order and below. For each order exceeding the TL order, a decoded non-predicted LSP parameter vector ⁇ ⁇ f obtained by adding the elements of the decoded difference vector ⁇ S f and the non-predicted corresponding average vector Y is generated and output.
- the non-predictive correspondence adding unit 413 receives the control signal C indicating that the correction decoding process is to be executed or a positive integer (or a sign representing a positive integer) as the control signal C, the main point is that the spectral envelope Yamaya Is greater than a predetermined criterion, and in the case of (A-1) and / or (B-1), the decoded low-order correction vector ⁇ U ' f and the decoded difference vector ⁇ S for each order below the TL order Add the elements of f and the non-predicted mean vector Y, and for each order that exceeds the p th and TL orders, add the decoded difference vector ⁇ S f and the elements of the non-predictive mean vector Y Generate and output a prediction compatible LSP parameter vector ⁇ ⁇ f .
- encoding distortion is reduced by giving priority to low-order LSP parameters that may have a large influence on the efficiency of signal processing, which will be described later, with high approximation accuracy, while suppressing an increase in distortion.
- the code amount can be reduced as compared with the method of the third embodiment.
- the input to the LSP calculation unit is the linear prediction coefficient a f [1], a f [2],..., A f [p].
- the encoding target of the linear prediction coefficient encoding device and the decoding target of the linear prediction coefficient decoding device are LSP parameters, but they are converted into linear prediction coefficients such as the linear prediction coefficients themselves and ISP parameters. Any coefficient may be used as an encoding or decoding target as long as it is possible.
- the program describing the processing contents can be recorded on a computer-readable recording medium.
- a computer-readable recording medium any recording medium such as a magnetic recording device, an optical disk, a magneto-optical recording medium, and a semiconductor memory may be used.
- this program is distributed by selling, transferring, or lending a portable recording medium such as a DVD or CD-ROM in which the program is recorded. Further, the program may be distributed by storing the program in a storage device of the server computer and transferring the program from the server computer to another computer via a network.
- a computer that executes such a program first stores a program recorded on a portable recording medium or a program transferred from a server computer in its storage unit. When executing the process, this computer reads the program stored in its own storage unit and executes the process according to the read program.
- a computer may read a program directly from a portable recording medium and execute processing according to the program. Further, each time a program is transferred from the server computer to the computer, processing according to the received program may be executed sequentially.
- the program is not transferred from the server computer to the computer, and the above-described processing is executed by a so-called ASP (Application Service Provider) type service that realizes a processing function only by an execution instruction and result acquisition. It is good.
- the program includes information provided for processing by the electronic computer and equivalent to the program (data that is not a direct command to the computer but has a property that defines the processing of the computer).
- each device is configured by executing a predetermined program on a computer, at least a part of these processing contents may be realized by hardware.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
図1は、従来の線形予測係数符号化装置80の構成を示す。
線形予測分析部81は、入力音響信号Xfを受け取り、入力音響信号Xfを線形予測分析して、線形予測係数af[1],af[2],…,af[p]を求めて出力する。ここで、af[i]はf番目のフレームの入力音響信号Xfを線形予測分析して得られるi次の線形予測係数を表す。
LSP計算部82は、線形予測係数af[1],af[2],…,af[p]を受け取り、線形予測係数af[1],af[2],…,af[p]からLSPパラメータθf[1],θf[2],…,θf[p]を求め、求めたLSPパラメータを要素とするベクトルであるLSPパラメータベクトルΘf=(θf[1],θf[2],…,θf[p])Tを出力する。ここで、θf[i]はf番目のフレームの入力音響信号Xfに対応するi次のLSPパラメータである。
予測対応減算部83は、例えば、所定の係数αを記憶した記憶部83c、予測対応平均ベクトルVを記憶した記憶部83d、乗算部88、減算部83a及び83bを含んで構成される。
ベクトル符号化部84は、差分ベクトルSfを受け取り、差分ベクトルSfを符号化して、LSP符号CfとLSP符号Cfに対応する量子化差分ベクトル^Sf=(^sf[1], ^sf [2],…, ^sf[p])Tとを得て出力する。差分ベクトルSfの符号化には、差分ベクトルSfをベクトル量子化する方法、差分ベクトルSfを複数のサブベクトルに分けてサブベクトルそれぞれをベクトル量子化する方法、差分ベクトルSfまたはサブベクトルを多段ベクトル量子化する方法、ベクトルの要素をスカラ量子化する方法、これらを組み合わせた方法、などの周知の何れの符号化方法を用いてもよい。
ベクトル符号帳86には、各候補差分ベクトルとその各候補差分ベクトルに対応する差分ベクトル符号とが予め記憶されている。
遅延入力部87は、量子化差分ベクトル^Sfを受け取り、量子化差分ベクトル^Sfを保持し、1フレーム分遅らせて、前フレーム量子化差分ベクトル^Sf-1として出力する。つまり、f番目のフレームの量子化差分ベクトル^Sfに対して予測対応減算部83が処理を行っているときには、f-1番目のフレームについての量子化差分ベクトル^Sf-1を出力する。
図2は、従来の線形予測係数復号装置90の構成を示す。線形予測係数復号装置90には、フレーム単位のLSP符号Cfが連続して入力され、フレーム単位でLSP符号Cfを復号して復号予測対応LSPパラメータベクトル^Θf=(^θf[1],^θf[2],…,^θf[p])を得る。
ベクトル復号部91は、LSP符号Cfを受け取り、LSP符号Cfを復号して、LSP符号Cfに対応する復号差分ベクトル^Sfを得て出力する。LSP符号Cfの復号には、符号化装置のベクトル符号化部84の符号化方法に対応する復号方法を用いる。
ベクトル符号帳92には、各候補差分ベクトルとその各候補差分ベクトルに対応する差分ベクトル符号とが予め記憶されている。なお、ベクトル符号帳92は、前述の線形予測係数符号化装置80のベクトル符号帳86と共通の情報を含む。
遅延入力部93は、復号差分ベクトル^Sfを受け取り、復号差分ベクトル^Sfを保持し、1フレーム分遅らせて、前フレーム復号差分ベクトル^Sf-1として出力する。つまり、f番目のフレームの復号差分ベクトル^Sfに対して予測対応加算部95が処理を行っているときには、f-1番目のフレームの復号差分ベクトル^Sf-1を出力する。
予測対応加算部95は、例えば、所定の係数αを記憶した記憶部95c、予測対応平均ベクトルVを記憶した記憶部95d、乗算部94、加算部95a及び95bを含んで構成される。
線形予測係数が必要な場合には、線形予測係数復号装置90に復号予測対応線形予測係数計算部96を備えてもよい。この場合は、復号予測対応線形予測係数計算部96は、復号予測対応LSPパラメータベクトル^Θfを受け取り、復号予測対応LSPパラメータベクトル^Θfを復号予測対応線形予測係数^af[1],^af[2],…,^af[p]に変換して出力する。
以下、従来の線形予測係数符号化装置及び線形予測係数復号装置と異なる点を中心に説明する。
図3は第一実施形態に係る線形予測係数符号化装置の機能ブロック図を、図4はその処理フローの例を示す。
予測対応符号化部120は、予測対応減算部83とベクトル符号化部84とベクトル符号帳86と遅延入力部87とを含み、各部における処理は、従来技術で説明した内容と同じである。予測対応減算部83とベクトル符号化部84と遅延入力部87とにおける処理はそれぞれ、図4のs83~s87に対応する。ただし、ベクトル符号化部84は、量子化差分ベクトル^Sfを遅延入力部87だけでなく、非予測対応符号化部110にも出力する。
非予測対応符号化部110は、非予測対応減算部111と補正ベクトル符号化部112と補正ベクトル符号帳113とを含む。
非予測対応減算部111は、例えば、非予測対応平均ベクトルYを記憶した記憶部111c、加算部111a及び111bを含んで構成される。
補正ベクトル符号帳113には、各候補補正ベクトルとその各候補補正ベクトルに対応する補正ベクトル符号とが記憶されている。
補正ベクトル符号化部112は、補正ベクトルUfを受け取り、補正ベクトルUfを符号化して補正LSP符号Dfを得て(s112)出力する。例えば、補正ベクトル符号化部112は、補正ベクトル符号帳113に記憶されている複数の候補補正ベクトルの中から、補正ベクトルUfに最も近い候補補正ベクトルを探索し、その候補補正ベクトルに対応する補正ベクトル符号を補正LSP符号Dfとして出力する。なお、補正ベクトル符号化部112では実際には生成しないでよいが、補正ベクトルUfに最も近い候補補正ベクトルを量子化済補正ベクトル^Ufとして以下の説明を行う。
以下、従来と異なる点を中心に説明する。
予測対応復号部220は、従来技術の線形予測係数復号装置90と同様の構成であり、ベクトル符号帳92とベクトル復号部91と遅延入力部93と予測対応加算部95とを含み、必要に応じて復号予測対応線形予測係数計算部96も含む。ベクトル復号部91と遅延入力部93と予測対応加算部95と復号予測対応線形予測係数計算部96における処理は、それぞれ図6のs91~96に対応する。
非予測対応復号部210は、補正ベクトル符号帳212と補正ベクトル復号部211と非予測対応加算部213とを含み、必要に応じて復号非予測対応線形予測係数計算部214も含む。
補正ベクトル符号帳212は、線形予測係数符号化装置100内の補正ベクトル符号帳113と同じ内容の情報を記憶している。つまり、補正ベクトル符号帳212には、各候補補正ベクトルとその各候補補正ベクトルに対応する補正ベクトル符号とが記憶されている。
補正ベクトル復号部211は、補正LSP符号Dfを受け取り、補正LSP符号Dfを復号して復号補正ベクトル^Ufを得て(s211)出力する。例えば、補正ベクトル復号部211は、補正ベクトル符号帳212に記憶されている複数の補正ベクトル符号の中から、線形予測係数復号装置200に入力された補正LSP符号Dfに対応する補正ベクトル符号を探索し、探索された補正ベクトル符号に対応する候補補正ベクトルを復号補正ベクトル^Ufとして出力する。
非予測対応加算部213は、例えば、非予測対応平均ベクトルYを記憶した記憶部213c、加算部213a及び213bを含んで構成される。
復号非予測対応線形予測係数計算部214は、復号非予測対応LSPパラメータベクトル^Φfを受け取る。復号非予測対応線形予測係数計算部214は、復号非予測対応LSPパラメータベクトル^Φfを復号非予測対応線形予測係数^bf[1],^bf[2],…,^bf[p]に変換して(s214)出力する。
第一実施形態の線形予測係数復号装置によれば、f-1番目のフレームのLSP符号Cf-1に伝送誤りが生じて復号差分ベクトル^Sf-1が正しく復号できなかったとしても、非予測対応復号部210で復号差分ベクトル^Sf-1に依存しないLSPパラメータベクトルの復号値である復号非予測対応LSPパラメータベクトル^Φfを得られるので、f-1番目のフレームのLSP符号Cf-1の伝送誤りをf番目のフレームの復号非予測対応LSPパラメータベクトル^Φfに影響させなくすることができる。例えば、LSPパラメータベクトルから求まるスペクトル包絡を構成する各振幅値に依存した可変長符号化/復号に用いるLSPパラメータベクトルとして非予測対応量子化LSPパラメータベクトル/復号非予測対応LSPパラメータベクトル^Φfを用いる場合であれば、f-1番目のフレームで正しい復号非予測対応LSPパラメータベクトル^Φfが得られずに可変長復号を正しく行うことができなかったとしても、f番目のフレームでは正しい復号非予測対応LSPパラメータベクトル^Φfが得られ、可変長復号を正しく行うことができる。
本実施形態では、LSPパラメータについて記載しているが、複数次の線形予測係数に変換可能な係数であれば、他の係数を用いてもよい。PARCOR係数や、LSPパラメータまたはPARCOR係数を変形した係数、さらには、線形予測係数自体を対象としてもよい。これら全ての係数は、音声符号化の技術分野では、互いに変換可能なものであり、何れの係数を用いても第一実施形態の効果を得ることができる。なお、LSP符号CfまたはLSP符号Cfに対応する符号を第一符号ともいい、予測対応符号化部を第一符号化部ともいう。同様に、補正LSP符号または補正LSP符号に対応する符号を第二符号ともいい、非予測対応符号化部を第二符号化部ともいう。また、復号予測対応LSPパラメータベクトル^Θfまたは復号予測対応LSPパラメータベクトル^Θfに対応するベクトルを第一復号ベクトルともいい、予測対応復号部を第一復号部ともいう。また、復号非予測対応LSPパラメータベクトル^Φfまたは復号非予測対応LSPパラメータベクトル^Φfに対応するベクトルを第二復号ベクトルともいい、非予測対応復号部を第二復号部ともいう。
第一実施形態と異なる部分を中心に説明する。
<第二実施形態に係る線形予測係数符号化装置300>
図7は第二実施形態に係る線形予測係数符号化装置300の機能ブロック図を、図8はその処理フローの例を示す。
予測対応加算部314は、例えば、予測対応平均ベクトルVを記憶した記憶部314c、加算部314a及び314bを含んで構成される。
指標計算部315は、予測対応量子化LSPパラメータベクトル^Θfを受け取る。指標計算部315は、予測対応量子化LSPパラメータベクトル^Θfを用いて、予測対応量子化LSPパラメータベクトル^Θfに対応するスペクトル包絡の山谷の大きさに対応する指標Q、すなわち、スペクトル包絡の山谷が大きいほど大きくなる指標Q、及び/または、スペクトル包絡の山谷の小ささに対応する指標Q’、すなわち、スペクトル包絡の山谷が大きいほど小さくなる指標Q’、を計算する(s315)。指標計算部315は、指標Q及び/またはQ’の大きさに応じて、補正ベクトル符号化部312に符号化処理を実行するように、または、所定のビット数で符号化処理を実行するように制御信号Cを出力する。また、指標計算部315は、指標Q及び/またはQ’の大きさに応じて、非予測対応減算部311に減算処理を実行するように制御信号Cを出力する。以下に、制御信号Cの生成方法を説明する。
非予測対応減算部311は、制御信号CとLSPパラメータベクトルΘf=(θf[1],θf[2],…,θf[p])Tと量子化差分ベクトル^Sfとを受け取る。
補正ベクトル符号化部312は、制御信号Cと補正ベクトルUfを受け取る。補正符号化処理を実行することを示す制御信号Cや、正の整数(または正の整数を表す符号)を制御信号Cとして受け取った場合、要は、スペクトル包絡の山谷が所定の基準より大きい場合、すなわち上記の例では(A-1)および/または(B-1)の場合に、補正ベクトル符号化部312は、補正ベクトルUfを符号化して補正LSP符号Dfを得て(s312)出力する。補正ベクトルUfを符号化する符号化処理自体は、補正ベクトル符号化部112と同様である。
図9は第二実施形態に係る線形予測係数復号装置400の機能ブロック図を、図10はその処理フローの例を示す。
指標計算部415は、復号予測対応LSPパラメータベクトル^Θfを受け取り、復号予測対応LSPパラメータベクトル^Θf=(^θf[1],^θf[2],…,^θf[p])Tに対応するスペクトル包絡の山谷の大きさに対応する指標Q、すなわち、スペクトル包絡の山谷が大きいほど大きくなる指標Q、及び/または、スペクトル包絡の山谷の小ささに対応する指標Q’、すなわち、スペクトル包絡の山谷が大きいほど小さくなる指標Q’、を計算する(s415)。指標計算部415は、指標Q及び/またはQ’の大きさに応じて、補正ベクトル復号部411および非予測対応加算部413に補正復号処理を実行する/しないことを示す制御信号Cを、または、所定のビット数で補正復号処理を実行することを示す制御信号Cを出力する。指標Q及びQ’は、指標計算部315で説明したものと同様であり、予測対応量子化LSPパラメータベクトル^Θfの代わりに復号予測対応LSPパラメータベクトル^Θfを用いて、指標計算部315と同様の方法で計算すればよい。
補正ベクトル復号部411は、補正LSP符号Dfと制御信号Cとを受け取る。補正復号処理を実行することを示す制御信号Cや、正の整数(または正の整数を表す符号)を制御信号Cとして受け取った場合、要は、スペクトル包絡の山谷が所定の基準より大きい場合、すなわち上記の例では(A-1)および/または(B-1)の場合に、補正ベクトル符号帳212を参照して、補正LSP符号Dfを復号して復号補正ベクトル^Ufを得て(s411)出力する。補正LSP符号Dfを復号する復号処理自体は、補正ベクトル復号部211と同様である。
非予測対応加算部413は、例えば、非予測対応平均ベクトルYを記憶した記憶部413c、加算部413a及び413bを含んで構成される。
このような構成により、f-1番目のフレームのLSP符号Cf-1の伝送誤りをf番目のフレームの復号非予測対応LSPパラメータベクトル^Φfに影響させなくすることに加えて、スペクトル包絡の山谷が大きい場合には、非予測対応平均ベクトルYと復号差分ベクトル^Sfに補正LSP符号Dfを復号して得られる復号補正ベクトル^Ufを加えることで量子化誤差の少ない復号非予測対応LSPパラメータベクトル^Φfを得るとともに、スペクトル包絡の山谷が大きくない場合には、補正LSP符号Dfが不要である非予測対応平均ベクトルYと復号差分ベクトル^Sfを加えたものを復号非予測対応LSPパラメータベクトル^Φfとすることで、補正LSP符号Df分の符号量を少なくすることができる。すなわち、第一実施形態の符号化及び復号よりは全体として符号量を削減しつつ、従来技術よりは前のフレームの符号の伝送誤りによる音質劣化が少ない符号化及び復号処理を行うことができる。
第一実施形態の変形例で説明したように、LSPパラメータに代えて、線形予測係数に変換可能な係数であれば、他の係数を用いてもよい。PARCOR係数や、LSPパラメータやPARCOR係数の何れかを変形した係数、さらには、線形予測係数自体を対象としてもよい。以下、PARCOR係数kf[1],kf[2],…,kf[p]を用いた場合について説明する。
第二実施形態と異なる部分を中心に説明する。
図11は第三実施形態の線形予測係数符号化装置500の機能ブロック図を、図8はその処理フローの例を示す。
補正ベクトル符号化部512は、指標Qおよび/または指標Q’と補正ベクトルUfとを受け取る。補正ベクトル符号化部512は、(A-2)指標Qが大きいほど、および/または、(B-2)指標Q’が小さいほど、ビット数の多い(符号長が大きい)補正LSP符号Dfを得て(s512)出力する。例えば、所定の閾値Th2、および/または、所定の閾値Th2'を用いて、以下のように符号化を行う。なお、補正ベクトル符号化部512が符号化処理を実行するのは指標Qが所定の閾値Th1以上である場合、および/または、指標Q’が所定の閾値Th1'以下である場合であるので、Th2はTh1よりも大きな値であり、Th2'はTh1'よりも小さな値である。
図12は第三実施形態に係る線形予測係数復号装置600の機能ブロック図を、図10はその処理フローの例を示す。
補正ベクトル復号部611は、指標Qおよび/または指標Q’と補正LSP符号Dfとを受け取る。補正ベクトル復号部611は、(A-2)指標Qが大きいほど、および/または、(B-2)指標Q’が小さいほど、多くのビット数を有する補正LSP符号Dfを復号して、多くの候補補正ベクトルから復号補正ベクトル^Ufを得る(s611)。例えば、所定の閾値Th2、および/または、Th2'を用いて、以下のように復号を行う。なお、補正ベクトル復号部611が復号処理を実行するのは指標Qが所定の閾値Th1以上である場合、および/または、指標Q’が所定の閾値Th1'以下である場合であるので、Th2はTh1よりも大きな値であり、Th2'はTh1'よりも小さな値である。
このような構成により、第二実施形態と同様の効果を得ることができる。さらに、LSP符号の伝送誤りに起因する復号精度の低下の影響が大きさに応じて、近似精度を変更することで、第一実施形態の符号化及ぶ復号よりは全体として符号量を抑えつつ、第二実施形態符号化及ぶ復号よりも音質の良い符号化及び復号処理を行うことができる。
補正ベクトル符号帳の個数は、必ずしも2個でなくてもよく、3個以上であってもよい。補正ベクトル符号帳毎に異なるビット数(ビット長)の補正ベクトル符号が記憶されており、その補正ベクトル符号に対応する補正ベクトルが記憶されている。補正ベクトル符号帳の個数に応じて、閾値を設定すればよい。指標Qに対する閾値は、閾値の値が大きくなるほど、その閾値以上の場合に用いられる補正ベクトル符号帳に記憶される補正ベクトル符号のビット数が大きくなるように設定すればよい。同様に、指標Q’に対する閾値は、閾値の値が小さくなるほど、その閾値以下の場合に用いられる補正ベクトル符号帳に記憶される補正ベクトル符号のビット数が大きくなるように設定すればよい。このような構成とすることで、全体として符号量を抑えつつ、より精度の高い符号化及び復号処理を行うことができる。
第四実施形態にかかる符号化装置700は、第一実施形態の線形予測係数符号化装置100及び線形予測係数復号装置200を周波数領域での符号化方法であるTCX(transform coded excitation)符号化方法に応用したものである。
線形予測係数符号化装置100は、音響信号Xfを受け取り、LSP符号Cf及び補正LSP符号Dfを得て(s100)出力する。
線形予測係数復号装置200は、LSP符号Cfと補正LSP符号Dfとを受け取り、予測対応量子化線形予測係数^af[1],^af[2],…,^af[p]と非予測対応量子化線形予測係数^bf[1],^bf[2],…,^bf[p]とを得て(s200)出力する。
パワースペクトル包絡系列計算部710は、非予測量子化対応線形予測係数^bf[1],^bf[2],…,^bf[p]を受け取る。パワースペクトル包絡系列計算部710は、非予測対応量子化線形予測係数^bf[1],^bf[2],…,^bf[p]を用いて、N点の入力音響信号のパワースペクトル包絡系列Z[1],…,Z[N]を計算して(s710)出力する。例えば、パワースペクトル包絡系列の各値Z[n]は、次式で求めることができる。
第一平滑化パワースペクトル包絡系列計算部720Aは、予測対応量子化線形予測係数^af[1],^af[2],…,^af[p]を受け取る。第一平滑化パワースペクトル包絡系列計算部720Aは、予測対応量子化線形予測係数^af[1],^af[2],…,^af[p]と予め与えられた1以下の正の定数である補正係数γiを用いて、
第二平滑化パワースペクトル包絡系列計算部720Bは、非予測対応量子化線形予測係数^bf[1],^bf[2],…,^bf[p]を受け取る。第二平滑化パワースペクトル包絡系列計算部720Bは、非予測対応量子化線形予測係数^bf[1],^bf[2],…,^bf[p]と予め与えられた1以下の正の定数である補正係数γiを用いて、
周波数領域変換部730は、所定の時間区間であるフレーム単位で、入力された時間領域の入力音響信号Xfを周波数領域のN点のMDCT係数列X[1],…,X[N]に変換して(s730)出力する。ただし、Nは正整数である。
包絡正規化部740は、MDCT係数列X[1],…,X[N]と第一平滑化済パワースペクトル包絡系列~W[1],~W[2],…,~W[N]とを受け取り、MDCT係数列X[1],…,X[N]の各係数X[i]を第一平滑化済パワースペクトル包絡系列~W[1],~W[2],…,~W[N]の各値~W[i]の平方根で正規化した系列である、正規化済MDCT係数列XN[1],…,XN[N]を求めて(s740)出力する。つまり、
XN[i]=X[i]/sqrt(~W[i])
である。ただし、sqrt(・)は1/2乗を示す記号である。
可変長符号化パラメータ計算部750は、パワースペクトル包絡系列Z[1],…,Z[N]と第二平滑化パワースペクトル包絡系列~Z[1],…,~Z[N]とMDCT係数列X[1],…,X[N]と正規化済MDCT係数列XN[1],…,XN[N]とを受け取る。これらの値を用いて、正規化済MDCT係数列XN[1],…,XN[N]を可変長符号化するためのパラメータである可変長符号化パラメータriを計算して(s750)出力する。可変長符号化パラメータriは、符号化対象の正規化済MDCT係数列XN[1],…,XN[N]の振幅が取り得る範囲を特定するパラメータである。ライス符号化の場合にはライスパラメータが可変長符号化パラメータに相当し、算術符号化の場合は符号化対象の振幅の取りうる範囲が可変長符号化パラメータに相当する。
可変長符号化部760は、可変長符号化パラメータriを受け取り、この値を用いて正規化済係数列XN(1),…,XN(N)を可変長符号化し、可変長符号CXを出力する(s760)。
第四実施形態は、MDCT係数列X[1],X[2],…,X[N]を平滑化パワースペクトル包絡系列で正規化して得られる正規化済MDCT係数列XN[1],…,XN[N]を、可変長符号化パラメータを用いて符号化する構成である。
以上の第一~第四実施形態において、図3の線形予測係数符号化装置100の非予測対応符号化部110、図7の線形予測係数符号化装置300の非予測対応符号化部310、図11の線形予測係数符号化装置500の非予測対応符号化部510で行われる処理(非予測対応符号化処理)を実行する対象を、予測次数p未満の所定の次数TL以下のLSPパラメータ(低次のLSPパラメータ)のみとしてもよく、復号側でもこれらに対応する処理を行ってもよい。
非予測対応減算部111、311は、入力されたLSPパラメータベクトルΘf=(θf[1],θf[2],…,θf[p])TのうちのTL次以下のLSPパラメータからなる低次LSPパラメータベクトルΘ’f=(θf[1],θf[2],…,θf[TL])Tから、記憶部111cに記憶された非予測対応低次平均ベクトルY’=(y[1],y[2],…,y[TL])Tと、入力された量子化差分ベクトル^Sf=(^sf[1],^sf[2],…,^sf[p])TのうちのTL次以下の要素からなる低次量子化差分ベクトル^S’f=(^sf[1],^sf[2],…,^sf[TL])Tと、を減算したベクトルである低次補正ベクトルU’f=Θ’f-Y’-^S’fを生成して出力する。すなわち、非予測対応減算部111、311は、補正ベクトルUfの要素の一部からなるベクトルである低次補正ベクトルU’fを生成して出力する。
補正ベクトル符号化部112、312及び512は、補正ベクトルUfの要素の一部からなるベクトルである低次補正ベクトルU’fを補正ベクトル符号帳113、513A、513Bを参照して符号化する。補正ベクトル符号帳113、513A、513Bに記憶しておく各候補補正ベクトルはTL次のベクトルとしておけばよい。
補正ベクトル復号部211,411、611は、補正LSP符号Dfを受け取り、補正ベクトル符号帳212、612A、612Bを参照して、補正LSP符号Dfを復号して復号低次補正ベクトル^U’fを得て出力する。復号低次補正ベクトル^U’f=(uf[1],uf[2],…,uf[TL])TはTL次のベクトルである。補正ベクトル符号帳212、612A、612Bに記憶しておく各候補補正ベクトルは、補正ベクトル符号帳113、513A、513Bと同様に、TL次のベクトルとしておけばよい。
非予測対応加算部213は、復号低次補正ベクトル^U’f=(uf[1],uf[2],…,uf[TL])Tと非予測対応平均ベクトルY=(y[1],y[2],…,y[p])Tと復号差分ベクトル^Sf=(^sf[1],^sf[2],…,^sf[p])Tとを受け取る。
非予測対応加算部413は、復号低次補正ベクトル^U’f=(uf[1],uf[2],…,uf[TL])Tと非予測対応平均ベクトルY=(y[1],y[2],…,y[p])Tと復号差分ベクトル^Sf=(^sf[1],^sf[2],…,^sf[p])Tとを受け取る。
第一~第四実施形態では、LSP計算部の入力を線形予測係数af[1],af[2],…,af[p]としていたが、例えば、線形予測係数の各係数af[i]にγのi乗を乗じた係数の系列af[1]×γ,af[2]×γ2,…,af[p]×γpをLSP計算部の入力としてもよい。
本発明は上記の実施形態及び変形例に限定されるものではない。例えば、上述の各種の処理は、記載に従って時系列に実行されるのみならず、処理を実行する装置の処理能力あるいは必要に応じて並列的にあるいは個別に実行されてもよい。その他、本発明の趣旨を逸脱しない範囲で適宜変更が可能である。
また、上記の実施形態及び変形例で説明した各装置における各種の処理機能をコンピュータによって実現してもよい。その場合、各装置が有すべき機能の処理内容はプログラムによって記述される。そして、このプログラムをコンピュータで実行することにより、上記各装置における各種の処理機能がコンピュータ上で実現される。
Claims (20)
- 現在のフレームの複数次の線形予測係数に変換可能な係数によるベクトルについての、少なくとも過去のフレームからの予測を含む予測ベクトルとの差分からなる差分ベクトルを符号化して第一符号を得て、前記第一符号に対応する量子化差分ベクトルを得る予測対応符号化部と、
現在のフレームの複数次の線形予測係数に変換可能な係数によるベクトルと、前記量子化差分ベクトルとの差分または差分の要素の一部からなる補正ベクトルを符号化して第二符号を生成する非予測対応符号化部とを含む、
符号化装置。 - 現在のフレームの複数次の線形予測係数に変換可能な係数によるベクトルと、少なくとも過去のフレームからの予測と予め定めたベクトルとからなる予測ベクトルとの差分からなる差分ベクトルを符号化して第一符号を得て、前記第一符号に対応する量子化差分ベクトルを得る予測対応符号化部と、
現在のフレームの複数次の線形予測係数に変換可能な係数によるベクトルから前記量子化差分ベクトルと予め定めたベクトルとを減算して得られる差分または差分の要素の一部からなる補正ベクトルを符号化して第二符号を生成する非予測対応符号化部とを含む、
符号化装置。 - 請求項1または2の符号化装置であって、
αを正の定数とし、前記予測ベクトルは、予め定めた予測対応平均ベクトルと、過去のフレームの量子化差分ベクトルのα倍とが加算されたベクトルであり、
前記補正ベクトルは、現在のフレームの複数次の線形予測係数に変換可能な係数によるベクトルから、前記量子化差分ベクトルと予め定めた非予測対応平均ベクトルと、を減算して得られるベクトルまたはその要素の一部からなるベクトルである、
符号化装置。 - 請求項1から3の何れかの符号化装置であって、
前記非予測対応符号化部は、(A-1)前記線形予測係数に変換可能な係数の列に対応するスペクトル包絡の山谷の大きさに対応する指標Qが所定の閾値Th1以上である場合、および/または、(B-1)前記スペクトル包絡の山谷の小ささに対応する指標Q’が所定の閾値Th1’以下である場合に、前記補正ベクトルを符号化して前記第二符号を得る、
符号化装置。 - 請求項4の符号化装置であって、
前記複数次の線形予測係数に変換可能な係数によるベクトルは、線スペクトル対のパラメータ列からなるベクトルであり、
前記指標Q’は、前記第一符号に対応する全次または低次の量子化済みの線スペクトル対のパラメータ列の隣り合うパラメータ間の差分と、最低次の量子化済みの線スペクトル対のパラメータと、のうちの最小値である、
符号化装置。 - 請求項4の符号化装置であって、
前記複数次の線形予測係数に変換可能な係数によるベクトルは、線スペクトル対のパラメータ列からなるベクトルであり、
前記指標Q’は、前記第一符号に対応する全次または低次の量子化済みの線スペクトル対のパラメータ列の隣り合うパラメータ間の差分の最小値である、
符号化装置。 - 第一符号を復号して復号差分ベクトルを得て、前記復号差分ベクトルと、少なくとも過去のフレームからの予測を含む予測ベクトルとを加算して、現在のフレームの複数次の線形予測係数に変換可能な係数の復号値からなる第一復号ベクトルを生成する予測対応復号部と、
第二符号を復号して復号補正ベクトルを得て、前記復号補正ベクトルと、少なくとも前記復号差分ベクトルとの対応する次数の要素同士を加算して、現在のフレームの複数次の線形予測係数に変換可能な係数の復号値からなる第二復号ベクトルを生成する非予測対応復号部とを含む、
復号装置。 - 第一符号を復号して復号差分ベクトルを得て、前記復号差分ベクトルと、少なくとも過去のフレームからの予測と予め定めたベクトルとからなる予測ベクトルとを加算して、現在のフレームの複数次の線形予測係数に変換可能な係数の復号値からなる第一復号ベクトルを生成する予測対応復号部と、
第二符号を復号して復号補正ベクトルを得て、前記復号補正ベクトルに、少なくとも前記復号差分ベクトルと予め定めたベクトルとを対応する次数の要素毎に加算して、現在のフレームの複数次の線形予測係数に変換可能な係数の復号値からなる第二復号ベクトルを生成する非予測対応復号部とを含む、
復号装置。 - 請求項7または8の復号装置であって、
αを正の定数とし、前記予測ベクトルは、予め定めた予測対応平均ベクトルと、過去のフレームの復号差分ベクトルのα倍とを加算して得られるベクトルであり、
前記第二復号ベクトルは、前記復号補正ベクトルと、前記復号差分ベクトルと、予め定めた非予測対応平均ベクトルと、の対応する次数の要素同士を加算して得られるベクトルである、
復号装置。 - 請求項7から9の何れかの復号装置であって、
前記第二符号を復号して得られる復号補正ベクトルの要素数TLは、前記予測対応復号部の前記各ベクトルの要素数p未満であり、
前記非予測対応復号部が生成する第二復号ベクトルの要素数はpである
復号装置。 - 請求項7から10の何れかの復号装置であって、
前記非予測対応復号部は、(A)前記線形予測係数に変換可能な係数の列に対応するスペクトル包絡の山谷の大きさに対応する指標Qが所定の閾値Th1以上である場合、および/または、(B)前記スペクトル包絡の山谷の小ささに対応する指標Q’が所定の閾値Th1’以下である場合、前記第二復号ベクトルを生成する、
復号装置。 - 請求項11の復号装置であって、
前記複数次の線形予測係数に変換可能な係数は、線スペクトル対のパラメータであり、前記指標Q’は、前記第一復号ベクトルである復号された線スペクトル対のパラメータの列の隣り合うパラメータ間の差分と、最低次の復号された線スペクトル対のパラメータと、のうちの最小値である、
復号装置。 - 請求項11の復号装置であって、
前記複数次の線形予測係数に変換可能な係数は、線スペクトル対のパラメータであり、前記指標Q’は、前記第一復号ベクトルである復号された線スペクトル対のパラメータの列の隣り合うパラメータ間の差分の最小値である、
復号装置。 - 現在のフレームの複数次の線形予測係数に変換可能な係数によるベクトルについての、少なくとも過去のフレームからの予測を含む予測ベクトルとの差分からなる差分ベクトルを符号化して第一符号を得て、前記第一符号に対応する量子化差分ベクトルを得る予測対応符号化ステップと、
現在のフレームの複数次の線形予測係数に変換可能な係数によるベクトルと、前記量子化差分ベクトルとの差分または差分の要素の一部からなる補正ベクトルを符号化して第二符号を生成する非予測対応符号化ステップとを含む、
符号化方法。 - 現在のフレームの複数次の線形予測係数に変換可能な係数によるベクトルと、少なくとも過去のフレームからの予測と予め定めたベクトルとからなる予測ベクトルとの差分からなる差分ベクトルを符号化して第一符号を得て、前記第一符号に対応する量子化差分ベクトルを得る予測対応符号化ステップと、
現在のフレームの複数次の線形予測係数に変換可能な係数によるベクトルから前記量子化差分ベクトルと予め定めたベクトルとを減算して得られる差分または差分の要素の一部からなる補正ベクトルを符号化して第二符号を生成する非予測対応符号化ステップとを含む、
符号化方法。 - 第一符号を復号して復号差分ベクトルを得て、前記復号差分ベクトルと、少なくとも過去のフレームからの予測を含む予測ベクトルとを加算して、現在のフレームの複数次の線形予測係数に変換可能な係数の復号値からなる第一復号ベクトルを生成する予測対応復号ステップと、
第二符号を復号して復号補正ベクトルを得て、前記復号補正ベクトルと、少なくとも前記復号差分ベクトルとの対応する次数の要素同士を加算して、現在のフレームの複数次の線形予測係数に変換可能な係数の復号値からなる第二復号ベクトルを生成する非予測対応復号ステップとを含む、
復号方法。 - 第一符号を復号して復号差分ベクトルを得て、前記復号差分ベクトルと、少なくとも過去のフレームからの予測と予め定めたベクトルとからなる予測ベクトルとを加算して、現在のフレームの複数次の線形予測係数に変換可能な係数の復号値からなる第一復号ベクトルを生成する予測対応復号ステップと、
第二符号を復号して復号補正ベクトルを得て、前記復号補正ベクトルに、少なくとも前記復号差分ベクトルと予め定めたベクトルとを対応する次数の要素毎に加算して、現在のフレームの複数次の線形予測係数に変換可能な係数の復号値からなる第二復号ベクトルを生成する非予測対応復号ステップとを含む、
復号方法。 - 請求項16または17の復号方法であって、
前記第二符号を復号して得られる復号補正ベクトルの要素数TLは、前記予測対応復号ステップの前記各ベクトルの要素数p未満であり、
前記非予測対応復号ステップで生成される第二復号ベクトルの要素数はpである
復号方法。 - 請求項1から請求項6の何れかの符号化装置としてコンピュータを機能させるためのプログラム。
- 請求項7から請求項13の何れかの復号装置としてコンピュータを機能させるためのプログラム。
Priority Applications (24)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP15786812.6A EP3139382B1 (en) | 2014-05-01 | 2015-03-16 | Sound signal coding device, sound signal coding method, program and recording medium |
CN201580022683.3A CN106415715B (zh) | 2014-05-01 | 2015-03-16 | 编码装置、编码方法、记录介质 |
EP20167742.4A EP3706121B1 (en) | 2014-05-01 | 2015-03-16 | Sound signal coding device, sound signal coding method, program and recording medium |
ES15786812T ES2744904T3 (es) | 2014-05-01 | 2015-03-16 | Dispositivo de codificación de señal de sonido, método de codificación de señal de sonido, programa y medio de grabación |
PL20167742T PL3706121T3 (pl) | 2014-05-01 | 2015-03-16 | Urządzenie kodujące sygnał dźwiękowy, sposób kodowania sygnału dźwiękowego, program i nośnik rejestrujący |
KR1020187012383A KR101870947B1 (ko) | 2014-05-01 | 2015-03-16 | 부호화 장치, 복호 장치 및 그 방법, 프로그램, 기록 매체 |
EP19174056.2A EP3544004B1 (en) | 2014-05-01 | 2015-03-16 | Sound signal decoding device, sound signal decoding method, program and recording medium |
PL21158838T PL3859734T3 (pl) | 2014-05-01 | 2015-03-16 | Urządzenie dekodujące sygnał dźwiękowy, sposób dekodowania sygnału dźwiękowego, program i nośnik rejestrujący |
EP21158838.9A EP3859734B1 (en) | 2014-05-01 | 2015-03-16 | Sound signal decoding device, sound signal decoding method, program and recording medium |
JP2016515896A JP6270993B2 (ja) | 2014-05-01 | 2015-03-16 | 符号化装置、及びその方法、プログラム、記録媒体 |
KR1020167030130A KR101855945B1 (ko) | 2014-05-01 | 2015-03-16 | 부호화 장치, 복호 장치 및 그 방법, 프로그램, 기록 매체 |
CN201910644499.2A CN110444217B (zh) | 2014-05-01 | 2015-03-16 | 解码装置、解码方法、记录介质 |
CN201910644410.2A CN110444216B (zh) | 2014-05-01 | 2015-03-16 | 解码装置、解码方法、记录介质 |
KR1020187012387A KR101870962B1 (ko) | 2014-05-01 | 2015-03-16 | 부호화 장치, 복호 장치 및 그 방법, 프로그램, 기록 매체 |
KR1020187012384A KR101870957B1 (ko) | 2014-05-01 | 2015-03-16 | 부호화 장치, 복호 장치 및 그 방법, 프로그램, 기록 매체 |
PL15786812T PL3139382T3 (pl) | 2014-05-01 | 2015-03-16 | Urządzenie kodujące sygnał dźwiękowy, sposób kodowania sygnału dźwiękowego, program i nośnik rejestrujący |
US15/307,059 US10418042B2 (en) | 2014-05-01 | 2015-03-16 | Coding device, decoding device, method, program and recording medium thereof |
PL19174056T PL3544004T3 (pl) | 2014-05-01 | 2015-03-16 | Urządzenie dekodujące sygnał dźwiękowy, sposób dekodowania sygnału dźwiękowego, program i nośnik rejestrujący |
CN201910644404.7A CN110444215B (zh) | 2014-05-01 | 2015-03-16 | 编码装置、编码方法、记录介质 |
US16/527,160 US11120809B2 (en) | 2014-05-01 | 2019-07-31 | Coding device, decoding device, and method and program thereof |
US17/369,056 US11670313B2 (en) | 2014-05-01 | 2021-07-07 | Coding device, decoding device, and method and program thereof |
US17/370,060 US11694702B2 (en) | 2014-05-01 | 2021-07-08 | Coding device, decoding device, and method and program thereof |
US18/195,015 US12051430B2 (en) | 2014-05-01 | 2023-05-09 | Coding device, decoding device, and method and program thereof |
US18/743,662 US20240339119A1 (en) | 2014-05-01 | 2024-06-14 | Coding device, decoding device, and method and program thereof |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2014094758 | 2014-05-01 | ||
JP2014-094758 | 2014-05-01 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/307,059 A-371-Of-International US10418042B2 (en) | 2014-05-01 | 2015-03-16 | Coding device, decoding device, method, program and recording medium thereof |
US16/527,160 Continuation US11120809B2 (en) | 2014-05-01 | 2019-07-31 | Coding device, decoding device, and method and program thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2015166733A1 true WO2015166733A1 (ja) | 2015-11-05 |
Family
ID=54358473
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2015/057727 WO2015166733A1 (ja) | 2014-05-01 | 2015-03-16 | 符号化装置、復号装置、及びその方法、プログラム |
Country Status (8)
Country | Link |
---|---|
US (6) | US10418042B2 (ja) |
EP (4) | EP3859734B1 (ja) |
JP (4) | JP6270993B2 (ja) |
KR (4) | KR101855945B1 (ja) |
CN (4) | CN110444217B (ja) |
ES (4) | ES2876184T3 (ja) |
PL (4) | PL3139382T3 (ja) |
WO (1) | WO2015166733A1 (ja) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110444217B (zh) | 2014-05-01 | 2022-10-21 | 日本电信电话株式会社 | 解码装置、解码方法、记录介质 |
US11789729B2 (en) | 2017-12-29 | 2023-10-17 | Intel Corporation | Systems and methods for computing dot products of nibbles in two tile operands |
US11669326B2 (en) | 2017-12-29 | 2023-06-06 | Intel Corporation | Systems, methods, and apparatuses for dot product operations |
US11809869B2 (en) | 2017-12-29 | 2023-11-07 | Intel Corporation | Systems and methods to store a tile register pair to memory |
US11816483B2 (en) | 2017-12-29 | 2023-11-14 | Intel Corporation | Systems, methods, and apparatuses for matrix operations |
US11093247B2 (en) | 2017-12-29 | 2021-08-17 | Intel Corporation | Systems and methods to load a tile register pair |
US11023235B2 (en) * | 2017-12-29 | 2021-06-01 | Intel Corporation | Systems and methods to zero a tile register pair |
CN109688409B (zh) * | 2018-12-28 | 2021-03-02 | 北京奇艺世纪科技有限公司 | 一种视频编码方法及装置 |
US11281470B2 (en) * | 2019-12-19 | 2022-03-22 | Advanced Micro Devices, Inc. | Argmax use for machine learning |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002123298A (ja) * | 2000-10-18 | 2002-04-26 | Nippon Telegr & Teleph Corp <Ntt> | 信号符号化方法、装置及び信号符号化プログラムを記録した記録媒体 |
JP2002366195A (ja) * | 2001-06-04 | 2002-12-20 | Yrp Kokino Idotai Tsushin Kenkyusho:Kk | 音声符号化パラメータ符号化方法及び装置 |
JP2009139505A (ja) * | 2007-12-04 | 2009-06-25 | Nippon Telegr & Teleph Corp <Ntt> | 符号化方法、復号化方法、これらの方法を用いた装置、プログラム、記録媒体 |
JP2009210645A (ja) * | 2008-02-29 | 2009-09-17 | Nippon Telegr & Teleph Corp <Ntt> | 符号化装置、復号化装置、符号化方法、復号化方法、プログラム、記録媒体 |
JP2010145593A (ja) * | 2008-12-17 | 2010-07-01 | Sony Corp | 情報符号化装置 |
Family Cites Families (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5396576A (en) * | 1991-05-22 | 1995-03-07 | Nippon Telegraph And Telephone Corporation | Speech coding and decoding methods using adaptive and random code books |
JP3255189B2 (ja) * | 1992-12-01 | 2002-02-12 | 日本電信電話株式会社 | 音声パラメータの符号化方法および復号方法 |
CA2154911C (en) * | 1994-08-02 | 2001-01-02 | Kazunori Ozawa | Speech coding device |
TW408298B (en) * | 1997-08-28 | 2000-10-11 | Texas Instruments Inc | Improved method for switched-predictive quantization |
CA2722196C (en) * | 1997-12-24 | 2014-10-21 | Mitsubishi Denki Kabushiki Kaisha | A method for speech coding, method for speech decoding and their apparatuses |
JP3478209B2 (ja) * | 1999-11-01 | 2003-12-15 | 日本電気株式会社 | 音声信号復号方法及び装置と音声信号符号化復号方法及び装置と記録媒体 |
US7167828B2 (en) * | 2000-01-11 | 2007-01-23 | Matsushita Electric Industrial Co., Ltd. | Multimode speech coding apparatus and decoding apparatus |
US6757654B1 (en) * | 2000-05-11 | 2004-06-29 | Telefonaktiebolaget Lm Ericsson | Forward error correction in speech coding |
JP2002202799A (ja) * | 2000-10-30 | 2002-07-19 | Fujitsu Ltd | 音声符号変換装置 |
KR100487719B1 (ko) * | 2003-03-05 | 2005-05-04 | 한국전자통신연구원 | 광대역 음성 부호화를 위한 엘에스에프 계수 벡터 양자화기 |
CN101494460B (zh) * | 2003-09-02 | 2012-07-11 | 日本电信电话株式会社 | 浮点信号的编码方法、解码方法及其编码器、解码器 |
EP1755109B1 (en) * | 2004-04-27 | 2012-08-15 | Panasonic Corporation | Scalable encoding and decoding apparatuses and methods |
ATE394774T1 (de) * | 2004-05-19 | 2008-05-15 | Matsushita Electric Ind Co Ltd | Kodierungs-, dekodierungsvorrichtung und methode dafür |
CN101794579A (zh) * | 2005-01-12 | 2010-08-04 | 日本电信电话株式会社 | 长期预测编码和长期预测解码的方法和装置 |
RU2008112137A (ru) * | 2005-09-30 | 2009-11-10 | Панасоник Корпорэйшн (Jp) | Устройство кодирования речи и способ кодирования речи |
US20090248404A1 (en) * | 2006-07-12 | 2009-10-01 | Panasonic Corporation | Lost frame compensating method, audio encoding apparatus and audio decoding apparatus |
RU2462769C2 (ru) * | 2006-10-24 | 2012-09-27 | Войсэйдж Корпорейшн | Способ и устройство кодирования кадров перехода в речевых сигналах |
US7813922B2 (en) * | 2007-01-30 | 2010-10-12 | Nokia Corporation | Audio quantization |
US8719012B2 (en) * | 2007-06-15 | 2014-05-06 | Orange | Methods and apparatus for coding digital audio signals using a filtered quantizing noise |
WO2009075326A1 (ja) * | 2007-12-11 | 2009-06-18 | Nippon Telegraph And Telephone Corporation | 符号化方法、復号化方法、これらの方法を用いた装置、プログラム、記録媒体 |
ES2409843T3 (es) * | 2008-01-24 | 2013-06-28 | Nippon Telegraph And Telephone Corporation | Método de codificación, método de decodificación, y dispositivo y programa para los mismos, y medio o soporte de registro o grabación |
WO2010041680A1 (ja) * | 2008-10-10 | 2010-04-15 | 日本電信電話株式会社 | 符号化方法、符号化装置、復号方法、復号装置、プログラム及び記録媒体 |
JP4848049B2 (ja) * | 2008-12-09 | 2011-12-28 | 日本電信電話株式会社 | 符号化方法、復号方法、それらの装置、プログラム及び記録媒体 |
JP5253518B2 (ja) * | 2008-12-22 | 2013-07-31 | 日本電信電話株式会社 | 符号化方法、復号方法、それらの装置、プログラム及び記録媒体 |
CN101521013B (zh) * | 2009-04-08 | 2011-08-17 | 武汉大学 | 空间音频参数双向帧间预测编解码装置 |
WO2010140546A1 (ja) * | 2009-06-03 | 2010-12-09 | 日本電信電話株式会社 | 符号化方法、復号化方法、符号化装置、復号化装置、符号化プログラム、復号化プログラム及びこれらの記録媒体 |
GB0917417D0 (en) * | 2009-10-05 | 2009-11-18 | Mitsubishi Elec R&D Ct Europe | Multimedia signature coding and decoding |
KR101397058B1 (ko) * | 2009-11-12 | 2014-05-20 | 엘지전자 주식회사 | 신호 처리 방법 및 이의 장치 |
CN102714040A (zh) * | 2010-01-14 | 2012-10-03 | 松下电器产业株式会社 | 编码装置、解码装置、频谱变动量计算方法和频谱振幅调整方法 |
JP5813094B2 (ja) * | 2010-04-09 | 2015-11-17 | ドルビー・インターナショナル・アーベー | Mdctベース複素予測ステレオ符号化 |
ES2704742T3 (es) * | 2011-04-05 | 2019-03-19 | Nippon Telegraph & Telephone | Descodificación de una señal acústica |
JP6160072B2 (ja) * | 2012-12-06 | 2017-07-12 | 富士通株式会社 | オーディオ信号符号化装置および方法、オーディオ信号伝送システムおよび方法、オーディオ信号復号装置 |
US9842598B2 (en) * | 2013-02-21 | 2017-12-12 | Qualcomm Incorporated | Systems and methods for mitigating potential frame instability |
JP6366706B2 (ja) * | 2013-10-18 | 2018-08-01 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | スピーチ関連のスペクトル整形情報を使用したオーディオ信号符号化と復号化の概念 |
FR3013496A1 (fr) * | 2013-11-15 | 2015-05-22 | Orange | Transition d'un codage/decodage par transformee vers un codage/decodage predictif |
AU2014391078B2 (en) * | 2014-04-17 | 2020-03-26 | Voiceage Evs Llc | Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates |
CN110444217B (zh) * | 2014-05-01 | 2022-10-21 | 日本电信电话株式会社 | 解码装置、解码方法、记录介质 |
US9747910B2 (en) * | 2014-09-26 | 2017-08-29 | Qualcomm Incorporated | Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework |
-
2015
- 2015-03-16 CN CN201910644499.2A patent/CN110444217B/zh active Active
- 2015-03-16 KR KR1020167030130A patent/KR101855945B1/ko active IP Right Grant
- 2015-03-16 PL PL15786812T patent/PL3139382T3/pl unknown
- 2015-03-16 CN CN201910644404.7A patent/CN110444215B/zh active Active
- 2015-03-16 PL PL21158838T patent/PL3859734T3/pl unknown
- 2015-03-16 WO PCT/JP2015/057727 patent/WO2015166733A1/ja active Application Filing
- 2015-03-16 PL PL19174056T patent/PL3544004T3/pl unknown
- 2015-03-16 EP EP21158838.9A patent/EP3859734B1/en active Active
- 2015-03-16 PL PL20167742T patent/PL3706121T3/pl unknown
- 2015-03-16 ES ES20167742T patent/ES2876184T3/es active Active
- 2015-03-16 CN CN201910644410.2A patent/CN110444216B/zh active Active
- 2015-03-16 JP JP2016515896A patent/JP6270993B2/ja active Active
- 2015-03-16 ES ES21158838T patent/ES2911527T3/es active Active
- 2015-03-16 CN CN201580022683.3A patent/CN106415715B/zh active Active
- 2015-03-16 KR KR1020187012384A patent/KR101870957B1/ko active IP Right Grant
- 2015-03-16 EP EP20167742.4A patent/EP3706121B1/en active Active
- 2015-03-16 EP EP19174056.2A patent/EP3544004B1/en active Active
- 2015-03-16 ES ES19174056T patent/ES2822127T3/es active Active
- 2015-03-16 KR KR1020187012387A patent/KR101870962B1/ko active IP Right Grant
- 2015-03-16 US US15/307,059 patent/US10418042B2/en active Active
- 2015-03-16 EP EP15786812.6A patent/EP3139382B1/en active Active
- 2015-03-16 KR KR1020187012383A patent/KR101870947B1/ko active IP Right Grant
- 2015-03-16 ES ES15786812T patent/ES2744904T3/es active Active
-
2017
- 2017-12-25 JP JP2017247954A patent/JP6462104B2/ja active Active
-
2018
- 2018-01-26 JP JP2018011829A patent/JP6490846B2/ja active Active
- 2018-01-26 JP JP2018011828A patent/JP6484358B2/ja active Active
-
2019
- 2019-07-31 US US16/527,160 patent/US11120809B2/en active Active
-
2021
- 2021-07-07 US US17/369,056 patent/US11670313B2/en active Active
- 2021-07-08 US US17/370,060 patent/US11694702B2/en active Active
-
2023
- 2023-05-09 US US18/195,015 patent/US12051430B2/en active Active
-
2024
- 2024-06-14 US US18/743,662 patent/US20240339119A1/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002123298A (ja) * | 2000-10-18 | 2002-04-26 | Nippon Telegr & Teleph Corp <Ntt> | 信号符号化方法、装置及び信号符号化プログラムを記録した記録媒体 |
JP2002366195A (ja) * | 2001-06-04 | 2002-12-20 | Yrp Kokino Idotai Tsushin Kenkyusho:Kk | 音声符号化パラメータ符号化方法及び装置 |
JP2009139505A (ja) * | 2007-12-04 | 2009-06-25 | Nippon Telegr & Teleph Corp <Ntt> | 符号化方法、復号化方法、これらの方法を用いた装置、プログラム、記録媒体 |
JP2009210645A (ja) * | 2008-02-29 | 2009-09-17 | Nippon Telegr & Teleph Corp <Ntt> | 符号化装置、復号化装置、符号化方法、復号化方法、プログラム、記録媒体 |
JP2010145593A (ja) * | 2008-12-17 | 2010-07-01 | Sony Corp | 情報符号化装置 |
Non-Patent Citations (1)
Title |
---|
"Recommendation ITU-T G.729", June 2012 (2012-06-01), pages 1 - 8, XP055234042, Retrieved from the Internet <URL:https://www.itu.int/rec/T-REC-G.729/en> * |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6484358B2 (ja) | 符号化装置、及びその方法、プログラム、記録媒体 | |
JP6495492B2 (ja) | 復号装置、及びその方法、プログラム、記録媒体 | |
WO2013129439A1 (ja) | 符号化装置、この方法、プログラム及び記録媒体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15786812 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2016515896 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 20167030130 Country of ref document: KR Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 15307059 Country of ref document: US |
|
REEP | Request for entry into the european phase |
Ref document number: 2015786812 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2015786812 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |