EP1697931A1 - Vorrichtung und verfahren zum ermitteln eines sch tzwerts - Google Patents
Vorrichtung und verfahren zum ermitteln eines sch tzwertsInfo
- Publication number
- EP1697931A1 EP1697931A1 EP05707481A EP05707481A EP1697931A1 EP 1697931 A1 EP1697931 A1 EP 1697931A1 EP 05707481 A EP05707481 A EP 05707481A EP 05707481 A EP05707481 A EP 05707481A EP 1697931 A1 EP1697931 A1 EP 1697931A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- energy
- measure
- band
- signal
- distribution
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 20
- 238000009826 distribution Methods 0.000 claims abstract description 36
- 230000003595 spectral effect Effects 0.000 claims description 54
- 238000004590 computer program Methods 0.000 claims description 6
- 238000009827 uniform distribution Methods 0.000 claims description 4
- 239000000654 additive Substances 0.000 claims description 3
- 230000000996 additive effect Effects 0.000 claims description 3
- 230000005236 sound signal Effects 0.000 abstract description 9
- 238000004364 calculation method Methods 0.000 description 23
- 238000013139 quantization Methods 0.000 description 20
- 238000010586 diagram Methods 0.000 description 7
- 238000012545 processing Methods 0.000 description 7
- 238000007493 shaping process Methods 0.000 description 5
- 230000001052 transient effect Effects 0.000 description 5
- 230000008569 process Effects 0.000 description 4
- 238000001914 filtration Methods 0.000 description 3
- 230000000873 masking effect Effects 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000010183 spectrum analysis Methods 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
- G10L19/025—Detection of transients or attacks for time/frequency resolution switching
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
Definitions
- the present invention relates to encoders for encoding a signal comprising audio and / or video information, and more particularly to the estimation of a need for information units to encode that signal.
- an audio signal to be coded is fed. This is first supplied to a scaling stage 1002 in which a so-called AAC gain control is performed to set the level of the audio signal. Scaling page information is provided to a bitstream formatter 1004, as indicated by the arrow between block 1002 and block 1004. The scaled audio signal is then fed to an MDCT filter bank 1006.
- the filter bank implements a modified discrete cosine transform with 50% overlapping windows, the window length being determined by a block 1008.
- block 1008 is for windowing transient signals with shorter windows, and for windowing stationary signals with longer windows. This serves to achieve a higher time resolution (at the expense of frequency resolution) due to the shorter windows for transient signals, while for more stationary signals a higher frequency resolution (at the expense of time resolution) is achieved by longer windows is achieved, with longer windows tend to be preferred because they promise a larger Codier stand.
- temporally successive blocks of spectral values are present, which, depending on the embodiment of the filter bank, may be MDCT coefficients, Fourier coefficients or even subband signals, each subband signal having a certain limited bandwidth passing through the corresponding subband channel in the filter bank 1006, and wherein each subband signal has a certain number of subband samples.
- the filter bank outputs temporally successive blocks of MDCT spectral coefficients, which generally represent successive short-term spectra of the audio signal to be encoded at input 1000.
- a block of MDCT spectral values is then fed into a TNS processing block 1010 where temporal noise shaping (TNS) takes place.
- TNS temporal noise shaping
- the TNS technique is used to shape the temporal shape of the quantization noise within each window of the transform. This is achieved by applying a filtering process to parts of the spectral data of each channel.
- the coding is performed on a window basis. In particular, the following steps are performed to apply the TNS tool to a window of spectral data, that is, to a block of spectral values.
- a frequency range for the TNS tool is selected.
- a suitable choice is to cover a frequency range of 1.5 kHz to the highest possible scale factor band with a filter. It should be noted that this frequency range of the sampling rate depends as specified in the AAC standard (ISO / IEC 14496-3: 2001 (E)).
- LPC Linear Predictive Coding
- the expected prediction gain PG is obtained. Further, the reflection coefficients or Parcor coefficients are obtained.
- the TNS tool is not applied. In this case, control information is written in the bit stream for a decoder to know that no TNS processing has been performed.
- TNS processing is applied.
- the reflection coefficients are quantized.
- the order of the noise shaping filter used is determined by removing all the reflection coefficients having an absolute value less than a threshold from the "tail" of the reflection coefficient array. The number of remaining reflection coefficients is on the order of the noise shaping filter.
- a suitable threshold is 0.1.
- the remaining reflection coefficients are typically converted into linear prediction coefficients, which technique is also known as the N step-up- w- procedure.
- the calculated LPC coefficients are then used as coder noise shaping filter coefficients, ie as prediction filter coefficients.
- This FIR filter is routed over the specified target frequency range.
- an autoregressive filter is used, while the coding uses a so-called moving average filter.
- the page information for the TNS tool is also supplied to the bit stream formatter as shown by the arrow shown between the block TNS processing 1010 and the bitstream formatter 1004 in FIG.
- the center / side encoder 1012 is active when the audio signal to be encoded is a multi-channel signal, that is, a stereo signal having a left channel and a right channel. So far, that is, in the processing direction before the block 1012 in Fig. 3, the left and right stereo channels have been separately processed, that is, scaled, transformed by the filter bank, subjected to TNS processing or not, etc.
- middle / side encoder In the middle / side encoder is then first checked whether a middle / side encoding makes sense, that brings a coding gain at all. A middle / side encoding will then bring a coding gain if the left and the right channel are more similar, because then the center channel, that is the sum of the left and the right channel is almost equal to the left or the right channel, apart from the scaling by the factor 1/2, while the side channel has only very small values, since it is equal to the difference between the left and the right channel.
- the difference is approximately zero, or includes only very small values that are hoped to be quantized to zero in a subsequent quantizer 1014 and thus can be transmitted very efficiently, since the quantizer 1014 is followed by an entropy coder 1016.
- the quantizer 1014 is given a allowed perturbation per scale factor band by a psycho-acoustic model 1020.
- the quantizer operates iteratively, ie it first calls an outer iteration loop, which then calls an inner iteration loop.
- a quantization of a block of values is performed at the input of the quantizer 1014.
- the inner loop quantizes the MDCT coefficients, consuming a certain number of bits.
- the outer loop calculates the distortion and modified energy of the coefficients using the scale factor to again invoke an inner loop. This process is iterated until a certain conditional set is satisfied.
- the signal is reconstructed to compute the perturbation introduced by the quantization and to compare it with the allowable perturbation provided by the psycho-acoustic model 1020. Furthermore, the scale factors are increased from iteration to iteration by one step, for each iteration of the outer iteration loop.
- the analysis-through synthesis process terminates and the resulting scale factors are encoded as set forth in block 1014 and supplied in encoded form to the bitstream formatter 1004 as indicated by the arrow between block 1014 and block Block 1004 is drawn.
- the quantized values are then fed to entropy coder 1016, which typically performs entropy coding using several Huffman code tables for different scale factor bands to transfer the quantized values to a binary format.
- entropy coding in the form of Huffman coding relies on code tables that are created on the basis of expected signal statistics and in which frequently occurring values get shorter code words than more rarely occurring values.
- the entropy-coded values are then also supplied as actual main information to the bitstream formatter 1004, which then outputs the coded audio signal on the output side according to a specific bit stream syntax.
- the data reduction of audio signals is now a known technique that is the subject of a number of international standards (e.g., ISO / MPEG-1, MPEG-2 AAC, MPEG-4).
- the input signal is brought into a compact, data-reduced representation by means of a so-called encoder using perception-related effects (psychoacoustics, psychooptics).
- a spectral analysis of the signal is usually carried out and the corresponding signal components are quantized taking into account a perceptual model and subsequently coded in a compact manner as so-called bitstream.
- PE perceptual entropy
- the perceptual entropy or demand estimate of information units for encoding a signal can be used to estimate whether the signal is transient or stationary, since transient signals also require more bits to encode than more stationary signals.
- the estimation of a transient property For example, a signal is used to make a window length decision, as indicated at block 1008 in FIG. 3.
- FIG. 6 shows the perceptual entropy calculated in accordance with ISO / IEC IS 13818-7 (MPEG-2 advanced audio coding (AAC)).
- AAC MPEG-2 advanced audio coding
- the bands may originate from the band division of the psychoacoustic model (block 1020 in Fig. 3), or are the so-called scale factor bands (scfb) used in quantization.
- the psychoacoustic masking threshold is the energy value that the quantization error should not exceed.
- FIG. 6 thus shows how well such a Perceptual Entropy works as an estimate of the number of bits needed for coding.
- the respective perceptual entropy was plotted as a function of the bits consumed for each individual block using the example of an AAC coder at different bit rates.
- the test piece used contains a typical mix of music, language and individual instruments.
- the points would gather along a straight line through the zero point.
- the extension of the point sequence with the deviations from the ideal line illustrates the inaccurate estimate.
- the value for the Perceptual Entropy is determined to be too small, then the quantizer is signaled that fewer bits than actually required are needed to encode the signal. This, in turn, causes the quantizer to be coarsely quantized, which would immediately result in an audible disturbance in the signal unless countermeasures are taken.
- the countermeasures may be that the quantizer still requires one or more further iteration loops, which increases the computation time of the encoder.
- FIG. 8 shows the case in which the perceptual entropy is calculated line by line.
- the disadvantage lies in the higher computational complexity of the line-by-line calculation.
- spectral coefficients X (k) are used, where kOffset (b) designates the first index of band b.
- the object of the present invention is to provide an efficient yet accurate concept for determining an estimate of a need for information units to encode a signal.
- the present invention is based on the finding that it must be noted in a frequency band-wise calculation of the estimate for a need for information units for computing time reasons, however, that in order to obtain an accurate determination of the estimated value, the distribution the energy in the frequency band, which has to be calculated band by band.
- the entropy coder following the quantizer is implicitly "involved" in determining the estimate of the demand for information units, because entropy coding allows a smaller number of bits to be used to transmit smaller spectral values than to transmit
- the entropy coder is particularly efficient when it is possible to transmit to-zero quantized spectral values, since these will typically occur most frequently, and the codeword for transmitting a zero-quantized spectral line is the shortest codeword
- even run-length coding can be resorted to, which in the event of a run of zero On average, not even a single bit is needed per per-zero quantized spectral value.
- the measure of the distribution of energy in the frequency band can be determined based on the actual amplitudes, or by estimating the frequency lines that are not quantized to zero by the quantizer.
- This measure which is also referred to as "nl", where nl stands for “number of active lines", ie for the number of active lines, is preferred for reasons of computing efficiency.
- the number of spectral lines quantized to zero or a finer subdivision can also be taken into account, and this estimate becomes more and more accurate as more information from the downstream entropy coder is taken into account.
- the entropy coder is constructed on the basis of Huffman code tables, properties of these codetables can be integrated particularly well, since the codetables are not calculated on-line on the basis of the signal statistics, but because the codetables are fixed independently of the actual signal anyway.
- the measure of the distribution of the energy in the frequency band is determined by determining the lines still surviving after the quantization, ie the number of active lines.
- the present invention is advantageous in that an estimate of a need for information content is determined which is more accurate and more efficient than the prior art.
- the present invention is scalable for various applications since, depending on the desired accuracy of the estimate, more and more characteristics of the entropy coder, but at the cost of increased computation time, can be included in the estimation of the bit demand.
- FIG. 1 shows a block diagram of the device according to the invention for determining an estimated value
- Fig. 2a shows a preferred embodiment of the means for calculating a measure of the distribution of energy in the frequency band
- Fig. 2b shows a preferred embodiment of the means for calculating the demand for bits
- Fig. 3 is a block diagram of a known audio encoder
- FIG. 4 is a schematic diagram for explaining the influence of the energy distribution within a band on the determination of the estimated value
- 5 is a diagram for estimation calculation according to the present invention
- 6 shows a diagram for estimation calculation according to I-SO / IEC IS 13818-7 (AAC);
- the device according to the invention for determining an estimate for a requirement of information units for coding a signal is illustrated below with reference to FIG.
- the signal which may be an audio and / or a video signal, is input via an input 100.
- the signal is already present as a spectral representation with spectral values. However, this is not absolutely necessary as it can be achieved by appropriate e.g. Bandpass filtering also some calculations can be done with a time signal.
- the signal is provided to a device 102 for providing a measure of allowable interference to a frequency band of the signal.
- the allowed disturbance can be determined, for example, by means of a psycho-acoustic model, as has been explained with reference to FIG. 3 (block 1020).
- the device 102 is also operative to also provide a measure of the energy of the signal in the frequency band.
- the prerequisite for a band-wise calculation is that a frequency band for which a permitted interference or a signal energy is specified contains at least two or more spectral lines of the spectral representation of the signal.
- the frequency band will preferably be a scale factor band, since the bit demand estimate is needed directly by the quantizer to determine if a done quantization satisfies a bit criterion or not.
- the device 102 is designed to supply both the allowed disturbance nb (b) and the signal energy e (b) of the signal in the band to a device 104 for calculating the demand for bits.
- the means 104 for calculating the demand for bits is designed to take into account, in addition to the allowed disturbance and the signal energy, a measure nl (b) for a distribution of the energy in the frequency band, the distribution of the energy in the frequency band of deviates from a completely uniform distribution.
- the measure of the energy distribution is computed in a device 106, wherein the device 106 requires at least one band, namely the considered frequency band of the audio or video signal, either as a bandpass signal or directly as a series of spectral lines, e.g. to perform a spectral analysis of the band to get the measure of the distribution of energies in the frequency band.
- the audio or video signal may be supplied to the device 106 as a time signal, the device 106 then performing band filtering as well as analysis in the band.
- the audio or video signal supplied to the device 106 may already be in the frequency domain, such as MDCT coefficients, or as a bandpass signal in the filter bank with a smaller bandpass compared to an MDCT filterbank -Filter.
- the means 106 for calculating is designed to take into account current amounts of spectral values in the frequency band for calculating the estimated value.
- the means for calculating the measure of the distribution of the energy can be designed to determine as a measure of the distribution of energy a number of spectral values whose magnitude is greater than or equal to a predetermined magnitude threshold, or whose magnitude is less than or equal to the magnitude threshold wherein the magnitude threshold is preferably an estimated quantizer level that causes a quantizer to quantize values less than or equal to the quantizer level to zero.
- the measure of the energy is the number of active lines, that is, the number of lines that survive after quantization or not equal to zero.
- Fig. 2a shows a preferred embodiment of means 106 for calculating the measure of the distribution of energy in the frequency band.
- the measure of the distribution of the energy in the frequency band is designated nl (b) in FIG. 2a.
- the form factor ffac (b) is already a measure of the distribution of the energy in the frequency band.
- the measure of the spectral distribution nl from the form factor ffac (b) is weighted by the 4th root of the signal energy e (b) divided by the bandwidth width (b) and number of lines, respectively determined in the scale factor band b.
- the form factor is also an example of a quantity which gives a measure of the distribution of the energies
- nl (b) is an example of is a quantity representing an estimate of the number of lines relevant to quantization.
- the form factor ffac (b) is calculated by absolute value formation of a spectral line and subsequent rooting of this spectral line and subsequent summation of the "rooted" amounts of the spectral lines in the band.
- FIG. 2b shows a preferred embodiment of the device 104 for calculating the estimated value pe, wherein a case distinction is introduced in FIG. 2b, namely when the base 2 logarithm of the ratio of the energy to the permitted interference is greater than a constant one Factor cl or equal to the constant factor.
- the alternative above in block 104 is taken, ie the measure of the spectral distribution n1 is multiplied by the logarithm expression.
- Fig. 4a shows a band in which four spectral lines are present, all of equal size. The energy in this band is thus distributed evenly across the band.
- Fig. 4b shows a situation in which the energy in the band resides in one spectral line while the other three spectral lines are equal are zero.
- the band shown in Figure 4b could be before quantization, or could be obtained after quantization, if the spectral lines zeroed in Figure 4b are smaller than the first quantizer before quantization and thus set to zero by the quantizer So do not "survive".
- nl in Fig. 4b is calculated to the square root of 2.
- n 1 that is to say the measure for the spectral distribution of the energy in FIGS. 4 a to 4 is calculated. This means that the spectral distribution of the energy is more uniform when the measure of the distribution of the spectral energy is greater.
- the invention thus takes into account how the energy is distributed within the band. This is done as it is by replacing the number of lines per band in the known equation ( Figure 6) by an estimate of the number of lines which are non-zero after quantization. This estimate is shown in FIG. 2a.
- the form factor shown in Fig. 2a is also needed elsewhere in the encoder, for example, within the quantization block 1014 to determine the quantization step size. Then, if the form factor is already computed elsewhere, it need not be recalculated for bit estimation, so that the inventive concept of improved estimation of the measure of the required bits requires a minimum of additional computational overhead.
- X (k) is the spectral coefficient to be quantized later, while the variable k ⁇ ffset (b) designates the first index in band b.
- a measure is thus available for the characterization of the spectral field structure within the band.
- the new formula for calculating an improved band-wise perceptual entropy is thus based on the multiplication of the measure of the spectral distribution of energy and of the logarithmic expression by the signal energy e (b) in the numerator and the allowed error in the denominator, each If required, enter a term within the logarithm. can be set, as it is already shown in Fig. 7. This term may for example also be 1.5, but may also be zero, as in the case shown in Fig. 2b, this z. B. can be determined empirically.
- the method according to the invention can be implemented in hardware or in software.
- the implementation may be on a digital storage medium, in particular a floppy disk or CD with electronically readable control signals, which may interact with a programmable computer system such that the method is performed.
- the invention thus also consists in a computer program product with a program code stored on a machine-readable carrier for carrying out the method according to the invention, when the computer program product runs on a computer.
- the invention can thus be realized as a computer program with a program code for carrying out the method when the computer program runs on a computer.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Electrical Discharge Machining, Electrochemical Machining, And Combined Machining (AREA)
- Control Of Ac Motors In General (AREA)
- Radar Systems Or Details Thereof (AREA)
- Branch Pipes, Bends, And The Like (AREA)
- Manufacture Or Reproduction Of Printing Formes (AREA)
- Diaphragms For Electromechanical Transducers (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
- Measurement Of Resistance Or Impedance (AREA)
- Analysing Materials By The Use Of Radiation (AREA)
- Measurement Of Current Or Voltage (AREA)
- Apparatus For Radiation Diagnosis (AREA)
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP19167397.9A EP3544003B1 (de) | 2004-03-01 | 2005-02-17 | Vorrichtung und verfahren zum ermitteln eines schätzwerts |
EP08021083.4A EP2034473B1 (de) | 2004-03-01 | 2005-02-17 | Vorrichtung und Verfahren zum Ermitteln eines Schaetzwerts |
PL19167397T PL3544003T3 (pl) | 2004-03-01 | 2005-02-17 | Urządzenie i sposób ustalania szacowanej wartości |
PL08021083T PL2034473T3 (pl) | 2004-03-01 | 2005-02-17 | Urządzenie i sposób ustalania szacowanej wartości |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE102004009949A DE102004009949B4 (de) | 2004-03-01 | 2004-03-01 | Vorrichtung und Verfahren zum Ermitteln eines Schätzwertes |
PCT/EP2005/001651 WO2005083680A1 (de) | 2004-03-01 | 2005-02-17 | Vorrichtung und verfahren zum ermitteln eines schätzwerts |
Related Child Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP08021083.4A Division EP2034473B1 (de) | 2004-03-01 | 2005-02-17 | Vorrichtung und Verfahren zum Ermitteln eines Schaetzwerts |
EP19167397.9A Division EP3544003B1 (de) | 2004-03-01 | 2005-02-17 | Vorrichtung und verfahren zum ermitteln eines schätzwerts |
EP08021083.4 Division-Into | 2008-12-04 |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1697931A1 true EP1697931A1 (de) | 2006-09-06 |
EP1697931B1 EP1697931B1 (de) | 2011-11-02 |
Family
ID=34894902
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP19167397.9A Active EP3544003B1 (de) | 2004-03-01 | 2005-02-17 | Vorrichtung und verfahren zum ermitteln eines schätzwerts |
EP08021083.4A Active EP2034473B1 (de) | 2004-03-01 | 2005-02-17 | Vorrichtung und Verfahren zum Ermitteln eines Schaetzwerts |
EP05707481A Active EP1697931B1 (de) | 2004-03-01 | 2005-02-17 | Vorrichtung und verfahren zum ermitteln eines schätzwerts |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP19167397.9A Active EP3544003B1 (de) | 2004-03-01 | 2005-02-17 | Vorrichtung und verfahren zum ermitteln eines schätzwerts |
EP08021083.4A Active EP2034473B1 (de) | 2004-03-01 | 2005-02-17 | Vorrichtung und Verfahren zum Ermitteln eines Schaetzwerts |
Country Status (19)
Country | Link |
---|---|
US (1) | US7318028B2 (de) |
EP (3) | EP3544003B1 (de) |
JP (1) | JP4673882B2 (de) |
KR (1) | KR100852482B1 (de) |
CN (1) | CN1938758B (de) |
AT (1) | ATE532173T1 (de) |
AU (1) | AU2005217507B2 (de) |
BR (1) | BRPI0507815B1 (de) |
CA (1) | CA2559354C (de) |
DE (1) | DE102004009949B4 (de) |
DK (1) | DK1697931T3 (de) |
ES (3) | ES2739544T3 (de) |
HK (1) | HK1093813A1 (de) |
IL (1) | IL176978A (de) |
NO (1) | NO338917B1 (de) |
PL (2) | PL2034473T3 (de) |
PT (2) | PT2034473T (de) |
RU (1) | RU2337414C2 (de) |
WO (1) | WO2005083680A1 (de) |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012152764A1 (en) | 2011-05-09 | 2012-11-15 | Dolby International Ab | Method and encoder for processing a digital stereo audio signal |
FR2977439A1 (fr) * | 2011-06-28 | 2013-01-04 | France Telecom | Fenetres de ponderation en codage/decodage par transformee avec recouvrement, optimisees en retard. |
JP7257975B2 (ja) * | 2017-07-03 | 2023-04-14 | ドルビー・インターナショナル・アーベー | 密集性の過渡事象の検出及び符号化の複雑さの低減 |
EP3483879A1 (de) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Analyse-/synthese-fensterfunktion für modulierte geläppte transformation |
WO2019091573A1 (en) | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for encoding and decoding an audio signal using downsampling or interpolation of scale parameters |
EP3483883A1 (de) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audiokodierung und -dekodierung mit selektiver nachfilterung |
EP3483884A1 (de) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Signalfiltrierung |
WO2019091576A1 (en) | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoders, audio decoders, methods and computer programs adapting an encoding and decoding of least significant bits |
EP3483880A1 (de) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Zeitliche rauschformung |
EP3483882A1 (de) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Steuerung der bandbreite in codierern und/oder decodierern |
EP3483886A1 (de) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Auswahl einer grundfrequenz |
EP3483878A1 (de) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audiodecoder mit auswahlfunktion für unterschiedliche verlustmaskierungswerkzeuge |
CN111405419B (zh) * | 2020-03-26 | 2022-02-15 | 海信视像科技股份有限公司 | 音频信号处理方法、装置及可读存储介质 |
CN116707557B (zh) * | 2022-12-20 | 2024-05-03 | 荣耀终端有限公司 | 信道选择方法、接收机及存储介质 |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69127842T2 (de) * | 1990-03-09 | 1998-01-29 | At & T Corp | Hybride wahrnehmungsgebundene Kodierung von Audiosignalen |
CA2090052C (en) * | 1992-03-02 | 1998-11-24 | Anibal Joao De Sousa Ferreira | Method and apparatus for the perceptual coding of audio signals |
US5285498A (en) * | 1992-03-02 | 1994-02-08 | At&T Bell Laboratories | Method and apparatus for coding audio signals based on perceptual model |
EP0559348A3 (de) | 1992-03-02 | 1993-11-03 | AT&T Corp. | Rateurregelschleifenprozessor für einen wahrnehmungsgebundenen Koder/Dekoder |
CA2277975C (en) * | 1992-06-24 | 2001-11-20 | British Telecommunications Public Limited Company | Method and apparatus for objective speech quality measurements of telecommunication equipment |
JP2927660B2 (ja) * | 1993-01-25 | 1999-07-28 | シャープ株式会社 | 樹脂封止型半導体装置の製造方法 |
US5623577A (en) * | 1993-07-16 | 1997-04-22 | Dolby Laboratories Licensing Corporation | Computationally efficient adaptive bit allocation for encoding method and apparatus with allowance for decoder spectral distortions |
US5632003A (en) * | 1993-07-16 | 1997-05-20 | Dolby Laboratories Licensing Corporation | Computationally efficient adaptive bit allocation for coding method and apparatus |
US5956674A (en) * | 1995-12-01 | 1999-09-21 | Digital Theater Systems, Inc. | Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels |
DE19736669C1 (de) * | 1997-08-22 | 1998-10-22 | Fraunhofer Ges Forschung | Verfahren und Vorrichtung zum Erfassen eines Anschlags in einem zeitdiskreten Audiosignal sowie Vorrichtung und Verfahren zum Codieren eines Audiosignals |
DE19747132C2 (de) * | 1997-10-24 | 2002-11-28 | Fraunhofer Ges Forschung | Verfahren und Vorrichtungen zum Codieren von Audiosignalen sowie Verfahren und Vorrichtungen zum Decodieren eines Bitstroms |
AU3372199A (en) * | 1998-03-30 | 1999-10-18 | Voxware, Inc. | Low-complexity, low-delay, scalable and embedded speech and audio coding with adaptive frame loss concealment |
AU4201100A (en) * | 1999-04-05 | 2000-10-23 | Hughes Electronics Corporation | Spectral phase modeling of the prototype waveform components for a frequency domain interpolative speech codec system |
JP3762579B2 (ja) * | 1999-08-05 | 2006-04-05 | 株式会社リコー | デジタル音響信号符号化装置、デジタル音響信号符号化方法及びデジタル音響信号符号化プログラムを記録した媒体 |
JP2001166797A (ja) * | 1999-12-07 | 2001-06-22 | Nippon Hoso Kyokai <Nhk> | オーディオ信号の符号化装置 |
US6937979B2 (en) * | 2000-09-15 | 2005-08-30 | Mindspeed Technologies, Inc. | Coding based on spectral content of a speech signal |
EP1199711A1 (de) * | 2000-10-20 | 2002-04-24 | Telefonaktiebolaget Lm Ericsson | Kodierung von Audiosignalen unter Verwendung von Vergrösserung der Bandbreite |
SE0004187D0 (sv) * | 2000-11-15 | 2000-11-15 | Coding Technologies Sweden Ab | Enhancing the performance of coding systems that use high frequency reconstruction methods |
US6636830B1 (en) * | 2000-11-22 | 2003-10-21 | Vialta Inc. | System and method for noise reduction using bi-orthogonal modified discrete cosine transform |
US6996523B1 (en) * | 2001-02-13 | 2006-02-07 | Hughes Electronics Corporation | Prototype waveform magnitude quantization for a frequency domain interpolative speech codec system |
US6871176B2 (en) * | 2001-07-26 | 2005-03-22 | Freescale Semiconductor, Inc. | Phase excited linear prediction encoder |
US6912495B2 (en) * | 2001-11-20 | 2005-06-28 | Digital Voice Systems, Inc. | Speech model and analysis, synthesis, and quantization methods |
-
2004
- 2004-03-01 DE DE102004009949A patent/DE102004009949B4/de not_active Expired - Fee Related
-
2005
- 2005-02-17 CN CN2005800067994A patent/CN1938758B/zh active Active
- 2005-02-17 KR KR1020067016835A patent/KR100852482B1/ko active IP Right Grant
- 2005-02-17 DK DK05707481.7T patent/DK1697931T3/da active
- 2005-02-17 JP JP2007501149A patent/JP4673882B2/ja active Active
- 2005-02-17 PT PT08021083T patent/PT2034473T/pt unknown
- 2005-02-17 EP EP19167397.9A patent/EP3544003B1/de active Active
- 2005-02-17 BR BRPI0507815A patent/BRPI0507815B1/pt active IP Right Grant
- 2005-02-17 WO PCT/EP2005/001651 patent/WO2005083680A1/de active Application Filing
- 2005-02-17 PT PT191673979T patent/PT3544003T/pt unknown
- 2005-02-17 ES ES08021083T patent/ES2739544T3/es active Active
- 2005-02-17 ES ES05707481T patent/ES2376887T3/es active Active
- 2005-02-17 PL PL08021083T patent/PL2034473T3/pl unknown
- 2005-02-17 EP EP08021083.4A patent/EP2034473B1/de active Active
- 2005-02-17 CA CA2559354A patent/CA2559354C/en active Active
- 2005-02-17 RU RU2006134638/09A patent/RU2337414C2/ru active
- 2005-02-17 PL PL19167397T patent/PL3544003T3/pl unknown
- 2005-02-17 AU AU2005217507A patent/AU2005217507B2/en active Active
- 2005-02-17 AT AT05707481T patent/ATE532173T1/de active
- 2005-02-17 EP EP05707481A patent/EP1697931B1/de active Active
- 2005-02-17 ES ES19167397T patent/ES2847237T3/es active Active
-
2006
- 2006-07-20 IL IL176978A patent/IL176978A/en active IP Right Grant
- 2006-08-31 US US11/469,418 patent/US7318028B2/en active Active
- 2006-09-29 NO NO20064432A patent/NO338917B1/no unknown
-
2007
- 2007-01-25 HK HK07100908.4A patent/HK1093813A1/xx unknown
Non-Patent Citations (1)
Title |
---|
See references of WO2005083680A1 * |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1697930B1 (de) | Vorrichtung und verfahren zum verarbeiten eines multikanalsignals | |
DE60014363T2 (de) | Verringerung der von der quantisierung verursachten datenblock-diskontinuitäten in einem audio-kodierer | |
DE19811039B4 (de) | Verfahren und Vorrichtungen zum Codieren und Decodieren von Audiosignalen | |
EP2022043B1 (de) | Informationssignalcodierung | |
DE69915400T2 (de) | Vorrichtung zur Kodierung und Dekodierung von Audiosignalen | |
DE69518452T2 (de) | Verfahren für die Transformationskodierung akustischer Signale | |
DE60004814T2 (de) | Quantisierung in perzeptuellen audiokodierern mit kompensation des durch den synthesefilter verschmierten rauschens | |
EP1495464B1 (de) | Vorrichtung und verfahren zum codieren eines zeitdiskreten audiosignals und vorrichtung und verfahren zum decodieren von codierten audiodaten | |
US7318028B2 (en) | Method and apparatus for determining an estimate | |
DE69810361T2 (de) | Verfahren und Vorrichtung zur mehrkanaligen akustischen Signalkodierung und -dekodierung | |
EP1687810B1 (de) | Vorrichtung und verfahren zum ermitteln einer quantisierer-schrittweite | |
EP1397799B1 (de) | Verfahren und vorrichtung zum verarbeiten von zeitdiskreten audio-abtastwerten | |
DE69932861T2 (de) | Verfahren zur kodierung eines audiosignals mit einem qualitätswert für bit-zuordnung | |
DE69801536T2 (de) | Vorrichtung und Verfahren zur Signalcodierung | |
DE10010849C1 (de) | Vorrichtung und Verfahren zum Analysieren eines Analyse-Zeitsignals | |
EP1247275B1 (de) | Vorrichtung und verfahren zum bestimmen eines codierungs-blockrasters eines decodierten signals | |
DE19742201C1 (de) | Verfahren und Vorrichtung zum Codieren von Audiosignalen | |
DE10065363B4 (de) | Vorrichtung und Verfahren zum Decodieren eines codierten Datensignals | |
MXPA06009934A (es) | Metodo y aparato para determinar un estimado |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20060712 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU MC NL PL PT RO SE SI SK TR |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWAN |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1093813 Country of ref document: HK |
|
DAX | Request for extension of the european patent (deleted) | ||
17Q | First examination report despatched |
Effective date: 20070717 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU MC NL PL PT RO SE SI SK TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 502005012097 Country of ref document: DE Effective date: 20120105 |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: T3 |
|
REG | Reference to a national code |
Ref country code: SE Ref legal event code: TRGR |
|
REG | Reference to a national code |
Ref country code: DK Ref legal event code: T3 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2376887 Country of ref document: ES Kind code of ref document: T3 Effective date: 20120320 |
|
LTIE | Lt: invalidation of european patent or patent extension |
Effective date: 20111102 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111102 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120302 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111102 Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120302 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120203 Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111102 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111102 |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1093813 Country of ref document: HK |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111102 Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111102 Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111102 Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120202 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111102 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20120803 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 502005012097 Country of ref document: DE Effective date: 20120803 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20111102 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20050217 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 12 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 13 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 14 |
|
P01 | Opt-out of the competence of the unified patent court (upc) registered |
Effective date: 20230512 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: LU Payment date: 20240220 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: ES Payment date: 20240319 Year of fee payment: 20 Ref country code: IE Payment date: 20240216 Year of fee payment: 20 Ref country code: NL Payment date: 20240220 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: AT Payment date: 20240216 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: MC Payment date: 20240219 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FI Payment date: 20240219 Year of fee payment: 20 Ref country code: DE Payment date: 20240216 Year of fee payment: 20 Ref country code: CH Payment date: 20240301 Year of fee payment: 20 Ref country code: GB Payment date: 20240222 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: SE Payment date: 20240221 Year of fee payment: 20 Ref country code: IT Payment date: 20240229 Year of fee payment: 20 Ref country code: FR Payment date: 20240222 Year of fee payment: 20 Ref country code: DK Payment date: 20240221 Year of fee payment: 20 Ref country code: BE Payment date: 20240219 Year of fee payment: 20 |