[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2009029036A1 - Method and device for noise filling - Google Patents

Method and device for noise filling Download PDF

Info

Publication number
WO2009029036A1
WO2009029036A1 PCT/SE2008/050968 SE2008050968W WO2009029036A1 WO 2009029036 A1 WO2009029036 A1 WO 2009029036A1 SE 2008050968 W SE2008050968 W SE 2008050968W WO 2009029036 A1 WO2009029036 A1 WO 2009029036A1
Authority
WO
WIPO (PCT)
Prior art keywords
spectral
coefficients
spectral coefficients
codebook
decoded
Prior art date
Application number
PCT/SE2008/050968
Other languages
French (fr)
Inventor
Anisse Taleb
Manuel Briand
Gustaf Ullberg
Original Assignee
Telefonaktiebolaget Lm Ericsson (Publ)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=40387560&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=WO2009029036(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Priority to PL19194270T priority Critical patent/PL3591650T3/en
Priority to EP19194270.5A priority patent/EP3591650B1/en
Priority to EP18176984.5A priority patent/EP3401907B1/en
Priority to US12/675,290 priority patent/US8370133B2/en
Priority to CN2008801048087A priority patent/CN101809657B/en
Priority to EP08828426.0A priority patent/EP2186089B1/en
Priority to PL18176984T priority patent/PL3401907T3/en
Application filed by Telefonaktiebolaget Lm Ericsson (Publ) filed Critical Telefonaktiebolaget Lm Ericsson (Publ)
Priority to DK08828426.0T priority patent/DK2186089T3/en
Priority to JP2010522868A priority patent/JP5255638B2/en
Priority to ES08828426T priority patent/ES2704286T3/en
Priority to CA2698031A priority patent/CA2698031C/en
Priority to MX2010001504A priority patent/MX2010001504A/en
Publication of WO2009029036A1 publication Critical patent/WO2009029036A1/en
Priority to US13/755,672 priority patent/US9111532B2/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/028Noise substitution, i.e. substituting non-tonal spectral components by noisy source
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/035Scalar quantisation

Definitions

  • the present invention relates in general to methods and devices for coding and decoding of audio signals, and in particular to methods and devices for perceptual spectral decoding.
  • a time domain signal has typically to be divided into smaller parts in order to precisely encode the evolution of the signal's amplitude, i.e. describe with low amount of information.
  • State-of-the-art coding methods usually transform the time-domain signal into the frequency domain where a better coding gain can be reached by using perceptual coding i.e. lossy coding but ideally unnoticeable by the human auditory system. See e.g. J. D. Johnston, "Transform coding of audio signals using perceptual noise criteria", IEEE J. Select. Areas Commun., Vol. 6, pp. 314-323, 1988 [I].
  • the perceptual audio coding concept can not avoid the introduction of distortions, i.e.
  • TNS Temporal Noise Shaping
  • audio coding standards are continuously designed in order to deliver high or intermediate audio quality, from narrowband speech to fullband audio, at low data rates for a reasonable complexity according to the dedicated application.
  • SBR Spectral Band Replication
  • 3GPP TS 26.404 V6.0.0 (2004-09) " Enhanced aacPlus general audio codec - encoder SBR part (Release 6)", 2004 [3]
  • specific parameters are typically used at the decoder side to re-generate the missing high-frequencies that is not decoded by the core codec from the low-frequency decoded spectrum.
  • a general object of the present invention is thus to provide methods and devices for reducing coding artifacts, applicable also at low bit rates.
  • a further object of the present invention is also to provide methods and devices for reducing coding artifacts having a low complexity.
  • a method for perceptual spectral decoding comprises decoding of spectral coefficients recovered from a binary flux into decoded spectral coefficients of an initial set of spectral coefficients.
  • the initial set of spectral coefficients is spectrum filled into a set of reconstructed spectral coefficients.
  • the spectrum filling comprises noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients not being decoded from the binary flux equal to elements derived from the decoded spectral coefficients.
  • the set of reconstructed spectral coefficients of a frequency domain is converted into an audio signal of a time domain.
  • a method for signal handling in perceptual spectral decoding comprises obtaining of decoded spectral coefficients of an initial set of spectral coefficients.
  • the initial set of spectral coefficients is spectrum filled into a set of reconstructed spectral coefficients.
  • the spectrum filling comprises noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients having a zero magnitude or being non-coded equal to elements derived from the decoded spectral coefficients.
  • the set of reconstructed spectral coefficients is outputted.
  • a perceptual spectral decoder comprises an input for a binary flux and a spectral coefficient decoder arranged for decoding spectral coefficients recovered from the binary flux into decoded spectral coefficients of an initial set of spectral coefficients.
  • the perceptual spectral decoder further comprises a spectrum filler connected to the spectral coefficient decoder and arranged for spectrum filling of the set of spectral coefficients.
  • the spectrum filler comprises a noise filler for noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients not being decoded from the binary flux equal to elements derived from the decoded spectral coefficients.
  • the perceptual spectral decoder also comprises a converter connected to the spectrum filler and arranged for converting the set of reconstructed spectral coefficients of a frequency domain into an audio signal of a time domain and an output for the audio signal.
  • a signal handling device for use in a perceptual spectral decoder comprises an input for decoded spectral coefficients of an initial set of spectral coefficients and a spectrum filler connected to the input and arranged for spectrum filling of the initial set of spectral coefficients.
  • the spectrum filler comprises a noise filler for noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients having a zero magnitude or being non-decoded equal to elements derived from the decoded spectral coefficients.
  • the signal handling device also comprises an output for the set of reconstructed spectral coefficients.
  • One advantage with the present invention is that an original signal temporal envelope of an audio signal is better preserved since noise filling relies on the decoded spectral coefficients without injection of random noise as it occurs in conventional noise filling methods.
  • the present invention is also possible to implement in a low-complexity manner. Other advantages are further discussed in connection with the different embodiments described further below.
  • FIG. 1 is a schematic block scheme of a codec system
  • FIG. 2 is a schematic block scheme of an embodiment of an audio signal encoder
  • FIG. 3 is a schematic block scheme of an embodiment of an audio signal decoder
  • FIG. 4 is a schematic block scheme of an embodiment of a noise filler according to the present invention.
  • FIGS. 5A-B are illustrations of creation and utilization of spectral codebooks for noise filling purposes according to an embodiment of the present invention
  • FIG. 6 is a schematic block scheme of an embodiment of a decoder according to the present invention
  • FIG. 7 is a schematic block scheme of another embodiment of a noise filler according to the present invention.
  • FIGS. 8A-B are illustrations of embodiments of bandwidth expansion according to an embodiment of a spectrum fold approach according to the present invention.
  • FIG. 9 is a schematic block scheme of yet another embodiment of a noise filler according to the present invention.
  • FIG. 10 is a schematic block scheme of en encoder having an envelope coder according to an embodiment of the present invention.
  • FIG. 11 is a flow diagram of steps of an embodiment of a decoding method according to the present invention.
  • FIG. 12 is a flow diagram of steps of an embodiment of a signal handling method according to the present invention.
  • the present invention relies on a frequency domain processing at the decoding side of a coding-decoding system.
  • This frequency domain processing is called Noise Fill (NF), which is able to reduce the coding artifacts occurring particularly for low bit-rates and which also may be used to regenerate a full bandwidth audio signal even at low rates and with a low complexity scheme.
  • NF Noise Fill
  • FIG. 1 An embodiment of a general codec system for audio signals is schematically illustrated in Fig. 1.
  • An audio source 10 gives rise to an audio signal 15.
  • the audio signal 15 is handled in an encoder 20, which produces a binary flux 25 comprising data representing the audio signal 15.
  • the binary flux 25 may be transmitted, as e.g. in the case of multimedia communication, by a transmission and/ or storing arrangement 30.
  • the transmission and/ or storing arrangement 30 optionally also may comprise some storing capacity.
  • the binary flux 25 may also only be stored in the transmission and/ or storing arrangement 30, just introducing a time delay in the utilization of the binary flux.
  • the transmission and/or storing arrangement 30 is thus an arrangement introducing at least one of a spatial repositioning or time delay of the binary flux 25.
  • the binary flux 25 is handled in a decoder 40, which produces an audio output 35 from the data comprised in the binary flux.
  • the audio output 35 should approximate the original audio signal 15 as well as possible under certain constraints, e.g. data rate, delay or complexity.
  • Perceptual audio coding has therefore become an important part for many multimedia services today.
  • the basic principle is to convert the audio signal into spectral coefficient in a frequency domain and using a perceptual model to determine a frequency and time dependent masking of the spectral coefficients.
  • Fig. 2 illustrates an embodiment of a typical perceptual audio encoder 20.
  • the perceptual audio encoder 20 is a spectral encoder based on a time-to-frequency transformer or a filter bank.
  • An audio source 15 is received, comprising frames of audio signals.
  • the first step consists of a time-domain processing usually called windowing of the signal which results in a time segmentation of the input audio signal x[n].
  • a windowing section 21 receives the audio signals and provides time segmented audio signal x[n] 22.
  • the time segmented audio signal x[n] 22 is provided to a converter 23, arranged for converting the time domain audio signal 22 into a set of spectral coefficients of a frequency domain.
  • the converter 23 can be implemented according to any prior-art transformer or filter bank. The details are not of particular importance for the principles of the present invention to be functional, and the details are therefore omitted from the description.
  • the time to frequency domain transform used by the encoder could be, for example, the:
  • x[k] is the DFT of the windowed input signal x[n] .
  • N is the size of the window w[n]
  • n is the time index and k the frequency bin index.
  • DCT Discrete Cosine Transform
  • MDCT Modified Discrete Cosine Transform
  • the perceptual audio codec aims at decompose the spectrum, or its approximation, regarding to the critical bands of the auditory system e.g. the Bark scale.
  • This step can be achieved by a frequency grouping of the transform coefficients according to a perceptual scale established according to the critical bands.
  • N b the number of frequency or psychoacoustical bands and b the relative index.
  • the output from the converter 23 is a set of spectral coefficients being a frequency representation 24 of the input audio signal.
  • a perceptual model is used to determine a frequency and time dependent masking of the spectral coefficients.
  • the perceptual transform codec relies on an estimation of a Masking Threshold in order to derive a frequency shaping function, e.g. the Scale Factors iSFJ ⁇ ], applied to the transform coefficients X b [k] in the psychoacoustical subband domain.
  • a frequency shaping function e.g. the Scale Factors iSFJ ⁇
  • the scaled spectrum Xs b [k] can be defined as
  • X Sb [k] X b [k]xm[blk e [k b ,--,k b+1 -llb e [l,---,N b ] .
  • a psychoacoustic modeling section 26 is connected to the windowing section 21 for having access to the original acoustic signal 22 and to the converter 23 for having access to the frequency representation.
  • the psychoacoustic modeling section 26 is in the present embodiment arranged to utilize the above described estimation and outputs a masking threshold MT[k] 27.
  • the masking threshold MT[k] 27 and the frequency representation 24 of the input audio signal are provided to a quantizing and coding section 28.
  • the masking threshold Mr[A:] 27 is applied on the frequency representation 24 giving a set of spectral coefficients.
  • the set of spectral coefficients corresponds to the scaled spectrum coefficients Xs ⁇ [&] based on the frequency groupings X 4 [A;] .
  • the scaling can also be performed on the individual spectral coefficients directly.
  • the quantizing and coding section 28 is further arranged for quantizing the set of spectral coefficients in any appropriate manner giving an information compression.
  • the quantizing and coding section 28 is also arranged for coding the quantized set of spectral coefficients.
  • Such coding takes preferably advantage of the perceptual properties and operates for masking the quantization noise in a best possible manner.
  • the perceptual coder may thereby exploit the perceptually scaled spectrum for the coding purpose. The redundancy reduction can be thereby be performed by a quantization and coding process which will be able to focus on the most perceptually relevant coefficients of the original spectrum by using the scaled spectrum.
  • the coded spectral coefficients together with additional side information are packed into a bitstream according to the transmission or storage standard that is going to be used.
  • a binary flux 25 having data representing the set of spectral coefficients is thereby outputted from the quantizing and coding section 28.
  • FIG. 3 an embodiment of a typical perceptual audio decoder 40 is illustrated.
  • a binary flux 25 is received, which has the properties from the encoder described here above.
  • De-quantization and decoding of the received binary flux 25 e.g. a bitstream is performed in a spectral coefficient decoder 41.
  • the spectral coefficient decoder 41 is arranged for decoding spectral coefficients recovered from the binary flux into decoded spectral coefficients X Q [k] of an initial set of spectral coefficients 42, possible grouped in frequency groupings
  • the initial set of spectral coefficients 42 is typically incomplete in that sense that it typically comprises so-called "spectral holes", which corresponds to spectral coefficients that are not received in the binary flux or at least not decoded from the binary flux.
  • the spectral holes are non- decoded spectral coefficients X ⁇ [&] or spectral coefficients automatically set to a predetermined value, typically zero, by the spectral coefficient decoder 41.
  • the incomplete initial set of spectral coefficients 42 from the spectral coefficient decoder 41 is provided to a spectrum filler 43.
  • the spectrum filler 43 is arranged for spectrum filling the initial set of spectral coefficients 42.
  • the spectrum filler 43 in turn comprises a noise filler 50.
  • the noise filler 50 is arranged for providing a process for noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients 42 not being decoded from the binary flux 25 to a definite value.
  • the spectral coefficients of the spectral holes are set equal to elements derived from the decoded spectral coefficients.
  • the decoder 40 thus presents a specific module which allows a high-quality noise fill in the transform domain.
  • the result from the spectrum filler 43 is a complete set 44 of reconstructed spectral coefficients X b [k], having all spectral coefficients within a certain frequency range defined.
  • the complete set 44 of spectral coefficients is provided to a converter 45 connected to the spectrum filler 43.
  • the converter 45 is arranged for converting the complete set 44 of reconstructed spectral coefficients of a frequency domain into an audio signal 46 of a time domain.
  • the converter 45 is typically based on an inverse transformer or filter bank, corresponding to the transformation technique used in the encoder 20 (fig. 2).
  • the signal 46 is provided back into the time domain with an inverse transform, e.g. Inverse MDCT - IMDCT or Inverse DFT - IDFT, etc.
  • an inverse filter bank is utilized.
  • the technique of the converter 45 as such is known in prior art, and will not be further discussed.
  • the overlap-add method is used to generate the final perceptually reconstructed audio signal 34 x'[n] at an output 35 for said audio signal 34.
  • This is in the present exemplary embodiment provided by a windowing section 47 and an overlap adaptation section 49.
  • the above presented encoder and decoder embodiments could be provided for sub-band coding as well as for coding of entire the frequency band of interest.
  • a noise filler 50 In Fig. 4, an embodiment of a noise filler 50 according to the present invention is illustrated.
  • This particular high-quality noise filler 50 allows the preservation of the temporal structure with a spectrum filling based on a new concept called spectral noise codebook.
  • the spectral noise codebook is built on-the-fly based on the decoded spectrum, i.e. the decoded spectral coefficients.
  • the decoded spectrum contains the overall temporal envelope information which means that the generated, possibly random, noise from the noise codebook will also contain such information which will avoid a temporally flat noise fill, which would introduce noisy distortions.
  • the architecture of the noise filler of Fig. 4 relies on two consecutive sections, each one associated with a respective step.
  • the first step performed by a spectral codebook generator 51, consists in building a spectral codebook with elements that are provided by the decoded spectrum Xf [A:] , i.e. the decoded spectral coefficients of the initial set of spectral coefficients 42.
  • a filling spectrum section 52 the decoded spectrum subbands or spectral coefficients that are considered as spectral holes, are filled with the codebook elements in order to reduce the coding artifacts.
  • This spectrum filling should preferably be considered for the lowest frequencies up to a transition frequency which can be defined adaptively. However, filling can be performed in the entire frequency range if requested.
  • codebook elements which are associated with a certain temporal structure of a present audio signal, some temporal structure preservation will be introduced also into the filled spectral coefficients.
  • Fig. 4 can be seen as illustrating a signal handling device for use in a perceptual spectral decoder.
  • the signal handling device comprises an input for decoded spectral coefficients of an initial set of spectral coefficients.
  • the signal handling device further comprises a spectrum filler connected to the input and arranged for spectrum filling of the initial set of spectral coefficients into a set of reconstructed spectral coefficients.
  • the spectrum filler comprises a noise filler for noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients having a zero magnitude or being non-decoded equal to elements derived from the decoded spectral coefficients.
  • the signal handling device also comprises an output for the set of reconstructed spectral coefficients.
  • Figs. 5A-B The process is schematically illustrated in Figs. 5A-B.
  • the first step of the noise fill procedure relies on building of the spectral codebook from the spectral coefficients, e.g. the transform coefficients.
  • This step is achieved by concatenating the perceptually relevant spectral coefficients of the decoded spectrum Xf [&] .
  • the decoded spectrum is divided in groups of spectral coefficients.
  • the presented principles are, however, applicable to any such grouping.
  • a special case is then when each spectral coefficient X ⁇ [&] constitutes its own group, i.e. equivalent to a situation without any grouping at all.
  • the decoded spectrum of the Fig. 5A has several series of zero coefficients or undecoded coefficients, denoted by black rectangles, which are usually called spectral holes.
  • the groups of spectral coefficients Xf[A;] appear typically with a certain length L.
  • This length can be a fixed length or a value determined by the quantization and coding process.
  • the spectral codebook is in this embodiment made from the groups of spectral coefficients Xf[ ⁇ ] or equivalently spectral subbands, which have not only zeros.
  • a subband of length L with Z zeros (Z ⁇ L) will in this embodiment be part of the codebook since a part of the subband has been encoded, i.e. quantized.
  • the codebook size is defined adaptively to the perceptually relevant content of the input spectrum.
  • spectral codebook other selection criteria may be used when generating the spectral codebook.
  • One possible criterion to be included in the spectral codebook could be that none of the spectral coefficients of a certain group of spectral coefficients X ® [k] is allowed to be undefined or equal to zero. This reduces the selection possibilities within the spectral codebook, but at the same time it ensures that all elements of the spectral codebook carry some temporal structure information.
  • spectral filling is achieved with parts of the perceptually relevant spectrum itself and then, allows the preservation of the temporal structure of the original signal.
  • white noise injection proposed by the state-of-the-art noise fill schemes [1] does not meet the important requirement of preservation of the temporal structure, which means that pre-echo artefacts may be produced.
  • the spectral filling according to the present embodiment will not introduce pre- echo artefacts while still reducing the quantization and coding artefacts.
  • the transition frequency may be defined by the encoder and then transmitted to the decoder or determined adaptively by the decoder from the audio signal content. It is then assume that the transition frequency is defined at the decoder in the same way as it would have been done by the encoder, e.g. based on the number of coded coefficients per subband. Since the total length of all spectral holes can be larger than the length of the spectral codebook, the same codebook elements may have to be used for filling several spectral holes.
  • the choice of the elements from the spectral codebook used for filling can be done by following one or several criteria.
  • One criterion which corresponds to the embodiment illustrated in Fig. 5B, is to use the elements of the spectral codebook in index order, preferably starting at the low frequency end. If the indices of the set of spectral coefficients are denoted by i and the indices of the spectral codebook are denoted by j, couples (ij) can represent the filling strategy.
  • the index order approach can then be expressed as blindly fill the spectral holes by increasing the codebook index j as much as the index i. This is used to cover all the spectral holes.
  • the use of the spectral codebook elements may start from the beginning again, i.e. by a cyclic use of the spectral codebook, when all elements of the spectral codebook are utilized.
  • criterions could also be used to define the couples (i,j), for instance, the spectral distance e.g. frequency, between the spectral hole coefficients and the codebook elements. In this manner, it can be assured e.g. that the utilized temporal structure is based on spectral coefficients associated with a frequency not too far from the spectral hole to be filled. Typically, it is believed that it is more appropriate to fill spectral holes with elements associated with a frequency that is lower than the frequency of the spectral hole to be filled.
  • Another criterion is to consider the energy of the spectral hole neighbours so that the injected codebook elements smoothly will fit to the recovered encoded coefficients.
  • the noise filler is arranged to select the elements from the spectral codebook based on an energy of a decoded spectral coefficient adjacent to a spectral hole to be filled and an energy of the selected element.
  • a combination of such criteria could also be considered.
  • the spectral codebook comprises decoded spectral coefficients from a present frame of the audio signal. There are also temporal dependencies passing the frame boundaries. In alternative embodiment, in order to utilize such interframe temporal dependencies, it would be possible to e.g. save parts of a spectral codebook from one frame to another.
  • the spectral codebook may comprise decoded spectral coefficients from at least one of a past frame and a future frame.
  • the elements of the spectral codebook can, as indicated in the above embodiments, correspond directly to certain decoded spectral coefficients.
  • the noise filler it is also possible to arrange the noise filler to further comprise a postprocessor.
  • the postprocessor is arranged for postprocessing the elements of the spectral codebook. This leads to that the noise filler has to be arranged for selecting the elements from the postprocessed spectral codebook. In such a way, certain dependencies, in frequency and/ or temporal space, can be smoothed, reducing the influence of e.g. quantizing or coding noise.
  • spectral codebook is a practical implementation of the arranging of setting spectral holes equal to elements derived from the decoded spectral coefficients.
  • simple solutions may also be realized in alternative manners. Instead of explicitly collect the candidates for filling elements in a separate codebook, the selection and/ or derivation of elements to be used for filling spectral holes can be performed directly from the decoded spectral coefficients of the set.
  • the spectrum filler of the decoder is further arranged for providing bandwidth extension.
  • a decoder 40 is illustrated, in which the spectrum filler 43 additionally comprises a bandwidth extender 55.
  • the bandwidth extender 55 increases the frequency region in which spectral coefficients are available at the high frequency end.
  • the recovered spectral coefficients are provided mainly below a transition frequency. Any spectral holes are there filled by the above described noise filling.
  • frequencies above the transition frequency typically none or a few recovered spectral coefficients are available. This frequency region is thus typically unknown, and of rather low importance for the perception.
  • spectral coefficients suitable for e.g. inverse transforming can be provided.
  • noise filling is typically performed for frequencies below the transition frequency and the bandwidth extension is typically performed for frequencies above the transition frequency.
  • the bandwidth extender 55 is considered as a part of the noise filler 50.
  • the bandwidth extender 55 comprises a spectrum folding section 56, in which high-frequency spectral coefficients are generated by spectral folding in order to build a full-bandwidth audio signal.
  • the process synthesizes a high-frequencies spectrum from the filled spectrum in the present embodiment by spectral folding based on the value of the transition frequency.
  • Fig. 8A An embodiment of a full-bandwidth generation is described by Fig. 8A. It is based on a spectral folding of the spectrum below the transition frequency to the high-frequency spectrum, i.e. basically zeros above the transition frequency. To do so, the zeros at frequencies over the transition frequency are filled with the low- frequency filled spectrum.
  • a length of the low-frequency filled spectrum equal to half the length of the high-frequency spectrum to be filled is selected from frequencies just below the transition frequency. Then, a first spectral copy is achieved with respect to a point of symmetry defined by the transition frequency. Finally, the first half part of the high-frequency spectrum is then also used to generate the second half part of the high-frequency spectrum by an additional folding.
  • a section of the low frequency filled spectrum just below the transition frequency is also here used for spectrum folding. If the intended bandwidth extension Z is smaller than or equal to half the available low-frequency filled spectrum (N-Z) /2, a section of the low frequency filled spectrum corresponding to the length of the high-spectrum to be filled is selected and folded onto the high-frequency around the transition frequency. However, if the intended bandwidth extension Z is larger than half the available low-frequency filled spectrum (N- Z) /2, i.e. in case that N ⁇ 3*Z, only half the low frequency filled spectrum is selected and folded in the first place. Then, a spectrum range from the just folded spectrum is selected to cover the rest of the high-frequency range. If necessary, i.e. if N ⁇ 2*Z, this folding can be repeated with a third copy, a fourth copy, and so on, until the entire high-frequency range is covered to ensure spectral continuity and a full-bandwidth signal generation.
  • the spectral folding should preferably not replace, modify or even delete these coefficients, as indicated in Fig. 8B.
  • the noise filler 50 comprises a spectral fill envelope section 57.
  • the spectral fill envelope section 57 is arranged for applying the spectral fill envelope to the filled and folded spectrum over all subbands so that the final energy of the decoded spectrum will approximate the energy of the original spectrum X fr [&], i.e. in order to conserve an initial energy. This is also applicable when the noise filling is performed in a normalized domain.
  • this is done by using a subband gain correction which can be written as:
  • the energy levels of the original spectrum and/ or the noise floor e.g. the envelope G[b] should have been encoded and transmitted by the encoder to the decoder as side information.
  • the signal like estimated envelope, G[b] for the subbands above the transition frequency is able to adapt the energy of the filled spectrum after spectral folding to the initial energy of the original spectrum, as it is described by the equation further above.
  • a combination of a signal and noise floor like energy estimation, in a frequency dependant manner, is made in order to build an appropriate envelope to be used after the spectral fill and folding.
  • Fig. 10 illustrate a part of an encoder 20 used for such purposes.
  • Spectral coefficients 66 e.g. transform coefficients
  • Quantization errors 67 are introduced by the quantization of the spectral coefficients.
  • the envelope coding section 60 comprising two estimators; a signal like energy estimator 62 and a noise floor like energy estimator 62.
  • the estimators 62, 61 are connected to a quantizer 63 for quantization of the energy estimation outputs.
  • a noise floor like energy estimation for the subbands below the transition frequency.
  • the main difference with the signal like energy estimation, of the equations above, relies on the computation so that the quantization error will be flattened by- using a mean over the logarithmic values of its coefficients and not a logarithmic value of the averaged coefficients per subband.
  • the combination of signal and noise floor like energy estimation at the encoder is used to build an appropriate envelope, which is applied to the filled spectrum at the decoder side.
  • Fig. 11 illustrates a flow diagram of steps of an embodiment of a decoding method according to the present invention.
  • the method for perceptual spectral decoding starts in step 200.
  • step 210 spectral coefficients recovered from a binary flux are decoded into decoded spectral coefficients of an initial set of spectral coefficients.
  • step 212 spectrum filling of the initial set of spectral coefficients is performed, giving a set of reconstructed spectral coefficients.
  • the set of reconstructed spectral coefficients of a frequency domain is converted in step 216 into an audio signal of a time domain.
  • Step 212 in turn comprises a step 214, in which spectral holes are noise filled by setting spectral coefficients in the initial set of spectral coefficients not being decoded from the binary flux equal to elements derived from the decoded spectral coefficients.
  • the procedure is ended in step 249.
  • the spectrum fill part of the procedure of Fig. 11 can also be considered as a separate signal handling method that is generally used within perceptual spectral decoding.
  • Such a signal handling method involves the central noise fill step and steps for obtaining an initial set of spectral coefficients and for outputting a set of reconstructed spectral coefficients.
  • a flow diagram of steps of a preferred embodiment of such a noise fill method according to the present invention is illustrated. This method may thus be used as a part of the method illustrated in Fig. 11.
  • the method for signal handling starts in step 250.
  • step 260 an initial set of spectral coefficients is obtained.
  • Step 270 being a spectrum filling step comprises a noise filling step 272, which in turn comprises a number of substeps 262- 266.
  • a spectral codebook is created from decoded spectral coefficients.
  • step 264 which may be omitted, the spectral codebook is postprocessed, as described further above.
  • fill elements are selected from the codebook to fill spectral holes in the initial set of spectral coefficients.
  • step 268 a set of recovered spectral coefficients is outputted. The procedure ends in step 299.
  • the noise fill according to the present invention provides a high quality compared e.g. to typical noise fill with standard Gaussian white noise injection. It preserves the original signal temporal envelope.
  • the complexity of the implementation of the present invention is very low compared solutions according to state of the art.
  • the noise fill in the frequency domain can e.g. be adapted to the coding scheme under usage by defining an adaptive transition frequency at the encoder and/ or at the decoder side.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

A method for perceptual spectral decoding comprises decoding of spectral coefficients recovered from a binary flux into decoded spectral coefficients of an initial set of spectral coefficients. The initial set of spectral coefficients are spectrum filled. The spectrum filling comprises noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients not being decoded from the binary flux equal to elements derived from the decoded spectral coefficients. The set of reconstructed spectral coefficients of a frequency domain formed by the spectrum filling is converted into an audio signal of a time domain. A perceptual spectral decoder comprises a noise filler, operating according to the method for perceptual spectral decoding.

Description

METHOD AND DEVICE FOR NOISE FILLING
TECHNICAL FIELD
The present invention relates in general to methods and devices for coding and decoding of audio signals, and in particular to methods and devices for perceptual spectral decoding.
BACKGROUND
When audio signals are to be stored and/ or transmitted, a standard approach today is to code the audio signals into a digital representation according to different schemes. In order to save storage and/ or transmission capacity, it is a general wish to reduce the size of the digital representation needed to allow reconstruction of the audio signals with sufficient perceptual quality. The trade-off between size of the coded signal and signal quality depends on the actual application.
A time domain signal has typically to be divided into smaller parts in order to precisely encode the evolution of the signal's amplitude, i.e. describe with low amount of information. State-of-the-art coding methods usually transform the time-domain signal into the frequency domain where a better coding gain can be reached by using perceptual coding i.e. lossy coding but ideally unnoticeable by the human auditory system. See e.g. J. D. Johnston, "Transform coding of audio signals using perceptual noise criteria", IEEE J. Select. Areas Commun., Vol. 6, pp. 314-323, 1988 [I]. However, when the bit rate constraint is too strong, the perceptual audio coding concept can not avoid the introduction of distortions, i.e. coding noise over the masking threshold. The general issue of reducing distortions in perceptual audio coding has been addressed by the Temporal Noise Shaping (TNS) technology described in e.g. J. Herre, "Temporal Noise Shaping, Quantization and Coding Methods in Perceptual Audio Coding: A tutorial introduction", AES 17th Int. conf. on High Quality Audio Coding, 1997 [2]. Basically, the TNS i approach is based on two main considerations, namely the consideration of the time/ frequency duality and the shaping of quantization noise spectra by means of open-loop predictive coding.
In addition, audio coding standards are continuously designed in order to deliver high or intermediate audio quality, from narrowband speech to fullband audio, at low data rates for a reasonable complexity according to the dedicated application. The Spectral Band Replication (SBR) technology, described in 3GPP TS 26.404 V6.0.0 (2004-09), " Enhanced aacPlus general audio codec - encoder SBR part (Release 6)", 2004 [3], has been introduced to allow wideband or fullband audio coding at low data rate by associating specific parameters to the binary flux resulting from perceptual audio coding of the narrow band signal. Such specific parameters are typically used at the decoder side to re-generate the missing high-frequencies that is not decoded by the core codec from the low-frequency decoded spectrum.
The association of TNS and SBR technologies, described in [3], in a transform based audio codec has been successfully implemented for intermediate data rate applications, i.e. a typical bit rate of 32 kbps for intermediate audio quality. Nevertheless, these highly sophisticated coding methods are very complex since they involve predictive coding and adaptive- resolution filter bank requiring certain delays. They are indeed not well appropriated for low delay and low complexity applications.
SUMMARY
A general object of the present invention is thus to provide methods and devices for reducing coding artifacts, applicable also at low bit rates. A further object of the present invention is also to provide methods and devices for reducing coding artifacts having a low complexity.
The above mentioned objects are achieved by methods and devices according to the enclosed patent claims. In general words, in a first aspect, a method for perceptual spectral decoding comprises decoding of spectral coefficients recovered from a binary flux into decoded spectral coefficients of an initial set of spectral coefficients. The initial set of spectral coefficients is spectrum filled into a set of reconstructed spectral coefficients. The spectrum filling comprises noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients not being decoded from the binary flux equal to elements derived from the decoded spectral coefficients. The set of reconstructed spectral coefficients of a frequency domain is converted into an audio signal of a time domain.
In a second aspect, a method for signal handling in perceptual spectral decoding comprises obtaining of decoded spectral coefficients of an initial set of spectral coefficients. The initial set of spectral coefficients is spectrum filled into a set of reconstructed spectral coefficients. The spectrum filling comprises noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients having a zero magnitude or being non-coded equal to elements derived from the decoded spectral coefficients. The set of reconstructed spectral coefficients is outputted.
In a third aspect, a perceptual spectral decoder comprises an input for a binary flux and a spectral coefficient decoder arranged for decoding spectral coefficients recovered from the binary flux into decoded spectral coefficients of an initial set of spectral coefficients. The perceptual spectral decoder further comprises a spectrum filler connected to the spectral coefficient decoder and arranged for spectrum filling of the set of spectral coefficients. The spectrum filler comprises a noise filler for noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients not being decoded from the binary flux equal to elements derived from the decoded spectral coefficients. The perceptual spectral decoder also comprises a converter connected to the spectrum filler and arranged for converting the set of reconstructed spectral coefficients of a frequency domain into an audio signal of a time domain and an output for the audio signal. In a fourth aspect, a signal handling device for use in a perceptual spectral decoder comprises an input for decoded spectral coefficients of an initial set of spectral coefficients and a spectrum filler connected to the input and arranged for spectrum filling of the initial set of spectral coefficients. The spectrum filler comprises a noise filler for noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients having a zero magnitude or being non-decoded equal to elements derived from the decoded spectral coefficients. The signal handling device also comprises an output for the set of reconstructed spectral coefficients.
One advantage with the present invention is that an original signal temporal envelope of an audio signal is better preserved since noise filling relies on the decoded spectral coefficients without injection of random noise as it occurs in conventional noise filling methods. The present invention is also possible to implement in a low-complexity manner. Other advantages are further discussed in connection with the different embodiments described further below.
BRIEF DESCRIPTION OF THE DRAWINGS
The invention, together with further objects and advantages thereof, may best be understood by making reference to the following description taken together with the accompanying drawings, in which:
FIG. 1 is a schematic block scheme of a codec system;
FIG. 2 is a schematic block scheme of an embodiment of an audio signal encoder;
FIG. 3 is a schematic block scheme of an embodiment of an audio signal decoder;
FIG. 4 is a schematic block scheme of an embodiment of a noise filler according to the present invention;
FIGS. 5A-B are illustrations of creation and utilization of spectral codebooks for noise filling purposes according to an embodiment of the present invention; FIG. 6 is a schematic block scheme of an embodiment of a decoder according to the present invention;
FIG. 7 is a schematic block scheme of another embodiment of a noise filler according to the present invention;
FIGS. 8A-B are illustrations of embodiments of bandwidth expansion according to an embodiment of a spectrum fold approach according to the present invention;
FIG. 9 is a schematic block scheme of yet another embodiment of a noise filler according to the present invention;
FIG. 10 is a schematic block scheme of en encoder having an envelope coder according to an embodiment of the present invention;
FIG. 11 is a flow diagram of steps of an embodiment of a decoding method according to the present invention; and
FIG. 12 is a flow diagram of steps of an embodiment of a signal handling method according to the present invention.
DETAILED DESCRIPTION
Throughout the drawings, the same reference numbers are used for similar or corresponding elements.
The present invention relies on a frequency domain processing at the decoding side of a coding-decoding system. This frequency domain processing is called Noise Fill (NF), which is able to reduce the coding artifacts occurring particularly for low bit-rates and which also may be used to regenerate a full bandwidth audio signal even at low rates and with a low complexity scheme.
An embodiment of a general codec system for audio signals is schematically illustrated in Fig. 1. An audio source 10 gives rise to an audio signal 15. The audio signal 15 is handled in an encoder 20, which produces a binary flux 25 comprising data representing the audio signal 15. The binary flux 25 may be transmitted, as e.g. in the case of multimedia communication, by a transmission and/ or storing arrangement 30. The transmission and/ or storing arrangement 30 optionally also may comprise some storing capacity. The binary flux 25 may also only be stored in the transmission and/ or storing arrangement 30, just introducing a time delay in the utilization of the binary flux. The transmission and/or storing arrangement 30 is thus an arrangement introducing at least one of a spatial repositioning or time delay of the binary flux 25. When being used, the binary flux 25 is handled in a decoder 40, which produces an audio output 35 from the data comprised in the binary flux. Typically, the audio output 35 should approximate the original audio signal 15 as well as possible under certain constraints, e.g. data rate, delay or complexity.
In many real-time applications, the time delay between the production of the original audio signal 15 and the produced audio output 35 is typically not allowed to exceed a certain time. If the transmission resources at the same time are limited, the available bit-rate is also typically low. In order to utilize the available bit-rate in a best possible manner, perceptual audio coding has been developed. Perceptual audio coding has therefore become an important part for many multimedia services today. The basic principle is to convert the audio signal into spectral coefficient in a frequency domain and using a perceptual model to determine a frequency and time dependent masking of the spectral coefficients.
Fig. 2 illustrates an embodiment of a typical perceptual audio encoder 20. In this particular embodiment, the perceptual audio encoder 20 is a spectral encoder based on a time-to-frequency transformer or a filter bank. An audio source 15 is received, comprising frames of audio signals.
In a typical transform encoder, the first step consists of a time-domain processing usually called windowing of the signal which results in a time segmentation of the input audio signal x[n]. Thus, a windowing section 21 receives the audio signals and provides time segmented audio signal x[n] 22. The time segmented audio signal x[n] 22 is provided to a converter 23, arranged for converting the time domain audio signal 22 into a set of spectral coefficients of a frequency domain. The converter 23 can be implemented according to any prior-art transformer or filter bank. The details are not of particular importance for the principles of the present invention to be functional, and the details are therefore omitted from the description. The time to frequency domain transform used by the encoder could be, for example, the:
Discrete Fourier Transform (DFT),
N-I _ ■« "k
X[k] = ∑w[n]x x[n]x e~J *~» ,k e N
0,- -1
«=0 where x[k] is the DFT of the windowed input signal x[n] . N is the size of the window w[n], n is the time index and k the frequency bin index.
Discrete Cosine Transform (DCT),
Modified Discrete Cosine Transform (MDCT),
Figure imgf000008_0001
where X[k] is the MDCT of the windowed input signal x[n] . N is the size of the window w[n], n is the time index and k the frequency bin index, etc.
In the present embodiment, based on one of these frequency representations of the input audio signal, the perceptual audio codec aims at decompose the spectrum, or its approximation, regarding to the critical bands of the auditory system e.g. the Bark scale. This step can be achieved by a frequency grouping of the transform coefficients according to a perceptual scale established according to the critical bands.
Figure imgf000009_0001
with Nb the number of frequency or psychoacoustical bands and b the relative index.
The output from the converter 23 is a set of spectral coefficients being a frequency representation 24 of the input audio signal.
Typically, a perceptual model is used to determine a frequency and time dependent masking of the spectral coefficients. In the present embodiment, the perceptual transform codec relies on an estimation of a Masking Threshold
Figure imgf000009_0002
in order to derive a frequency shaping function, e.g. the Scale Factors iSFJδ], applied to the transform coefficients Xb[k] in the psychoacoustical subband domain. The scaled spectrum Xs b [k] can be defined as
XSb[k] = Xb[k]xm[blk e [kb,--,kb+1 -llb e [l,---,Nb] .
To this end, in the embodiment of Fig. 2, a psychoacoustic modeling section 26 is connected to the windowing section 21 for having access to the original acoustic signal 22 and to the converter 23 for having access to the frequency representation. The psychoacoustic modeling section 26 is in the present embodiment arranged to utilize the above described estimation and outputs a masking threshold MT[k] 27.
The masking threshold MT[k] 27 and the frequency representation 24 of the input audio signal are provided to a quantizing and coding section 28. First, the masking threshold Mr[A:] 27 is applied on the frequency representation 24 giving a set of spectral coefficients. In the present embodiment, the set of spectral coefficients corresponds to the scaled spectrum coefficients Xsέ[&] based on the frequency groupings X4[A;] . However, in a more general transform encoder, the scaling can also be performed on the individual spectral coefficients
Figure imgf000010_0001
directly.
The quantizing and coding section 28 is further arranged for quantizing the set of spectral coefficients in any appropriate manner giving an information compression. The quantizing and coding section 28 is also arranged for coding the quantized set of spectral coefficients. Such coding takes preferably advantage of the perceptual properties and operates for masking the quantization noise in a best possible manner. The perceptual coder may thereby exploit the perceptually scaled spectrum for the coding purpose. The redundancy reduction can be thereby be performed by a quantization and coding process which will be able to focus on the most perceptually relevant coefficients of the original spectrum by using the scaled spectrum. The coded spectral coefficients together with additional side information are packed into a bitstream according to the transmission or storage standard that is going to be used. A binary flux 25 having data representing the set of spectral coefficients is thereby outputted from the quantizing and coding section 28.
At the decoding stage, the inverse operation is basically achieved. In Fig. 3, an embodiment of a typical perceptual audio decoder 40 is illustrated. A binary flux 25 is received, which has the properties from the encoder described here above. De-quantization and decoding of the received binary flux 25 e.g. a bitstream is performed in a spectral coefficient decoder 41. The spectral coefficient decoder 41 is arranged for decoding spectral coefficients recovered from the binary flux into decoded spectral coefficients XQ [k] of an initial set of spectral coefficients 42, possible grouped in frequency groupings
The initial set of spectral coefficients 42 is typically incomplete in that sense that it typically comprises so-called "spectral holes", which corresponds to spectral coefficients that are not received in the binary flux or at least not decoded from the binary flux. In other words, the spectral holes are non- decoded spectral coefficients Xβ [&] or spectral coefficients automatically set to a predetermined value, typically zero, by the spectral coefficient decoder 41. The incomplete initial set of spectral coefficients 42 from the spectral coefficient decoder 41 is provided to a spectrum filler 43. The spectrum filler 43 is arranged for spectrum filling the initial set of spectral coefficients 42. The spectrum filler 43 in turn comprises a noise filler 50. The noise filler 50 is arranged for providing a process for noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients 42 not being decoded from the binary flux 25 to a definite value. As described in detail further below, according to the present invention, the spectral coefficients of the spectral holes are set equal to elements derived from the decoded spectral coefficients. The decoder 40 thus presents a specific module which allows a high-quality noise fill in the transform domain. The result from the spectrum filler 43 is a complete set 44 of reconstructed spectral coefficients Xb [k], having all spectral coefficients within a certain frequency range defined.
The complete set 44 of spectral coefficients is provided to a converter 45 connected to the spectrum filler 43. The converter 45 is arranged for converting the complete set 44 of reconstructed spectral coefficients of a frequency domain into an audio signal 46 of a time domain. The converter 45 is typically based on an inverse transformer or filter bank, corresponding to the transformation technique used in the encoder 20 (fig. 2). In a particular embodiment, the signal 46 is provided back into the time domain with an inverse transform, e.g. Inverse MDCT - IMDCT or Inverse DFT - IDFT, etc. In other embodiments an inverse filter bank is utilized. As at the encoder side, the technique of the converter 45 as such, is known in prior art, and will not be further discussed. Finally, the overlap-add method is used to generate the final perceptually reconstructed audio signal 34 x'[n] at an output 35 for said audio signal 34. This is in the present exemplary embodiment provided by a windowing section 47 and an overlap adaptation section 49. The above presented encoder and decoder embodiments could be provided for sub-band coding as well as for coding of entire the frequency band of interest.
In Fig. 4, an embodiment of a noise filler 50 according to the present invention is illustrated. This particular high-quality noise filler 50 allows the preservation of the temporal structure with a spectrum filling based on a new concept called spectral noise codebook. The spectral noise codebook is built on-the-fly based on the decoded spectrum, i.e. the decoded spectral coefficients. The decoded spectrum contains the overall temporal envelope information which means that the generated, possibly random, noise from the noise codebook will also contain such information which will avoid a temporally flat noise fill, which would introduce noisy distortions.
The architecture of the noise filler of Fig. 4 relies on two consecutive sections, each one associated with a respective step. The first step, performed by a spectral codebook generator 51, consists in building a spectral codebook with elements that are provided by the decoded spectrum Xf [A:] , i.e. the decoded spectral coefficients of the initial set of spectral coefficients 42.
Then, in a filling spectrum section 52, the decoded spectrum subbands or spectral coefficients that are considered as spectral holes, are filled with the codebook elements in order to reduce the coding artifacts. This spectrum filling should preferably be considered for the lowest frequencies up to a transition frequency which can be defined adaptively. However, filling can be performed in the entire frequency range if requested. By using codebook elements, which are associated with a certain temporal structure of a present audio signal, some temporal structure preservation will be introduced also into the filled spectral coefficients.
Fig. 4 can be seen as illustrating a signal handling device for use in a perceptual spectral decoder. The signal handling device comprises an input for decoded spectral coefficients of an initial set of spectral coefficients. The signal handling device further comprises a spectrum filler connected to the input and arranged for spectrum filling of the initial set of spectral coefficients into a set of reconstructed spectral coefficients. The spectrum filler comprises a noise filler for noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients having a zero magnitude or being non-decoded equal to elements derived from the decoded spectral coefficients. The signal handling device also comprises an output for the set of reconstructed spectral coefficients.
The process is schematically illustrated in Figs. 5A-B. Here it is shown that the first step of the noise fill procedure relies on building of the spectral codebook from the spectral coefficients, e.g. the transform coefficients. This step is achieved by concatenating the perceptually relevant spectral coefficients of the decoded spectrum Xf [&] . In the present embodiment, the decoded spectrum is divided in groups of spectral coefficients. The presented principles are, however, applicable to any such grouping. A special case is then when each spectral coefficient Xδ[&] constitutes its own group, i.e. equivalent to a situation without any grouping at all. The decoded spectrum of the Fig. 5A has several series of zero coefficients or undecoded coefficients, denoted by black rectangles, which are usually called spectral holes. The groups of spectral coefficients Xf[A;] appear typically with a certain length L.
This length can be a fixed length or a value determined by the quantization and coding process.
According to the fact that spectral holes resulting from the quantization and coding process are not perceptually relevant, the spectral codebook is in this embodiment made from the groups of spectral coefficients Xf[^] or equivalently spectral subbands, which have not only zeros. For example, a subband of length L with Z zeros (Z<L) will in this embodiment be part of the codebook since a part of the subband has been encoded, i.e. quantized. In this way the codebook size is defined adaptively to the perceptually relevant content of the input spectrum.
In other embodiments, other selection criteria may be used when generating the spectral codebook. One possible criterion to be included in the spectral codebook could be that none of the spectral coefficients of a certain group of spectral coefficients X® [k] is allowed to be undefined or equal to zero. This reduces the selection possibilities within the spectral codebook, but at the same time it ensures that all elements of the spectral codebook carry some temporal structure information. As anyone skilled in the art realizes, there are unlimited variations of possible criterions for selecting appropriate elements derived from the decoded spectral coefficients.
When a spectral hole is requested to be filled, it is in this embodiment proposed to fill the spectral holes by elements from the spectral codebook. This is performed in order to reduce typical quantization and coding artefacts. One improvement of the present invention compared to prior art relies on the fact that the spectral filling is achieved with parts of the perceptually relevant spectrum itself and then, allows the preservation of the temporal structure of the original signal. Typically, white noise injection proposed by the state-of-the-art noise fill schemes [1] does not meet the important requirement of preservation of the temporal structure, which means that pre-echo artefacts may be produced. At the contrary, the spectral filling according to the present embodiment will not introduce pre- echo artefacts while still reducing the quantization and coding artefacts.
As it is shown in Fig. 5B, the spectral codebook elements are used to fill the spectral holes, e.g. succession of Z=L zeros, preferably up to a transition frequency. The transition frequency may be defined by the encoder and then transmitted to the decoder or determined adaptively by the decoder from the audio signal content. It is then assume that the transition frequency is defined at the decoder in the same way as it would have been done by the encoder, e.g. based on the number of coded coefficients per subband. Since the total length of all spectral holes can be larger than the length of the spectral codebook, the same codebook elements may have to be used for filling several spectral holes.
The choice of the elements from the spectral codebook used for filling can be done by following one or several criteria. One criterion, which corresponds to the embodiment illustrated in Fig. 5B, is to use the elements of the spectral codebook in index order, preferably starting at the low frequency end. If the indices of the set of spectral coefficients are denoted by i and the indices of the spectral codebook are denoted by j, couples (ij) can represent the filling strategy. The index order approach can then be expressed as blindly fill the spectral holes by increasing the codebook index j as much as the index i. This is used to cover all the spectral holes. If there are more spectral holes than elements in the spectral codebook, the use of the spectral codebook elements may start from the beginning again, i.e. by a cyclic use of the spectral codebook, when all elements of the spectral codebook are utilized.
Other criterions could also be used to define the couples (i,j), for instance, the spectral distance e.g. frequency, between the spectral hole coefficients and the codebook elements. In this manner, it can be assured e.g. that the utilized temporal structure is based on spectral coefficients associated with a frequency not too far from the spectral hole to be filled. Typically, it is believed that it is more appropriate to fill spectral holes with elements associated with a frequency that is lower than the frequency of the spectral hole to be filled.
Another criterion is to consider the energy of the spectral hole neighbours so that the injected codebook elements smoothly will fit to the recovered encoded coefficients. In other words, the noise filler is arranged to select the elements from the spectral codebook based on an energy of a decoded spectral coefficient adjacent to a spectral hole to be filled and an energy of the selected element. A combination of such criteria could also be considered.
In the above embodiment, the spectral codebook comprises decoded spectral coefficients from a present frame of the audio signal. There are also temporal dependencies passing the frame boundaries. In alternative embodiment, in order to utilize such interframe temporal dependencies, it would be possible to e.g. save parts of a spectral codebook from one frame to another. In other words, the spectral codebook may comprise decoded spectral coefficients from at least one of a past frame and a future frame.
The elements of the spectral codebook can, as indicated in the above embodiments, correspond directly to certain decoded spectral coefficients. However, it is also possible to arrange the noise filler to further comprise a postprocessor. The postprocessor is arranged for postprocessing the elements of the spectral codebook. This leads to that the noise filler has to be arranged for selecting the elements from the postprocessed spectral codebook. In such a way, certain dependencies, in frequency and/ or temporal space, can be smoothed, reducing the influence of e.g. quantizing or coding noise.
The use of a spectral codebook is a practical implementation of the arranging of setting spectral holes equal to elements derived from the decoded spectral coefficients. However, simple solutions may also be realized in alternative manners. Instead of explicitly collect the candidates for filling elements in a separate codebook, the selection and/ or derivation of elements to be used for filling spectral holes can be performed directly from the decoded spectral coefficients of the set.
In preferred embodiments, the spectrum filler of the decoder is further arranged for providing bandwidth extension. In Fig. 6, an embodiment of a decoder 40 is illustrated, in which the spectrum filler 43 additionally comprises a bandwidth extender 55. The bandwidth extender 55, as such known in prior art, increases the frequency region in which spectral coefficients are available at the high frequency end. In a typical situation, the recovered spectral coefficients are provided mainly below a transition frequency. Any spectral holes are there filled by the above described noise filling. At frequencies above the transition frequency, typically none or a few recovered spectral coefficients are available. This frequency region is thus typically unknown, and of rather low importance for the perception. By extending the available spectral coefficients also within this region, a full set of spectral coefficients suitable for e.g. inverse transforming can be provided. As a summary, noise filling is typically performed for frequencies below the transition frequency and the bandwidth extension is typically performed for frequencies above the transition frequency.
In a particular embodiment, illustrated in Fig. 7, the bandwidth extender 55 is considered as a part of the noise filler 50. In this particular embodiment, the bandwidth extender 55 comprises a spectrum folding section 56, in which high-frequency spectral coefficients are generated by spectral folding in order to build a full-bandwidth audio signal. In other words, the process synthesizes a high-frequencies spectrum from the filled spectrum in the present embodiment by spectral folding based on the value of the transition frequency.
An embodiment of a full-bandwidth generation is described by Fig. 8A. It is based on a spectral folding of the spectrum below the transition frequency to the high-frequency spectrum, i.e. basically zeros above the transition frequency. To do so, the zeros at frequencies over the transition frequency are filled with the low- frequency filled spectrum. In the present embodiment, a length of the low-frequency filled spectrum equal to half the length of the high-frequency spectrum to be filled is selected from frequencies just below the transition frequency. Then, a first spectral copy is achieved with respect to a point of symmetry defined by the transition frequency. Finally, the first half part of the high-frequency spectrum is then also used to generate the second half part of the high-frequency spectrum by an additional folding. This procedure can be seen as a specific implementation of the general method which can be described as follows. The spectrum above the transition frequency (Z transform coefficients) is divided into U (U>2) spectral units or blocks depending on the signal harmonic structure (speech signal for instance) or any other suitable criterion. Indeed, if the original signal has a strong harmonic structure then it is appropriated to reduce the length of the spectrum part used for the folding (increase U) in order to avoid annoying artefacts.
In an alternative embodiment, described in Fig. 8B, a section of the low frequency filled spectrum just below the transition frequency is also here used for spectrum folding. If the intended bandwidth extension Z is smaller than or equal to half the available low-frequency filled spectrum (N-Z) /2, a section of the low frequency filled spectrum corresponding to the length of the high-spectrum to be filled is selected and folded onto the high-frequency around the transition frequency. However, if the intended bandwidth extension Z is larger than half the available low-frequency filled spectrum (N- Z) /2, i.e. in case that N < 3*Z, only half the low frequency filled spectrum is selected and folded in the first place. Then, a spectrum range from the just folded spectrum is selected to cover the rest of the high-frequency range. If necessary, i.e. if N < 2*Z, this folding can be repeated with a third copy, a fourth copy, and so on, until the entire high-frequency range is covered to ensure spectral continuity and a full-bandwidth signal generation.
In case the high-frequency spectrum, above the transition frequency, is not completely full of zero or undefined coefficients, which means that some transform coefficients indeed have been perceptually encoded or quantized, then, the spectral folding should preferably not replace, modify or even delete these coefficients, as indicated in Fig. 8B.
In Fig. 9, an embodiment of a decoder 40 also presenting application of the spectral fill envelope is illustrated. To this end, the noise filler 50 comprises a spectral fill envelope section 57. The spectral fill envelope section 57 is arranged for applying the spectral fill envelope to the filled and folded spectrum over all subbands so that the final energy of the decoded spectrum will approximate the energy of the original spectrum Xfr [&], i.e. in order to conserve an initial energy. This is also applicable when the noise filling is performed in a normalized domain.
In one embodiment, this is done by using a subband gain correction which can be written as:
Figure imgf000019_0001
k e [kb,-,kb+1
Figure imgf000019_0002
b 4l,--,Nb),
where the gains G[b] in dB are given by the logarithmic value of the average quantization error for each subband b
Figure imgf000019_0003
To do so, the energy levels of the original spectrum and/ or the noise floor e.g. the envelope G[b], should have been encoded and transmitted by the encoder to the decoder as side information.
This way, the signal like estimated envelope, G[b] for the subbands above the transition frequency, is able to adapt the energy of the filled spectrum after spectral folding to the initial energy of the original spectrum, as it is described by the equation further above.
In a particular embodiment, a combination of a signal and noise floor like energy estimation, in a frequency dependant manner, is made in order to build an appropriate envelope to be used after the spectral fill and folding. Fig. 10 illustrate a part of an encoder 20 used for such purposes. Spectral coefficients 66, e.g. transform coefficients, are input to an envelope coding section. Quantization errors 67 are introduced by the quantization of the spectral coefficients. The envelope coding section 60 comprising two estimators; a signal like energy estimator 62 and a noise floor like energy estimator 62. The estimators 62, 61 are connected to a quantizer 63 for quantization of the energy estimation outputs.
As can be seen in Fig. 10, rather than only using a signal like estimated envelope, it is in the present embodiment proposed to use a noise floor like energy estimation for the subbands below the transition frequency. The main difference with the signal like energy estimation, of the equations above, relies on the computation so that the quantization error will be flattened by- using a mean over the logarithmic values of its coefficients and not a logarithmic value of the averaged coefficients per subband. The combination of signal and noise floor like energy estimation at the encoder is used to build an appropriate envelope, which is applied to the filled spectrum at the decoder side.
Fig. 11 illustrates a flow diagram of steps of an embodiment of a decoding method according to the present invention. The method for perceptual spectral decoding starts in step 200. In step 210, spectral coefficients recovered from a binary flux are decoded into decoded spectral coefficients of an initial set of spectral coefficients. In step 212, spectrum filling of the initial set of spectral coefficients is performed, giving a set of reconstructed spectral coefficients. The set of reconstructed spectral coefficients of a frequency domain is converted in step 216 into an audio signal of a time domain. Step 212, in turn comprises a step 214, in which spectral holes are noise filled by setting spectral coefficients in the initial set of spectral coefficients not being decoded from the binary flux equal to elements derived from the decoded spectral coefficients. The procedure is ended in step 249.
Preferred embodiments of the method are to be found among the procedures described in connection with the devices further above. The spectrum fill part of the procedure of Fig. 11 can also be considered as a separate signal handling method that is generally used within perceptual spectral decoding. Such a signal handling method involves the central noise fill step and steps for obtaining an initial set of spectral coefficients and for outputting a set of reconstructed spectral coefficients.
In Fig. 12, a flow diagram of steps of a preferred embodiment of such a noise fill method according to the present invention is illustrated. This method may thus be used as a part of the method illustrated in Fig. 11. The method for signal handling starts in step 250. In step 260, an initial set of spectral coefficients is obtained. Step 270, being a spectrum filling step comprises a noise filling step 272, which in turn comprises a number of substeps 262- 266. In step 262, a spectral codebook is created from decoded spectral coefficients. In step 264, which may be omitted, the spectral codebook is postprocessed, as described further above. In step 266, fill elements are selected from the codebook to fill spectral holes in the initial set of spectral coefficients. In step 268, a set of recovered spectral coefficients is outputted. The procedure ends in step 299.
The invention described here above has many advantages, some of which will be mentioned here. The noise fill according to the present invention provides a high quality compared e.g. to typical noise fill with standard Gaussian white noise injection. It preserves the original signal temporal envelope. The complexity of the implementation of the present invention is very low compared solutions according to state of the art. The noise fill in the frequency domain can e.g. be adapted to the coding scheme under usage by defining an adaptive transition frequency at the encoder and/ or at the decoder side.
The embodiments described above are to be understood as a few illustrative examples of the present invention. It will be understood by those skilled in the art that various modifications, combinations and changes may be made to the embodiments without departing from the scope of the present invention. In particular, different part solutions in the different embodiments can be combined in other configurations, where technically possible. The scope of the present invention is, however, defined by the appended claims.
REFERENCES
[1] J. D. Johnston, "Transform coding of audio signals using perceptual noise criteria", IEEE J. Select. Areas Commun., Vol. 6, pp. 314-323,
1988. [2] J. Herre, "Temporal Noise Shaping, Quantization and Coding Methods in Perceptual Audio Coding: A tutorial introduction", AES 17th Int. conf. on High Quality Audio Coding, 1997. [3] 3GPP TS 26.404 V6.0.0 (2004-09), " Enhanced aacPlus general audio codec - encoder SBR part (Release 6)", 2004.

Claims

1. Method for perceptual spectral decoding, comprising the steps of: decoding (210) spectral coefficients recovered from a binary flux into decoded spectral coefficients of an initial set of spectral coefficients; spectrum filling (212) of said initial set of spectral coefficients into a set of reconstructed spectral coefficients; said spectrum filling (212) comprising noise filling (214) of spectral holes by setting spectral coefficients in said initial set of spectral coefficients not being decoded from said binary flux equal to elements derived from said decoded spectral coefficients; and converting (216) said set of reconstructed spectral coefficients of a frequency domain into an audio signal of a time domain.
2 Method according to claim 1, wherein said noise filling (214) in turn comprises creation (262) of a spectral codebook dependent on said decoded spectral coefficients, whereby said noise filling (214) of spectral holes comprises setting of spectral coefficients in said initial set of spectral coefficients equal to elements selected (266) from said spectral codebook.
3. Method according to claim 2, wherein said spectral codebook (51) comprises elements based on perceptually relevant decoded spectral coefficients from a present frame.
4. Method according to claim 2 or 3, wherein said spectral codebook comprises elements based on perceptually relevant decoded spectral coefficients from at least one of a past frame and a future frame.
5. Method according to any of the claims 2 to 4, wherein said elements are selected (266) from said spectral codebook according to at least one criterion.
6. Method according to claim 5, wherein said elements are selected (266) from said spectral codebook in index order as a circular buffer, starting from a low frequency end.
7. Method according to claim 5, wherein said elements are selected from said spectral codebook based on a spectral distance between a spectral hole to be filled and said selected element.
8. Method according to claim 5, wherein said elements are selected (266) from said spectral codebook based on an energy of a decoded spectral coefficient adjacent to a spectral hole to be filled and an energy of said selected element.
9. Method according to any of the claims 2 to 8, wherein said noise filling (214) further comprises postprocessing (264) of said spectral codebook, whereby said elements are selected (266) from said postprocessed spectral codebook.
10. Method according to any of the claims 1 to 9, wherein said spectrum filling (212) further comprises bandwidth extension.
11. Method according to claim 10, wherein said noise filling (214) is performed for frequencies below a transition frequency (ft) and said bandwidth extension is performed for frequencies above said transition frequency (ft).
12. Method according to claim 10 or 11, wherein said bandwidth extension comprises spectral folding.
13. Method according to any of the claims 1 to 12, wherein said noise filling (214) is performed in a normalized domain.
14. Method according to claim 13, further comprising the step of applying a spectral fill envelope on said set of spectral coefficients in order to conserve an initial energy.
15. Method according to any of the claims 1 to 14, wherein said converting (216) comprises inverse transformation using at least one of an inverse transform and an inverse filter bank.
16. Method for signal handling in perceptual spectral decoding, comprising the steps of: obtaining (260) decoded spectral coefficients of an initial set of spectral coefficients; spectrum filling (212) of said initial set of spectral coefficients into a set of reconstructed spectral coefficients; said spectrum filling (212) comprising noise filling (214) of spectral holes by setting spectral coefficients in said initial set of spectral coefficients having a zero magnitude or being non-decoded equal to elements derived from said decoded spectral coefficients; and outputting (268) said set of reconstructed spectral coefficients.
17. Perceptual spectral decoder (40), comprising: an input for a binary flux (25); a spectral coefficient decoder (41) arranged for decoding spectral coefficients recovered from said binary flux (25) into decoded spectral coefficients of an initial set of spectral coefficients (42); a spectrum filler (43) connected to said spectral coefficient decoder (41) and arranged for spectrum filling of said set of spectral coefficients (42); said spectrum filler (43) comprising a noise filler (50) for noise filling of spectral holes by setting spectral coefficients in said initial set of spectral coefficients (42) not being decoded from said binary flux (25) equal to elements derived from said decoded spectral coefficients; and a converter (45) connected to said spectrum filler (43) and arranged for converting said set of reconstructed spectral coefficients of a frequency domain into an audio signal (34) of a time domain; and an output (35) for said audio signal (34).
18. Perceptual spectral decoder according to claim 17, wherein said noise filler (50) in turn comprising a spectral codebook generator (51); said spectral codebook generator (51) being arranged for creating a spectral codebook from said decoded spectral coefficients, and whereby said noise filler (50) being arranged for filling said spectral holes with elements selected from said spectral codebook.
19. Perceptual spectral decoder according to claim 18, wherein said spectral codebook generator (51) is arranged for creating said spectral codebook to comprise elements based on perceptually relevant decoded spectral coefficients from a present frame.
20. Perceptual spectral decoder according to claim 18 or 19, wherein said spectral codebook generator (51) is arranged for creating said spectral codebook to comprise elements based on perceptually relevant decoded spectral coefficients from at least one of a past frame and a future frame.
21. Perceptual spectral decoder according to any of the claims 18 to 20, wherein said noise filler (50) being further arranged to select said elements from said spectral codebook according to at least one criterion.
22. Perceptual spectral decoder according to claim 21, wherein said noise filler (50) being further arranged to select said elements from said spectral codebook in index order as a circular buffer, starting from a low frequency end.
23. Perceptual spectral decoder according to claim 21, wherein said noise filler (50) being further arranged to select said elements from said spectral codebook based on a spectral distance between a spectral hole to be filled and said selected element.
24. Perceptual spectral decoder according to claim 21, wherein said noise filler (50) being further arranged to select said elements from said spectral codebook based on an energy of a recovered spectral coefficient adjacent to a spectral hole to be filled and an energy of said selected element.
25. Perceptual spectral decoder according to any of the claims 18 to 24, wherein said noise filler (50) further comprises a postprocessor arranged for postprocessing said spectral codebook, whereby said noise filler (50) being arranged for selecting said elements from said postprocessed spectral codebook.
26. Perceptual spectral decoder according to any of the claims 17 to 25, wherein said spectrum filler (43) further comprises a bandwidth extender (55).
27. Perceptual spectral decoder according to claim 26, wherein said noise filler (50) is arranged for performing noise filling for frequencies below a transition frequency (ft) and said bandwidth extender (55) being arranged for extending a bandwidth for frequencies above said transition frequency (ft) .
28. Perceptual spectral decoder according to claim 26 or 27, wherein said bandwidth extender (55) comprises a spectral folding section.
29. Perceptual spectral decoder according to any of the claims 17 to 28, wherein said noise filler (50) is arranged to operate in a normalized domain.
30. Perceptual spectral decoder according to claim 29, further comprising a spectral fill envelope applier (57) arranged for applying a spectral fill envelope on said set of spectral coefficients in order to conserve an initial energy.
31. Perceptual spectral decoder according to any of the claims 17 to 30, wherein said converter (45) comprises at least one of an inverse transform section and an inverse filter bank.
32. Signal handling device for use in a perceptual spectral decoder, comprising: an input for decoded spectral coefficients of an initial set of spectral coefficients; a spectrum filler (43) connected to said input and arranged for spectrum filling of said initial set of spectral coefficients into a set of reconstructed spectral coefficients; said spectrum filler (43) comprising a noise filler (50) for noise filling of spectral holes by setting spectral coefficients in said initial set of spectral coefficients having a zero magnitude or being non-decoded equal to elements derived from said decoded spectral coefficients; and an output for said set of reconstructed spectral coefficients.
PCT/SE2008/050968 2007-08-27 2008-08-26 Method and device for noise filling WO2009029036A1 (en)

Priority Applications (13)

Application Number Priority Date Filing Date Title
MX2010001504A MX2010001504A (en) 2007-08-27 2008-08-26 Method and device for noise filling.
PL18176984T PL3401907T3 (en) 2007-08-27 2008-08-26 Method and device for perceptual spectral decoding of an audio signal including filling of spectral holes
EP18176984.5A EP3401907B1 (en) 2007-08-27 2008-08-26 Method and device for perceptual spectral decoding of an audio signal including filling of spectral holes
US12/675,290 US8370133B2 (en) 2007-08-27 2008-08-26 Method and device for noise filling
CN2008801048087A CN101809657B (en) 2007-08-27 2008-08-26 Method and device for noise filling
EP08828426.0A EP2186089B1 (en) 2007-08-27 2008-08-26 Method and device for perceptual spectral decoding of an audio signal including filling of spectral holes
DK08828426.0T DK2186089T3 (en) 2007-08-27 2008-08-26 Method and apparatus for perceptual spectral decoding of an audio signal including filling in spectral holes
PL19194270T PL3591650T3 (en) 2007-08-27 2008-08-26 Method and device for filling of spectral holes
EP19194270.5A EP3591650B1 (en) 2007-08-27 2008-08-26 Method and device for filling of spectral holes
JP2010522868A JP5255638B2 (en) 2007-08-27 2008-08-26 Noise replenishment method and apparatus
ES08828426T ES2704286T3 (en) 2007-08-27 2008-08-26 Method and device for the perceptual spectral decoding of an audio signal, including the filling of spectral holes
CA2698031A CA2698031C (en) 2007-08-27 2008-08-26 Method and device for noise filling
US13/755,672 US9111532B2 (en) 2007-08-27 2013-01-31 Methods and systems for perceptual spectral decoding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US96823007P 2007-08-27 2007-08-27
US60/968,230 2007-08-27

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US12/675,290 A-371-Of-International US8370133B2 (en) 2007-08-27 2008-08-26 Method and device for noise filling
US13/755,672 Continuation US9111532B2 (en) 2007-08-27 2013-01-31 Methods and systems for perceptual spectral decoding

Publications (1)

Publication Number Publication Date
WO2009029036A1 true WO2009029036A1 (en) 2009-03-05

Family

ID=40387560

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SE2008/050968 WO2009029036A1 (en) 2007-08-27 2008-08-26 Method and device for noise filling

Country Status (12)

Country Link
US (2) US8370133B2 (en)
EP (3) EP3401907B1 (en)
JP (1) JP5255638B2 (en)
CN (1) CN101809657B (en)
CA (1) CA2698031C (en)
DK (3) DK3401907T3 (en)
ES (3) ES2704286T3 (en)
HU (2) HUE047607T2 (en)
MX (1) MX2010001504A (en)
PL (2) PL3401907T3 (en)
PT (1) PT2186089T (en)
WO (1) WO2009029036A1 (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010003565A1 (en) * 2008-07-11 2010-01-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Noise filler, noise filling parameter calculator, method for providing a noise filling parameter, method for providing a noise-filled spectral representation of an audio signal, corresponding computer program and encoded audio signal
CN102027537A (en) * 2009-04-02 2011-04-20 弗劳恩霍夫应用研究促进协会 Apparatus, method and computer program for generating a representation of a bandwidth-extended signal on the basis of an input signal representation using a combination of a harmonic bandwidth-extension and a non-harmonic bandwidth-extension
US20120046955A1 (en) * 2010-08-17 2012-02-23 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for noise injection
WO2012053150A1 (en) * 2010-10-18 2012-04-26 パナソニック株式会社 Audio encoding device and audio decoding device
WO2012139668A1 (en) * 2011-04-15 2012-10-18 Telefonaktiebolaget L M Ericsson (Publ) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
KR20120127335A (en) * 2011-05-13 2012-11-21 삼성전자주식회사 Noise filling Method, audio decoding method and apparatus, recoding medium and multimedia device employing the same
CN103366750A (en) * 2012-03-28 2013-10-23 北京天籁传音数字技术有限公司 Sound coding and decoding apparatus and sound coding and decoding method
RU2509380C2 (en) * 2009-11-27 2014-03-10 ЗетТиИ Корпорейшн Method and apparatus for hierarchical encoding and decoding audio
CN103843062A (en) * 2011-06-30 2014-06-04 三星电子株式会社 Apparatus and method for generating bandwidth extension signal
US8831933B2 (en) 2010-07-30 2014-09-09 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for multi-stage shape vector quantization
US9626972B2 (en) 2012-12-06 2017-04-18 Huawei Technologies Co., Ltd. Method and device for decoding signal
US11508394B2 (en) 2019-01-04 2022-11-22 Samsung Electronics Co., Ltd. Device and method for wirelessly communicating on basis of neural network model

Families Citing this family (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB0704622D0 (en) * 2007-03-09 2007-04-18 Skype Ltd Speech coding system and method
US8370133B2 (en) * 2007-08-27 2013-02-05 Telefonaktiebolaget L M Ericsson (Publ) Method and device for noise filling
CN101939782B (en) 2007-08-27 2012-12-05 爱立信电话股份有限公司 Adaptive transition frequency between noise fill and bandwidth extension
US8190440B2 (en) * 2008-02-29 2012-05-29 Broadcom Corporation Sub-band codec with native voice activity detection
KR101390433B1 (en) * 2009-03-31 2014-04-29 후아웨이 테크놀러지 컴퍼니 리미티드 Signal de-noising method, signal de-noising apparatus, and audio decoding system
JP5754899B2 (en) 2009-10-07 2015-07-29 ソニー株式会社 Decoding apparatus and method, and program
JP5609737B2 (en) 2010-04-13 2014-10-22 ソニー株式会社 Signal processing apparatus and method, encoding apparatus and method, decoding apparatus and method, and program
JP5850216B2 (en) 2010-04-13 2016-02-03 ソニー株式会社 Signal processing apparatus and method, encoding apparatus and method, decoding apparatus and method, and program
JP6075743B2 (en) * 2010-08-03 2017-02-08 ソニー株式会社 Signal processing apparatus and method, and program
WO2012037515A1 (en) 2010-09-17 2012-03-22 Xiph. Org. Methods and systems for adaptive time-frequency resolution in digital data coding
JP5707842B2 (en) 2010-10-15 2015-04-30 ソニー株式会社 Encoding apparatus and method, decoding apparatus and method, and program
US9009036B2 (en) 2011-03-07 2015-04-14 Xiph.org Foundation Methods and systems for bit allocation and partitioning in gain-shape vector quantization for audio coding
US8838442B2 (en) 2011-03-07 2014-09-16 Xiph.org Foundation Method and system for two-step spreading for tonal artifact avoidance in audio coding
US9015042B2 (en) * 2011-03-07 2015-04-21 Xiph.org Foundation Methods and systems for avoiding partial collapse in multi-block audio coding
CN105448298B (en) * 2011-03-10 2019-05-14 瑞典爱立信有限公司 Fill the non-coding subvector in transform encoded audio signal
EP2684190B1 (en) * 2011-03-10 2015-11-18 Telefonaktiebolaget L M Ericsson (PUBL) Filling of non-coded sub-vectors in transform coded audio signals
JP2013015598A (en) * 2011-06-30 2013-01-24 Zte Corp Audio coding/decoding method, system and noise level estimation method
JP5416173B2 (en) * 2011-07-07 2014-02-12 中興通訊股▲ふん▼有限公司 Frequency band copy method, apparatus, audio decoding method, and system
KR101926651B1 (en) 2013-01-29 2019-03-07 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에.베. Noise Filling Concept
EP2830064A1 (en) * 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for decoding and encoding an audio signal using adaptive spectral tile selection
WO2015041070A1 (en) 2013-09-19 2015-03-26 ソニー株式会社 Encoding device and method, decoding device and method, and program
RU2666468C2 (en) * 2013-10-31 2018-09-07 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Audio bandwidth extension by insertion of temporal pre-shaped noise in frequency domain
JP6593173B2 (en) 2013-12-27 2019-10-23 ソニー株式会社 Decoding apparatus and method, and program
EP3115991A4 (en) * 2014-03-03 2017-08-02 Samsung Electronics Co., Ltd. Method and apparatus for high frequency decoding for bandwidth extension
US10468035B2 (en) 2014-03-24 2019-11-05 Samsung Electronics Co., Ltd. High-band encoding method and device, and high-band decoding method and device
JP6432180B2 (en) * 2014-06-26 2018-12-05 ソニー株式会社 Decoding apparatus and method, and program
EP2980792A1 (en) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for generating an enhanced signal using independent noise-filling
KR102482162B1 (en) * 2014-10-01 2022-12-29 돌비 인터네셔널 에이비 Audio encoder and decoder
WO2016142002A1 (en) 2015-03-09 2016-09-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder, method for encoding an audio signal and method for decoding an encoded audio signal
EP3182411A1 (en) 2015-12-14 2017-06-21 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for processing an encoded audio signal
JP7123134B2 (en) * 2017-10-27 2022-08-22 フラウンホファー ゲセルシャフト ツール フェールデルンク ダー アンゲヴァンテン フォルシュンク エー.ファオ. Noise attenuation in decoder
WO2019172811A1 (en) * 2018-03-08 2019-09-12 Telefonaktiebolaget Lm Ericsson (Publ) Method and apparatus for handling antenna signals for transmission between a base unit and a remote unit of a base station system
US11495237B2 (en) 2018-04-05 2022-11-08 Telefonaktiebolaget Lm Ericsson (Publ) Support for generation of comfort noise, and generation of comfort noise

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1991016769A1 (en) * 1990-04-12 1991-10-31 Dolby Laboratories Licensing Corporation Adaptive-block-length, adaptive-transform, and adaptive-window transform coder, decoder, and encoder/decoder for high-quality audio
US20030233234A1 (en) 2002-06-17 2003-12-18 Truman Michael Mead Audio coding system using spectral hole filling
WO2003107329A1 (en) * 2002-06-01 2003-12-24 Dolby Laboratories Licensing Corporation Audio coding system using characteristics of a decoded signal to adapt synthesized spectral components
WO2005078706A1 (en) * 2004-02-18 2005-08-25 Voiceage Corporation Methods and devices for low-frequency emphasis during audio compression based on acelp/tcx
US20050267739A1 (en) * 2004-05-25 2005-12-01 Nokia Corporation Neuroevolution based artificial bandwidth expansion of telephone band speech
WO2006107840A1 (en) * 2005-04-01 2006-10-12 Qualcomm Incorporated Systems, methods, and apparatus for wideband speech coding
US20060265087A1 (en) * 2003-03-04 2006-11-23 France Telecom Sa Method and device for spectral reconstruction of an audio signal
US20070041324A1 (en) * 2005-06-10 2007-02-22 Kishan Shenoi Adaptive play-out buffers and adaptive clock operation in packet networks

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3276977B2 (en) * 1992-04-02 2002-04-22 シャープ株式会社 Audio coding device
US6157811A (en) * 1994-01-11 2000-12-05 Ericsson Inc. Cellular/satellite communications system with improved frequency re-use
US5619503A (en) * 1994-01-11 1997-04-08 Ericsson Inc. Cellular/satellite communications system with improved frequency re-use
JPH1091194A (en) * 1996-09-18 1998-04-10 Sony Corp Method of voice decoding and device therefor
DE60209888T2 (en) * 2001-05-08 2006-11-23 Koninklijke Philips Electronics N.V. CODING AN AUDIO SIGNAL
US20030187663A1 (en) * 2002-03-28 2003-10-02 Truman Michael Mead Broadband frequency translation for high frequency regeneration
CA2388358A1 (en) * 2002-05-31 2003-11-30 Voiceage Corporation A method and device for multi-rate lattice vector quantization
US7831421B2 (en) * 2005-05-31 2010-11-09 Microsoft Corporation Robust decoder
US7630882B2 (en) * 2005-07-15 2009-12-08 Microsoft Corporation Frequency segmentation to obtain bands for efficient coding of digital media
US7885819B2 (en) * 2007-06-29 2011-02-08 Microsoft Corporation Bitstream syntax for multi-process audio decoding
US8370133B2 (en) * 2007-08-27 2013-02-05 Telefonaktiebolaget L M Ericsson (Publ) Method and device for noise filling
CN101939782B (en) * 2007-08-27 2012-12-05 爱立信电话股份有限公司 Adaptive transition frequency between noise fill and bandwidth extension

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1991016769A1 (en) * 1990-04-12 1991-10-31 Dolby Laboratories Licensing Corporation Adaptive-block-length, adaptive-transform, and adaptive-window transform coder, decoder, and encoder/decoder for high-quality audio
WO2003107329A1 (en) * 2002-06-01 2003-12-24 Dolby Laboratories Licensing Corporation Audio coding system using characteristics of a decoded signal to adapt synthesized spectral components
US20030233234A1 (en) 2002-06-17 2003-12-18 Truman Michael Mead Audio coding system using spectral hole filling
US20060265087A1 (en) * 2003-03-04 2006-11-23 France Telecom Sa Method and device for spectral reconstruction of an audio signal
WO2005078706A1 (en) * 2004-02-18 2005-08-25 Voiceage Corporation Methods and devices for low-frequency emphasis during audio compression based on acelp/tcx
US20050267739A1 (en) * 2004-05-25 2005-12-01 Nokia Corporation Neuroevolution based artificial bandwidth expansion of telephone band speech
WO2006107840A1 (en) * 2005-04-01 2006-10-12 Qualcomm Incorporated Systems, methods, and apparatus for wideband speech coding
US20070041324A1 (en) * 2005-06-10 2007-02-22 Kishan Shenoi Adaptive play-out buffers and adaptive clock operation in packet networks

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"Enhanced aacPlus general audio codec - encoder SBR part (Release 6", 3GPP TS 26.404 V6.0.0 (2004-09, 2004

Cited By (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8983851B2 (en) 2008-07-11 2015-03-17 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Noise filer, noise filling parameter calculator encoded audio signal representation, methods and computer program
US9449606B2 (en) 2008-07-11 2016-09-20 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder, methods for encoding and decoding an audio signal, and a computer program
US9711157B2 (en) 2008-07-11 2017-07-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder, methods for encoding and decoding an audio signal, and a computer program
WO2010003565A1 (en) * 2008-07-11 2010-01-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Noise filler, noise filling parameter calculator, method for providing a noise filling parameter, method for providing a noise-filled spectral representation of an audio signal, corresponding computer program and encoded audio signal
US12080306B2 (en) 2008-07-11 2024-09-03 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder, methods for encoding and decoding an audio signal, audio stream and a computer program
US12080305B2 (en) 2008-07-11 2024-09-03 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder, methods for encoding and decoding an audio signal, audio stream and a computer program
US10629215B2 (en) 2008-07-11 2020-04-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder, methods for encoding and decoding an audio signal, and a computer program
US11024323B2 (en) 2008-07-11 2021-06-01 Fraunhofer-Gesellschaft zur Fcerderung der angewandten Forschung e.V. Audio encoder, audio decoder, methods for encoding and decoding an audio signal, audio stream and a computer program
US9043203B2 (en) 2008-07-11 2015-05-26 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder, methods for encoding and decoding an audio signal, and a computer program
US11869521B2 (en) 2008-07-11 2024-01-09 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder, methods for encoding and decoding an audio signal, audio stream and a computer program
CN102027537A (en) * 2009-04-02 2011-04-20 弗劳恩霍夫应用研究促进协会 Apparatus, method and computer program for generating a representation of a bandwidth-extended signal on the basis of an input signal representation using a combination of a harmonic bandwidth-extension and a non-harmonic bandwidth-extension
RU2509380C2 (en) * 2009-11-27 2014-03-10 ЗетТиИ Корпорейшн Method and apparatus for hierarchical encoding and decoding audio
US9236063B2 (en) 2010-07-30 2016-01-12 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for dynamic bit allocation
US8924222B2 (en) 2010-07-30 2014-12-30 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for coding of harmonic signals
US8831933B2 (en) 2010-07-30 2014-09-09 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for multi-stage shape vector quantization
US9208792B2 (en) * 2010-08-17 2015-12-08 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for noise injection
US20120046955A1 (en) * 2010-08-17 2012-02-23 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for noise injection
JP5695074B2 (en) * 2010-10-18 2015-04-01 パナソニック インテレクチュアル プロパティ コーポレーション オブアメリカPanasonic Intellectual Property Corporation of America Speech coding apparatus and speech decoding apparatus
WO2012053150A1 (en) * 2010-10-18 2012-04-26 パナソニック株式会社 Audio encoding device and audio decoding device
JPWO2012053150A1 (en) * 2010-10-18 2014-02-24 パナソニック株式会社 Speech coding apparatus and speech decoding apparatus
US8706509B2 (en) 2011-04-15 2014-04-22 Telefonaktiebolaget L M Ericsson (Publ) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
US9595268B2 (en) 2011-04-15 2017-03-14 Telefonaktiebolaget Lm Ericsson (Publ) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
CN103503065A (en) * 2011-04-15 2014-01-08 瑞典爱立信有限公司 Method and a decoder for attenuation of signal regions reconstructed with low accuracy
US9349379B2 (en) 2011-04-15 2016-05-24 Telefonaktiebolaget L M Ericsson (Publ) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
WO2012139668A1 (en) * 2011-04-15 2012-10-18 Telefonaktiebolaget L M Ericsson (Publ) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
CN103503065B (en) * 2011-04-15 2015-08-05 瑞典爱立信有限公司 For method and the demoder of the signal area of the low accuracy reconstruct that decays
EP3067888A1 (en) * 2011-04-15 2016-09-14 Telefonaktiebolaget LM Ericsson (publ) Decoder for attenuation of signal regions reconstructed with low accuracy
EP2816556A1 (en) * 2011-04-15 2014-12-24 Telefonaktiebolaget L M Ericsson (PUBL) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
US9691398B2 (en) 2011-04-15 2017-06-27 Telefonaktiebolaget Lm Ericsson (Publ) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
KR102053900B1 (en) * 2011-05-13 2019-12-09 삼성전자주식회사 Noise filling Method, audio decoding method and apparatus, recoding medium and multimedia device employing the same
US10276171B2 (en) 2011-05-13 2019-04-30 Samsung Electronics Co., Ltd. Noise filling and audio decoding
KR20120127335A (en) * 2011-05-13 2012-11-21 삼성전자주식회사 Noise filling Method, audio decoding method and apparatus, recoding medium and multimedia device employing the same
KR102284106B1 (en) 2011-05-13 2021-07-30 삼성전자주식회사 Noise filling Method, audio decoding method and apparatus, recoding medium and multimedia device employing the same
KR20200143332A (en) * 2011-05-13 2020-12-23 삼성전자주식회사 Noise filling Method, audio decoding method and apparatus, recoding medium and multimedia device employing the same
KR102193621B1 (en) 2011-05-13 2020-12-21 삼성전자주식회사 Noise filling Method, audio decoding method and apparatus, recoding medium and multimedia device employing the same
KR20190138767A (en) * 2011-05-13 2019-12-16 삼성전자주식회사 Noise filling Method, audio decoding method and apparatus, recoding medium and multimedia device employing the same
CN106128473B (en) * 2011-06-30 2019-12-10 三星电子株式会社 Apparatus and method for generating bandwidth extended signal
AU2012276367B2 (en) * 2011-06-30 2016-02-04 Samsung Electronics Co., Ltd. Apparatus and method for generating bandwidth extension signal
US10037766B2 (en) 2011-06-30 2018-07-31 Samsung Electronics Co., Ltd. Apparatus and method for generating bandwith extension signal
AU2017202211C1 (en) * 2011-06-30 2018-08-02 Samsung Electronics Co., Ltd. Apparatus and method for generating bandwidth extended signal
CN106157968A (en) * 2011-06-30 2016-11-23 三星电子株式会社 For producing equipment and the method for bandwidth expansion signal
CN103843062A (en) * 2011-06-30 2014-06-04 三星电子株式会社 Apparatus and method for generating bandwidth extension signal
CN106157968B (en) * 2011-06-30 2019-11-29 三星电子株式会社 For generating the device and method of bandwidth expansion signal
US9349380B2 (en) 2011-06-30 2016-05-24 Samsung Electronics Co., Ltd. Apparatus and method for generating bandwidth extension signal
AU2016202120B2 (en) * 2011-06-30 2017-01-05 Samsung Electronics Co., Ltd. Apparatus and method for generating bandwidth extended signal
US9734843B2 (en) 2011-06-30 2017-08-15 Samsung Electronics Co., Ltd. Apparatus and method for generating bandwidth extension signal
CN106128473A (en) * 2011-06-30 2016-11-16 三星电子株式会社 For producing equipment and the method for bandwidth expansion signal
AU2017202211B2 (en) * 2011-06-30 2018-01-18 Samsung Electronics Co., Ltd. Apparatus and method for generating bandwidth extended signal
EP2728577A4 (en) * 2011-06-30 2016-07-27 Samsung Electronics Co Ltd Apparatus and method for generating bandwidth extension signal
CN103366750B (en) * 2012-03-28 2015-10-21 北京天籁传音数字技术有限公司 A kind of sound codec devices and methods therefor
CN103366750A (en) * 2012-03-28 2013-10-23 北京天籁传音数字技术有限公司 Sound coding and decoding apparatus and sound coding and decoding method
US10546589B2 (en) 2012-12-06 2020-01-28 Huawei Technologies Co., Ltd. Method and device for decoding signal
US10971162B2 (en) 2012-12-06 2021-04-06 Huawei Technologies Co., Ltd. Method and device for decoding signal
US9626972B2 (en) 2012-12-06 2017-04-18 Huawei Technologies Co., Ltd. Method and device for decoding signal
US11610592B2 (en) 2012-12-06 2023-03-21 Huawei Technologies Co., Ltd. Method and device for decoding signal
US9830914B2 (en) 2012-12-06 2017-11-28 Huawei Technologies Co., Ltd. Method and device for decoding signal
US10236002B2 (en) 2012-12-06 2019-03-19 Huawei Technologies Co., Ltd. Method and device for decoding signal
US11508394B2 (en) 2019-01-04 2022-11-22 Samsung Electronics Co., Ltd. Device and method for wirelessly communicating on basis of neural network model

Also Published As

Publication number Publication date
US8370133B2 (en) 2013-02-05
CN101809657A (en) 2010-08-18
HUE041323T2 (en) 2019-05-28
EP2186089A4 (en) 2011-12-28
EP2186089B1 (en) 2018-10-03
US20130218577A1 (en) 2013-08-22
EP3401907A1 (en) 2018-11-14
ES2774956T3 (en) 2020-07-23
HUE047607T2 (en) 2020-05-28
PL3401907T3 (en) 2020-05-18
EP3591650A1 (en) 2020-01-08
CA2698031C (en) 2016-10-18
DK3591650T3 (en) 2021-02-15
US20100241437A1 (en) 2010-09-23
US9111532B2 (en) 2015-08-18
CN101809657B (en) 2012-05-30
EP2186089A1 (en) 2010-05-19
JP5255638B2 (en) 2013-08-07
ES2858423T3 (en) 2021-09-30
EP3401907B1 (en) 2019-11-20
DK2186089T3 (en) 2019-01-07
PL3591650T3 (en) 2021-07-05
PT2186089T (en) 2019-01-10
CA2698031A1 (en) 2009-03-05
ES2704286T3 (en) 2019-03-15
JP2010538317A (en) 2010-12-09
DK3401907T3 (en) 2020-03-02
MX2010001504A (en) 2010-03-10
EP3591650B1 (en) 2020-12-23

Similar Documents

Publication Publication Date Title
US9111532B2 (en) Methods and systems for perceptual spectral decoding
US11990147B2 (en) Adaptive transition frequency between noise fill and bandwidth extension
CN1957398B (en) Methods and devices for low-frequency emphasis during audio compression based on acelp/tcx
KR101586317B1 (en) A method and an apparatus for processing a signal
US20070219785A1 (en) Speech post-processing using MDCT coefficients
MX2014000161A (en) Apparatus and method for generating bandwidth extension signal.
US6611798B2 (en) Perceptually improved encoding of acoustic signals
AU2001284606A1 (en) Perceptually improved encoding of acoustic signals
KR102390360B1 (en) Backward-compatible integration of harmonic transposer for high frequency reconstruction of audio signals

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880104808.7

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08828426

Country of ref document: EP

Kind code of ref document: A1

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2008828426

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: MX/A/2010/001504

Country of ref document: MX

ENP Entry into the national phase

Ref document number: 2010522868

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 12675290

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2698031

Country of ref document: CA

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 1654/DELNP/2010

Country of ref document: IN