KR20080035448A

KR20080035448A - Method and apparatus for encoding/decoding multi channel audio signal

Info

Publication number: KR20080035448A
Application number: KR1020070088315A
Authority: KR
Inventors: 김중회; 오은미
Original assignee: 삼성전자주식회사
Priority date: 2006-10-18
Filing date: 2007-08-31
Publication date: 2008-04-23
Also published as: KR101434834B1

Abstract

A method and an apparatus for encoding and decoding a multi-channel audio signal are provided to perform the up-mixing of decoded core audio data by using decoded residual coding data, thereby decoding core audio data and residual coding data in the same decoding type to reduce the complexity of a decoder. A method for encoding and decoding a multi-channel audio signal comprises the following steps of: detecting a type of space extension data included in the encoding result of the audio signal(700); detecting a core audio object type when the space extension data is data indicating the core audio object type in which core audio data is encoded(710); decoding the core audio data in a decoding type according to the detected core audio object type(720); decoding residual coding data in the decoding type according to the core audio object type when the space extension data is residual coding data(730); and up-mixing the decoded core audio data by using the decoded residual coding data(740).

Description

Method and apparatus for encoding / decoding multi-channel audio signal

본 발명은 다채널 오디오 신호의 부호화/복호화 방법 및 장치에 관한 것으로, 보다 상세하게는 오디오 신호를 업믹싱하는 데 사용되는 레지듀얼 신호의 부호화/복호화 방법 및 장치에 관한 것이다.The present invention relates to a method and apparatus for encoding / decoding multichannel audio signals, and more particularly, to a method and apparatus for encoding / decoding residual signals used for upmixing an audio signal.

MPEG(Moving Picture Experts Group) 서라운드(surround) 기술은 오디오 신호의부호화에 있어서 공간의 음원에 대한 오디오 데이터를 압축하는 기술로서, MP3(MPEG Audio Layer-3), MPEG-4 AAC(Advanced Audio Coding) 또는 MPEG-4 HE(High Efficiency)-AAC로 압축된 오디오 신호를 고품질 멀티 채널 서라운드 오디오로 바꿔준다. MPEG 서라운드는 기존 스테레오 장비에 대한 백워드(backward) 호환성을 유지하고, 기존 장비를 사용하면서도 고품질 멀티 채널 오디오 압축에 필요한 비트레이트(bitrate), 즉, 전송 속도를 줄일 수 있는 장점이 있다.Moving Picture Experts Group (MPEG) surround technology is a technology that compresses audio data for a sound source in a space in encoding an audio signal, and includes MP3 (MPEG Audio Layer-3) and MPEG-4 Advanced Audio Coding (AAC). Or, convert an audio signal compressed with MPEG-4 High Efficiency (AAC) -AAC into high quality multi-channel surround audio. MPEG surround has the advantage of maintaining backward compatibility with existing stereo equipment and reducing the bitrate, i.e., transmission speed, required for high quality multi-channel audio compression while using existing equipment.

MPEG 서라운드 표준에 따르면 코어 오디오 신호(core audio signal)는 BSAC(Bit Sliced Arithmetic Coding), AAC, MP3(MPEG Audio Layer-3) 등 가운데 어느 하나의 부호화 방식을 이용하여 부호화하지만, 레지듀얼 신호(residual signal) 는 AAC에 의하여만 부호화한다.According to the MPEG surround standard, the core audio signal is encoded by using any one of a coding method such as Bit Sliced Arithmetic Coding (BSAC), AAC, or MPEG Audio Layer-3 (MP3), but a residual signal is used. signal) is encoded only by AAC.

그러므로, MPEG 서라운드 표준에 따라 코어 오디오 신호를 AAC가 아닌 다른 부호화 방식으로 부호화할 경우, 부호화단에서 코어 오디오 신호와 레지듀얼 신호를 각기 다른 부호화 방식에 의하여 부호화하여야 한다. 마찬가지로, 복호화단에서도 코어 오디오 신호와 레지듀얼 신호를 각기 다른 복호화 방식에 의하여 복호화하여야 한다.Therefore, when the core audio signal is encoded using a coding scheme other than AAC according to the MPEG surround standard, the core audio signal and the residual signal must be encoded by different encoding schemes. Similarly, the decoding end should decode the core audio signal and the residual signal by different decoding methods.

본 발명이 해결하고자 하는 과제는 레지듀얼 신호를 복호화하는 경우 복호화단의 복잡도를 줄일 수 있는 다채널 오디오 신호의 복호화 방법 및 장치를 제공하는데 있다.An object of the present invention is to provide a method and apparatus for decoding a multi-channel audio signal that can reduce the complexity of the decoding stage when decoding the residual signal.

또한, 본 발명이 해결하고자 하는 다른 과제는 레지듀얼 신호를 부호화하는 경우 부호화단의 복잡도를 줄일 수 있는 다채널 오디오 신호의 부호화 방법 및 장치를 제공하는데 있다.In addition, another object of the present invention is to provide a method and apparatus for encoding a multi-channel audio signal that can reduce the complexity of the encoding stage when the residual signal is encoded.

상기 과제를 해결하기 위한 본 발명에 따른 다채널 오디오 신호의 복호화 방법은 오디오 신호의 부호화 결과에 포함된 공간 확장 데이터의 타입을 검출하는 단계; 상기 공간 확장 데이터가 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우, 상기 코어 오디오 오브젝트 타입을 검출하는 단계; 상기 검출된 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 상기 코어 오디오 데이터를 복호화하는 단계; 상기 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 상기 코어 오디오 오브젝트 타입에 따른 상기 복호화 방식으로 상기 레지듀얼 코딩 데이터를 복호화하는 단계; 및 상기 복호화된 코어 오디오 데이터를 상기 복호화된 레지듀얼 코딩 데이터를 이용하여 업믹싱하는 단계를 포함한다.According to an aspect of the present invention, there is provided a method of decoding a multichannel audio signal, the method including: detecting a type of spatial extension data included in an encoding result of an audio signal; Detecting the core audio object type when the spatial extension data is data representing a core audio object type in which core audio data is encoded; Decoding the core audio data by a decoding method according to the detected core audio object type; Decoding the residual coded data by the decoding method according to the core audio object type when the spatial extension data is residual coded data; And upmixing the decoded core audio data using the decoded residual coded data.

상기 과제는 오디오 신호의 부호화 결과에 포함된 공간 확장 데이터의 타입을 검출하는 단계; 상기 공간 확장 데이터가 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우, 상기 코어 오디오 오브젝트 타입을 검출하는 단계; 상기 검출된 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 상기 코어 오디오 데이터를 복호화하는 단계; 상기 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 상기 코어 오디오 오브젝트 타입에 따른 상기 복호화 방식으로 상기 레지듀얼 코딩 데이터를 복호화하는 단계; 및 상기 복호화된 코어 오디오 데이터를 상기 복호화된 레지듀얼 코딩 데이터를 이용하여 업믹싱하는 단계를 포함하는 다채널 오디오 신호의 복호화 방법를 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 의해 달성된다.The task may include detecting a type of spatial extension data included in an encoding result of an audio signal; Detecting the core audio object type when the spatial extension data is data representing a core audio object type in which core audio data is encoded; Decoding the core audio data by a decoding method according to the detected core audio object type; Decoding the residual coded data by the decoding method according to the core audio object type when the spatial extension data is residual coded data; And upmixing the decoded core audio data using the decoded residual coded data. The method may be achieved by a computer-readable recording medium having recorded thereon a program for executing a method of decoding a multichannel audio signal.

또한, 상기 다른 과제를 해결하기 위한 본 발명에 따른 다채널 오디오 신호의 복호화 장치는 오디오 신호의 부호화 결과에 포함된 공간 확장 데이터의 타입을 검출하는 공간 확장 데이터 타입 검출부; 상기 공간 확장 데이터가 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우, 상기 코어 오디오 오브젝트 타입을 검출하는 코어 오디오 오브젝트 타입 검출부; 상 기 검출된 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 상기 코어 오디오 데이터를 복호화하는 코어 오디오 데이터 복호화부; 상기 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 상기 코어 오디오 오브젝트 타입에 따른 상기 복호화 방식으로 상기 레지듀얼 코딩 데이터를 복호화하는 레지듀얼 코딩 데이터 복호화부; 및 상기 복호화된 코어 오디오 데이터를 상기 복호화된 레지듀얼 코딩 데이터를 이용하여 업믹싱하는 업믹싱부를 포함한다.In addition, the apparatus for decoding a multi-channel audio signal according to the present invention for solving the other problem is a spatial extension data type detection unit for detecting the type of spatial extension data included in the encoding result of the audio signal; A core audio object type detector for detecting the core audio object type when the spatial extension data is data indicating a core audio object type in which core audio data is encoded; A core audio data decoder which decodes the core audio data by a decoding method according to the detected core audio object type; A residual coded data decoder configured to decode the residual coded data by the decoding method according to the core audio object type when the spatial extension data is residual coded data; And an upmixing unit for upmixing the decoded core audio data by using the decoded residual coding data.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 다채널 오디오 신호의 부호화 방법은 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터 및 레지듀얼 데이터를 생성하는 단계; 상기 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화하는 단계; 상기 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입에 따른 상기 소정의 부호화 방식에 따라 상기 레지듀얼 데이터를 부호화하는 단계; 및 상기 부호화된 코어 오디오 데이터 및 상기 부호화된 레지듀얼 데이터를 상기 오디오 신호에 대한 부호화 결과로써 출력하는 단계를 포함한다.In addition, the method for encoding a multi-channel audio signal according to the present invention for solving the another problem comprises the steps of: downmixing the input audio signal to generate core audio data and residual data; Encoding the core audio data according to a predetermined encoding method; Encoding the residual data according to the predetermined encoding method according to a core audio object type, which is a method of encoding the core audio data; And outputting the encoded core audio data and the encoded residual data as an encoding result of the audio signal.

또한, 상기 또 다른 과제는 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터 및 레지듀얼 데이터를 생성하는 단계; 상기 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화하는 단계; 상기 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입에 따른 상기 소정의 부호화 방식에 따라 상기 레지듀얼 데이터를 부호화하는 단계; 및 상기 부호화된 코어 오디오 데이터 및 상기 부호화된 레지듀얼 데이터를 상기 오디오 신호에 대한 부호화 결과로써 출력하는 단계를 포함하는 다채널 오디오 신호의 부호화 방법을 실행하기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 의해 달성된다.In addition, another object of the present invention is to downmix the input audio signal to generate core audio data and residual data; Encoding the core audio data according to a predetermined encoding method; Encoding the residual data according to the predetermined encoding method according to a core audio object type, which is a method of encoding the core audio data; And outputting the encoded core audio data and the encoded residual data as an encoding result of the audio signal. A computer-readable recording medium having recorded thereon a program for executing a multi-channel audio signal encoding method. Is achieved by.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 다채널 오디오 신호의 부호화 장치는 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터 및 레지듀얼 데이터를 생성하는 다운믹싱부; 상기 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화하는 코어 오디오 데이터 부호화부; 상기 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입에 따른 상기 소정의 부호화 방식에 따라 상기 레지듀얼 데이터를 부호화하는 레지듀얼 데이터 부호화부; 및 상기 부호화된 코어 오디오 데이터 및 상기 부호화된 레지듀얼 데이터를 상기 오디오 신호에 대한 부호화 결과로써 출력하는 다중화부를 포함한다.In addition, the apparatus for encoding a multi-channel audio signal according to the present invention for solving the another problem comprises a downmixing unit for downmixing the input audio signal to generate the core audio data and the residual data; A core audio data encoder which encodes the core audio data according to a predetermined encoding method; A residual data encoder which encodes the residual data according to the predetermined encoding method according to a core audio object type which is a method of encoding the core audio data; And a multiplexer for outputting the encoded core audio data and the encoded residual data as an encoding result of the audio signal.

또한, 상기 또 다른 과제를 해결하기 위한 본 발명에 따른 다채널 오디오 신호의 복호화 방법은 다운 믹싱된 오디오 코어 신호에 해당하는 비트스트림과 다채널 생성을 위한 부가 정보가 포함된 비트스트림을 수신하는 단계; 상기 다운 믹싱된 오디오 코어 신호에 해당하는 비트스트림으로부터 코어 오브젝트 타입을 검출하는 단계; 상기 검출된 코어 오브젝트 타입에 의해 결정된 복호화 방식에 의해 상기 다운 믹싱된 오디오 코어 신호를 복호화하는 단계; 상기 다채널 생성을 위한 부가 정보에 포함된 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 상기 코어 오디오 오브젝트 타입에 따른 상기 복호화 방식으로 상기 레지듀얼 코딩 데이터를 복호화하는 단계; 및 상기 복호화된 코어 오디오 데이터를 상기 복호화된 레지듀얼 코딩 데이터를 이용하여 업 믹싱하는 단계를 포함한다.In addition, the decoding method of the multi-channel audio signal according to the present invention for solving the another problem is a step of receiving a bitstream corresponding to the downmixed audio core signal and a bitstream including the additional information for generating the multi-channel ; Detecting a core object type from a bitstream corresponding to the downmixed audio core signal; Decoding the down-mixed audio core signal by a decoding scheme determined by the detected core object type; If the spatial extension data included in the additional information for generating the multi-channel is residual coded data, decoding the residual coded data by the decoding method according to the core audio object type; And upmixing the decoded core audio data using the decoded residual coded data.

본 발명에 따르면, 오디오 신호의 부호화 결과에 포함된 공간 확장 데이터의 타입을 검출하고, 공간 확장 데이터가 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우, 코어 오디오 오브젝트 타입을 검출하며, 검출된 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 코어 오디오 데이터를 복호화하고, 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 레지듀얼 코딩 데이터를 복호화하며, 복호화된 코어 오디오 데이터를 복호화된 레지듀얼 코딩 데이터를 이용하여 업믹싱함으로써, 코어 오디오 데이터와 레지듀얼 코딩 데이터를 동일한 복호화 방식으로 복호화하여 복호화단의 복잡도를 줄일 수 있다.According to the present invention, a type of spatial extension data included in an encoding result of an audio signal is detected, and when the spatial extension data is data representing a core audio object type in which core audio data is encoded, a core audio object type is detected. Decode the core audio data by a decoding method according to the detected core audio object type, and when the spatial extension data is residual coding data, decode the residual coding data by the decoding method according to the core audio object type, and decode By upmixing the core audio data using the decoded residual coding data, the complexity of the decoding stage can be reduced by decoding the core audio data and the residual coding data by the same decoding method.

또한, 본 발명에 따르면, 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터 및 레지듀얼 데이터를 생성하고, 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화하며, 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입에 따른 소정의 부호화 방식에 따라 레지듀얼 데이터를 부호화하고, 부호화된 코어 오디오 데이터 및 부호화된 레지듀얼 데이터를 오디오 신호에 대한 부호화 결과로써 출력함으로써, 코어 오디오 데이터와 레지듀얼 데이터를 동일한 부호화 방식으로 부호화하여 부호화단의 복잡도를 줄일 수 있다.Further, according to the present invention, a core audio object is generated by downmixing an input audio signal to generate core audio data and residual data, encoding core audio data according to a predetermined encoding method, and encoding the core audio data. Residual data is encoded according to a predetermined encoding method according to the type, and the core audio data and the residual data are output as the encoding result of the audio signal by encoding the encoded core audio data and the encoded residual data as the same encoding scheme. By coding, the complexity of the coding stage can be reduced.

본문에 개시되어 있는 본 발명의 실시예들에 대해서, 특정한 구조적 내지 기능적 설명들은 단지 본 발명의 실시예를 설명하기 위한 목적으로 예시된 것으로, 본 발명의 실시예들은 다양한 형태로 실시될 수 있으며 본문에 설명된 실시예들에 한정되는 것으로 해석되어서는 아니 된다. With respect to the embodiments of the present invention disclosed in the text, specific structural to functional descriptions are merely illustrated for the purpose of describing embodiments of the present invention, embodiments of the present invention may be implemented in various forms and It should not be construed as limited to the embodiments described in.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 형태를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 본문에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 개시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 구성요소에 대해 사용하였다. As the inventive concept allows for various changes and numerous embodiments, particular embodiments will be illustrated in the drawings and described in detail in the text. However, this is not intended to limit the present invention to the specific disclosed form, it should be understood to include all modifications, equivalents, and substitutes included in the spirit and scope of the present invention. In describing the drawings, similar reference numerals are used for the components.

다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥 상 가지는 의미와 일치하는 의미를 가지는 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다. Unless defined otherwise, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art. Terms such as those defined in the commonly used dictionaries should be construed as having meanings consistent with the meanings in the context of the related art and shall not be construed in ideal or excessively formal meanings unless expressly defined in this application. Do not.

이하, 첨부한 도면들을 참조하여, 본 발명의 바람직한 실시예를 보다 상세하게 설명하고자 한다. 도면상의 동일한 구성요소에 대해서는 동일한 참조부호를 사용하고 동일한 구성요소에 대해서 중복된 설명은 생략한다. Hereinafter, with reference to the accompanying drawings, it will be described in detail a preferred embodiment of the present invention. The same reference numerals are used for the same elements in the drawings, and duplicate descriptions of the same elements are omitted.

도 1은 본 발명의 일 실시예에 따른 다채널 오디오 신호의 복호화 장치를 나타내는 블록도이다.1 is a block diagram illustrating an apparatus for decoding a multichannel audio signal according to an embodiment of the present invention.

도 1을 참조하면, 다채널 오디오 신호의 복호화 장치는 역다중화부(demultiplexing unit, 100), 공간 확장 데이터 타입 검출부(spatial extension data type detecting unit, 110), 코어 오디오 오브젝트 타입 검출부(core audio object type detecting unit, 120), 코어 오디오 데이터 복호화부(core audio data decoding unit, 130), 레지듀얼 코딩 데이터 복호화부(residual coding data decoding unit, 140), 아비트레리 다운 믹스 레지듀얼 코딩 데이터 복호화부(arbitrary down-mix residual coding data decoding unit, 150), 및 업믹싱부(up-mixing unit, 160)를 포함한다. Referring to FIG. 1, an apparatus for decoding a multichannel audio signal includes a demultiplexing unit 100, a spatial extension data type detecting unit 110, and a core audio object type detection unit. detecting unit 120, a core audio data decoding unit 130, a residual coding data decoding unit 140, an abiteriary downmix residual coding data decoder a mixed residual coding data decoding unit 150, and an up-mixing unit 160.

역다중화부(100)는 입력단자 IN을 통하여 부호화단으로부터 비트스트림을 입력받아 역다중화한다. The demultiplexer 100 demultiplexes a bitstream from an encoding terminal through the input terminal IN.

도 2는 본 발명의 일 실시예에 따른 공간 확장 데이터 타입을 검출하기 위한 신택스를 도시한 도면이다. 도 3은 도 2에 도시된 "bsSacExtType"에 대응하는 값들이 할당된 테이블의 일 실시예를 도시한 도면이다. 이하에서는, 도 1 내지 도 3를 참조하여, 공간 확장 데이터 타입 검출부(110)의 동작을 설명하기로 한다.2 is a diagram illustrating syntax for detecting a spatial extension data type according to an embodiment of the present invention. FIG. 3 is a diagram illustrating an embodiment of a table to which values corresponding to "bsSacExtType" shown in FIG. 2 are assigned. Hereinafter, an operation of the spatial extension data type detection unit 110 will be described with reference to FIGS. 1 to 3.

공간 확장 데이터 타입 검출부(110)는 역다중화부(100)에서 역다중화된 데이터의 헤더에서 공간 확장 데이터(spatial extension data)의 타입을 검출한다. 보다 상세하기는, 공간 확장 데이터 타입 검출부(110)는 도 2에 도시된 함수 SpatialExtensionConfig()에 의하여 역다중화된 데이터의 헤더에서 공간 확장 데이터의 타입을 검출할 수 있다. 함수 SpatialExtensionConfig()에서 "bsSacExtType"은 공간 확장 데이터의 타입을 나타낸다. The spatial extension data type detector 110 detects the type of spatial extension data from the header of the demultiplexed data by the demultiplexer 100. In more detail, the spatial extension data type detector 110 may detect the type of spatial extension data from the header of the demultiplexed data by the function SpatialExtensionConfig () shown in FIG. 2. In the function SpatialExtensionConfig (), "bsSacExtType" indicates the type of spatial extension data.

도 3을 참조하면, 본 발명의 일 실시예에서 "bsSacExtType"이 '0'이면, 공간 확장 데이터는 레지듀얼 코딩 데이터이고, "bsSacExtType"이 '1'이면, 공간 확장 데이터는 아비트레리 다운믹스 레지듀얼 코딩 데이터이며, "bsSacExtType"이 '12' 이면, 공간 확장 데이터는 MPEG-4 오디오의 코어 오디오 오브젝트 타입(core Audio Object Type)이다. 여기서, 코어 오디오 오브젝트 타입은 부호화단에서 다운 믹스된 신호를 부호화하는 오디오 오브젝트 타입을 말한다. 그러나, 이는 본 발명의 일 실시예에 불과하고, 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 다양한 변형이 가능하다는 것을 이해할 수 있다.Referring to FIG. 3, in one embodiment of the present invention, if "bsSacExtType" is '0', the spatial extension data is residual coding data, and if "bsSacExtType" is '1', the spatial extension data is an abiteriary downmix ledge. If the data is dual coded and "bsSacExtType" is '12', the spatial extension data is a core audio object type of MPEG-4 audio. Here, the core audio object type refers to an audio object type for encoding a downmixed signal at the encoding end. However, this is only one embodiment of the present invention, those skilled in the art can understand that various modifications are possible.

다시 말해, 공간 확장 데이터 타입 검출부(110)는 "bsSacExtType"에 '0'이 할당된 경우 공간 확장 데이터의 타입이 레지듀얼 코딩 데이터라고 판단하고, "bsSacExtType"에 '1'이 할당된 경우 공간 확장 데이터의 타입이 아비트레리 다운믹스 레지듀얼 코딩 데이터라고 판단하며, "bsSacExtType"에 '12'가 할당된 경우 공간 확장 데이터의 타입이 MPEG-4 오디오의 코어 오디오 오브젝트 타입을 나타내는 데이터라고 판단한다.In other words, the spatial extension data type detection unit 110 determines that the type of the spatial extension data is residual coding data when '0' is assigned to the "bsSacExtType", and the spatial extension when '1' is assigned to the "bsSacExtType". It is determined that the data type is Abiteri Downmix residual coding data, and when '12' is assigned to "bsSacExtType", it is determined that the type of spatial extension data is data representing a core audio object type of MPEG-4 audio.

이하에서는, 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터 타입에 따른 오디오 신호의 복호화 장치의 동작에 대하여 설명하기로 한다.Hereinafter, an operation of an audio signal decoding apparatus according to the spatial extension data type detected by the spatial extension data type detection unit 110 will be described.

첫째, 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터 타입이 MPEG-4 오디오의 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우에 대하여 살펴보기로 한다. 다시 말해, 이 경우에는 "bsSacExtType"은 '12'이다.First, a case in which the spatial extension data type detected by the spatial extension data type detector 110 is data representing the core audio object type of MPEG-4 audio will be described. In other words, in this case "bsSacExtType" is '12'.

도 4는 본 발명의 일 실시예에 따른 코어 오디오 오브젝트 타입을 독출하기 위한 신택스를 도시한 도면이다. 이하에서는, 도 1 및 4를 참조하여, 코어 오디오 오브젝트 타입 검출부(120)의 동작을 설명하기로 한다.4 is a diagram illustrating syntax for reading a core audio object type according to an embodiment of the present invention. Hereinafter, the operation of the core audio object type detector 120 will be described with reference to FIGS. 1 and 4.

공간 확장 데이터 타입 검출부(110)에서 공간 확장 데이터의 타입을 검출한 결과, 공간 확장 데이터가 MPEG-4 오디오의 코어 오디오 오브젝트 타입을 나타내는 데이터라고 판단되면, 코어 오디오 오브젝트 타입 검출부(120)는 코어 오디오 오브젝트 타입을 검출한다. When the spatial extension data type detection unit 110 detects the type of the spatial extension data and determines that the spatial extension data is data representing the core audio object type of MPEG-4 audio, the core audio object type detection unit 120 determines the core audio. Detect the object type.

보다 상세하게는, 코어 오디오 오브젝트 타입 검출부(120)는 도 4에 도시된 함수 "SpatialExtensionConfigData(12)"에 의하여 코어 오디오 오브젝트 타입을 독출할 수 있다. 여기서, "coreAudioObjectType"은 MPEG-4 오디오의 코어 오디오 오브젝트 타입을 나타낸다. In more detail, the core audio object type detector 120 may read the core audio object type by the function “SpatialExtensionConfigData 12” illustrated in FIG. 4. Here, "coreAudioObjectType" represents a core audio object type of MPEG-4 audio.

다시 도 1을 참조하면, 코어 오디오 데이터 복호화부(130)는 역다중화부(100)에서 역다중화된 코어 오디오 데이터를 복호화한다. 보다 상세하게는, 코어 오디오 데이터 복호화부(130)는 코어 오디오 오브젝트 타입 검출부(120)에서 검출된 코어 오디오 오브젝트 타입에 따라 역다중화된 코어 오디오 데이터를 복호화한다. Referring back to FIG. 1, the core audio data decoder 130 decodes core audio data demultiplexed by the demultiplexer 100. In more detail, the core audio data decoder 130 decodes the demultiplexed core audio data according to the core audio object type detected by the core audio object type detector 120.

상술한 바와 같이, 코어 오디오 오브젝트 타입은 부호화단에서 다운 믹스된 신호를 부호화하는 오디오 오브젝트 타입을 말한다. 여기서, 코어 오디오 데이터는 부호화단에서 BSAC(Bit Sliced Arithmetic Coding), AAC(Advanced Audio Coding), MP3(MPEG Audio Layer-3) 등과 같은 다양한 부호화 방식 가운데 어느 하나에 의하여 부호화될 수 있다. 여기서, BSAC, AAC, MP3 등은 본 발명의 실시예에 불과하며, 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 다양한 부호화 방식에 의해 코어 오디오 데이터는 부호화될 수 있음을 이해할 수 있다.As described above, the core audio object type refers to an audio object type for encoding a downmixed signal at the encoding end. Here, the core audio data may be encoded by any one of various encoding schemes such as bit sliced arithmetic coding (BSAC), advanced audio coding (AAC), and MPEG audio layer-3 (MP3). Here, BSAC, AAC, MP3 and the like are only embodiments of the present invention, and those skilled in the art can understand that the core audio data can be encoded by various encoding schemes.

둘째, 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터 타 입이 레지듀얼 코딩 데이터인 경우에 대하여 살펴보기로 한다. 다시 말해, 이 경우에는 "bsSacExtType"은 '0'이다.Second, a case in which the spatial extension data type detected by the spatial extension data type detection unit 110 is residual coded data will be described. In other words, in this case "bsSacExtType" is '0'.

도 5는 본 발명의 일 실시예에 따른 레지듀얼 코딩 데이터를 복호화하기 위한 신택스를 도시한 도면이다. 이하에서는, 도 1 및 5를 참조하여 레지듀얼 코딩 데이터 복호화부(140)의 동작에 대하여 설명하기로 한다. 5 is a diagram illustrating syntax for decoding residual coded data according to an embodiment of the present invention. Hereinafter, the operation of the residual coded data decoder 140 will be described with reference to FIGS. 1 and 5.

레지듀얼 코딩 데이터 복호화부(140)는 제1 코어 오디오 오브젝트 타입 판별부(141), 제1 BSAC 복호화부(142), 및 제1 AAC 복호화부(143)를 포함하여 레지듀얼 코딩 데이터를 복호화한다.The residual coded data decoder 140 includes a first core audio object type determiner 141, a first BSAC decoder 142, and a first AAC decoder 143 to decode the residual coded data. .

공간 확장 데이터 타입 검출부(110)에서 공간 확장 데이터의 타입을 검출한 결과, 공간 확장 데이터가 레지듀얼 코딩 데이터라고 판단되면, 제1 코어 오디오 오브젝트 타입 판별부(141)는 코어 오디오 오브젝트 타입이 'BSAC'인지 여부를 판단한다. When the spatial extension data type detection unit 110 detects the type of the spatial extension data and determines that the spatial extension data is residual coding data, the first core audio object type determination unit 141 determines that the core audio object type is 'BSAC'. Determine whether or not.

도 5를 참조하면, 'BSAC'의 코어 오디오 오브젝트 타입으로 '22'가 할당되었으므로, 제1 코어 오디오 오브젝트 타입 판별부(141)는 코어 오디오 오브젝트 타입 검출부(120)에서 검출된 "coreAudioObjectType"이 '22'에 해당하는지 여부를 판단한다.Referring to FIG. 5, since '22' is assigned as the core audio object type of 'BSAC', the first core audio object type determination unit 141 determines that the "coreAudioObjectType" detected by the core audio object type detection unit 120 is ''. It is determined whether or not it corresponds to 22 '.

제1 코어 오디오 오브젝트 타입 판별부(141)의 판단 결과, 코어 오디오 오브젝트 타입이 'BSAC'에 해당하면, 제1 BSAC 복호화부(142)는 'BSAC'에 의해서 레지듀얼 신호를 복호화한다. 예를 들어, 제1 BSAC 복호화부(142)는 도 5에 도시된 신택스의 식별번호 500 또는 520에 의하여 실시될 수 있다. 식별번호 500 또는 520에 서 제1 BSAC 복호화부(142)는 MPEG-4 ER BSAC에 정의되어 있는 함수 bsac_raw_data_block()에 의하여 레지듀얼 코딩 데이터를 복호화한다. 여기서, bsac_raw_data_block()의 "nch"는 항상 '1'로 설정되어 있어야 한다. 여기서, "nch"는 채널의 수(Number of Channels)를 나타낸다.As a result of the determination by the first core audio object type determiner 141, when the core audio object type corresponds to 'BSAC', the first BSAC decoder 142 decodes the residual signal by 'BSAC'. For example, the first BSAC decoder 142 may be implemented by the identification number 500 or 520 of the syntax illustrated in FIG. 5. At identification number 500 or 520, the first BSAC decoder 142 decodes the residual coded data by the function bsac_raw_data_block () defined in the MPEG-4 ER BSAC. Here, "nch" of bsac_raw_data_block () should always be set to '1'. Here, "nch" represents the number of channels.

제1 코어 오디오 오브젝트 타입 판별부(141)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되면, 제1 AAC 복호화부(143)는 'AAC'에 의해서 레지듀얼 코딩 데이터를 복호화한다. 예를 들어, 제1 AAC 복호화부(143)는 도 5에 도시된 신택스의 식별번호 510 또는 530에 의하여 실시될 수 있다. 식별번호 510 또는 530에서 제1 AAC 복호화부(143)는 ISO/IEC 13818-7의 subclause 6.3에 기재된 "MPEG-2 AAC Low Complexity profile bitstream syntax"에 정의되어 있는 individual_channel_stream(0)에 의하여 레지듀얼 코딩 데이터를 복호화한다.If it is determined by the first core audio object type determiner 141 that the core audio object type does not correspond to 'BSAC', the first AAC decoder 143 decodes the residual coded data by 'AAC'. For example, the first AAC decoder 143 may be implemented by the identification number 510 or 530 of the syntax illustrated in FIG. 5. In identification number 510 or 530, the first AAC decoding unit 143 performs residual coding by individual_channel_stream (0) defined in "MPEG-2 AAC Low Complexity profile bitstream syntax" described in subclause 6.3 of ISO / IEC 13818-7. Decrypt the data.

그러나, 제1 AAC 복호화부(143)에서 'AAC'는 단순히 일 실시예에 불과하며, 제1 코어 오디오 오브젝트 타입 판별부(141)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되는 경우, 제1 AAC 복호화부(143)에서는 제1 코어 오디오 오브젝트 타입 판별부(141)에서 검출된 코어 오디오 오브젝트 타입에 대응하는 디코딩 방식으로 레지듀얼 코딩 데이터를 복호화할 수 있다. 예를 들어, 제1 코어 오디오 오브젝트 타입 판별부(141)에서 검출된 코어 오디오 오브젝트 타입이 'MP3'일 경우 제1 AAC 복호화부(143)에서는 'MP3'에 의해서 레지듀얼 코딩 데이터를 복호화한다.However, in the first AAC decoder 143, 'AAC' is merely an example, and when the first core audio object type determiner 141 determines that the core audio object type does not correspond to the 'BSAC', The first AAC decoder 143 may decode the residual coded data by a decoding method corresponding to the core audio object type detected by the first core audio object type determiner 141. For example, when the core audio object type detected by the first core audio object type determination unit 141 is 'MP3', the first AAC decoder 143 decodes the residual coding data by 'MP3'.

이와 같이, 제1 BSAC 복호화부(142) 또는 제1 AAC 복호화부(143)에서 복호화 된 레지듀얼 코딩 데이터를 이용하여 코어 오디오 데이터 복호화부(130)에서 복호화된 코어 오디오 데이터를 멀티 채널 신호로 업믹싱할 수 있다.As described above, the core audio data decoded by the core audio data decoder 130 is upgraded to the multi-channel signal using the residual coding data decoded by the first BSAC decoder 142 or the first AAC decoder 143. You can mix.

셋째, 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터 타입이 아비트레리 다운 믹스 레지듀얼 코딩 데이터인 경우에 대하여 살펴보기로 한다. 다시 말해, 이 경우에는 "bsSacExtType"은 '1'이다.Third, a case in which the spatial extension data type detected by the spatial extension data type detection unit 110 is the Abiteri down mix residual coding data will be described. In other words, in this case "bsSacExtType" is '1'.

도 6은 본 발명의 일 실시예에 따른 아비트레리 다운 믹스 레지듀얼 데이터를 복호화하기 위한 신택스를 도시한 도면이다. 이하에서는, 도 1 및 6을 참조하여 아비트레리 다운 믹스 레지듀얼 코딩 데이터 복호화부(150)의 동작에 대하여 설명하기로 한다. 6 is a diagram illustrating syntax for decoding Abiteri down mix residual data according to an embodiment of the present invention. Hereinafter, an operation of the abiteri down mix residual coded data decoder 150 will be described with reference to FIGS. 1 and 6.

아비트레리 다운 믹스 레지듀얼 코딩 데이터 복호화부(150)는 제2 코어 오디오 오브젝트 타입 판별부(151), 제2 BSAC 복호화부(152), 및 제2 AAC 복호화부(153)를 포함하여 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다.The abiteri down mix residual coding data decoder 150 includes a second core audio object type discriminator 151, a second BSAC decoder 152, and a second AAC decoder 153. Decode the mix residual coded data.

제2 코어 오디오 오브젝트 타입 판별부(151)의 판단 결과, 코어 오디오 오브젝트 타입이 'BSAC'에 해당하면, 제2 BSAC 복호화부(152)는 'BSAC'에 의해서 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다. 예를 들어, 제2 BSAC 복호화부(152)는 도 6에 도시된 신택스의 식별번호 600, 620, 640 및 660 중 적어도 어느 하나 이상에 의하여 실시될 수 있다. 식별번호 600, 620, 640 및 660 중 적어도 어느 하나 이상에서 제2 BSAC 복호화부(152)는 MPEG-4 ER BSAC에 정의되어 있는 함수 bsac_raw_data_block()에 의하여 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다. 여기서, bsac_raw_data_block()의 "nch"는 항상 '1'로 설정되어 있어 야 한다. 여기서, "nch"는 채널의 수(Number of Channels)를 나타낸다.As a result of the determination by the second core audio object type determiner 151, when the core audio object type corresponds to 'BSAC', the second BSAC decoder 152 uses the BSAC to perform bitwise down mix residual coding data. Decrypt For example, the second BSAC decoder 152 may be implemented by at least one of the identification numbers 600, 620, 640, and 660 of the syntax illustrated in FIG. 6. In at least one of identification numbers 600, 620, 640, and 660, the second BSAC decoder 152 decodes the Abiteri downmix residual coded data by a function bsac_raw_data_block () defined in the MPEG-4 ER BSAC. . Here, "nch" of bsac_raw_data_block () should always be set to '1'. Here, "nch" represents the number of channels.

제1 코어 오디오 오브젝트 타입 판별부(151)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되면, 제2 AAC 복호화부(152)는 'AAC'에 의해서 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다. 예를 들어, 제2 AAC 복호화부(153)는 도 6에 도시된 신택스의 식별번호 610, 630, 650 및 670 중 적어도 어느 하나 이상에 의하여 실시될 수 있다. 식별번호 610 또는 650에서 제2 AAC 복호화부(153)는 ISO/IEC 13818-7의 subclause 6.3에 기재된 "MPEG-2 AAC Low Complexity profile bitstream syntax"에 정의되어 있는 individual_channel_stream(0)에 의하여 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다. 또한, 식별번호 630 또는 670에서 제2 AAC 복호화부(153)는 ISO/IEC 13818-7의 subclause 6.3에 기재된 "MPEG-2 AAC Low Complexity profile bitstream syntax"에 정의되어 있는 channel_pair_element()에 의하여 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다. 여기서, 파라미터 "common_window"는 '1'로 설정되어 있다.If it is determined by the first core audio object type determination unit 151 that the core audio object type does not correspond to 'BSAC', the second AAC decoding unit 152 uses the AAC to determine the bitwise down mix residual coding data. Decrypt For example, the second AAC decoder 153 may be implemented by at least one of the identification numbers 610, 630, 650, and 670 of the syntax illustrated in FIG. 6. At identification number 610 or 650, the second AAC decoding unit 153 is assigned to Abiteri down by the individual_channel_stream (0) defined in "MPEG-2 AAC Low Complexity profile bitstream syntax" described in subclause 6.3 of ISO / IEC 13818-7. Decode the mix residual coded data. In addition, at identification number 630 or 670, the second AAC decoding unit 153 uses abiteri by channel_pair_element () defined in "MPEG-2 AAC Low Complexity profile bitstream syntax" described in subclause 6.3 of ISO / IEC 13818-7. Decode the down mix residual coded data. Here, the parameter "common_window" is set to '1'.

그러나, 제2 AAC 복호화부(153)에서 'AAC'는 단순히 일 실시예에 불과하며, 제2 코어 오디오 오브젝트 타입 판별부(151)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되는 경우, 제2 AAC 복호화부(153)에서는 제2 코어 오디오 오브젝트 타입 판별부(151)에서 검출된 코어 오디오 오브젝트 타입에 대응하는 디코딩 방식으로 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화할 수 있다. 예를 들어, 제1 코어 오디오 오브젝트 타입 판별부(151)에서 검출된 코어 오디오 오브젝트 타입이 'MP3'일 경우 제2 AAC 복호화부(153)에서는 'MP3'에 의해서 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화한다.However, in the second AAC decoder 153, 'AAC' is merely an example, and when the second core audio object type determiner 151 determines that the core audio object type does not correspond to 'BSAC', The second AAC decoder 153 may decode the Abiteri down mix residual coded data in a decoding manner corresponding to the core audio object type detected by the second core audio object type determiner 151. For example, when the core audio object type detected by the first core audio object type determination unit 151 is 'MP3', the second AAC decoder 153 uses the MP3 to abbreviate downmix residual coding data. Decode

이와 같이, 제2 BSAC 복호화부(152) 또는 제2 AAC 복호화부(153)에서 복호화된 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 이용하여 코어 오디오 데이터 복호화부(130)에서 복호화된 코어 오디오 데이터를 멀티 채널 신호로 업믹싱할 수 있다.As described above, the core audio data decoded by the core audio data decoder 130 may be multiplied by using the Abiteri downmix residual coding data decoded by the second BSAC decoder 152 or the second AAC decoder 153. Upmix to channel signal.

넷째, 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터 타입이 MPEG-4 오디오의 코어 오디오 오브젝트 타입을 나타내는 데이터, 레지듀얼 코딩 데이터 또는 아비트레리 다운 믹스 레지듀얼 코딩 데이터가 아닌 경우에 대하여 살펴보기로 한다.Fourth, the case in which the spatial extension data type detected by the spatial extension data type detection unit 110 is not data representing the core audio object type of MPEG-4 audio, residual coding data, or Abiteri downmix residual coding data will be described. Let's look at it.

공간 확장 데이터 복호화부(160)는 공간 확장 데이터 타입 검출부(110)에서 검출된 공간 확장 데이터의 타입에 대응하는 방식으로 복호화를 수행한다. 이와 같이, 공간 확장 데이터 복호화부(160)에서 복호화된 데이터를 이용하여 코어 오디오 데이터 복호화부(130)에서 복호화된 코어 오디오 데이터를 멀티 채널 신호로 업믹싱할 수 있다.The spatial extension data decoder 160 performs decoding in a manner corresponding to the type of the spatial extension data detected by the spatial extension data type detector 110. As described above, the core audio data decoded by the core audio data decoder 130 may be upmixed into a multi-channel signal using the data decoded by the spatial extension data decoder 160.

업믹싱부(170)는 코어 오디오 데이터 복호화부(130)에서 복호화된 코어 오디오 데이터를 제1 및 제2 BSAC 복호화부(142, 152), 제1 및 제2 ACC 복호화부(143, 153), 또는 공간 확장 데이터 복호화부(160)에서 복호화된 결과를 이용하여 멀티 채널 신호로 업믹싱한다. 여기서, 업믹싱은 다운믹싱에 상반되는 개념으로, 모노 신호로부터 두 채널 이상의 스테레오 신호를 생성하는 것이다.The upmixer 170 may be configured to convert the core audio data decoded by the core audio data decoder 130 into the first and second BSAC decoders 142 and 152, the first and second ACC decoders 143 and 153, Alternatively, the result of the decoding by the spatial extension data decoder 160 is upmixed into a multi-channel signal. Here, upmixing is a concept opposite to downmixing, in which stereo signals are generated from two or more channels from a mono signal.

도 7은 본 발명의 일 실시예에 따른 다채널 오디오 신호의 복호화 방법을 나타내는 흐름도이다.7 is a flowchart illustrating a method of decoding a multichannel audio signal according to an embodiment of the present invention.

도 7을 참조하면, 본 실시예에 따른 다채널 오디오 신호의 복호화 방법은 도 1에 도시된 오디오 신호의 복호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 1에 도시된 오디오 신호의 복호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 복호화 방법에도 적용된다.Referring to FIG. 7, the method for decoding a multi-channel audio signal according to the present embodiment includes steps that are processed in time series in an apparatus for decoding an audio signal shown in FIG. 1. Therefore, even if omitted below, the above description of the audio signal decoding apparatus shown in FIG. 1 is also applied to the audio signal decoding method according to the present embodiment.

700 단계에서 공간 확장 데이터 타입 검출부(110)는 오디오 신호의 부호화 결과에 포함된 공간 확장 데이터의 타입을 검출한다.In operation 700, the spatial extension data type detector 110 detects the type of spatial extension data included in the encoding result of the audio signal.

710 단계에서 코어 오디오 오브젝트 타입 검출부(120)는 공간 확장 데이터가 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입을 나타내는 데이터인 경우, 코어 오디오 오브젝트 타입을 검출한다.In step 710, the core audio object type detector 120 detects the core audio object type when the spatial extension data is data representing the core audio object type, which is a method of encoding the core audio data.

720 단계에서 코어 오디오 데이터 복호화부(130)는 검출된 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 코어 오디오 데이터를 복호화한다.In operation 720, the core audio data decoder 130 decodes the core audio data by a decoding method according to the detected core audio object type.

730 단계에서 레지듀얼 코딩 데이터 복호화부(140)는 공간 확장 데이터가 레지듀얼 코딩 데이터인 경우, 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 레지듀얼 코딩 데이터를 복호화한다.In operation 730, when the spatial extension data is residual coding data, the residual coding data decoder 140 decodes the residual coding data by a decoding method according to the core audio object type.

740 단계에서 업믹싱부(170)는 복호화된 코어 오디오 데이터를 복호화된 레지듀얼 코딩 데이터를 이용하여 업믹싱한다.In operation 740, the upmixing unit 170 upmixes the decoded core audio data using the decoded residual coding data.

본 실시예에서 따른 오디오 신호의 복호화 방법은 공간 확장 데이터가 아비 트레리 다운 믹스 레지듀얼 코딩 데이터인 경우, 코어 오디오 오브젝트 타입에 따른 복호화 방식으로 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 복호화하는 단계를 더 포함할 수 있다. 이 경우, 업믹싱부(170)는 복호화된 코어 오디오 데이터를 복호화된 레지듀얼 코딩 데이터 및 복호화된 아비트레리 다운 믹스 레지듀얼 코딩 데이터를 이용하여 업믹싱할 수 있다.The decoding method of the audio signal according to the present embodiment further includes the step of decoding the Abiteri down mix residual coding data by a decoding method according to the core audio object type when the spatial extension data is Avi Treri down mix residual coding data. It may include. In this case, the upmixing unit 170 may upmix the decoded core audio data by using the decoded residual coding data and the decoded Abiteri downmix residual coding data.

또한, 본 실시예에 따른 오디오 신호의 복호화 방법은 공간 확장 데이터가 코어 오디오 오브젝트 타입을 나타내는 데이터, 레지듀얼 코딩 데이터 및 아비트레리 다운 믹스 코딩 데이터 외의 데이터인 경우, 공간 확장 데이터의 타입에 따른 복호화 방식으로 공간 확장 데이터를 복호화하는 단계를 더 포함할 수 있다. 이 경우, 업믹싱부(170)는 복호화된 코어 오디오 데이터를 복호화된 레지듀얼 코딩 데이터, 복호화된 아비트레리 다운 믹스 레지듀얼 코딩 데이터, 및 복호화된 공간 확장 데이터를 이용하여 업믹싱할 수 있다.In addition, the decoding method of the audio signal according to the present embodiment is a decoding method according to the type of the spatial extension data, when the spatial extension data is data other than the data indicating the core audio object type, residual coding data and Abiteri downmix coding data The method may further include decoding the spatial extension data. In this case, the upmixing unit 170 may upmix the decoded core audio data by using the decoded residual coding data, the decoded Abiteri down mix residual coding data, and the decoded spatial extension data.

도 8은 본 발명의 일 실시예에 따른 다채널 오디오 신호의 부호화 장치를 나타내는 블록도이다.8 is a block diagram illustrating an apparatus for encoding a multichannel audio signal according to an embodiment of the present invention.

도 8을 참조하면, 다채널 오디오 신호의 부호화 장치는 다운믹싱부(down-mixing unit, 800), 코어 오디오 데이터 부호화부(core audio data encoding unit, 810), 레지듀얼 데이터 부호화부(residual data encoding unit, 820), 아비트레리 다운 믹스 레지듀얼 데이터 부호화부(arbitrary down-mix residual data encoding unit, 830), 및 다중화부(multiplexing unit, 840)를 포함한다.Referring to FIG. 8, an apparatus for encoding a multi-channel audio signal includes a down-mixing unit 800, a core audio data encoding unit 810, and a residual data encoding unit. unit, 820, an arbitrary down-mix residual data encoding unit 830, and a multiplexing unit 840.

다운믹싱부(800)는 입력 신호(IN)를 다운믹싱한다. 여기서, 입력 신호(IN)는 아날로그의 음성 신호 또는 오디오 신호를 디지털 신호로 변조한 PCM(Pulse Code Modulation) 신호일 수 있다. 여기서, 다운믹싱은 두 채널 이상의 스테레오 신호로부터 한 채널의 모노 신호를 생성하는 것이며, 다운믹싱을 통하여 부호화 과정에 할당되는 비트량을 줄일 수 있다. The downmixing unit 800 downmixes the input signal IN. Here, the input signal IN may be a pulse code modulation (PCM) signal obtained by modulating an analog voice signal or an audio signal into a digital signal. Here, downmixing generates a mono signal of one channel from stereo signals of two or more channels, and the amount of bits allocated to the encoding process can be reduced through downmixing.

코어 오디오 데이터 부호화부(810)는 다운믹싱부(800)에서 출력된 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화한다. 여기서, 코어 오디오 데이터는 BSAC(Bit Sliced Arithmetic Coding), AAC(Advanced Audio Coding), MP3(MPEG Audio Layer-3) 등과 같은 다양한 부호화 방식 가운데 어느 하나에 의하여 부호화될 수 있다. 여기서, BSAC, AAC, MP3 등은 본 발명의 실시예에 불과하며, 본 실시예가 속하는 기술분야에서 통상의 지식을 가진 자는 다양한 부호화 방식에 의해 코어 오디오 데이터는 부호화될 수 있음을 이해할 수 있다.The core audio data encoder 810 encodes the core audio data output from the downmixer 800 according to a predetermined encoding method. Here, the core audio data may be encoded by any one of various coding schemes such as bit sliced arithmetic coding (BSAC), advanced audio coding (AAC), and MPEG audio layer-3 (MP3). Here, BSAC, AAC, MP3 and the like are only embodiments of the present invention, and those skilled in the art can understand that the core audio data can be encoded by various encoding schemes.

레지듀얼 데이터 부호화부(820)는 제1 코어 오디오 오브젝트 타입 판별부(core audio object type determining unit, 821), 제1 BSAC 부호화부(BSAC encoding unit, 822) 및 제1 AAC 부호화부(AAC encoding unit, 823)를 포함하여 레지듀얼 데이터를 부호화한다.The residual data encoder 820 may include a first core audio object type determining unit 821, a first BSAC encoding unit 822, and a first AAC encoding unit. 823) to encode the residual data.

제1 코어 오디오 오브젝트 타입 판별부(821)는 코어 오디오 데이터 부호화부(810)에서 코어 오디오 데이터를 부호화하는 방식인 코어 오디오 오브젝트 타입을 판별하여 레지듀얼 데이터의 부호화 방식을 결정한다. 예를 들어, 제1 코어 오디오 오브젝트 타입 판별부(821)는 코어 오디오 오브젝트 타입이 'BSAC'인 경우 레지듀얼 데이터의 부호화 방식을 'BSAC'으로 결정하고, 코어 오디오 오브젝트 타입 이 'AAC'인 경우 레지듀얼 데이터의 부호화 방식을 'AAC'로 결정한다.The first core audio object type determiner 821 determines a core audio object type, which is a method of encoding core audio data by the core audio data encoder 810, and determines a coding method of residual data. For example, when the core audio object type is 'BSAC', the first core audio object type determination unit 821 determines the encoding method of the residual data as 'BSAC', and when the core audio object type is 'AAC'. The encoding method of the residual data is determined as 'AAC'.

제1 BSAC 부호화부(822)는 제1 코어 오디오 오브젝트 타입 판별부(821)의 판단 결과 코어 오디오 오브젝트 타입이 'BSAC'인 경우에 레지듀얼 데이터를 'BSAC' 방식으로 부호화한다. 이로써, 코어 오디오 데이터와 레지듀얼 데이터를 동일한 부호화 방식으로 부호화하여 부호화단의 복잡도를 줄일 수 있다.The first BSAC encoder 822 encodes the residual data in the 'BSAC' method when the core audio object type is 'BSAC' as a result of the determination by the first core audio object type determiner 821. As a result, the complexity of the encoding end can be reduced by encoding the core audio data and the residual data by the same encoding scheme.

제1 AAC 부호화부(823)는 제1 코어 오디오 오브젝트 타입 판별부(821)의 판단 결과 코어 오디오 오브젝트 타입이 'AAC'인 경우에 레지듀얼 데이터를 'AAC' 방식으로 부호화한다. 이로써, 코어 오디오 데이터와 레지듀얼 데이터를 동일한 부호화 방식으로 부호화하여 부호화단의 복잡도를 줄일 수 있다.The first AAC encoder 823 encodes the residual data in the 'AAC' method when the core audio object type is 'AAC' as a result of the determination by the first core audio object type determiner 821. As a result, the complexity of the encoding end can be reduced by encoding the core audio data and the residual data by the same encoding scheme.

그러나, 제1 AAC 부호화부(823)에서 'AAC'는 단순히 일 실시예에 불과하며, 제1 코어 오디오 오브젝트 타입 판별부(821)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되는 경우, 제1 AAC 복호화부(823)에서는 제1 코어 오디오 오브젝트 타입 판별부(821)에서 검출된 코어 오디오 오브젝트 타입에 대응하는 부호화 방식으로 레지듀얼 데이터를 부호화할 수 있다. 예를 들어, 제1 코어 오디오 오브젝트 타입 판별부(821)에서 검출된 코어 오디오 오브젝트 타입이 'MP3'일 경우 제1 AAC 부호화부(823)에서는 'MP3'에 의해서 레지듀얼 데이터를 부호화한다.However, in the first AAC encoder 823, 'AAC' is merely an embodiment, and when the first core audio object type determiner 821 determines that the core audio object type does not correspond to 'BSAC', The first AAC decoder 823 may encode the residual data by an encoding method corresponding to the core audio object type detected by the first core audio object type determiner 821. For example, when the core audio object type detected by the first core audio object type determination unit 821 is 'MP3', the first AAC encoder 823 encodes the residual data by 'MP3'.

아비트레리 다운 믹스 레지듀얼 데이터 부호화부(830)는 제2 코어 오디오 오브젝트 타입 판별부(831), 제2 BSAC 부호화부(832) 및 제2 AAC 부호화부(833)를 포함하여 레지듀얼 데이터를 부호화한다.The Abiteri down mix residual data encoder 830 includes a second core audio object type determiner 831, a second BSAC encoder 832, and a second AAC encoder 833 to encode residual data. do.

제2 코어 오디오 오브젝트 타입 판별부(831)는 코어 오디오 데이터 부호화부(810)에서 코어 오디오 데이터를 부호화하는 방식인 코어 오디오 오브젝트 타입을 판별하여 레지듀얼 데이터의 부호화 방식을 결정한다. 예를 들어, 제2 코어 오디오 오브젝트 타입 판별부(831)는 코어 오디오 오브젝트 타입이 'BSAC'인 경우 레지듀얼 데이터의 부호화 방식을 'BSAC'으로 결정하고, 코어 오디오 오브젝트 타입이 'AAC'인 경우 레지듀얼 데이터의 부호화 방식을 'AAC'로 결정한다.The second core audio object type determiner 831 determines a core audio object type, which is a method of encoding core audio data by the core audio data encoder 810, and determines a coding method of the residual data. For example, when the core audio object type is 'BSAC', the second core audio object type determination unit 831 determines the encoding method of the residual data as 'BSAC', and when the core audio object type is 'AAC'. The encoding method of the residual data is determined as 'AAC'.

제2 BSAC 부호화부(832)는 제2 코어 오디오 오브젝트 타입 판별부(831)의 판단 결과 코어 오디오 오브젝트 타입이 'BSAC'인 경우에 레지듀얼 데이터를 'BSAC' 방식으로 부호화한다. 이로써, 코어 오디오 데이터와 레지듀얼 데이터를 동일한 부호화 방식으로 부호화하여 부호화단의 복잡도를 줄일 수 있다.The second BSAC encoder 832 encodes the residual data in the 'BSAC' method when the core audio object type is 'BSAC' as a result of the determination by the second core audio object type determiner 831. As a result, the complexity of the encoding end can be reduced by encoding the core audio data and the residual data by the same encoding scheme.

제2 AAC 부호화부(833)는 제2 코어 오디오 오브젝트 타입 판별부(831)의 판단 결과 코어 오디오 오브젝트 타입이 'AAC'인 경우에 레지듀얼 데이터를 'AAC' 방식으로 부호화한다. 이로써, 코어 오디오 데이터와 레지듀얼 데이터를 동일한 부호화 방식으로 부호화하여 부호화단의 복잡도를 줄일 수 있다.The second AAC encoder 833 encodes the residual data in the 'AAC' method when the core audio object type is 'AAC' as a result of the determination by the second core audio object type determiner 831. As a result, the complexity of the encoding end can be reduced by encoding the core audio data and the residual data by the same encoding scheme.

그러나, 제2 AAC 부호화부(833)에서 'AAC'는 단순히 일 실시예에 불과하며, 제2 코어 오디오 오브젝트 타입 판별부(831)에서 코어 오디오 오브젝트 타입이 'BSAC'에 해당하지 않는다고 판단되는 경우, 제2 AAC 복호화부(833)에서는 제2 코어 오디오 오브젝트 타입 판별부(831)에서 검출된 코어 오디오 오브젝트 타입에 대응하는 부호화 방식으로 레지듀얼 데이터를 부호화할 수 있다. 예를 들어, 제2 코어 오디오 오브젝트 타입 판별부(831)에서 검출된 코어 오디오 오브젝트 타입이 'MP3'일 경우 제2 AAC 부호화부(833)에서는 'MP3'에 의해서 레지듀얼 데이터를 부호화한다.However, in the second AAC encoder 833, 'AAC' is merely an example, and when the second core audio object type determiner 831 determines that the core audio object type does not correspond to 'BSAC', The second AAC decoder 833 may encode the residual data by an encoding method corresponding to the core audio object type detected by the second core audio object type determiner 831. For example, when the core audio object type detected by the second core audio object type determination unit 831 is 'MP3', the second AAC encoder 833 encodes the residual data by 'MP3'.

다중화부(840)는 코어 오디오 데이터 부호화부(810)에서 부호화된 결과, 제1 및 제2 BSAC 부호화부(822, 832)에서 부호화된 결과, 및 제1 및 제2 AAC 부호화부(823, 833)에서 부호화된 결과를 다중화하여 비트스트림을 생성하여 출력 단자 OUT로 출력한다.The multiplexer 840 encodes a result of the core audio data encoder 810, a result of the first and second BSAC encoders 822 and 832, and a first and second AAC encoders 823 and 833. ), The encoded result is multiplexed, and a bitstream is generated and output to the output terminal OUT.

도 9는 본 발명의 일 실시예에 따른 다채널 오디오 신호의 부호화 방법을 나타내는 흐름도이다.9 is a flowchart illustrating a method of encoding a multichannel audio signal according to an embodiment of the present invention.

도 9를 참조하면, 본 실시예에 따른 다채널 오디오 신호의 부호화 방법은 도 8에 도시된 오디오 신호의 부호화 장치에서 시계열적으로 처리되는 단계들로 구성된다. 따라서, 이하 생략된 내용이라 하더라도 도 8에 도시된 오디오 신호의 부호화 장치에 관하여 이상에서 기술된 내용은 본 실시예에 따른 오디오 신호의 부호화 방법에도 적용된다.Referring to FIG. 9, the method for encoding a multichannel audio signal according to the present embodiment includes the steps of time-series processing in the apparatus for encoding an audio signal of FIG. Therefore, even if omitted below, the above description of the audio signal encoding apparatus shown in FIG. 8 is also applied to the audio signal encoding method according to the present embodiment.

900 단계에서 다운믹싱부(900)는 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터 및 레지듀얼 데이터를 생성한다.In operation 900, the downmixing unit 900 downmixes the input audio signal to generate core audio data and residual data.

910 단계에서 코어 오디오 데이터 부호화부(910)는 코어 오디오 데이터를 소정의 부호화 방식에 따라 부호화한다. In operation 910, the core audio data encoder 910 encodes the core audio data according to a predetermined encoding method.

920 단계에서 레지듀얼 데이터 부호화부(920)는 코어 오디오 데이터가 부호화된 방식인 코어 오디오 오브젝트 타입에 따른 소정의 부호화 방식에 따라 레지듀얼 데이터를 부호화한다.In operation 920, the residual data encoder 920 encodes the residual data according to a predetermined encoding method according to the core audio object type, which is a method of encoding core audio data.

930 단계에서 다중화부(940)는 부호화된 코어 오디오 데이터 및 부호화된 레지듀얼 데이터를 다중화하여 오디오 신호에 대한 부호화 결과로써 출력한다.In operation 930, the multiplexer 940 multiplexes the encoded core audio data and the encoded residual data and outputs the encoded result of the audio signal.

상기 900 단계는 입력된 오디오 신호를 다운믹싱하여 코어 오디오 데이터, 레지듀얼 데이터, 및 아비트레리 다운 믹스 레지듀얼 데이터를 생성할 수 있다. 이 경우, 본 실시예에 따른 오디오 신호의 부호화 방법은 코어 오디오 오브젝트 타입에 따른 소정의 부호화 방식에 따라 아비트레리 다운 믹스 레지듀얼 데이터를 부호화하는 단계를 더 포함할 수 있다. 이 경우, 다중화부(940)는 부호화된 코어 오디오 데이터, 부호화된 레지듀얼 데이터, 및 부호화된 아비트레리 다운 믹스 레지듀얼 데이터를 다중화하여 오디오 신호에 대한 부호화 결과로써 출력할 수 있다.In operation 900, the input audio signal may be downmixed to generate core audio data, residual data, and abiteri down mix residual data. In this case, the encoding method of the audio signal according to the present embodiment may further include encoding the Abiteri down mix residual data according to a predetermined encoding scheme according to the core audio object type. In this case, the multiplexing unit 940 may multiplex the encoded core audio data, the encoded residual data, and the encoded Abiteri down mix residual data and output the result of encoding the audio signal.

본 발명은 상술한 실시예에 한정되지 않으며, 본 발명의 사상 내에서 당업자에 의한 변형이 가능함은 물론이다.The present invention is not limited to the above-described embodiment, and of course, modifications may be made by those skilled in the art within the spirit of the present invention.

본 발명은 또한 컴퓨터로 읽을 수 있는 기록매체에 컴퓨터가 읽을 수 있는 코드로서 구현하는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록매체는 컴퓨터 시스템에 의하여 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록장치를 포함한다. 컴퓨터가 읽을 수 있는 기록매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 하드디스크, 플로피디스크, 플래쉬 메모리, 광 데이터 저장장치 등이 있으며, 또한 캐리어 웨이브(예를 들어 인터넷을 통한 전송)의 형태로 구현되는 것도 포함한다. 또한 컴퓨터가 읽을 수 있는 기록매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 컴퓨터가 읽을 수 있는 코드로서 저장되고 실행될 수 있다.The invention can also be embodied as computer readable code on a computer readable recording medium. The computer-readable recording medium includes all kinds of recording devices in which data that can be read by a computer system is stored. Examples of computer-readable recording media include ROM, RAM, CD-ROM, magnetic tape, hard disk, floppy disk, flash memory, optical data storage device, and also carrier waves (for example, transmission over the Internet). It also includes the implementation in the form of. The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.

도 2는 본 발명의 일 실시예에 따른 공간 확장 데이터 타입을 검출하기 위한 신택스를 도시한 도면이다. 2 is a diagram illustrating syntax for detecting a spatial extension data type according to an embodiment of the present invention.

도 3은 도 2에 도시된 "bsSacExtType"에 대응하는 값들이 할당된 테이블의 일 실시예를 도시한 도면이다. FIG. 3 is a diagram illustrating an embodiment of a table to which values corresponding to "bsSacExtType" shown in FIG. 2 are assigned.

도 4는 본 발명의 일 실시예에 따른 코어 오디오 오브젝트 타입을 독출하기 위한 신택스를 도시한 도면이다.4 is a diagram illustrating syntax for reading a core audio object type according to an embodiment of the present invention.

도 5는 본 발명의 일 실시예에 따른 레지듀얼 코딩 데이터를 복호화하기 위한 신택스를 도시한 도면이다.5 is a diagram illustrating syntax for decoding residual coded data according to an embodiment of the present invention.

도 6은 본 발명의 일 실시예에 따른 아비트레리 다운 믹스 레지듀얼 데이터를 복호화하기 위한 신택스를 도시한 도면이다.6 is a diagram illustrating syntax for decoding Abiteri down mix residual data according to an embodiment of the present invention.

Claims

Detecting a type of spatial extension data included in an encoding result of the audio signal;

Detecting the core audio object type when the spatial extension data is data representing a core audio object type in which core audio data is encoded;

Decoding the core audio data by a decoding method according to the detected core audio object type;

Decoding the residual coded data by the decoding method according to the core audio object type when the spatial extension data is residual coded data; And

And upmixing the decoded core audio data using the decoded residual coded data.

The method of claim 1,

If the spatial extension data is Abiteri down mix residual coding data, further comprising: decoding the Abiteri down mix residual coding data by the decoding method according to the core audio object type. Method for decoding channel audio signal.

The method of claim 2,

The upmixing step

And up-mixing the decoded core audio data using the decoded residual coding data and the decoded abiteri down mix residual coding data.

The method of claim 2,

When the spatial extension data is data other than the data indicating the core audio object type, the residual coding data and the abiteri down mix coding data, the spatial extension data is decoded by a decoding method according to the type of the spatial extension data. The method of decoding a multi-channel audio signal further comprising the step.

The method of claim 4, wherein

The upmixing step

And decoding the decoded core audio data by using the decoded residual coding data, the decoded bitwise down mix residual coding data, and the decoded spatial extension data. Way.

And upmixing the decoded core audio data using the decoded residual coded data. A computer-readable recording medium having recorded thereon a program for executing a method of decoding a multichannel audio signal.

A spatial extension data type detector for detecting a type of spatial extension data included in an encoding result of the audio signal;

A core audio object type detector for detecting the core audio object type when the spatial extension data is data indicating a core audio object type in which core audio data is encoded;

A core audio data decoder which decodes the core audio data by a decoding method according to the detected core audio object type;

A residual coded data decoder configured to decode the residual coded data by the decoding method according to the core audio object type when the spatial extension data is residual coded data; And

And an upmixing unit configured to upmix the decoded core audio data by using the decoded residual coded data.

The method of claim 7, wherein

If the spatial extension data is Abiteri down mix residual coding data, an Abiteri down mix residual coding data decoding unit for decoding the Abiteri down mix residual coding data by the decoding method according to the core audio object type Apparatus for decoding a multi-channel audio signal comprising a.

The method of claim 8,

The upmixing unit

And up-mixing the decoded core audio data using the decoded residual coding data and the decoded bitwise down mix residual coding data.

The method of claim 8,

When the spatial extension data is data other than the data indicating the core audio object type, the residual coding data and the abiteri down mix coding data, the spatial extension data is decoded by a decoding method according to the type of the spatial extension data. An apparatus for decoding a multichannel audio signal, further comprising a spatial extension data decoder.

The method of claim 10,

The upmixing unit

And decoding the decoded core audio data by using the decoded residual coding data, the decoded bitwise down mix residual coding data, and the decoded spatial extension data. Device.

Downmixing the input audio signal to generate core audio data and residual data;

Encoding the core audio data according to a predetermined encoding method;

Encoding the residual data according to the predetermined encoding method according to a core audio object type, which is a method of encoding the core audio data; And

And outputting the encoded core audio data and the encoded residual data as an encoding result of the audio signal.

The method of claim 12,

The downmixing step

And downmixing the input audio signal to generate the core audio data, the residual data, and the abiteri down-mix residual data.

The method of claim 13,

And encoding the abiteri down mix residual data according to the predetermined encoding scheme according to the core audio object type.

The method of claim 14,

The output as the encoding result is

And encoding the coded core audio data, the coded residual data, and the coded bitwise down mix residual data as an encoding result of the audio signal.

Encoding the core audio data according to a predetermined encoding method;

And outputting the encoded core audio data and the encoded residual data as an encoding result of the audio signal. A program for executing a multi-channel audio signal encoding method may be read by a computer. Recording media.

A downmixing unit for downmixing the input audio signal to generate core audio data and residual data;

A core audio data encoder which encodes the core audio data according to a predetermined encoding method;

A residual data encoder which encodes the residual data according to the predetermined encoding method according to a core audio object type which is a method of encoding the core audio data; And

And a multiplexer for outputting the encoded core audio data and the encoded residual data as an encoding result of the audio signal.

The method of claim 16,

The downmixing unit

And down-mixing the input audio signal to generate the core audio data, the residual data, and the abiteri down mix residual data.

The method of claim 17,

The apparatus for encoding a multi-channel audio signal according to claim 6, further comprising an abiteri down mix residual data encoder for encoding the abiteri down mix residual data according to the predetermined encoding scheme according to the core audio object type.

The method of claim 19,

The multiplexer

And output the coded core audio data, the coded residual data, and the coded abiteri down-mix residual data as an encoding result of the audio signal.

Receiving a bitstream corresponding to a downmixed audio core signal and a bitstream including additional information for generating multichannels;

Detecting a core object type from a bitstream corresponding to the downmixed audio core signal;

Decoding the down-mixed audio core signal by a decoding scheme determined by the detected core object type;

If the spatial extension data included in the additional information for generating the multi-channel is residual coded data, decoding the residual coded data by the decoding method according to the core audio object type; And

And upmixing the decoded core audio data using the decoded residual coded data.