JP2009134115A

JP2009134115A - decoder

Info

Publication number: JP2009134115A
Application number: JP2007310628A
Authority: JP
Inventors: Ko Uchida; 航内田; Hirobumi Muramatsu; 寛文村松; Katsu Nagase; 克永瀬; Makoto Onishi; 誠大西
Original assignee: Oki Semiconductor Co Ltd
Current assignee: Lapis Semiconductor Co Ltd
Priority date: 2007-11-30
Filing date: 2007-11-30
Publication date: 2009-06-18

Abstract

<P>PROBLEM TO BE SOLVED: To provide a decoder capable of correctly decoding a compressed speech data file in a file format such as an AAC, ADIF and an AAC raw data from an optional position without depending on an encoder, even in the first reproduction time. <P>SOLUTION: The decoder prestores a bit string for detecting a frame head position, composed of a bit string corresponding to a TERM bit string which is defined in an AAC standard, and a bit string corresponding to a data padding bit string, acquires the reproduction resume position, searches a bit string which matches the bit string for detecting the frame head position and which is the bit string on and after a reproduction resume position in the compressed speech data file, and detects a head position of the frame based on a position of the bit string which is acquired by the search. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は圧縮音声データファイルをフレーム単位でデコードするデコーダに関する。 The present invention relates to a decoder that decodes a compressed audio data file in units of frames.

映像データ圧縮方式ＭＰＥＧ−１で利用されるＭＰ３やＭＰＥＧ−２又はＭＰＥＧ−４で利用されるＡＡＣ（Advanced Audio Coding）などの音声圧縮方式が知られている。ＭＰ３やＡＡＣＡＤＴＳ（Audio Data Transport Stream）形式のファイルフォーマットにおいては、フレーム毎にヘッダが備えられており、デコーダはヘッダ内に含まれる位置情報などに基づいてフレームの先頭位置を検出して再生のための同期を取ることができる。一方、ＡＡＣＡＤＩＦ（Audio Data Interchange Format）やＡＡＣｒａｗｄａｔａ形式のファイルフォーマットにおいては、フレームはデータ（raw data block）のみで構成されている。 Audio compression systems such as MP3 used in video data compression system MPEG-1 and AAC (Advanced Audio Coding) used in MPEG-2 or MPEG-4 are known. In the MP3 and AAC ADTS (Audio Data Transport Stream) format file formats, a header is provided for each frame, and the decoder detects the start position of the frame based on position information included in the header and plays back the frame. Can be synchronized for. On the other hand, in the file format of AAC ADIF (Audio Data Interchange Format) or AAC raw data format, a frame is composed only of data (raw data block).

ＡＡＣＡＤＩＦ形式やＡＡＣｒａｗｄａｔａ形式において、早送りや巻き戻しのため又はエラー発生のために再生を一旦停止した後で、デコード及び再生を再開する場合には、いわゆるパーサー（Parser）と称される、ＭＰＥＧ−４形式のファイルフォーマットを解釈できる装置が必要であった。しかしながら、パーサーを用いた場合、再生処理量やメモリにおける記憶量の増加など再生装置のリソースを消費することになり、再生装置の小型化を阻害する要因となっていた。そのため、パーサーを使用することなく、デコード及び再生の再開を可能とする技術が望まれていた。従来、パーサーを使用せず、再生再開のための特段の技術を用いない場合、フレームの先頭位置を検出できないことから正常にデコードできず、圧縮音声データファイルを再生できなかった。 In the AAC ADIF format or AAC raw data format, when playback is temporarily stopped due to fast forward, rewind, or error occurrence, decoding and playback are resumed, so-called a parser. A device capable of interpreting the MPEG-4 format file format is required. However, when a parser is used, the resources of the playback device are consumed, such as an increase in the amount of playback processing and the amount of storage in the memory, and this is a factor that hinders downsizing the playback device. Therefore, there has been a demand for a technique that enables decoding and reproduction to be resumed without using a parser. Conventionally, when a parser is not used and a special technique for resuming playback is not used, the head position of the frame cannot be detected, so that it cannot be normally decoded and the compressed audio data file cannot be played.

例えば、特許文献１には、ＡＡＣエンコーダが同期のための情報をＤＳＥ（data stream element）と称されるエレメントに予め埋め込み、ＡＡＣデコーダは当該情報に基づいてＡＡＣエンコーダと同期を取ることにより圧縮情報を再生する圧縮情報再生方法が開示されている。ＤＳＥはＩＳＯ／ＩＥＣ１４４９６−３規格において規定されているエレメントであり、当該規格においてはユーザーによる拡張が許容されており、当該文献に開示されているような方法を用いた場合でも当該規格を逸脱することなく圧縮情報を再生できるとしている。 For example, in Patent Document 1, the AAC encoder embeds information for synchronization in an element called DSE (data stream element) in advance, and the AAC decoder synchronizes with the AAC encoder based on the information, thereby compressing information. A method for reproducing compressed information is disclosed. DSE is an element specified in the ISO / IEC 14496-3 standard, and the standard allows extensions by the user. Even if a method such as that disclosed in the document is used, the DSE deviates from the standard. The compressed information can be reproduced without any problem.

また、特許文献２には、圧縮オーディオ信号の初回の再生処理中に再生フレームのフレーム番号及び先頭アドレスをフレーム位置情報としてフレーム位置情報テーブルに保持し、２回目以降の再生処理中の早送り指示に応じて当該フレーム情報テーブルを参照し、早送り指示後の圧縮オーディオ信号の読み込み開始アドレスを決定する圧縮オーディオ信号再生装置が開示されている。当該装置によれば、ＡＡＣ規格のＡＤＩＦフォーマットの圧縮オーディオ信号の早送り再生を高速に行うことができるとしている。
ＵＳＰ６７１８５０７特開２００２−０４１０９５号公報 In Patent Document 2, the frame number and head address of a playback frame are held in the frame position information table as frame position information during the first playback process of the compressed audio signal, and a fast-forward instruction during the second and subsequent playback processes is provided. Accordingly, there is disclosed a compressed audio signal reproducing apparatus that refers to the frame information table and determines the read start address of the compressed audio signal after the fast-forward instruction. According to this apparatus, fast-forward reproduction of a compressed audio signal in the AAC standard ADIF format can be performed at high speed.
USP 6718507 JP 2002-041095 A

しかしながら、特許文献１に開示される圧縮情報再生方法の場合、ＡＡＣエンコーダにおいて同期情報をＤＳＥエレメント内に予め埋め込み、ＡＡＣデコーダが当該同期情報に基づいてＡＡＣエンコーダと同期するため、ＡＡＣデコーダに当該同期情報がどのようなものであるかを予め設定しておく必要があった。したがって、ＡＡＣデコーダは任意のエンコーダと同期できるわけではなく、ＡＡＣエンコーダとＡＡＣデコーダの組み合わせが限定されてしまうという問題点があった。 However, in the compressed information reproduction method disclosed in Patent Document 1, since synchronization information is embedded in the DSE element in advance in the AAC encoder and the AAC decoder synchronizes with the AAC encoder based on the synchronization information, the synchronization information is included in the AAC decoder. It was necessary to set in advance what the information is. Therefore, the AAC decoder cannot be synchronized with an arbitrary encoder, and there is a problem that the combination of the AAC encoder and the AAC decoder is limited.

また、特許文献２に開示される圧縮オーディオ信号再生装置の場合、圧縮オーディオ信号の再生によって得られたフレーム番号及び先頭アドレスなどの情報を保持し、２回目以降の再生処理時にこれらの情報を参照して再生するため、最低でも１回は中断無しでの再生を行わなければならず、初回再生時に再生を中断した場合には、再生を再開しようとしても正常な再生ができないという問題点があった。 In the case of the compressed audio signal reproducing device disclosed in Patent Document 2, information such as a frame number and a head address obtained by reproducing the compressed audio signal is held, and the information is referred to in the second and subsequent reproduction processing. Therefore, if playback is interrupted at the first playback, normal playback cannot be performed even if playback is resumed. It was.

本発明は上記した如き問題点に鑑みてなされたものであって、エンコーダに依存せず、初回再生時でも、ＡＡＣＡＤＩＦやＡＡＣｒａｗｄａｔａなどのファイルフォーマットによる圧縮音声データファイルの任意のフレーム位置から正常にデコードできるデコーダを提供することを目的とする。 The present invention has been made in view of the above-described problems, and does not depend on an encoder. Even at the time of initial playback, the present invention can be applied from an arbitrary frame position of a compressed audio data file using a file format such as AAC ADIF or AAC raw data. An object is to provide a decoder capable of normal decoding.

本発明によるデコーダは、連続する複数のフレームからなる圧縮音声データファイルをデコード処理して音声データを得るデコーダであって、フレーム先頭位置検出用ビット列を記憶する検出用ビット列記憶部と、前記圧縮音声データファイルの再生再開位置を表す情報を取得する再生再開位置取得部と、前記圧縮音声データファイルにおける前記再生再開位置以降のビット列であって前記フレーム先頭位置検出ビット列と一致するビット列を検索し当該検索によって得られたビット列の位置に基づいて前記フレームの先頭位置を検出するフレーム先頭位置検出部と、を含むことを特徴とする。 A decoder according to the present invention is a decoder that obtains audio data by decoding a compressed audio data file consisting of a plurality of consecutive frames, and includes a detection bit string storage unit that stores a frame start position detection bit string, and the compressed audio data A reproduction resumption position acquisition unit that obtains information indicating a reproduction resumption position of the data file, and a bit string that is a bit string after the reproduction resumption position in the compressed audio data file and that matches the frame head position detection bit string And a frame head position detecting unit for detecting the head position of the frame based on the position of the bit string obtained by the above.

以下、本発明に係る実施例について添付の図面を参照しつつ詳細に説明する。 Hereinafter, embodiments according to the present invention will be described in detail with reference to the accompanying drawings.

＜第１の実施例＞
図１は本発明によるデコーダ１０を表すブロック図である。デコーダ１０は、再生再開位置取得部１１と、検出用ビット列記憶部１２と、フレーム先頭位置検出部１３と、デコード処理部１４と、を含む。デコーダ１０は圧縮音声データファイルをデコードして音声データを得る装置である。圧縮音声データファイルは例えば、ＩＳＯ／ＩＥＣ１４４９６−３規格において規定されているＡＡＣＡＤＩＦ（Audio Data Interchange Format）やＡＡＣｒａｗｄａｔａ形式のファイルフォーマットによって構成されたデータである。 <First embodiment>
FIG. 1 is a block diagram illustrating a decoder 10 according to the present invention. The decoder 10 includes a reproduction restart position acquisition unit 11, a detection bit string storage unit 12, a frame head position detection unit 13, and a decoding processing unit 14. The decoder 10 is a device that obtains audio data by decoding a compressed audio data file. The compressed audio data file is, for example, data configured in a file format of AAC ADIF (Audio Data Interchange Format) or AAC raw data format defined in the ISO / IEC 14496-3 standard.

再生再開位置取得部１１は、圧縮音声データファイルの再生再開位置を表す情報（以下、再生再開位置情報ＳＴと称する）を取得する。再生再開位置情報ＳＴは、早送り／巻き戻し又は再生エラーのために圧縮音声データファイルの再生を中断した後、再生を再開する際の位置を表す情報である。例えば早送り／巻き戻し後にユーザーが再生再開位置を指定する場合、再生再開位置取得部１１は、ユーザーによって入力された再生再開位置情報ＳＴを入力部（図示せず）から取得する。また、再生エラーのために再生を中断した場合、再生再開位置取得部１１は、図示せぬ制御部から再生再開の指示と共に与えられた再生再開位置情報ＳＴを取得する。 The reproduction resume position acquisition unit 11 obtains information indicating the reproduction resume position of the compressed audio data file (hereinafter referred to as reproduction resume position information ST). The reproduction restart position information ST is information indicating a position when reproduction is resumed after the reproduction of the compressed audio data file is interrupted due to fast forward / rewind or a reproduction error. For example, when the user designates a playback resume position after fast forward / rewind, the playback resume position acquisition unit 11 acquires playback resume position information ST input by the user from an input unit (not shown). When the reproduction is interrupted due to a reproduction error, the reproduction resume position acquisition unit 11 obtains reproduction resume position information ST given together with an instruction to resume reproduction from a control unit (not shown).

また、再生再開位置取得部１１は、圧縮音声データファイルＡＦも併せて取得する。例えば、図示せぬファイル記憶部が圧縮音声データファイルＡＦを記憶している場合には、再生再開位置取得部１１は、当該ファイル記憶部から圧縮音声データファイルＡＦを取得する。また、図示せぬファイル受信部が圧縮音声データファイルＡＦを、通信回線を介して送信装置（図示せず）から受信した場合には、再生再開位置取得部１１は、当該ファイル受信部から圧縮音声データファイルＡＦを取得する。 Further, the reproduction restart position acquisition unit 11 also acquires the compressed audio data file AF. For example, when the file storage unit (not shown) stores the compressed audio data file AF, the reproduction restart position acquisition unit 11 acquires the compressed audio data file AF from the file storage unit. When a file reception unit (not shown) receives the compressed audio data file AF from a transmission device (not shown) via a communication line, the reproduction resume position acquisition unit 11 receives the compressed audio data from the file reception unit. A data file AF is acquired.

圧縮音声データファイルＡＦは一連のフレームからなる。図２は、ＩＳＯ／ＩＥＣ１４４９６−３規格において規定されているＡＡＣＡＤＩＦ及びＡＡＣｒａｗｄａｔａのフレームの構成要素を表す図である。フレームは、ＳＣＥ、ＣＰＥ、ＣＣＥ、ＬＦＥ、ＤＳＥ、ＰＣＥ、ＦＩＬ及びＴＥＲＭの要素からなる。同図に示されるようにＴＥＲＭ（Terminator）はフレームの終端を表す要素である。ＴＥＲＭの値は同規格により０ｘ７すなわち２進数で１１１であることが定められている。また、フレームのデータサイズをバイト単位に揃える為のデータパディング（ＤａｔａＰａｄｄｉｎｇ）と称される要素がＴＥＲＭに続いて挿入される。データパディングの値は０であり、１〜７ビットの内のいずれかで表される。 The compressed audio data file AF consists of a series of frames. FIG. 2 is a diagram showing the components of the AAC ADIF and AAC raw data frames defined in the ISO / IEC 14496-3 standard. The frame is composed of SCE, CPE, CCE, LFE, DSE, PCE, FIL, and TERM elements. As shown in the figure, TERM (Terminator) is an element representing the end of a frame. The TERM value is defined by the standard as 0x7, that is, 111 in binary. Further, an element called data padding for aligning the data size of the frame in byte units is inserted following the TERM. The value of data padding is 0, and is represented by any one of 1 to 7 bits.

検出用ビット列記憶部１２は、フレーム先頭位置検出用ビット列（以下、単に検出用ビット列と称する）からなる検出用ビット列テーブルを記憶する。図３は検出用ビット列テーブルを表す図である。検出用ビット列テーブルには７通りの検出用ビット列が示されている。検出用ビット列の各々は１０ビットのビット列である。これらは、ＴＥＲＭ及びデータパディングの値に対応するように定められている。上記したようにＴＥＲＭの値は１１１の固定値であり、データパディングの値は０であり１〜７ビットの内のいずれかで表される。 The detection bit string storage unit 12 stores a detection bit string table composed of a frame head position detection bit string (hereinafter simply referred to as a detection bit string). FIG. 3 shows a detection bit string table. The detection bit string table shows seven detection bit strings. Each of the detection bit strings is a 10-bit bit string. These are defined to correspond to the values of TERM and data padding. As described above, the TERM value is a fixed value of 111, the data padding value is 0, and is represented by any one of 1 to 7 bits.

例えば、フレーム内におけるデータパディングが７ビットの場合、ＴＥＲＭビット列１１１及びデータパディングビット列０００００００（７ビットの０）より、ＴＥＲＭからデータパディングに掛けてのビット列は１１１０００００００となることから、これと同一のビット列を検出用ビット列として検出用ビット列テーブルに予め記憶する（同図中のＮｏ．７）。また、フレーム内におけるデータパディングが１ビットの場合、ＴＥＲＭビット列１１１及びデータパディングビット列０より、ＴＥＲＭからデータパディングに掛けてのビット列は１１１０となる。ＴＥＲＭビット列より前方の６ビットの値を０、１のどちらでも良いものとして、１０ビットの検出用ビット列ｘｘｘｘｘｘ１１１０を検出用ビット列テーブルに予め記憶する（同図中のＮｏ．１。ここでのｘは０又は１）。他の検出用ビット列も同様の考え方にしたがって定められる。 For example, when the data padding in the frame is 7 bits, the bit string from the TERM to the data padding becomes 1110000000 from the TERM bit string 111 and the data padding bit string 0000000 (7-bit 0). Is stored in advance in the detection bit string table as a detection bit string (No. 7 in the figure). When the data padding in the frame is 1 bit, the bit string from the TERM to the data padding becomes 1110 from the TERM bit string 111 and the data padding bit string 0. Assuming that the value of the 6 bits ahead of the TERM bit string can be either 0 or 1, a 10-bit detection bit string xxxxxxxx 1110 is stored in advance in the detection bit string table (No. 1 in FIG. 0 or 1). Other detection bit strings are determined according to the same concept.

フレーム先頭位置検出部１３は、再生再開位置取得部１１から再生再開位置情報ＳＴ及び圧縮音声データファイルＡＦを受け取り、フレーム先頭位置検出処理を実行する。図４は、圧縮音声データファイルＡＦを構成する一連のフレームの内、Ｎ番目及びＮ＋１番目のフレームを中心に表した図である（ここでのＮは正整数）。ここでは、再生再開位置情報ＳＴが表す再生再開位置が同図中に表される記号ＳＴで示される位置であるとする。ここでの再生再開位置ＳＴは、Ｎ番目のフレームの途中に位置する。フレーム先頭位置検出部１３は、圧縮音声データファイルＡＦにおける再生再開位置ＳＴ以降のビット列であって図３に示される７通りの検出用ビット列の内の何れかに一致するビット列を検索し、該検索によって得られたビット列の位置に基づいてフレームの先頭位置ＦＲを検出する。 The frame head position detection unit 13 receives the playback restart position information ST and the compressed audio data file AF from the playback restart position acquisition unit 11, and executes a frame head position detection process. FIG. 4 is a diagram centering on the Nth and N + 1th frames in a series of frames constituting the compressed audio data file AF (where N is a positive integer). Here, it is assumed that the reproduction resume position represented by the reproduction resume position information ST is a position indicated by a symbol ST represented in the figure. The reproduction restart position ST here is located in the middle of the Nth frame. The frame head position detector 13 searches for a bit string after the reproduction resuming position ST in the compressed audio data file AF that matches any of the seven detection bit strings shown in FIG. The head position FR of the frame is detected based on the position of the bit string obtained by.

詳細には、先ず、フレーム先頭位置検出部１３は再生再開位置ＳＴのビットを先頭とするビット列が、検出用ビット列記憶部１２に記憶されている７通りの検出用ビット列の内の何れかに一致するか否かを判別する。フレーム先頭位置検出部１３は、一致したと判別した場合、当該一致したビット列（Ｎ番目のフレームの終端）の次のビットの位置をＮ＋１番目のフレームの先頭位置ＦＲとして、これをデコード処理部１４に通知する。フレーム先頭位置検出部１３は、一致しなかったと判別した場合、圧縮音声データファイルＡＦ内における検出位置を変更してフレームの先頭位置を検出する。このときフレーム先頭位置検出部１３は、再生再開位置取得部１１から再生再開位置ＳＴを再度、取得して同様に検索する。また、フレーム先頭位置検出部１３は、例えば、再生再開位置ＳＴのビットから所定のＭビットだけ進んだビットを先頭とするビット列が、検出用ビット列の内の何れかに一致するか否かを判別するようにしても良い（ここでのＭは正整数）。この場合、フレーム先頭位置検出部１３は、一致したという判別結果が得られるまで、Ｍの値を増加させつつ、同様の判別処理を繰り返す。最終的にフレーム先頭位置検出部１３が一致したと判別した場合、当該一致したビット列の次のビットの位置をフレームの先頭位置ＦＲとして、これをデコード処理部１４に通知する。 Specifically, first, the frame head position detection unit 13 matches the bit string starting from the bit of the reproduction restart position ST with any of the seven detection bit strings stored in the detection bit string storage unit 12. It is determined whether or not to do so. If the frame head position detection unit 13 determines that they match, the position of the next bit in the matched bit string (the end of the Nth frame) is set as the head position FR of the (N + 1) th frame, and this is used as the decoding processing unit 14. Notify If the frame head position detection unit 13 determines that they do not match, the frame head position detection unit 13 detects the head position of the frame by changing the detection position in the compressed audio data file AF. At this time, the frame head position detection unit 13 obtains the reproduction resume position ST again from the reproduction resume position acquisition unit 11 and searches similarly. Further, the frame head position detection unit 13 determines, for example, whether a bit string starting from a bit advanced by a predetermined M bits from the bit of the reproduction restart position ST matches any of the detection bit strings. (M here is a positive integer). In this case, the frame head position detection unit 13 repeats the same determination process while increasing the value of M until a determination result indicating that they match is obtained. When it is finally determined that the frame head position detection unit 13 matches, the position of the next bit in the matched bit string is set as the frame head position FR, and this is notified to the decoding processing unit 14.

例えば、図４に示される再生再開位置ＳＴのビットを先頭とする１０ビットのビット列が０１０１０１０１０１であったとすると、当該ビット列は検出用ビット列の何れにも一致しない。この場合、フレーム先頭位置検出部１３は、一致したという判別結果が得られるまで、判別対称のビット列の先頭位置を再生再開位置ＳＴのビットの位置からフレームの進行方向にずらしつつ、判別処理を繰り返す。例えば、同図に示される如くＴＥＲＭからデータパディングに至るビット列が１１１０００００００であった場合、フレーム先頭位置検出部１３は、検出用ビット列テーブルに記憶されているＮｏ．７の検出用ビット列１１１０００００００と一致したと判別し、該一致したビット列１１１０００００００（すなわちＮ番目のフレームの終端）の次のビットの位置、すなわち、Ｎ＋１番目のフレームの最初のビットの位置をフレームの先頭位置ＦＲとして、これをデコード処理部１４に通知する。 For example, if the 10-bit bit string starting from the bit at the reproduction restart position ST shown in FIG. 4 is 0101010101, the bit string does not match any of the detection bit strings. In this case, the frame start position detection unit 13 repeats the determination process while shifting the start position of the bit string that is symmetrical to the determination from the position of the bit at the reproduction resume position ST in the frame progress direction until a determination result that they match is obtained. . For example, when the bit string from TERM to data padding is 1111000000 as shown in the figure, the frame head position detection unit 13 stores the No. stored in the detection bit string table. 7 is determined to be coincident with the detection bit string 11110000000, and the position of the bit next to the coincident bit string 11110000000 (that is, the end of the Nth frame), that is, the position of the first bit of the (N + 1) th frame is determined as the head of the frame. This is notified to the decoding processing unit 14 as the position FR.

デコード処理部１４は、連続する複数のフレームからなる圧縮音声データファイルＡＦをデコード処理して音声データを得る機能を有する。デコード処理部１４は、フレーム先頭位置検出部１３から圧縮音声データファイルＡＦと共にフレームの先頭位置ＦＲを受け取り、これを同期位置としてＮ＋１番目以降のフレームについてデコード処理を施すことができる。一般的にデコーダはフレームの途中からデコードした場合、正常な音声データが得られないが、デコード処理部１４は、フレーム先頭位置検出部１３からのフレームの先頭位置ＦＲに基づいてＮ＋１番目のフレームをその先頭位置から正常にデコードすることができると共に、当該先頭位置を同期位置としてＮ＋２番目以降のフレームについても正常にデコードすることができる。 The decode processing unit 14 has a function of obtaining audio data by decoding a compressed audio data file AF composed of a plurality of consecutive frames. The decode processing unit 14 can receive the frame start position FR from the frame start position detection unit 13 together with the compressed audio data file AF, and can perform the decoding process on the (N + 1) th and subsequent frames using this as the synchronization position. In general, when the decoder decodes from the middle of the frame, normal audio data cannot be obtained. However, the decoding processing unit 14 determines the N + 1th frame based on the frame start position FR from the frame start position detection unit 13. The normal position can be decoded from the head position, and the N + 2 and subsequent frames can be normally decoded with the head position as a synchronization position.

また、デコード処理部１４は、圧縮音声データファイルＡＦに対するエラー判定処理機能を有する。デコード処理部１４は、次の２つの場合にエラーと判定する。１つは、圧縮音声データファイルＡＦ内にデータの誤りが検出された場合である。例えば、圧縮音声データファイルＡＦがハードディスク（図示せず）に記憶されていた場合、再生再開位置取得部１１は、当該ハードディスクから何らかのハードウェア回路（図示せず）を介して圧縮音声データファイルＡＦを取得するが、この際、ハードディスクの読み取りミスやハードウェア回路におけるノイズ等の影響により、誤りのあるデータを含む圧縮音声データファイルＡＦを取得することになり、これにエラー判定処理を施したときにエラーと判定する。あるいは、圧縮音声データファイルＡＦが配信装置（図示せず）から有線若しくは無線の通信回線を介してデコーダ１０に配信される場合、通信回線に生じたノイズや瞬断の影響により、再生再開位置取得部１１は、誤りのあるデータを含む圧縮音声データファイルＡＦを取得することになり、この場合も同様に、エラー判定処理を施したときにエラーと判定する。 The decode processing unit 14 has an error determination processing function for the compressed audio data file AF. The decode processing unit 14 determines an error in the following two cases. One is a case where a data error is detected in the compressed audio data file AF. For example, when the compressed audio data file AF is stored in a hard disk (not shown), the reproduction resume position acquisition unit 11 receives the compressed audio data file AF from the hard disk via some hardware circuit (not shown). At this time, the compressed audio data file AF including erroneous data is acquired due to the influence of a reading error of the hard disk or noise in the hardware circuit, and when error determination processing is performed on this. Judged as an error. Alternatively, when the compressed audio data file AF is distributed from a distribution device (not shown) to the decoder 10 via a wired or wireless communication line, the reproduction resume position is acquired due to noise or a momentary interruption that occurs in the communication line. The unit 11 obtains the compressed audio data file AF including erroneous data. In this case as well, the unit 11 determines that an error has occurred when the error determination process is performed.

もう１つは、オーディオデータをデコードするための情報が仕様範囲外である場合である。通常、圧縮音声データファイルＡＦにはオーディオデータと共に、例えば圧縮方式や圧縮率などの当該オーディオデータをデコードするための情報が含まれている。これら圧縮方式や圧縮率などがデコーダ１０における仕様に適合しない場合、デコード処理部１４は、エラー判定処理を施したときにエラーと判定する。 The other is a case where information for decoding audio data is out of the specification range. Normally, the compressed audio data file AF includes information for decoding the audio data such as a compression method and a compression rate together with the audio data. If these compression methods and compression ratios do not conform to the specifications in the decoder 10, the decode processing unit 14 determines an error when performing an error determination process.

デコード処理部１４は、デコード処理により得られた音声データＤＡを図示せぬメモリに記憶せしめる。音声再生部（図示せず）は当該メモリから音声データＤＡを適宜、読み出して音声として再生することができる。 The decoding processing unit 14 stores the audio data DA obtained by the decoding processing in a memory (not shown). An audio reproducing unit (not shown) can appropriately read out audio data DA from the memory and reproduce it as audio.

図５はデコード再生処理ルーチンを表すフローチャートである。以下に図５を参照しつつ、デコード再生処理について説明する。 FIG. 5 is a flowchart showing a decoding reproduction processing routine. Hereinafter, the decoding reproduction process will be described with reference to FIG.

先ず、再生再開位置取得部１１は、早送り／巻き戻し後にユーザーが再生再開位置を指定する場合、ユーザーによって入力された再生再開位置情報ＳＴを図示せぬ入力部から取得する。若しくは、再生エラーのために再生を中断した場合、再生再開位置取得部１１は、図示せぬ制御部から再生再開の指示と共に与えられた再生再開位置情報ＳＴを取得する（ステップＳ１）。また、再生再開位置取得部１１は、圧縮音声データファイルＡＦを併せて取得する。再生再開位置取得部１１は、図示せぬファイル記憶部が圧縮音声データファイルＡＦを記憶している場合には当該ファイル記憶部から、若しくは、図示せぬファイル受信部が圧縮音声データファイルＡＦを通信回線を介して送信装置（図示せず）から受信した場合には当該ファイル受信部から、圧縮音声データファイルＡＦを取得する。 First, when the user designates a reproduction resume position after fast forward / rewind, the reproduction resume position acquisition unit 11 obtains reproduction resume position information ST input by the user from an input unit (not shown). Alternatively, when playback is interrupted due to a playback error, the playback restart position acquisition unit 11 acquires playback restart position information ST given together with a playback restart instruction from a control unit (not shown) (step S1). Further, the reproduction restart position acquisition unit 11 acquires the compressed audio data file AF together. When the file storage unit (not shown) stores the compressed audio data file AF, the reproduction resume position acquisition unit 11 communicates the compressed audio data file AF from the file storage unit or the file reception unit (not shown). When the data is received from a transmission device (not shown) via a line, the compressed audio data file AF is acquired from the file receiving unit.

フレーム先頭位置検出部１３は、再生再開位置取得部１１から再生再開位置情報ＳＴ及び圧縮音声データファイルＡＦを受け取り、フレーム先頭位置検出処理を実行する（ステップＳ２）。フレーム先頭位置検出部１３は、圧縮音声データファイルにおける再生再開位置ＳＴ以降のビット列であって検出用ビット列記憶部１２に記憶されている７通りの検出用ビット列の何れかに一致するビット列を検索し、該検索によって得られたビット列の位置に基づいてフレームの先頭位置ＦＲを検出する。 The frame head position detection unit 13 receives the playback restart position information ST and the compressed audio data file AF from the playback restart position acquisition unit 11, and executes a frame head position detection process (step S2). The frame head position detection unit 13 searches for a bit string that corresponds to one of the seven detection bit strings stored in the detection bit string storage unit 12 after the reproduction restart position ST in the compressed audio data file. Then, the start position FR of the frame is detected based on the position of the bit string obtained by the search.

先ず、フレーム先頭位置検出部１３は再生再開位置ＳＴのビットを先頭とするビット列が、検出用ビット列記憶部１２に記憶されている７通りの検出用ビット列の内の何れかに一致するか否かを判別する（ステップＳ３）。フレーム先頭位置検出部１３は、一致したと判別した場合、当該一致したビット列（Ｎ番目のビット列の終端）の次のビットの位置をＮ＋１番目のフレームの先頭位置ＦＲとして、これをデコード処理部１４に通知する。 First, the frame head position detection unit 13 determines whether the bit string starting from the bit at the reproduction restart position ST matches any of the seven detection bit strings stored in the detection bit string storage unit 12. Is discriminated (step S3). If the frame head position detection unit 13 determines that they match, the position of the bit next to the matched bit string (the end of the Nth bit string) is set as the head position FR of the (N + 1) th frame, and this is used as the decoding processing unit 14. Notify

フレーム先頭位置検出部１３は、一致しなかったと判別した場合（ステップＳ３）、再生再開位置ＳＴを再生再開位置取得部１１から再度取得して同様の判別処理を実行するか、若しくは、再生再開位置ＳＴのビットからＭビット（Ｍは正整数）だけ進んだビットを先頭とするビット列が検出用ビット列の内の何れかに一致するか否かを判別する（ステップＳ２）。フレーム先頭位置検出部１３は、一致したという判別結果が得られるまで、再生再開位置ＳＴを取得し直すか、Ｍの値を増加させるかして、同様の判別処理を繰り返す。最終的にフレーム先頭位置検出部１３が一致したと判別した場合、当該一致したビット列（Ｎ番目のフレームの終端）の次のビットの位置をＮ＋１番目のフレームの先頭位置ＦＲとして、これをデコード処理部１４に通知する。 When it is determined that they do not match (step S3), the frame head position detection unit 13 acquires the playback resume position ST from the playback resume position acquisition unit 11 again and executes the same determination process, or the playback restart position. It is determined whether or not the bit string starting from the bit advanced by M bits (M is a positive integer) from the ST bit matches any of the detection bit strings (step S2). The frame head position detection unit 13 repeats the same determination process until the reproduction restart position ST is re-acquired or the value of M is increased until a determination result that matches is obtained. When it is finally determined that the frame head position detection unit 13 matches, the position of the bit next to the matched bit string (the end of the Nth frame) is set as the head position FR of the (N + 1) th frame, and this is decoded. Notification to the unit 14.

デコード処理部１４は、フレーム先頭位置検出部１３から圧縮音声データファイルＡＦと共にフレームの先頭位置ＦＲを受け取り、Ｎ＋１番目のフレームをその先頭位置からデコードすると共に、当該先頭位置を同期位置としてＮ＋２番目以降のフレームについても同様にデコードする。フレームに対してその先頭位置からデコード処理を施す（ステップＳ４）。このとき、デコード処理部１４は、圧縮音声データファイルＡＦに対してエラー判定処理も施す。エラーが検出されるなどしてデコードが正常に実行されなかった場合（ステップＳ５）、再生再開位置取得部１１は、別の再生再開位置情報ＳＴを取得し（ステップＳ１）、同様の処理を繰り返す。 The decode processing unit 14 receives the frame start position FR from the frame start position detection unit 13 together with the compressed audio data file AF, decodes the (N + 1) th frame from the start position, and uses the start position as the synchronization position and the (N + 2) th and subsequent frames. The same frame is also decoded. The frame is decoded from the head position (step S4). At this time, the decoding processing unit 14 also performs error determination processing on the compressed audio data file AF. When the decoding is not normally executed due to an error or the like (step S5), the reproduction resume position acquisition unit 11 obtains another reproduction resume position information ST (step S1) and repeats the same processing. .

エラーが検出されず、デコードが正常に実行された場合（ステップＳ５）、デコード処理部１４は、デコード処理により得られた音声データＤＡを図示せぬメモリに記憶せしめる。音声再生部（図示せず）は当該メモリから音声データＤＡを適宜、読み出して音声として再生することができる（ステップＳ６）。 If no error is detected and decoding is executed normally (step S5), the decoding processing unit 14 stores the audio data DA obtained by the decoding processing in a memory (not shown). An audio reproducing unit (not shown) can appropriately read out the audio data DA from the memory and reproduce it as audio (step S6).

上記したように本実施例によるデコーダは、ＡＡＣ規格に規定されるＴＥＲＭビット列に対応するビット列と、データパディングビット列に対応するビット列と、からなるフレーム先頭位置検出用ビット列を予め記憶している。デコーダは再生再開位置を取得し、圧縮音声データファイルにおける当該再生再開位置以降のビット列であって当該フレーム先頭位置検出ビット列と一致するビット列（Ｎ番目のフレームの終端）を検索し、当該検索によって得られたビット列の次のビットの位置をＮ＋１番目のフレームの先頭位置とする。デコーダは、Ｎ＋１番目のフレームの先頭位置を同期位置としてＮ＋１番目以降のフレームを正常にデコードすることができる。このように本実施例によるデコーダは、早送り／巻き戻し又は再生エラーのために圧縮音声データファイルの再生を中断した場合においても、任意のフレーム位置から正常にデコードを開始し、再生を再開することができる。 As described above, the decoder according to the present embodiment stores in advance a frame head position detection bit string including a bit string corresponding to the TERM bit string defined by the AAC standard and a bit string corresponding to the data padding bit string. The decoder acquires the playback restart position, searches for a bit string (the end of the Nth frame) that matches the frame head position detection bit string after the playback restart position in the compressed audio data file, and obtains it by the search. The position of the next bit of the obtained bit string is set as the head position of the (N + 1) th frame. The decoder can normally decode the (N + 1) th and subsequent frames using the start position of the (N + 1) th frame as a synchronization position. As described above, the decoder according to the present embodiment starts decoding normally from an arbitrary frame position and resumes reproduction even when reproduction of the compressed audio data file is interrupted due to fast-forward / rewind or reproduction error. Can do.

フレーム先頭位置検出用ビット列は、ＩＳＯ／ＩＥＣ１４４９６−３の規格を満足するように定められており、圧縮音声データファイルを圧縮するエンコーダも当該規格を満たしてさえいれば良く、エンコーダが圧縮音声データファイルに特定の情報を埋め込むなどの特段の処理を施す必要がない。そのため、本実施例によるデコーダは、エンコーダに依存せず、圧縮音声データファイルを正常にデコードすることができる。また、本実施例によるデコーダは、初回再生であるか２回目以降の再生であるかにかかわらず、圧縮音声データファイルを正常にデコードすることができる。 The frame start position detection bit string is determined so as to satisfy the ISO / IEC 14496-3 standard, and the encoder that compresses the compressed audio data file only needs to satisfy the standard. There is no need to perform special processing such as embedding specific information. Therefore, the decoder according to the present embodiment can normally decode the compressed audio data file without depending on the encoder. Also, the decoder according to the present embodiment can normally decode the compressed audio data file regardless of whether it is the first reproduction or the second reproduction or later.

＜第２の実施例＞
本実施例におけるデコーダ１０は第１の実施例と同様に図１に示される。以下、第１の実施例と異なる部分について説明する。デコーダ１０は２チャンネルのデータをデコード可能であるとする。 <Second Embodiment>
The decoder 10 in this embodiment is shown in FIG. 1 as in the first embodiment. Hereinafter, parts different from the first embodiment will be described. It is assumed that the decoder 10 can decode data of two channels.

図６は圧縮音声データファイルＡＦを構成する2チャンネルＣｈ１及びＣｈ２の一連のフレームの内、Ｎ番目及びＮ＋１番目のフレームを中心に表した図である。ここでは、再生再開位置情報ＳＴが表す再生再開位置が同図中に表される記号ＳＴで示される位置であるとする。再生再開位置ＳＴは、Ｎ番目のフレームの途中に位置する。チャンネルＣｈ１の一連のフレームには、Ｎ番目のフレームの終端部ＴＥＲＭ及びデータパディングに続いてＮ＋１番目のフレームの先頭の要素ＳＣＥ（Single Channel Element）が示されている。ＳＣＥはオーディオ信号の内、前方中央信号を格納する要素として知られる。ＳＣＥはＩＳＯ／ＩＥＣ１４４９６−３規格により０ｘ０すなわち３ビットの２進数表現で０００と規定されている。チャンネルＣｈ２の一連のフレームには、Ｎ番目のフレームの終端部ＴＥＲＭ及びデータパディングに続いてＮ＋１番目のフレームの先頭の要素ＣＰＥ（Channel Pair Element）が示されている。ＣＰＥはオーディオ信号の内、前方左右信号及びサラウンド信号を格納する要素として知られる。ＣＰＥは同規格により０ｘ１すなわち３ビットの２進数表現で００１と規定されている。 FIG. 6 is a diagram mainly showing the Nth and N + 1th frames in a series of frames of the two channels Ch1 and Ch2 constituting the compressed audio data file AF. Here, it is assumed that the reproduction resume position represented by the reproduction resume position information ST is a position indicated by a symbol ST represented in the figure. The reproduction restart position ST is located in the middle of the Nth frame. In a series of frames of the channel Ch1, an element SCE (Single Channel Element) at the head of the (N + 1) th frame is shown following the end portion TERM and data padding of the Nth frame. SCE is known as an element that stores a front center signal in an audio signal. SCE is defined by the ISO / IEC 14496-3 standard as 0x0, that is, 000 in a 3-bit binary representation. In a series of frames of the channel Ch2, an element CPE (Channel Pair Element) at the head of the (N + 1) th frame is shown following the terminal TERM and data padding of the Nth frame. CPE is known as an element that stores a front left / right signal and a surround signal among audio signals. CPE is defined as 0x1 by the same standard, that is, 001 in a 3-bit binary representation.

図７はチャンネル１用検出用ビット列テーブルを表す図である。当該テーブルには７通りの検出用ビット列が示されている。これらのビット列は、図３に示される検出用ビット列の各々の末尾にＳＣＥの値０００を付加したものである。図８はチャンネル２用検出用ビット列テーブルを表す図である。当該テーブルには７通りの検出用ビット列が示されている。これらのビット列は、図３に示される検出用ビット列の各々の末尾にＣＰＥの値００１を付加したものである。 FIG. 7 shows a channel 1 detection bit string table. The table shows seven detection bit strings. These bit strings are obtained by adding the SCE value 000 to the end of each of the detection bit strings shown in FIG. FIG. 8 is a diagram showing a channel 2 detection bit string table. The table shows seven detection bit strings. These bit strings are obtained by adding the CPE value 001 to the end of each of the detection bit strings shown in FIG.

フレーム先頭位置検出部１３は、第１の実施例と同様に、圧縮音声データファイルＡＦにおける再生再開位置ＳＴ以降のビット列であってフレーム先頭位置検出ビット列と一致するビット列を検索する。当該検索によって得られるビット列はＮ番目のフレームのＴＥＲＭ及びデータパディングとＮ＋１番目のフレームのＳＣＥ若しくはＣＰＥとからなるビット列である。フレーム先頭位置検出部１３は、圧縮音声データファイルＡＦにおけるＳＣＥビット列若しくはＣＰＥビット列に対応するビット列の先頭のビットの位置をＮ＋１番目のフレームの先頭位置ＦＲとする。 Similarly to the first embodiment, the frame head position detection unit 13 searches for a bit string after the reproduction restart position ST in the compressed audio data file AF that matches the frame head position detection bit string. The bit string obtained by the search is a bit string including the TERM and data padding of the Nth frame and the SCE or CPE of the (N + 1) th frame. The frame head position detection unit 13 sets the position of the head bit of the bit string corresponding to the SCE bit string or CPE bit string in the compressed audio data file AF as the head position FR of the (N + 1) th frame.

例えば、同図のＣｈ１に示される如くＮ番目のフレームのＴＥＲＭからＮ＋１番目のフレームのＳＣＥに至るビット列が１１１００００００００００であった場合、フレーム先頭位置検出部１３は、チャンネル１用検出用ビット列テーブルに記憶されているＮｏ．７の検出用ビット列１１１００００００００００と一致したと判別し、ＳＣＥのビット列に対応するビット列０００の先頭のビットの位置をＮ＋１番目のフレームの先頭位置ＦＲとして、これをデコード処理部１４に通知する。デコード処理部１４は、フレーム先頭位置検出部１３から圧縮音声データファイルＡＦと共にフレームの先頭位置ＦＲを受け取り、Ｎ＋１番目のフレームをその先頭位置からデコードすると共に、当該先頭位置を同期位置としてＮ＋２番目以降のフレームについても同様にデコードする。 For example, as shown in Ch1 in the figure, when the bit string from the TERM of the Nth frame to the SCE of the (N + 1) th frame is 1110000000000000, the frame head position detection unit 13 stores the bit string in the detection bit string table for channel 1 No. 7 is determined to coincide with the detection bit string 1110000000000000, and the position of the head bit of the bit string 000 corresponding to the bit string of SCE is set as the head position FR of the (N + 1) th frame, and this is notified to the decoding processing unit 14. The decode processing unit 14 receives the frame start position FR from the frame start position detection unit 13 together with the compressed audio data file AF, decodes the (N + 1) th frame from the start position, and uses the start position as the synchronization position and the (N + 2) th and subsequent frames. The same frame is also decoded.

本実施例によるデコーダは、フレーム先頭位置検出処理を、Ｎ番目のフレームのＴＥＲＭ及びデータパディングに対応するビット列に加えて、Ｎ＋１番目のフレームのＳＣＥ若しくはＣＰＥに対応するビット列に基づいて行うため、第１の実施例に比較してフレームの先頭位置をより正確に検出することができる。 The decoder according to the present embodiment performs the frame head position detection process based on the bit string corresponding to the SCE or CPE of the (N + 1) th frame in addition to the bit string corresponding to the TERM and data padding of the Nth frame. Compared with the first embodiment, the head position of the frame can be detected more accurately.

＜第３の実施例＞
本実施例におけるデコーダ１０は第２の実施例と同様に図１に示される。以下、第２の実施例と異なる部分について説明する。 <Third embodiment>
The decoder 10 in this embodiment is shown in FIG. 1 as in the second embodiment. Hereinafter, a different part from a 2nd Example is demonstrated.

本実施例におけるフレーム先頭位置検出部１３は、フレーム先頭位置検出処理により最初に得られたビット列の位置以降のビット列であってフレーム先頭位置検出用ビット列と一致するビット列を再検索する再検索手段を備えている。また、フレーム先頭位置検出部１３は、最初の検索及び再検索によって得られたビット列が同一であるか否かを判別する判別手段を備えている。フレーム先頭位置検出部１３は、当該判別手段により、最初の検索及び再検索によって得られたビット列が同一であると判別した場合にのみ、最初の検索によって得られたビット列の位置に基づいてフレームの先頭位置を検出する。 In this embodiment, the frame head position detection unit 13 includes a re-search unit that re-searches a bit string after the position of the bit string first obtained by the frame head position detection process and that matches the bit string for detecting the frame head position. I have. The frame head position detection unit 13 includes determination means for determining whether or not the bit strings obtained by the initial search and the re-search are the same. The frame head position detection unit 13 determines the frame based on the position of the bit string obtained by the first search only when the discrimination means determines that the bit strings obtained by the first search and re-search are the same. Detect the start position.

図９は、Ｎ番目のフレームのＴＥＲＭの先頭ビットからＮ＋１番目のフレームのＳＣＥ若しくはＣＰＥの末尾のビットに至るビット列を記号ＳＹＮＣ１、Ｎ＋１番目のフレームのＴＥＲＭの先頭ビットからＮ＋２番目のフレームのＳＣＥ若しくはＣＰＥの末尾のビットに至るビット列を記号ＳＹＮＣ２、・・・、として表したときの圧縮音声データファイルを構成する一連のフレームの一部を表す図である。ＩＳＯ／ＩＥＣ１４４９６−３の規格には、Ｎ番目のフレームのＴＥＲＭ及びＳＣＥとＮ＋１番目のフレームのＴＥＲＭ及びＳＣＥとは同一であることが規定されている。そのため、当該規格が満たされている圧縮音声データファイルＡＦであれば、ＳＹＮＣ１とＳＹＮＣ２とは同一となる。ここでは、再生再開位置情報ＳＴが表す再生再開位置が同図中に表される記号ＳＴで示される位置であるとする。再生再開位置ＳＴは、Ｎ番目のフレームの途中に位置する。 FIG. 9 shows a bit string from the first bit of the TERM of the Nth frame to the last bit of the SCE or CPE of the N + 1th frame as a symbol SYNC1, and the SCE of the N + 2th frame from the first bit of the TERM of the N + 1th frame. It is a figure showing a part of a series of frames which constitute a compression voice data file when a bit string which reaches the last bit of CPE is expressed as symbol SYNC2,. The ISO / IEC 14496-3 standard specifies that the TERM and SCE of the Nth frame are the same as the TERM and SCE of the (N + 1) th frame. Therefore, if the compressed audio data file AF satisfies the standard, SYNC1 and SYNC2 are the same. Here, it is assumed that the reproduction resume position represented by the reproduction resume position information ST is a position indicated by a symbol ST represented in the figure. The reproduction restart position ST is located in the middle of the Nth frame.

フレーム先頭位置検出部１３は第２の実施例と同様に、図７に示されるチャンネル１用検出用ビット列テーブル及び図８に示されるチャンネル２用検出用ビット列テーブルを参照してビット列を検索する。以下、図５に示されるデコード再生処理ルーチンを参照しつつ、デコード再生処理について説明する。 Similarly to the second embodiment, the frame head position detector 13 searches for a bit string with reference to the channel 1 detection bit string table shown in FIG. 7 and the channel 2 detection bit string table shown in FIG. Hereinafter, the decoding / reproducing process will be described with reference to the decoding / reproducing process routine shown in FIG.

先ず、フレーム先頭位置検出部１３は、再生再開位置取得部１１から再生再開位置ＳＴを取得し（ステップＳ１）、圧縮音声データファイルＡＦにおける再生再開位置ＳＴ以降のビット列であってフレーム先頭位置検出ビット列と一致するビット列を検索する（ステップＳ２）。当該検索によって最初に得られるビット列はＳＹＮＣ１である。続いてフレーム先頭位置検出部１３は、ＳＹＮＣ１の位置以降のビット列であって、フレーム先頭位置検出用ビット列と一致するビット列を再検索する（ステップＳ２）。当該再検索によって最初に得られるビット列はＳＹＮＣ２である。 First, the frame head position detection unit 13 acquires the playback restart position ST from the playback restart position acquisition unit 11 (step S1), and is a bit string after the playback restart position ST in the compressed audio data file AF and a frame head position detection bit string. Is searched for a bit string that matches (step S2). The first bit string obtained by the search is SYNC1. Subsequently, the frame head position detection unit 13 re-searches a bit string after the position of SYNC1 that matches the frame head position detection bit string (step S2). The first bit string obtained by the re-search is SYNC2.

フレーム先頭位置検出部１３は、最初の検索によって得られたビット列ＳＹＮＣ１と再検索によって得られたビット列ＳＹＮＣ２とが同一であるか否かを判別する（ステップＳ３）。フレーム先頭位置検出部１３は、これらのビット列が同一であると判別した場合にのみ、圧縮音声データファイルＡＦにおけるＳＣＥビット列若しくはＣＰＥビット列に対応するビット列の先頭のビットの位置をＮ＋１番目のフレームの先頭位置ＦＲとする。例えば、図９に示される如くＳＹＮＣ１及びＳＹＮＣ２のビット列が共に１１１００００００００００であった場合、フレーム先頭位置検出部１３は、ＳＣＥのビット列に対応するビット列０００の先頭のビットの位置をＮ＋１番目のフレームの先頭位置ＦＲとする。 The frame head position detection unit 13 determines whether or not the bit string SYNC1 obtained by the first search is the same as the bit string SYNC2 obtained by the re-search (step S3). Only when the frame head position detection unit 13 determines that these bit strings are the same, the position of the head bit of the bit string corresponding to the SCE bit string or the CPE bit string in the compressed audio data file AF is determined as the head of the (N + 1) th frame. Let it be position FR. For example, as shown in FIG. 9, when both the bit strings of SYNC1 and SYNC2 are 1110000000000000, the frame head position detection unit 13 sets the position of the head bit of the bit string 000 corresponding to the SCE bit string to the head of the N + 1th frame. Let it be position FR.

フレーム先頭位置検出部１３は、圧縮音声データファイルＡＦと共にフレームの先頭位置ＦＲをデコード処理部１４に与える。デコード処理部１４は、フレーム先頭位置検出部１３から圧縮音声データファイルＡＦと共にフレームの先頭位置ＦＲを受け取り、Ｎ＋１番目のフレームをその先頭位置からデコードすると共に、当該先頭位置を同期位置としてＮ＋２番目以降のフレームについても同様にデコードする。（ステップＳ４）。 The frame head position detection unit 13 gives the frame head position FR to the decode processing unit 14 together with the compressed audio data file AF. The decode processing unit 14 receives the frame start position FR from the frame start position detection unit 13 together with the compressed audio data file AF, decodes the (N + 1) th frame from the start position, and uses the start position as the synchronization position and the (N + 2) th and subsequent frames. The same frame is also decoded. (Step S4).

本実施例によるデコーダは、フレームの先頭位置を再検索する手段を備え、最初の検索によって得られたビット列と再検索によって得られたビット列とが同一である場合にのみ、フレームの先頭位置を検出するため、第３の実施例に比較してフレームの先頭位置をより正確に検出することができる。 The decoder according to this embodiment includes means for re-searching the start position of the frame, and detects the start position of the frame only when the bit string obtained by the initial search and the bit string obtained by the re-search are the same. Therefore, the head position of the frame can be detected more accurately than in the third embodiment.

第１〜第３の実施例は、圧縮音声データファイルをＡＡＣＡＤＩＦ及びＡＡＣｒａｗｄａｔａのファイルフォーマットとした場合の例であるが、本発明はフレームの先頭と終端のビット列が規定されている全てのファイルフォーマットに適用可能である。 The first to third embodiments are examples in which the compressed audio data file has an AAC ADIF and AAC raw data file format. However, the present invention is not limited to all the bit strings at the beginning and end of a frame. Applicable to file format.

本発明によるデコーダを表すブロック図である。FIG. 3 is a block diagram illustrating a decoder according to the present invention. ＡＡＣＡＤＩＦ及びＡＡＣｒａｗｄａｔａのフレームの構成要素を表す図である。It is a figure showing the component of the flame | frame of AAC ADIF and AAC raw data. 検出用ビット列テーブルを表す図である。It is a figure showing the bit string table for a detection. 圧縮音声データファイルを構成する一連のフレームの一部を表す図である。It is a figure showing a part of a series of frames which constitute a compression audio data file. デコード再生処理ルーチンを表すフローチャートである。It is a flowchart showing a decoding reproduction processing routine. 2チャンネル時の圧縮音声データファイルを構成する一連のフレームの一部を表す図である。It is a figure showing a part of a series of frames which constitute a compression audio data file at the time of 2 channels. チャンネル１用の検出用ビット列テーブルを表す図である。6 is a diagram illustrating a detection bit string table for channel 1. FIG. チャンネル２用の検出用ビット列テーブルを表す図である。6 is a diagram illustrating a detection bit string table for channel 2. FIG. 特定のビット列を記号ＳＹＮＣとして表したときの圧縮音声データファイルを構成する一連のフレームの一部を表す図である。It is a figure showing a part of a series of frames which constitute a compression audio data file when a specific bit sequence is expressed as symbol SYNC.

Explanation of symbols

１０デコーダ
１１再生再開位置取得部
１２検出用ビット列記憶部
１３フレーム先頭位置検出部
１４デコード処理部
ＡＦ圧縮音声信号
ＤＡデコード音声データ
ＦＲフレーム先頭位置
ＲＰ再生データ
ＳＴ再生再開位置 DESCRIPTION OF SYMBOLS 10 Decoder 11 Reproduction | regeneration resumption position acquisition part 12 Bit stream memory | storage part 13 for detection Frame start position detection part 14 Decoding processing part AF Compressed audio signal DA Decoding audio | voice data FR Frame top position RP Reproduction data ST Reproduction resumption position

Claims

A decoder for obtaining audio data by decoding a compressed audio data file consisting of a plurality of consecutive frames,
A detection bit string storage unit for storing a frame start position detection bit string;
A reproduction resumption position acquisition unit for acquiring information indicating a reproduction resumption position of the compressed audio data file;
A frame in which a bit string that coincides with the frame head position detection bit string in the compressed audio data file after the reproduction restart position is searched, and the head position of the frame is detected based on the position of the bit string obtained by the search And a head position detector.

The frame head position detection bit string includes a bit string corresponding to a TERM bit string defined in the AAC standard and a bit string corresponding to a data padding bit string.
2. The decoder according to claim 1, wherein the frame head position detection unit sets a position of a bit next to a bit string that matches the frame head position detection bit string as a head position of the frame.

The frame start position detection bit string includes a bit string corresponding to a TERM bit string defined in the AAC standard, a bit string corresponding to a data padding bit string, and a bit string corresponding to an SCE bit string or a bit string corresponding to a CPE bit string,
2. The decoder according to claim 1, wherein the frame head position detection unit sets a position of a head bit of a bit string corresponding to the SCE bit string or CPE bit string in the compressed audio data file as a head position of the frame. .

The frame head position detector
Re-search means for re-searching a bit string after the position of the bit string obtained by the first search and matching the bit start position detection bit string;
Discriminating means for discriminating whether or not the bit strings obtained by the initial search and the re-search are the same,
2. The decoder according to claim 1, wherein the start position of the frame is detected based on the position of the bit string obtained by the first search only when the determination means determines that they are the same.

5. The decoder according to claim 1, further comprising: a decoding processing unit that decodes a frame subsequent to the frame with the head position of the frame as a synchronization position. 6.

The decoder according to claim 5, wherein the decoding processing unit performs error determination on the compressed audio data file.