CN1870163A - Audio signal encoder, disk and disk reproducing apparatus - Google Patents
Audio signal encoder, disk and disk reproducing apparatus Download PDFInfo
- Publication number
- CN1870163A CN1870163A CN 200610080336 CN200610080336A CN1870163A CN 1870163 A CN1870163 A CN 1870163A CN 200610080336 CN200610080336 CN 200610080336 CN 200610080336 A CN200610080336 A CN 200610080336A CN 1870163 A CN1870163 A CN 1870163A
- Authority
- CN
- China
- Prior art keywords
- audio
- data
- information
- packet
- ats
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000005236 sound signal Effects 0.000 title claims description 72
- 238000005070 sampling Methods 0.000 claims description 73
- 238000000034 method Methods 0.000 claims description 41
- 101100410782 Arabidopsis thaliana PXG1 gene Proteins 0.000 abstract 1
- 101150033568 ats1 gene Proteins 0.000 abstract 1
- 238000010586 diagram Methods 0.000 description 103
- 208000037146 Atypical Timothy syndrome Diseases 0.000 description 74
- 208000037498 atypical type Timothy syndrome Diseases 0.000 description 74
- 238000012545 processing Methods 0.000 description 33
- 238000006243 chemical reaction Methods 0.000 description 26
- 230000008569 process Effects 0.000 description 24
- 230000014759 maintenance of location Effects 0.000 description 22
- 238000013139 quantization Methods 0.000 description 19
- 230000000875 corresponding effect Effects 0.000 description 18
- 238000012986 modification Methods 0.000 description 17
- 230000004048 modification Effects 0.000 description 17
- 102100038246 Retinol-binding protein 4 Human genes 0.000 description 15
- 238000000605 extraction Methods 0.000 description 14
- 101710137011 Retinol-binding protein 4 Proteins 0.000 description 12
- 238000001514 detection method Methods 0.000 description 10
- 230000001360 synchronised effect Effects 0.000 description 9
- 239000011159 matrix material Substances 0.000 description 8
- 230000001276 controlling effect Effects 0.000 description 7
- 230000005540 biological transmission Effects 0.000 description 6
- 238000004891 communication Methods 0.000 description 5
- 230000006835 compression Effects 0.000 description 5
- 238000007906 compression Methods 0.000 description 5
- 101000665882 Homo sapiens Retinol-binding protein 4 Proteins 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 101100173755 Candida albicans (strain SC5314 / ATCC MYA-2876) RBP2 gene Proteins 0.000 description 2
- 102100028472 DNA-directed RNA polymerases I, II, and III subunit RPABC5 Human genes 0.000 description 2
- 101000723805 Homo sapiens DNA-directed RNA polymerases I, II, and III subunit RPABC5 Proteins 0.000 description 2
- 101000969688 Homo sapiens Macrophage-expressed gene 1 protein Proteins 0.000 description 2
- 102100021285 Macrophage-expressed gene 1 protein Human genes 0.000 description 2
- JDZPLYBLBIKFHJ-UHFFFAOYSA-N Sulfamoyldapsone Chemical compound C1=CC(N)=CC=C1S(=O)(=O)C1=CC=C(N)C=C1S(N)(=O)=O JDZPLYBLBIKFHJ-UHFFFAOYSA-N 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000012856 packing Methods 0.000 description 2
- 238000004350 spin decoupling difference spectroscopy Methods 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- UTPYTEWRMXITIN-YDWXAUTNSA-N 1-methyl-3-[(e)-[(3e)-3-(methylcarbamothioylhydrazinylidene)butan-2-ylidene]amino]thiourea Chemical compound CNC(=S)N\N=C(/C)\C(\C)=N\NC(=S)NC UTPYTEWRMXITIN-YDWXAUTNSA-N 0.000 description 1
- 101100126625 Caenorhabditis elegans itr-1 gene Proteins 0.000 description 1
- 101100129500 Caenorhabditis elegans max-2 gene Proteins 0.000 description 1
- 102100038503 Cellular retinoic acid-binding protein 1 Human genes 0.000 description 1
- 101100514485 Drosophila melanogaster Rbp6 gene Proteins 0.000 description 1
- 101000792933 Homo sapiens AT-rich interactive domain-containing protein 4A Proteins 0.000 description 1
- 101001099865 Homo sapiens Cellular retinoic acid-binding protein 1 Proteins 0.000 description 1
- 101000755643 Homo sapiens RIMS-binding protein 2 Proteins 0.000 description 1
- 101000756373 Homo sapiens Retinol-binding protein 1 Proteins 0.000 description 1
- 101000756365 Homo sapiens Retinol-binding protein 2 Proteins 0.000 description 1
- 101001096332 Homo sapiens Retinol-binding protein 5 Proteins 0.000 description 1
- 101100511816 Rattus norvegicus Lrrc59 gene Proteins 0.000 description 1
- 102100022941 Retinol-binding protein 1 Human genes 0.000 description 1
- 102100022942 Retinol-binding protein 2 Human genes 0.000 description 1
- 101100464574 Schizosaccharomyces pombe (strain 972 / ATCC 24843) rbp28 gene Proteins 0.000 description 1
- 238000005102 attenuated total reflection Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000010485 coping Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Images
Landscapes
- Signal Processing For Digital Recording And Reproducing (AREA)
- Management Or Editing Of Information On Record Carriers (AREA)
Abstract
When the disc of this invention is mainly recorded by audio frequency signal, it can be simply played by user again and again; furthermore, it can simplify the real-time management. The format not contains VTS but consisted by ATS. The ATS is consisted by audio frequency manager (AMG), video and audio frequency manage menu (AMGM) and ATS1 which is managed by AMGI in AMG.
Description
This application is a divisional application of the application entitled "encoding apparatus for audio signals, record disc and playback apparatus therefor" filed on 19/11/1998 with application number 98124848.9.
Technical Field
The present invention relates to an audio signal encoding device, a turntable reproducing device, and an audio reproducing device.
Background
As a conventional optical disk for audio reproduction, a CD (compact disk) is known. As an optical disk having higher density than CD, DVD (digital video disk) is known.
However, in a DVD (hereinafter, referred to as a DVD video), since a video signal is mainly recorded and an audio signal is secondarily recorded, there are the following problems:
(1) the audio signal is integrated with the video signal, and the recording capacity of the audio signal is small.
(2) The time of the audio signal cannot be managed.
(3) Simple character information such as a song title cannot be taken out.
Since a user of a disc player has a wider use area than that of a video disc, a simple playback method is obtained by providing a TOC (table of contents) area like a CD. However, in the DVD-video, since the video directory block unit is constituted by the pilot control packet (CONT packet), the plurality of video (V) packets, and the audio (a) packet, and the playback of the V, A packet is controlled by the CONT packet, the user cannot easily play back the recorded audio signal, which results in a problem of poor usability.
In the DVD video disc, since time management is performed only in units of video frames, when recording is performed mainly on audio signals, continuity of audio signals is important as compared with the video disc, and thus there is a problem that real-time management is difficult.
Disclosure of Invention
Accordingly, an object of the present invention is to provide a turntable capable of facilitating reproduction by a user and facilitating real-time management when recording is performed mainly on an audio signal.
The object of the present invention is to provide a playback apparatus for the turntable.
An object of the present invention is to provide an encoding apparatus of an audio signal.
The invention aims to provide an audio reproducing device which can be configured at low cost under the condition of reproducing the following data structure: the data structure includes a first packet having audio data as actual data, a second packet having real-time information related to the audio data as actual data, and a third packet having still picture data related to the audio data, and the still picture is played back in synchronization with the audio signal or page-turned display is performed regardless of the playback of the audio signal.
An audio reproducing apparatus is provided which can be configured inexpensively when audio data and multi-channel audio data are subjected to A/D conversion at two sampling frequencies.
An audio reproducing apparatus capable of effectively coping with copying by recording copyright data relating to audio data and still picture data on a disc.
In a CD, since text information and the like indicating the content of a music source are recorded while being scattered on a disc, it is difficult to display the music source while instantaneously changing while tracking the music source during playback. Therefore, for the music source (a) in playback, the text (V) cannot be displayed with the a-V effect.
Therefore, in the case of recording a moving image such as a movie in a DVD (digital バ - サタイル disc) -video disc, when subtitles are superimposed, the recording is continuously performed. However, in the DVD, if a DVD player which records mainly on a disc such as a music source is considered, there is a problem in that the ratio of audio data is reduced in this method.
In view of the above-described problems, it is an object of the present invention to provide an encoding device, a disc, and a disc playback device capable of efficiently recording and displaying an audio signal of character information indicating the content of audio data such as a music source when the audio data is mainly recorded.
In order to achieve the above object, the present invention provides an audio title set including audio data and still picture data, which does not include a playback control packet, and a management area including information for managing the audio title set, instead of a video title set and a management area including information for managing the video title set.
That is, according to the present invention, there is provided an audio disc recording a data structure having:
an audio title set having audio data and still picture data without including a packet for playback control;
a first management area having information for managing the audio title set;
a second management area not containing the video title set but containing information for managing the video title set.
According to the present invention, there is provided an encoding device for an audio signal, the encoding device having means for formatting into the following data structure: an audio title set including audio data and still picture data, and a first management area including information for managing the audio title set, the audio title set being free from a playback control packet; a second management area not containing the video title set but containing information for managing the video title set.
According to the present invention, there is provided an encoding device for an audio signal, the encoding device having means for formatting into the following data structure: an audio title set including audio data and still picture data, and a first management area including information for managing the audio title set, the audio title set being free from a playback control packet; and a second management area which does not include the video header group but includes information for managing the video header group, wherein the audio data in the audio header group includes at least one of first audio data for A/D conversion at a first sampling frequency and second audio data for A/D conversion at a second sampling frequency.
According to the present invention, there is provided an encoding device for an audio signal, the encoding device having means for formatting into the following data structure: an audio title set including audio data and still picture data, and a first management area including information for managing the audio title set, the audio title set being free from a playback control packet; the audio data is data A/D-converted at a first sampling frequency which is a multiple of 48kHz and has a frame rate of 1/600 seconds and data A/D-converted at a second sampling frequency which is a multiple of 44.1kHz and has a frame rate of 1/551.25 seconds, and a second management area which does not contain a video title group and contains information for managing the video title group.
According to the present invention, there is provided an encoding device for an audio signal, the encoding device having means for formatting into the following data structure: an audio title set including audio data and still picture data, and a first management area including information for managing the audio title set, the audio title set being free from a playback control packet; the audio data is data A/D-converted at a first sampling frequency which is a multiple of 48kHz and has a frame rate of 1/600 seconds and data A/D-converted at a second sampling frequency which is a multiple of 44.1kHz and has a frame rate of 1/551.25 seconds, and playback control information which prohibits playback from being emphasized is recorded in the first management area when the first sampling frequency is 192kHz and the second sampling frequency is 176.4 kHz.
According to the present invention, there is provided an audio disc reproducing apparatus having means for reproducing audio data and still picture data in an audio title set based on information in the first management area, not on a package for reproduction control, for an audio disc on which the above-described data structure is recorded.
In order to achieve the above object, the present invention is: in the case of playing back a data structure having a first packet having audio data as actual data, a second packet having real-time information data related to the audio data as the actual data, and a third packet having still picture data related to the audio data as the actual data, the playback is performed by a buffer and decoder of 3 systems.
That is, according to the present invention, there is provided an audio reproducing apparatus that reproduces a data structure having an Audio Title Set (ATS) containing a plurality of Audio Objects (AOBs) and a Still Picture Set (SPS),
the AOB is composed of the following two AOBs:
a first AOB constituted only of a first packet having audio data as actual data;
a second AOB constituted by a second packet having the above-mentioned first packet and real-time information data related to the above-mentioned audio data as actual data,
and, the SPS has a third packet containing still picture data related to the audio data,
characterized in that the audio playback apparatus further comprises:
a packet distribution means for distributing the first, second and third packets;
a first buffer for storing the first packet distributed by the packet distribution means;
a first decoding means for decoding the audio data based on the first packet stored in the first buffer;
a second buffer for storing the second packet distributed by the packet distribution device;
a second decoder for decoding the real-time information data according to a second packet stored in the second packet;
a third buffer for storing the third packet distributed by the packet distribution device;
and a third decoder for decoding the still picture data according to a third packet stored in the third packet.
The present invention performs D/A conversion in accordance with one system when performing A/D conversion of audio data and multi-channel audio data at a sampling frequency of 2 systems.
That is, according to the present invention, there is provided an audio reproducing apparatus for selectively reproducing a transmitted data structure at least together with sampling frequency data of first audio data subjected to a/D conversion at a sampling frequency of a first system and second audio data subjected to a/D conversion at a sampling frequency of a second system, comprising:
means for determining the sampling frequency of the audio data reproduced by the playback device based on the sampling frequency data;
rate conversion means for rate-converting the audio data of the first system into a sampling frequency of the second system based on the judgment result;
and a D/A conversion device for D/A converting the audio data at the sampling frequency of the second system.
When copyright data relating to audio data and still picture data is recorded on a disc, the still picture data and the copyright data are synthesized to perform telop display.
That is, according to the present invention, there is provided an audio reproducing apparatus for reproducing a data structure including audio data, copyright data related to the audio data, and still picture data related to the audio data,
the apparatus includes means for synthesizing and displaying copyright data and still picture data related to the audio data during reproduction of the audio data.
According to the present invention, there is provided an audio reproducing apparatus for reproducing a data structure including audio data, still picture data related to the audio data, and copyright data related to the still picture data,
the audio reproducing apparatus includes means for combining and displaying still picture data and copyright data related to the still picture data during reproduction of the audio data.
Drawings
These and other objects, advantages and features of the present invention will be further explained by the description of the embodiments of the present invention in conjunction with the accompanying drawings. In these drawings:
FIG. 1 is a schematic diagram illustrating one embodiment of a DVD-video format and a DVD-audio format in accordance with the present invention;
fig. 2 is a diagram showing in detail a format of an Audio Manager (AMG) of fig. 1;
fig. 3 is a diagram illustrating in detail a format of an Audio Title Set (ATS) of fig. 1;
fig. 4 is a diagram showing in detail the format of the audio manager information (AMGI) of fig. 2;
fig. 5 is a diagram showing in detail a format of an audio title set-attribute table (ATS-ATRT) of fig. 4;
fig. 6 is a diagram showing in detail the format of the audio title set-attribute data (ATS-ATR) of fig. 5;
fig. 7 is a diagram illustrating in detail the format of Audio Title Set Information (ATSI) of fig. 3;
fig. 8 is a diagram showing in detail the format of the audio title group information manager table (ATSI-MAT) of fig. 7;
fig. 9 is a diagram showing in detail the audio title set menu/audio stream/attribute data (ATSM-AST-ATR) of fig. 8;
fig. 10 is a diagram illustrating in detail the format of an audio title set-audio stream-attribute table (ATS-AST-ATRT) of fig. 8;
fig. 11 is a diagram showing in detail attribute data (ATS-AST-ATR) of each audio stream of fig. 10;
fig. 12 is a schematic diagram showing an audio directory block unit (ACBU) of fig. 1;
fig. 13 is a diagram showing in detail the formats of the audio and video packets of fig. 12;
fig. 14 is a diagram showing in detail the format of an audio control (a-CONT) packet of fig. 12;
FIGS. 15(A), (B), (C) are diagrams showing in detail the format of the Audio Character Display (ACD) area of FIG. 14 and deformation diagrams;
FIG. 16 is a schematic diagram and a modified schematic diagram showing an example displayed by the namespace information of FIG. 15;
fig. 17(a), (B), and (C) are diagrams showing in detail the format of the Audio Search Data (ASD) area of fig. 14;
fig. 18 is a diagram showing a modification of the audio directory block unit of fig. 1;
fig. 19 is a block diagram showing a reproduction apparatus of a DVD player according to the present invention;
fig. 20 is a block diagram functionally showing the playback apparatus of fig. 19; among them, FIGS. 20(A), (B), and (C) are a block diagram showing a character display circuit and a block diagram showing a modified example of a DVD playback apparatus;
fig. 21 is a diagram showing in detail the format of the audio manager information (AMGI) in the second embodiment;
FIG. 22 is a diagram showing TOC information of FIG. 21 in detail;
fig. 23 is a diagram showing in detail the format of Audio Title Set Information (ATSI) in a modification of the second embodiment;
fig. 24 is a block diagram showing a reproduction apparatus of a DVD player of the second embodiment;
fig. 25 is a block diagram functionally showing the playback apparatus of fig. 24;
FIG. 26 is a block diagram showing TOC information and a playback apparatus in the third embodiment;
FIG. 27 is a flowchart for explaining A-V synchronized playback processing;
FIG. 28 is a flowchart for explaining A-V synchronized playback processing;
fig. 29 is a diagram showing a basic format of a DVD player of the fourth embodiment;
fig. 30 is a diagram showing an audio data configuration of the DVD player of fig. 29;
FIG. 31 is a diagram showing a basic format of a DVD-Van disc;
FIG. 32 is a diagram showing a basic format of a DVD-video disc;
FIG. 33 is a diagram showing a basic format of a DVD-Avd disc;
FIG. 34 is a schematic view showing an AOTT-AOB-ATR in the DVD player of the fourth embodiment;
FIG. 35 is a diagram showing a special header of linear PCM in a DVD-Avd disc of the fourth embodiment;
fig. 36 is a flowchart showing a reproduction process of an ATS and a still picture in the reproduction apparatus of the fourth embodiment;
fig. 37 is a flowchart showing a frame playback process corresponding to the sampling frequency of audio data in the playback apparatus of the fourth embodiment;
fig. 38 is a flowchart showing an emphasis reproduction process of audio data in the reproduction apparatus of the fourth embodiment;
fig. 39 is a flowchart showing an emphasis reproduction process of audio data in the reproduction apparatus of the fourth embodiment; FIGS. 39(A), (B), (C) and (D) are schematic views showing a modified example of the DVD audio playback apparatus, the character display circuit and the recording area of the V-RAM;
fig. 40 is a block diagram showing an embodiment of an apparatus for encoding an audio signal according to the present invention;
fig. 41 is a block diagram showing the signal processing circuit of fig. 40 in detail;
FIG. 42 is a schematic view of a data configuration of the fifth embodiment;
fig. 43 is a diagram showing in detail an audio object group for audio private title (AOTT-AOBS) of fig. 42;
FIG. 44 is a diagram showing an example of the audio packet of FIG. 43 in detail;
FIG. 45 is a diagram illustrating in detail the private header of FIG. 44;
FIG. 46 is a schematic diagram illustrating in detail the UPC/EAN-ISRC data of FIG. 45;
FIG. 47 is a diagram illustrating bit shifting of the audio data of FIG. 44;
fig. 48 is a diagram illustrating in detail a Real Time Information (RTI) packet of fig. 43;
FIG. 49 is a diagram illustrating in detail the still picture group (SPS) of FIG. 43;
fig. 50 is a diagram showing in detail the format of the audio title group information manager table (ATSI-MAT) of fig. 42;
fig. 51 is a diagram showing in detail an audio object attribute for audio private title (AOTT-AOB-ATR) of fig. 50;
fig. 52 is a diagram showing in detail the audio object audio stream attribute for audio-only title (AOTT-VOB-AST-ATR) of fig. 50;
fig. 53 is a diagram illustrating channel allocation information of fig. 51 and 52 in detail;
FIG. 54 is a diagram illustrating in detail the down-mixing coefficients (ATS-DM-COEFT) of FIG. 50;
fig. 55 is a diagram showing in detail the still picture data attribute (ATS-SPCT-ATR) of fig. 50;
fig. 56 is a diagram showing in detail the audio title set program chain information table (ATS-PGCIT) of fig. 42;
FIG. 57 is a diagram showing in detail the ATS-PGCIT information (ATS-PGCITI) of FIG. 56;
FIG. 58 is a diagram showing in detail the ATS-PGCIT search pointer (ATS-PGCI-SPR) of FIG. 56;
FIG. 59 is a schematic diagram showing in detail ATS-PGC カテゴリ -A (ATS-PGCI-CAT) of FIG. 58;
fig. 60 is a diagram showing in detail the audio title set program chain information (ATS-PGCI) of fig. 56;
FIG. 61 is a diagram showing in detail the ATS-PGC general information (ATS-PGCI-GI) of FIG. 60;
FIG. 62 is a diagram showing in detail the ATS-PGC directory (ATS-PGCI-CNT) of FIG. 61;
fig. 63 is a diagram showing in detail the ATS program information table (ATS-PGIT) of fig. 60;
fig. 64 is a diagram showing in detail the ATS program information (ATS-PGI) of fig. 63;
FIG. 65 is a schematic diagram showing in detail the ATS-PG directory (ATS-PG-CNT) of FIG. 64;
FIG. 66 is a diagram showing in detail the ATS unit Play packet information Table (ATS-C-PBIT) of FIG. 63;
fig. 67 is a diagram showing in detail the ATS unit play pack information (ATS-C-PBI) of fig. 66;
FIG. 68 is a schematic diagram showing in detail the ATS-C type (ATS-C-TY) of FIG. 67;
fig. 69 is a block diagram showing an encoding device of the fifth embodiment;
FIG. 70 is a flowchart showing a process of the encoding apparatus of FIG. 69;
fig. 71 is a block diagram showing a decoding apparatus of the fifth embodiment;
fig. 72 is a block diagram functionally representing the decoding device of fig. 71;
fig. 73 is a flowchart showing a process of the decoding device of fig. 71 and 72;
fig. 74 is a block diagram showing in detail the decoding apparatus of the fifth embodiment;
fig. 75 is a flowchart showing a playback process of an audio signal and still picture data by the decoding apparatus of fig. 74;
fig. 76 is a flowchart showing a page turning process of a still picture by the decoding apparatus of fig. 74;
fig. 77 is a block diagram showing a modification of the audio decoder of fig. 74;
FIG. 78 is a process schematic diagram showing the sample rate converter of FIG. 77;
fig. 79 is a block diagram showing in detail the copyright data display apparatus of the decoding apparatus of the fifth embodiment;
fig. 80 is a flowchart showing a copyright data display device of the apparatus of fig. 79;
fig. 81 is a block diagram showing a packetizing apparatus in transmitting an audio signal of the fifth embodiment;
FIG. 82 is a flowchart showing a packing process by the packing apparatus of FIG. 81;
fig. 83 is a flowchart showing in detail the packet generation apparatus of fig. 82;
fig. 84 is a flowchart showing in detail the ATS generation process of fig. 82;
fig. 85 is a flowchart showing a transmission process by the packetizing apparatus of fig. 81;
fig. 86 is a block diagram showing an unpacking device in transmitting an audio signal according to the fifth embodiment;
fig. 87 is a flowchart showing a reception process of the unpacking device in fig. 86;
FIG. 88 is a flowchart showing an unpacking process by the unpacking device of FIG. 86;
fig. 89 is a flowchart showing in detail the ATSI decoding process of fig. 81;
fig. 90 is a flowchart showing in detail the packet decoding process of fig. 88.
Detailed Description
Embodiments of the present invention are described below with reference to the drawings. Fig. 1 is a schematic diagram showing a format of a DVD-video disc and an embodiment of a format of a DVD player according to the present invention, fig. 2 is a schematic diagram showing a format of an Audio Manager (AMG) of fig. 1 in detail, fig. 3 is a schematic diagram showing a format of an audio title group (ATS) of fig. 1 in detail, fig. 4 is a schematic diagram showing a format of audio manager information (AMGI) of fig. 2 in detail, fig. 5 is a schematic diagram showing a format of an audio title group attribute table (ATS-ATRT) of fig. 4 in detail, fig. 6 is a schematic diagram showing a format of audio title group attribute data (ATS-ATR) of fig. 5 in detail, fig. 7 is a schematic diagram showing a format of audio title group information (ATSI) of fig. 3 in detail, fig. 8 is a schematic diagram showing a format of an audio title group information manager table (ATSI-ATR) of fig. 7 in detail, and fig. 9 is a schematic diagram showing an audio title group menu/audio stream/attribute data (ATSI-AST-MAT) of fig. 8 in detail Fig. 10 is a diagram showing in detail the format of the audio title set/audio stream/attribute table (ATS-AST-ATRT) of fig. 8, and fig. 11 is a diagram showing in detail the attribute data (ATS-AST-ATR) of each audio stream of fig. 10.
Fig. 12 is a diagram showing an audio directory block unit (ACBU) of fig. 1, fig. 13 is a diagram showing in detail the formats of audio packets and video packets of fig. 12, fig. 14 is a diagram showing in detail the format of an audio control (a-CONT) packet of fig. 12, fig. 15(a), (B), (C) are a diagram showing in detail the format of an Audio Character Display (ACD) area of fig. 14 and a modification diagram, fig. 16 is a diagram showing an example displayed by name space information of fig. 15, fig. 17(a), (B), (C) are diagrams showing in detail the format of an audio retrieval data (ASD) area of fig. 14, and fig. 18 is a diagram showing a modification example of the audio directory block unit of fig. 1.
Fig. 19 is a block diagram showing a playback apparatus for a DVD disc according to the present invention, fig. 20 is a block diagram functionally showing the playback apparatus of fig. 19, fig. 21 is a diagram showing in detail the format of audio manager information (AMGI) in the second embodiment, fig. 22 is a diagram showing in detail TOC information of fig. 21, fig. 23 is a diagram showing in detail the format of Audio Title Set Information (ATSI) in the modification of the second embodiment, fig. 24 is a block diagram showing a playback apparatus for a DVD disc of the second embodiment, fig. 25 is a block diagram functionally showing the playback apparatus of fig. 24, fig. 26 is a block diagram showing TOC information and a playback apparatus in the third embodiment, and fig. 27 and 28 are flowcharts for explaining a-V synchronized playback processing.
In the DVD player described above, in order to adapt to a transition period when transitioning from a CD to a DVD player, signals of both of two channels for stereo and multiple channels of 5/6/8 channels are recorded as audio signals. When the transition period has elapsed, it may be considered to record only the 5/6/8-channel multi-channel signal.
Fig. 1(a) and (b) show respective formats of a DVD video disc and a DVD audio disc, and areas of the DVD audio disc format are different in name but have compatibility with the DVD video disc. First, the method is roughly divided into: the format of a DVD-video disc is composed of the respective areas of the Video Manager (VMG) at the beginning and the plurality of Video Title Sets (VTS) following it, while the format of a DVD-audio disc is composed of the respective areas of the Audio Manager (AMG) shown in detail in fig. 2 and the plurality of Audio Title Sets (ATS) following the AMG as shown in detail in fig. 3 in correspondence therewith.
The VTSs are respectively composed of the beginning VTS information (VTSI), one or more video directory block settings (VCBS) following the VTSI, and the last VTSI, and, correspondingly, the ATS are respectively composed of the beginning ATS information (ATSI), one or more audio directory block settings (ACBS) following the ATSI, and the last ATSI. The performance time of each track in ACBS is set in real time in ATSI.
In the present invention, menu information for displaying a menu screen is recorded in the initial ACBS. It is the same as the DVD video disc and its explanation is omitted.
Each VCBS is constituted by a plurality of VCBS, and on the other hand, each ACBS is constituted by a plurality of ACBS. Each VCB is a Title (Title) of video, and each ACB is a Title of audio, respectively. Each VCB (title) is constituted by a plurality of segments (chapters), and on the other hand, each ACB (title) is constituted by a plurality of tracks (tracks) in correspondence therewith. A segment contains a header Part (PTT) and a track contains a header Part (PTT).
Each segment is constituted by a plurality of CELLs (CELL), and, correspondingly, each track is constituted by a plurality of indices (Index). Each unit is composed of a plurality of VCB units (VCBU), and each index is composed of a plurality of ACB units (ACBU) corresponding thereto. Each of the VCB unit and the ACB unit is composed of a plurality of packets, and one packet is composed of 2048 bytes.
Each VCB unit is composed of a leading control packet (hereinafter, referred to as a CONT packet), a plurality of video (V) packets, audio (a) packets, and sub-picture (SP) packets following it, and, correspondingly, each ACB unit is composed of a leading audio control packet (hereinafter, referred to as a-CONT packet), a plurality of a packets, and V packets following it.
The CONT packet includes information for controlling the subsequent V packet, and the a-CONT packet includes information for managing the audio signal of the subsequent a packet, like the TOC information of the CD. Audio data is arranged in the a packet, and Closed Caption (CC) data, for example, other than the audio data is arranged in the V packet.
As shown in fig. 2, the AMG (audio manager) has:
audio manager information (AMGI) detailed in FIG. 4
AMG menu is set with an audio directory block (AMGM-ACBS);
AMGI for backup.
The AMGM-ACBS includes, as control information:
presentation Control Information (PCI);
data retrieval information (DSI).
As shown in fig. 3, the ATS (audio title set) has:
audio Title Set Information (ATSI) shown in detail in fig. 7;
ATS Menu Audio directory Block setup (ATSM-ACBS);
ATS title with Audio directory Block setup (ATSA-ACBS);
ATSI for backup.
Both ATSM-ACBS and ATSA-ACBS have the PCI and DSI described above (FIG. 2).
As shown in detail in fig. 4, the AMGI (audio manager information) has:
manager table of AMGI (AMGI-MAT);
a retrieval pointer table (T-SRPT) for the title;
audio manager Menu PGCI Unit Table (AMGM-PGCI-UT);
a dual rental management information table (PTL-MAIT);
audio title set attribute table (ATS-ATRT) detailed in fig. 5;
text data manager (TXTDT-MG);
audio manager Menu element (index) Address Table (AMGM-C-ADT);
audio manager Menu Audio directory Block Unit Address map (AMGM-ACBU-ADMAP).
As shown in detail in fig. 5, the ATS-ATRT (audio title set attribute table) has:
audio title set attribute table information (ATS-ATRTI);
each audio title set attribute retrieval pointer (ATS-ATR-SRP # 1 to # n) of a plurality (n) of ATS;
the audio title set attribute data (ATS-ATR- #1 to # n) of each of the plurality of (n) ATS shown in detail in fig. 6.
As shown in detail in fig. 6, each audio title set attribute data (ATS-ATR- #1 through # n) has:
ATS-ATR-EA (end address);
ATS-CAT (カテゴリ one);
ATS-ATRI (information).
As shown in detail in fig. 7, the ATSI (audio title set information) shown in fig. 3 has:
audio title group information manager table (ATSI-MAT) shown in detail in fig. 8;
audio title set title part search pointer table (ATS-PTT-SRPT);
audio title set program chain information table (ATS-PGCIT);
audio title set menu PGCI unit table (ATSM-PGCI-UT);
audio title set time chart (ATS-TMAPT);
audio title set menu, unit, address table (ATSM-C-ADT);
audio title set menu audio directory block unit address map (ATSM-ACBU-ADMAP);
audio title set unit address table (ATS-C-ADT);
audio title set audio directory block unit address map (ATS-ACBU-ADMAP).
As shown in detail in fig. 8, the ATSI-MAT (audio title group information manager table) shown in fig. 7 has:
ATS-ID (identifier);
ATS-EA (end address);
·ATSI-EA;
VERN (version number of DVD Audio Standard);
ATS-CAT (カテゴリ one);
·ATSI-MAT-EA;
ATSM-ACBS-SA (start address);
·ATSA-ACBS-SA;
·ATS-PTT-SRPT-SA;
·ATS-PGCIT-SA
·ATSM-PGCI-UT-SA;
·ATS-TMAP-SA;
·ATSM-C-ADT-SA;
·ATSM-ACBU-ADMAP-SA;
ATSM-AST-ATR (ATSM audio stream attribute), as shown in detail in FIG. 9;
ATS-AST-Ns (number of audio streams of ATS);
the ATS-AST-ATRT (audio stream attribute table of ATS) shown in detail in fig. 10.
As shown in detail in fig. 9, the ATSM-AST-ATR is composed of 8 bytes (bits b63 to b0), and the following data (1) to (4) (other bits are reserved) are arranged as attributes of the encoded audio signal recorded on the disc.
(1) Audio coding mode (3 bit b 63-b 61)
000 b: dolby-AC-3
010 b: MPEG-1 or MPEG-2 (without extension bit stream)
011 b: MPEG-2 (with extension bit stream)
100 b: linear PCM audio
101 b: linear PCM audio (comprising 2ch +5ch, 2ch +6ch, 2ch +8ch)
(2) quantization/DRC (dynamic Range control) information (2 bits b5, b54)
The audio encoding method is "11 b" in the case of "000 b
In the case of "010 b" or "011 b", the audio encoding method is:
00 b: absence of dynamic range control data in MPEG audio streams
01 b: dynamic range control data in MPEG audio stream
10b, 11 b: retention
The audio encoding method corresponds to stereo 2ch in the case of "100 b" or "101 b":
00 b: 16 bit
01 b: 20 bit
10 b: 24 bit
11 b: retention
(3) The sampling frequency fs (2 bits b53, b52), corresponding to stereo 2ch, is:
00b:48kHz
01b:96kHz
10b:192kHz
(4) number of channels (3 bits b 50-b 48)
000 b: 1ch (Single track)
001 b: 2ch (Dual-track)
010b:3ch
011b:4ch
100 b: (stereo 2ch +5ch)
101 b: (stereo 2ch +6ch)
110b:7ch
111 b: (stereo 2ch +8ch)
As shown in detail in fig. 11, the ATS-AST-ATRT (audio stream attribute table of ATS) shown in fig. 10 has ATS-AST-ATRs for each of audio streams # 1 to #7, and each ATS-AST-ATR is formed of 8 bits (64 bytes in total).
As shown in fig. 11, the ATS-AST-ATR of one audio stream is composed of 8 bits (bits b63 to b0) identical to the audio title set menu, audio stream, attribute data (ATSM-AST-ATR), and the following data are included in addition to the attribute data (1) to (4):
(5) multichannel extension (1 bit b60)
(6) Audio type (2 bit b59, b58)
(7) Audio application (2 bit b57, b56)
(8) Extraction information of its stream (AST) (2 bits b47, b46)
(9) The extraction information (2 bits b45, b44) of LFE (Low Frequency Effect) 1 ch.
Further, in the (7) audio application mode of the DVD player, recording:
11 b: 2ch + sound system
In addition, the (8) stream extraction information and the (9) LFE1ch extraction information have recorded as band information:
00 b: full (1/1)
10 b: half (1/2)
11 b: one quarter (1/4).
However, the number of (4) channels in the ATSM-AST-ATR must be 2ch under audio stream # 0, and audio stream # 1 contains 3ch in front. That is, for example, when 2+6ch is used to record an audio signal of one title, a binaural signal of 2ch is assigned to the audio stream # 0, a front signal of 3ch within 6ch is assigned to the audio stream # 1, and a back signal of 2ch and an LFE1ch signal are assigned to the audio stream # 2. Further, "3" is recorded in both the audio manager information manager table (AMGI-MAT) shown in fig. 4 and the audio title group information manager table (ATSI-MAT) shown in fig. 8 as the use data of streams # 0 to # 2.
Further, the 2+6ch analog audio signal is sampled at a sampling frequency fs as follows, and is quantized by the number of quantization bits as follows for recording:
binaural 2 ch: 48kHz, 20 bit
Back 2ch, LFE1 ch: 48kHz, 16 bit (not extracted)
In this case, in the audio title set menu/audio stream/attribute data (ATSM-AST-ATR) shown in fig. 9, the following are recorded as attributes of the two-channel 2 ch:
(1) audio coding mode
101 b: linear PCM audio (comprising 2+5ch, 2+6ch, 2+8ch)
(2) quantization/DRC
01 b: 20 bits
(3) Sampling frequency fs
00b:48kHz
(4) Number of sound channels
101 b: (stereo 2ch +6ch)
Also, in the ATS-AST-ATR of audio stream # 0, there are recorded:
(1) audio coding mode
101 b: linear PCM audio (comprising 2ch +5ch, 2ch +6ch, 2ch +8ch)
(2) quantization/DRC
01 b: 20 bits
(3) Sampling frequency fs
00b:48kHz
(4) Number of sound channels
001 b: 2ch (stereo)
(7) Audio application mode
11 b: 2ch + sound system
(8) Extracting information of the stream
00 b: all (1/1)
(9) LFE1ch extraction information
00 b: all (1/1).
Also, in the ATS-AST-ATR of audio stream # 1:
(1) audio coding mode
101 b: linear PCM audio (comprising 2ch +5ch, 2ch +6ch, 2ch +8ch)
(2) quantization/DRC
01 b: 16 bits
(3) Sampling frequency fs
00b:96kHz
(4) Number of sound channels
001b:3ch
(7) Audio application mode
11 b: 2ch + sound system
(8) Extracting information of the stream
00 b: all (1/1)
(9) LFE1ch extraction information
00 b: all (1/1).
Also, in the ATS-AST-ATR of audio stream # 2, there are recorded:
(1) audio coding mode
101 b: linear PCM audio (comprising 2ch +5ch, 2ch +6ch, 2ch +8ch)
(2) quantization/DRC
01 b: 16 bits
(3) Sampling frequency fs
00b:48kHz
(4) Number of sound channels
001b:3ch
(7) Audio application mode
11 b: 2ch + sound system
(8) Extracting information of the stream
00 b: all (1/1)
(9) LFE1ch extraction information
00 b: all (1/1).
The following describes a packet recorded in the audio stream and its control packet. As shown in fig. 12, the a-CONT packs in the ACB unit of the DVD player, which is composed of any number of packs of 0.4 to 1.0 second for the VCB unit, and the ACB unit, which is composed of any number of packs of 0.5 to 1.0 second, are arranged in the third pack of the DVD-video disc.
The A-CONT packets are arranged substantially in 0.5 second units of audio time with indexed nicks arranged to end in the range of 0.5-1.0 seconds. Also, the time of Audio (GOF: Group of Audio Frame unit) is represented by the A-CONT packet, and the data position is determined by the Audio Frame number and the number of first access unit pointers and Frame headers. The a packet preceding the a-CONT packet is not compulsorily padded in 0.5 second units of video data.
The adjacent a-packs are arranged such that audio signals are correlated with each other, for example, in the case of two channels, an L-channel pack and an R-channel pack are arranged adjacently, and even in the case of 5/6/8-channel multi-channel, the adjacent arrangement is also performed. The V pack is disposed adjacent to the A pack when an image is displayed when an audio signal is reproduced. As shown in fig. 13, the a-packet and the V-packet are configured by adding 4-byte packet start information, 6-byte SCR (System Clock reference) information, 3-byte Mux rate information, and a total 14-byte packet header of 1-byte padding to 2034-byte user data (a data and V data) (1-packet is 2048 bytes in total). In this case, the SCR information as the time stamp is "1" in the header packet in the ACB unit and is continuous in the same header, whereby the time of the a packet in the same header can be managed.
In contrast, as shown in fig. 14, the a-CONT packet is composed of a packet header of 14 bytes, a system header of 24 bytes, an ACD (audio character display) component of 1003 bytes, and an ASD (audio retrieval data) component of 1007 bytes. The ACD component is composed of a component header of 6 bytes, a substream ID of 1 byte, 636 bytes of ACD (audio character display) information shown in detail in fig. 15a, and a reserved area of 360 bytes. The ASD package is similarly composed of a package header of 6 bytes, a substream ID of 1 byte, and ASD (audio search data) of 1000 bytes shown in detail in fig. 17.
As shown in detail in fig. 15(a), the 636-byte ACD information area has a general information area of 48 bytes, and a 294-byte area for each of the first language character "1" and the second language character "2", and each of the areas is composed of a namespace area of 93 bytes, two free space areas of 93 bytes, and a data pointer area of 15 bytes. As shown in fig. 6, data for representing a song title in japanese is arranged in one of the first language character "1" and the second language character "2", and data for representing in english is arranged in the other one of the first language character "1" and the second language character "2". The presentation language may be decided by the disc distributor.
The 48-byte general information includes, for example, 16-byte service level information, 12-byte language code information, 6-byte character set code information, 6-byte display item information, 2-byte information "different from the ACD information" and 6-byte reservation information. The 16-byte service level information represents display size, display kind, audio/video/SP distinction, stream, etc., and text is legal (necessary) and bitmap is optional (arbitrary). The language code of 12 bytes and the video file respectively use 2 bytes to represent the language of the characters "1", "2", which represents at most 8 languages in 1 file. English is legal.
The 6-byte character set code information may have 15 character codes corresponding to the language code at maximum, and 1 byte may be used to indicate the presence or absence and type of the language of the character "1" or "2". Examples of codes are as follows:
1.ISO646
2.ISO8859-1
3.MS-JIS
the display item information of 6 bytes indicates the free space "1", "2", the presence or absence of a data pointer, and the ID shown in fig. 15. The namespace is legal and must record title names, music names, artist names.
As shown in detail in fig. 17a, the 1000-byte ASD (audio search data) is composed of 16-byte general information, 8-byte current number (No.) information, 16-byte current time information, 8-byte header group search information, 8-byte header search information, 404-byte track search information, 408-byte index search information, 80-byte highlight search information, and 52-byte reserved area.
The 8-byte current number information is composed of a current title number (2 bytes: BCD) of the title group, a current track number (2 bytes: BCD) of the title group, a current search number (2 bytes: BCD) of the track, and a reserved area (2 bytes). The 16-byte current time information is composed of the track play time (4 bytes: BCD), the track remaining play time (4 bytes: BCD), the title absolute time (4 bytes: BCD), and the title remaining absolute time (4 bytes: BCD).
The header group search information of 8 bytes is composed of the first segment number (4 bytes) of the header group and the last segment number (4 bytes) of the header group. The header search information of 8 bytes is composed of a first segment number (4 bytes) of the header and a last segment number (4 bytes) of the header. The track search information of 404 bytes is constituted by the track and the sector number (4 bytes × 99) of the header, the first sector number (4 bytes) of the header, and the last sector number (4 bytes) of the header.
The 408-byte index search information is composed of the index and the section number of the title (4 bytes × 100), the first section number of the title (4 bytes), and the last section number of the title (4 bytes). The 8-byte highlight search information is composed of an inner zone number (4 bytes × 10) of the track and an outer zone number (4 bytes × 10) of the track.
According to such a format, since the a-CONT packet for managing the audio signal of the subsequent a packet is arranged at the head of the plurality of a packets, as in the TOC information of the CD, the audio data is not integrated with the video data or the like, and the recording capacity can be increased. Further, the audio time can be managed by the a-CONT packet, and simple text information such as a song title related to the audio data can be extracted by the a-CONT packet.
Since TOC information such as a title, a start address, and a playing time is arranged in the A-CONT packet, even during audio playback, information corresponding to a user's operation can be extracted from the A-CONT packet and playback can be started. Further, by arranging TOC information in the audio manager information (AMGI) and the Audio Title Set Information (ATSI), it is possible to store necessary TOC information in a memory in the playback apparatus, and immediately take out information corresponding to a user operation from the memory to start playback. Since it is not necessary to store a large amount of information such as program chain information (PGCI) in a DVD video disc, the disc can be managed efficiently.
1. When there is no image (V) data in the directory,
(1) the search and random access corresponding to 3 levels of title, music and index can be realized.
(2) The start of GOF (audio frame) unit, time search and random access can be realized.
(3) The title, music, and time of indexing can be managed in real time.
2. When there is image (V) data in the directory,
in association with the audio data, in addition to the above-described (1) to (3),
(4) the title, the current time during the playing of the music and the remaining time can be displayed and managed in real time.
In connection with the video data, it is,
(1) it is possible to realize retrieval and random access corresponding to the title, PTT, and 3 levels of the unit.
(2) The start, time search, and random access of a video frame unit can be realized.
(3) The time of the title, PTT, unit can be managed in real time.
(4) The current time and the remaining time in the PTT or title playback can be displayed and managed in video frame unit time.
The ACBU of fig. 1(b) contains a-CONT packets and CONT packets, and may be configured without a-CONT packets and CONT packets, as shown in fig. 18. In this case, the video signal is recorded, and the recording capacity of the video signal is increased in proportion, the disc size can be miniaturized, and the playback function can be simplified, so that a playback apparatus suitable for portability can be provided.
Next, a description will be given of modifications of ACD (audio character display) information shown in fig. 15(a) and ASD (audio search data) information shown in fig. 17 (a).
Fig. 15(B) and 17(B) show a first modification, and fig. 15(C) and 17(C) show a second modification. As shown in detail in fig. 15(B), the ACD information area of 676 bytes in the first modification is composed of a general information area of 48 bytes, characters "1" and "2" of 294 bytes, respectively, a display control data area of 16 bytes, and a reserved area of 24 bytes. Each information area of the characters "1" and "2" is composed of a namespace area of 93 bytes, two free space areas of 93 bytes, and a data pointer area of 15 bytes. In order to display the music title shown in fig. 16, 1/4-corner header text data of 31 bytes and full-corner and half-corner text data of 62 bytes are arranged in the namespace area. The address (timing time) of the packet a is configured using 8 bytes in the display time data area of 16 bytes as the display start time and end time of the text.
The 48-byte general information includes, for example, 16-byte service level information, 12-byte language code information, 6-byte character set code information, 6-byte display item information, 2-byte information "different from the ACD information" and 6-byte reservation information. The 16-byte service level information represents display size, display kind, audio/video/SP distinction, stream, etc., and text is legal (necessary) and bitmap is optional (arbitrary). The 6-byte character set code information is in the same language as the video file, in which 2 bytes are used to represent the characters "1", "2", respectively, and represents at most 8 languages in 1 file. English is legal.
As shown in detail in fig. 17B, the 1000-byte ASD (audio search data) as the first modification is composed of 16-byte general information, 8-byte current number information, 16-byte current time information, 8-byte header group search information, 8-byte header search information, 404-byte track search information, 408-byte index search information, 80-byte highlight search information, and 52-byte reserved area.
Text display time data may be recorded in the ASD area instead of the ACD area. That is, as a second modification, as shown in fig. 15(C), 16 bytes of character control data of the ACD area are made a reserved area, and the entire area of 40 bytes is made a reserved area, or as shown in fig. 17(C), 16 bytes of a reserved area of 52 bytes of the ASD area are made character display time data, and the remaining 36 bytes are made a reserved area.
The playback apparatus according to the present invention will be described with reference to fig. 19. The above data structure is recorded in the form of bits by performing EFM modulation on the DVD video disc 1. When the operation section 18 and the remote controller 19 perform the track selection, the playback, the fast forward, and the stop operation, the control section 23 controls the drive device 2 and the playback device 17 according to the operation, and at the time of playback, the bit data recorded in the DVD-video 1 is read by the drive device 2, and then EFM demodulation is performed.
In the playback apparatus 17, the signal is sent to the CONT packet detection unit 3 and the a-CONT packet detection unit 9. A CONT packet detection unit 3 detects a CONT packet in the playback data, sets control parameters for a parameter unit 8, and sequentially writes V packets controlled by the CONT packet into a V packet buffer 4. The user data (video signal, sub-picture information) in the V packet written in the V packet buffer 4 is extracted in packet order by the buffer extraction unit 5 based on the SCR (see fig. 13) in the V packet, extracted in output Time order based on the pts (presentation Time stamp) in the CONT packet, and then outputted as an analog video signal through the video conversion unit 6, the D/a conversion unit 7, and the video output terminals 15 and 15'.
The a-CONT packet detection unit 9 detects an a-CONT packet in the playback data, sets control parameters in the parameter unit 14, and sequentially writes the a packets controlled by the a-CONT packet into the a packet buffer 10. The user data (audio signal) in the a packet written in the a-packet buffer 10 is extracted in packet order by the SCR by the buffer extraction unit 11, extracted in output time order by the current time (see fig. 17 a) of the Audio Search Data (ASD) in the a-CONT packet, and then output as an analog audio signal by the PCM conversion unit 12, the D/a conversion unit 13, and the audio output terminal 16. The display data (audio character display information ACD shown in fig. 15a and 16) in the a-CONT packet is supplied to the display signal generating unit 20, and a display signal is generated and output through the display signal output terminal 22 or to the built-in character display unit 21.
Fig. 20 is a block diagram functionally showing the configuration shown in fig. 19. The playback apparatus 2 corresponds to the driver apparatus 2 shown in fig. 19, the playback signal processing/separating apparatus a (9, 10, 11, 14) corresponds to the a-CONT packet detection unit 9, the a packet buffer 10, the buffer extraction unit 11, and the parameter unit 14, the audio signal output apparatus (12, 13) corresponds to the PCM conversion unit 12 and the D/a conversion unit 13, and the character information output apparatus 20 corresponds to the display signal generation unit 20. The playback signal processing/separating means V (3, 4, 5, 8) corresponds to the CONT packet detection unit 3, the V packet buffer 4, the buffer extraction unit 5, and the parameter unit 8, and the video signal output means and the sub-picture information output means (6, 7) correspond to the image conversion unit 6 and the D/a conversion unit 7. The control device 23 corresponds to the control unit 23.
In fig. 20, when the control device 23 issues a command signal for reproducing a target music piece from the operation section 18 and the remote controller 19, an address control information signal corresponding to the reproduction command is transmitted to the reproduction device 2, whereby the target music piece is reproduced from the DVD video disc 1. The reproduction signal processing/separating device A separates the reproduction data, and transmits A-CONT information to the control device 23, audio signals to the audio signal output devices (12, 13), and character information to the character information output device 20. The playback signal processing and separating device V separates the playback data, sends CONT information to the control device, and sends video signals and sub-picture information to the video signal device and the sub-picture information output devices (6, 7), respectively.
The operation of the character information output device 20 and the flat rectangular display 21 for character display will be described in detail with reference to fig. 20 (a). The display time data within the a-CONT packet is separated into display start time data and display end time data by the display time decoder 251, and the respective data are applied to the start comparator 252 and the end comparator 253. The character data in the a-CONT packet is converted into dot matrix character data for display by the character data decoder 254, and the converted data is stored in the buffer 255.
Next, the start comparator 252 compares the display start time data with the address of the packet a during playback, and when they match, sends a dot matrix read start control signal to the buffer 255 and sends a display on control signal to the display 256 (21'). The end comparator 253 compares the display end time data with the address of the a packet during playback, and when they match, sends a timing signal for outputting the next text time data to the display time decoder 251, and at the same time, sends a control signal for turning off the display to the display 256 (21').
Fig. 20(B) shows, as a modification of the embodiment, an apparatus for playing back a disc having a structure in which V packets and CONT packets but only a-CONT packets are included, and the video processing section is omitted. ACD information in the preceding a-CONT packet is read out in advance from the a-CONT packet of the a packet output from the control audio signal output device 12, 13, and stored in the ACD storage unit 14B, and text information in the ACD information is displayed on the flat rectangular display 21' as the display device 21.
In a disc without CONT pack, no synchronization is required.
Fig. 20C shows, as another modification of the embodiment, a reproduction apparatus in which additional recording is performed in a lead-in area (illustrated TOC area 1a) in the innermost peripheral portion of the disc 1, and when reproduction is started, the TOC detection device 324 accesses the TOC area 1a independently and records the TOC area in a memory constituting the TOC information storage unit 314A. In this configuration, when audio reproduction control information for reproducing with the sound quality recommended by the program mixer is recorded in the ACD component, the sound quality/level control information output means 331 and the sound quality/level control processing means 332 are provided, and the sound quality/level is controlled based on the control information.
The sound quality/level control information can be time-controlled in a block having the configuration shown in fig. 20(a) by using the control start time data and the control end time data.
Next, a playback apparatus according to a second embodiment using toc (table Of contents) information will be described. As shown in fig. 21, the TOC shown in detail in fig. 22 is additionally recorded in the free area of the AMGI (audio manager information), and the playback apparatus accesses the TOC information to play the beginning of the music piece. Fig. 22 shows, as an example, general TOC information recorded in a lead-in area of a CD, and the same information is recorded repeatedly 3 times. When recording is performed on the DVD-video 1 of the present invention, the repetition may be performed in this manner or may not be performed.
In TOC information used in a CD, when a pointer is 00 to 99, the absolute time at which each musical piece indicated by the number starts is indicated by minutes (PMIN), seconds (PSEC), and frames (PFRAME). When the pointer is a0, PMIN denotes the original music chapter, PSEC PFRAME 0. When the pointer is a1, PMIN denotes the last chapter, PSEC PFRAME 0. When the pointer is a2, the absolute time at which the lead-out region starts is represented using minute (PMIN), second (PSEC), and frame (PFRAME). Therefore, the TOC information shown in fig. 22 indicates that 6 music pieces (or 6 music pieces) are recorded on the DVD disk 1 (pointers are 01 to 06). The TOC information may be recorded in a free area of the ATSI (audio title set information) as shown in fig. 23, or may be recorded in a reserved area (360 bytes) within the ACD packet of the a-CONT packet shown in fig. 14, instead of the AMGI.
Next, the TOC information is read at the start of playback, stored in the TOC information storage unit 14A shown in fig. 24 and 25, and when playback of the beginning of a music piece or a musical piece is designated, the disc 1 is accessed with reference to the TOC information storage unit 14A. This TOC information is recorded in the AMGI, ATSI, or a-CONT packet, and as shown in fig. 26, the TOC information is additionally recorded in the lead-in area (TOC area 1a shown in the figure) on the innermost circumference of the disc 1, and at the start of playback, the TOC detection device 24 independently accesses the TOC area 1a and stores it in the TOC information storage unit 14A. The TOC information conforms to the configuration standard of fig. 22, is added with simple playback information such as a down-mix coefficient, and is referred to as sapp (simple audio Play pointer) for distinction.
The a-V synchronized playback process using the reproduction control information such as TOC information and search information will be described with reference to fig. 27 and 28. In fig. 27, when playback of the beginning of a music piece or a musical piece is designated, TOC information corresponding to the designated position is referred to (step S1), and then the positions of a cell (cell) and an index are calculated from the TOC information (step S2). Then, the position is retrieved (step S3), and when confirmed, the synchronous playback of the a packet and the V packet shown in detail in fig. 34 is performed (step S4 → S5).
In fig. 28, the CONT packet is reproduced ((step S11), the a-CONT packet is reproduced next (step S12), next, it is checked whether or not the time information in the CONT packet and the a-CONT packet is the same time (step S13), in the case where the time information is not the same time, the CONT packet and the a-CONT packet are both adjusted to ± 1 packet (step S14), and then the process returns to step S11 to reproduce the CONT packet and the a-CONT packet, and in step S14, the CONT packet and the a-CONT packet may be both adjusted to ± 1 packet to reproduce the packet.
In step S13, when the time information in the CONT packet and the a-CONT packet is the same time, the a packet controlled by the a-CONT packet is reproduced while the a packet address (SCR information) is incremented by one (step S15), and the V packet controlled by the CONT packet is reproduced while the V packet address (SCR information) is incremented by one (step S16). Then, it is checked whether the playback of the a-packet is ended (step S17), and when not ended, it proceeds to step S18, and on the other hand, when ended, it proceeds to step S20.
In step S18, it is checked whether playback of the V packet is finished, and when not finished, it returns to step S13, and on the other hand, when finished, it proceeds to step S19. In step S19, the next V packet controlled by the CONT packet is played back while incrementing the V packet address by one, and the process returns to step S13. In step S20, the next a packet controlled by its a-CONT packet is played back while incrementing the a packet address by one, and then it is checked whether the playback of the V packet is finished (step S21), and when not finished, it returns to step S16, on the other hand, when finished, it proceeds to step S23.
In step S22, it is checked whether the a packet controlled by the a-CONT packet is the final packet, and if not, the process returns to step S16, and if it is, the process returns to step S12 to play back the next a-CONT packet. In step S23, the next V packet controlled by the CONT packet is replayed while incrementing the V packet address by one, and then it is checked whether the end of frame (EOF) exists (step S24). When there is no EOF, the flow returns to step S13, and on the other hand, when there is an EOF, the a-V synchronized playback processing ends. Thus, the audio data a and the video data V accommodated in the packets are subjected to seamless synchronized playback in accordance with the CONT packet and the time management information in the a-CONT packet.
The fourth embodiment will be explained below. Fig. 29 shows a format of a DVD player according to a fourth embodiment of the present invention, and includes only an ATS without including the VTS shown in fig. 31 to 33. The ATS is composed of an Audio Manager (AMG) shown in FIG. 1(b), an audio management menu (AMGM) for video and audio, and ATS <1> and ATS <2> managed by the AMGI within the AMG. As shown in FIG. 30, ATS <1> and ATS <2> do not include an A-CONT packet, and are composed of an A packet and a still picture packet. The still picture pack is not arranged in plural relative to the a pack, but one pack is arranged per track.
For reference, fig. 31 shows a format of a DVD-Van (video + audio pilot) disc, which is roughly composed of a Video Title Set (VTS) as DVD-video data and an ANV title set (ANV-TS) as data pilot (pilot) data. Specifically, the VTS has the same configuration as the DVD-video shown in fig. 1(a) and fig. 32 described below, and the ANV-TS is composed of the Audio Manager (AMG) shown in fig. 1(b), and ATS <1> and ATS <2> paired with VTS <1> and VTS <2> on the VTS side and managed by the AMGI in the AMG, respectively.
The format of the DVD-video is, as shown in fig. 32 and fig. 1(a), composed of only the VTS without ATS and ANV-TS.
Fig. 33 shows a format of a DVD-Avd (audio + AV data) disc, which is roughly composed of a Video Title Set (VTS) as DVD-video data and an Audio Title Set (ATS) as DVD-audio data. Specifically, the VTS is composed of a Video Manager (VMG) shown in fig. 1(a), a video management menu (VMGM) for video and audio, and VTS <1> managed by the VMGI in the VMG.
On the other hand, the ATS is composed of an Audio Manager (AMG) shown in fig. 1(b), an audio management menu (AMGM) of video and audio, an ATS <1> managed by the AMGI in the AMG in a pair with audio data in the VTS <1> on the VTS side, and an ATS <2> managed by the AMGI in the AMG in the same manner as that not in a pair with the VTS side. The ATS <2>, as shown in FIG. 30, has no A-CONT packet and is composed of an A packet and a still picture packet.
Fig. 34 shows an audio-specific title audio object attribute (AOTT-AOB-ATR) of the disc of the fourth embodiment, which is recorded in the disc as data representing the configuration of audio data of the DVD-Avd disc. This attribute data is composed of 8 bytes (64 bits b63 to b0), and if specified in order from the MSB side, is composed of the following:
4-bit (b 63-b 60) Audio coding method
1 bit (b59) down-mixing (D-M) mode
3-bit (b 58-b 56) multichannel System
4 bits (b55 to b52) of the quantization bit number Q1 of the channel group 1
4 bits (b51 to b48) of the quantization bit number Q2 of the channel group 2
Sampling frequency fs1 of channel group 1 with 4 bits (b47 to b44)
Sampling frequency fs2 of channel group 2 with 4 bits (b43 to b40)
3 reserved regions of bits (b 43-b 40)
5-bit (b 36-b 32) channel assignment
The remaining reserved area of 32 bits (b31 to b 0).
The remaining 32 bits (b 31-b 0) are used for attribute data of each channel.
The above data is described in more detail below.
(1) Audio coding mode (b 63-b 60)
0000 b: linear PCM mode
0001 b: reserved for compressed audio (Dolby number)
0010 b: reserved for compressed audio (without MPEG2 extension)
0011 b: reserved for compressed audio (with MPEG2 extension)
0100 b: reserved for compressed audio (DTS)
0101 b: reserved for compressed audio (SDDS)
And others: reserved for other coding modes
(2) Frequency mixing mode (b59)
0 b: down-conversion mixing binaural output permission
1 b: down-conversion mixing binaural output disable
(3) Multichannel method (b 58-b 56)
0000 b: type 1
And others: retention
(4) Number of quantization bits Q1(b 55-b 52) of channel group 1
0000 b: 16 bits
0001 b: 20 bits
0010 b: 24 bits
And others: retention
(5) Number of quantization bits Q2(b 51-b 48) of channel group 2
When the number of quantization bits Q of the channel group 1 is "0000 b", it is "0000 b
When the number of quantization bits Q of the channel group 1 is "0001 b", it is "0000 b" or "0001 b
When the number of quantization bits Q of the channel group 1 is "0010 b", it is "0000 b", "0001 b", or "0010 b
Wherein,
0000 b: 16 bits
0001 b: 20 bits
0010 b: 24 bits
And others: retention
(6) Sampling frequency fs1(b 47-b 44) of channel set 1
0000b:48kHz
0001b:96kHz
0010b:192kHz
1000b:44.1kHz
1001b:88.2kHz
1010b:176.4kHz
And others: retention
(7) Sampling frequency fs2(b 43-b 40) of channel set 2
When the sampling frequency fs1 of the channel group 1 is "0000 b", it is "0000 b
When the sampling frequency fs1 of the channel set 1 is "0001 b", it is "0000 b" or "0001 b
When the sampling frequency fs1 of the channel set 1 is "0010 b", it is "0000 b", "0001 b", or "0010 b
When the sampling frequency fs1 of the channel set 1 is "1000 b", it is "1000 b
When the sampling frequency fs1 of the channel set 1 is "1001 b", it is "1000 b" or "1001 b
When the sampling frequency fs1 of the channel set 1 is "1010 b", it is "1000 b", "1001 b", or "1010 b
A linear PCM scheme is used in the disc of this fourth embodiment. The special header of the linear PCM, as shown in fig. 35, is composed of the following parts:
8 bits of sub-stream ID
Reserved region of 4 bits
4 bits of ISRC numbering
8 bits of ISRC data
8 bits of dedicated header Length
16 bits of the first Access Unit pointer
1-bit audio emphasis flag F1
4 bits of audio emphasis flag F2, and so on.
The audio emphasis flag F1 is described as "emphasis off" (0 b) when the sampling frequency fs is 96kHz or 88.2kHz, and is described as "emphasis on" (1 b) when the sampling frequency fs is other than 96 kHz. The audio emphasis flag F2 is described as "emphasis off" (0 b) when the sampling frequency fs is 192kHz or 176.4kHz, and is described as "emphasis on" (1 b) when the sampling frequency fs is other than 192 kHz.
The playback processing by the playback apparatus of this fourth embodiment is described below with reference to fig. 36 to 39. First, in fig. 36, it is determined whether the reproduced signal is data of an Audio Title Set (ATS) alone or has still picture data (step S500), and if it is data alone, only ATS is reproduced (step S501); on the other hand, if both are present, the ATS and the still picture are played back (step S502).
In fig. 37, first, it is determined whether the first sampling frequency is a multiple of 48kHz or the second sampling frequency is a multiple of 44.1kHz (step S600), and in the case of the first sampling frequency, the frame rate is set to the first frame rate (1/600 seconds) (step S601), and in the case of the second sampling frequency, the frame rate is set to the second frame rate (1/551.25 seconds) (step S602). At this time, the determined sampling frequency may be displayed (step S603).
In fig. 38, first, it is determined whether or not the sampling frequency fs is 192kHz (step S700), and if it is 192kHz, the emphasis filter circuit is turned off (step S703). If the frequency is not 192kHz, it is determined whether or not the audio emphasis flag is on (step S701), and if so, the emphasis circuit is turned on (step S702), while if not, the process proceeds to step S703 to turn on the emphasis circuit.
In fig. 39, first, it is determined whether or not the sampling frequency fs is 176.4kHz (step S800), and if it is 176.4kHz, the emphasis filter operation is turned off (step S803). If the frequency is not 176.4kHz, it is judged whether or not the audio emphasis flag is on (step S801), and if so, the emphasis filter is turned on (step S802), and if not, the operation proceeds to step S803 to turn on the emphasis circuit.
In the DVD audio playback apparatus according to the modification shown in fig. 39(a) and 39(B), the D-pack is processed and displayed by the display signal generation unit 20 and the character display unit 21'.
Fig. 39(B) further shows that in the case where audio reproduction control information is recorded, sound quality/level control information output means 331 and sound quality/level control processing means 332 are provided, and sound quality/level is controlled based on the control information.
Fig. 39(C) shows a character display circuit for displaying characters in a format, and the display time decoder 251, the start comparator 252, the end comparator 253, the character decoder 254, the buffer 255, and the display 256 (21') have the same configuration as fig. 20 (a). As shown in fig. 16, the buffer 255 and the display 256 can display characters of 15 characters (31 characters at a half angle) × 2.5 lines in a group for each line. Further, as shown in fig. 39(D), the V-RAM459a in the video display processor 459 has a capacity capable of storing 2 × 8 sets of character information.
The character display device shown in fig. 39(C) has a normal mode in which one set of character information is displayed on the display 256, and a special mode in which 2 × 8 sets of character information are displayed on an unshown display device through the external output terminal 15 ″, and the operation of the normal mode is substantially the same as that of fig. 20 (a). That is, in the normal mode, the display time data in the D packet is separated into the display start time data and the display end time data by the display time decoder 251, and the data is applied to the start comparator 252 and the end comparator 253. The text data in the D packet is converted into display dot matrix text data by the text data decoder 254, and the data is stored in the buffer 255.
Next, the start comparator 252 compares the display start time data with the address of the packet a during playback, and when they match, sends a dot matrix read start control signal to the buffer 255 and sends a display on control signal to the display 256 (21'). The end comparator 253 compares the display end time data with the address of the a packet during playback, and when they match, sends a timing signal for outputting the next text time data to the display time decoder 251, and at the same time, sends a control signal for turning off the display to the display 256 (21').
On the other hand, in the special mode, the display time data in the D packet is separated into display start time data and display end time data by the display time decoder 251, and the respective data are applied to the start comparator 252 and the end comparator 253 and, at the same time, to the CPU 60. The character data in the D packet is converted into dot matrix character data for display by the character data decoder 254, and the converted data is stored in the buffer 255. Also, the display start address (address corresponding to the display start address of the ACD) and the division number data within the a-CONT packet are decoded by the collective display control data decoder 58 and applied to the CPU 60.
Next, the start comparator 252 compares the display start time data with the address of the a packet during reproduction, and when they match, sends a read start control signal for the dot matrix to the buffer 255, and the end comparator 253 compares the display end time data with the address of the a packet during reproduction, and when they match, sends a timing signal for outputting the next character time data to the display time decoder 251.
The CPU 60 checks the display start time data of each group decoded by the display time decoder 251 based on the display start address and the division number data of each group decoded by the collective display control data decoder 458, and when they match, designates a group of write addresses to the V-RAM459a in the video display processor 459.
Thus, the dot matrix data of one set read out from the buffer 255 is written to the designated address by the CPU 60 in the V-RAM459a in the video display processor 459 based on the display start address and the division number data, and the same is performed as described below, and dot matrix data of 16 sets is written to the V-RAM459 a. The video display processor 459 develops the 16 sets of dot data into a video signal of one screen, and outputs the video signal to an external display device, not shown, through the external output terminal 15 ″.
Fig. 40 and 41 show an encoding device. Fig. 40 is a block diagram showing an embodiment of an audio signal encoding device according to the present invention, and fig. 41 is a block diagram showing the signal processing circuit of fig. 40 in detail.
In fig. 40, an analog audio signal a is sampled at a sufficiently high sampling frequency (sampling period Δ t), for example, 192kHz by an a/D converter 31, and converted into a high-resolution PCM signal of, for example, 24 bits, and converted into a data string corresponding to a high-resolution curve α:
xb1,x1,xa1,x2,xb2、x3,xa2,
…,xbi,x2i-1,xai,x2i,…。
the data string (xbi, x2i-1, xai, x2i) is encoded by the signal processing circuit 32 and the memory 33 shown in detail in fig. 41, and then applied to the DVD formatter 34.
The configuration of the signal processing circuit 32 will be described in detail with reference to fig. 41. First, a data string corresponding to the band-limited low-resolution curve β is obtained from the data string (xbi, x2i-1, xai, x2i) corresponding to the high-resolution curve α by the low-pass filter 36, for example, an FIR filter, passing through the 1/2 band:
xc1,*,*,*,xc2,*,*,*,xc3,*,*,*,…xci,*,*,*,…,
next, in the data string, the extraction circuit 37 extracts data "", thereby generating a data string:
xc1,xc2,xc3,…,xci,…
the data string xci is a data string in which the digital data a/D converted by the a/D converter 31 is subjected to channel limitation to reduce the sampling frequency to 1/4.
In the data string (xbi, x2i-1, xai, x2i), the data xi is extracted by the extraction circuit 38, thereby generating a data string:
xb1,xa1,xb2,xa2,…,xbi,xai,…。
then, from these data strings xci, xbi, xai, the difference is calculated by an adder 39 as a difference calculator:
xbi-xci=Δ1i
xai-xci=Δ2i。
here, the differential data Δ 1i and Δ 2i are, for example, 24 bits or less, and the number of bits may be fixed or variable.
The distribution circuit 40 packs the data string xci and the differential data Δ 1i and Δ 2i into user data (see fig. 13) (1 component is 2034 bytes), and outputs the user data to the DVD formatter 34.
The video signal V is converted into a digital signal by the a/D converter 31, then the digital video signal is V-encoded into an MPEG format by the V encoder 32, then packetized into user data shown in fig. 13, and applied to the DVD formatting section 34. The DVD formatter 34 performs packetization to have a format as shown in fig. 1 to 18. The data formatted by the DVD formatter 34 is modulated by a modulation scheme corresponding to the disc by a modulation circuit 35, and the disc is manufactured based on the modulated data.
A DVD player of the fifth embodiment will be described with reference to fig. 42 to 82. First, as shown in fig. 42(a), the data configuration of the fifth embodiment roughly has an AMG (audio manager), an SPS (still picture group) shown in detail in fig. 49, and a plurality of ATSs (audio title groups).
The ATS is composed of the following parts in order from the beginning:
ATSI (ATS information);
audio object group for audio private title (AOTT-AOBS) shown in detail in fig. 43 to 48;
ATSI for backup.
The ATSI is composed of the following parts from the beginning in sequence:
ATSI-MAT (ATSI manager Table) shown in detail in FIGS. 50 to 55;
ATS-PGCIT (ATS program chain information Table) shown in detail in FIGS. 56 to 68.
As shown in detail in fig. 43, the AOTT-AOBS is composed of a plurality of audio objects for audio private title (AOTT-AOB). The AOTT-AOB is respectively composed of a plurality of Programs (PG), each program being composed of a plurality of audio units (ATS-C).
The AOTT-AOB is constituted of two kinds of AOTT-AOBs containing audio data only and containing audio data and real-time information data (RTI data). More than one AOTT-AOB is configured in one disc and one song.
Each program of the first AOTT-AOB containing only audio data is composed of a plurality of audio units (ATS-C) composed of only a plurality of audio packets.
Each program of the second AOTT-AOB containing audio data and RTI data is constituted by a plurality of audio units (ATS-C) constituted by RTI packets arranged at the second packet position and audio packets arranged at the other packet positions.
The a packet of the linear PCM is composed of 2048 bytes or less, and its breakdown is composed of a packet header of 14 bytes and an a component as shown in fig. 44. The a component is composed of a component header of 17, 9, or 14 bytes, a private header shown in detail in fig. 45, and audio data (linear PCM) of 1 to 2011 bytes.
As shown in fig. 45, the private header is composed of the following parts:
8 bits of sub-stream ID
3 reserved area of bits
5-bit UPC/EAN-ISRC (Universal Product Code/European article Number-International Standard Recording Code) Number
8-bit UPC/EAN-ISRC data
8 bits of dedicated header Length
16 bits of the first Access Unit pointer
8 bits of Audio Data Information (ADI)
0 to 8 bits of padding bytes.
ADI is composed of the following moieties:
1-bit audio emphasis flag
1 bit reserved area
1-bit down-conversion mixing scheme
1-bit downmix code significance
4-bit downmix code
4 bits of quantization word length "1" of group "1
4 bits of quantization word length "2" of group "2
4-bit audio sampling frequency fs1 for group "1
4-bit audio sampling frequency fs2 for group "2
Reserved region of 4 bits
4-bit multichannel types
Bit shift data of 3-bit channel group "2" (see FIG. 47)
5-bit channel assignment information (see FIG. 53)
8 bits of dynamic Range control information
Reserved area of 8 × 2 bits.
As shown in fig. 46, data different in number from one UPC/EAN-ISRC is configured in an 8-bit UPC/EAN-ISRC data area. That is to say that the first and second electrodes,
(1) in the case of UPC/EAN-ISRC numbering equal to 1
First 2 bits b7, b 6: retention
Last 6 bits b 5-b 0: national code (ISRC #1)
(2) In case of UPC/EAN-ISRC number ═ 2
First 2 bits b7, b 6: retention
Last 6 bits b 5-b 0: national code (ISRC #2)
(3) In case of UPC/EAN-ISRC numbering ═ 3
First 2 bits b7, b 6: retention
Last 6 bits b 5-b 0: copyright owner code (ISRC #3)
(4) In case of UPC/EAN-ISRC number ═ 4
First 2 bits b7, b 6: retention
Last 6 bits b 5-b 0: copyright owner code (ISRC #4)
(5) In case of UPC/EAN-ISRC number ═ 5
First 2 bits b7, b 6: retention
Last 6 bits b 5-b 0: copyright owner code (ISRC #5)
(6) In case of UPC/EAN-ISRC number ═ 6
The first 4 bits b 7-b 4: retention
Last 4 bits b 3-b 0: recording year (ISRC #6)
(7) In case of UPC/EAN-ISRC number ═ 7
The first 4 bits b 7-b 4: retention
Last 4 bits b 3-b 0: recording year (ISRC #7)
In order to increase the S/N ratio and reduce bits, bits of data of each channel of the arrangement group "2" are reduced in a region of linear PCM data as actual data in the A packet. Fig. 47 a shows PCM data for 6 channels (group "1" Ch1 to Ch3, and group "2" Ch4 to Ch6), as an example, in which the level ranges from MAX 0dB to MIN 144dB (24 bits), and the values of the channels Ch are as follows:
Lmax2>Lmax1=Lmax3>Lmax4>Lmax5>Lmax6
ch1 to Ch3 of group "1" have the same word length, and Ch2 has the largest value in this example, so that Ch4 to Ch6 of group "2" are used for each word lengthMove in the horizontal direction (0-L)max2) dB, and 0-4 bytes on the LSB side are reduced. In the example shown in fig. 47, there are shown: each level of Ch4 to Ch6 is shifted upward by the maximum number of bits of 4, and is reduced to 20 bits.
The constitution of the RTI packet is explained in detail below with reference to fig. 48. The pack is composed of a pack header of 4 bytes and an RTI component composed of a component header of 17 or 14 bytes, a special header, and RTI data of 1 to 2015 bytes. The RTI data is text information and playback control information associated with the audio data.
The special header of the RTI component consists of the following parts:
1 bit sub-stream ID
2-bit UPC/EAN-ISRC numbering and data (in the figure, they are abbreviated as ISRC)
1-bit dedicated header Length
1-bit RTI information ID
0 to 7 bits of padding bytes.
The above-mentioned UPC/EAN-ISRC number and data are the UPC/EAN-ISRC number and data related to the copyright of the still picture accommodated in the SPCT packet.
As shown in fig. 49 a, an SPS (still picture group) includes SP address information (SPAI) and a plurality of SPUs (still picture units) #1 to # n, the SPUs # 1 to # n include a plurality of SP # 1 to # n, respectively, and the plurality of SP # 1 to # n include a plurality of SPCT (still picture) packets, respectively. As shown in detail in fig. 49(B), the SPCT packet is composed of a 14-byte packet header and SPCT components. The SPCT component consists of a component header of 22 or 19 or 9 bytes and SPCT data of 2015 bytes or less. One still picture is compressed in the MPEG1 or MPEG2 system, and is composed of I pictures and intra coded pictures, and is divided into one picture unit and arranged as SPCT data of an SPCT packet.
The UPC/EAN-ISRC numbers and data relating to the copyright of the still picture may be contained in the component header within the SPCT packet as described in the RTI packet.
As shown in detail in fig. 50, the ATSI-MAT shown in fig. 42(a) is composed of 2048 bytes (related byte positions RBP0 to 2047), and in order from the beginning:
an AST identifier (ATS-ID) of 12 bytes (RBP0 ~ 11);
4 bytes (RBP12 ~ 15) AST end address (ATS-EA);
a reserved area of 12 bytes (RBP16 ~ 27);
4 bytes (RBP28 ~ 31) of ASTI end address (ATSI-EA);
a version number (VERN) of 2 bytes (RBPs 32-33);
a reserved area of 94 bytes (RBP34 ~ 127);
4 bytes (RBP128 ~ 131) of the ASTI-MAT end address;
60 bytes (RBP132 ~ 191) reserved area;
4 bytes (RBP192 ~ 195) of AOTT start address of VTS;
4 bytes (RBP196 ~ 199) of AOTT start address of AOBS or AOTT start address of VOBS;
a reserved area of 4 bytes (RBPs 200-203);
4 bytes (RBP204 ~ 207) of ATS-PGCIT start address;
48 bytes (RBP208 ~ 255) reserved area;
attributes of AOB for AOTT (AOTT-AOB-ATR) or audio stream for VOB for AOTT (AOTT-VOB-AST-ATR) of 128(16 × 8) bytes (RBPs 256 to 383);
288(18 × 8) bytes (RBP384 ~ 671) of coefficients for down-mixing multi-channel audio data into 2 channels (ATS-DM-COEFT # 0 ~ 15)
A reserved area of 32 bytes (RBP 672-703);
attribute of still picture data (AOTT-SPCT-ATR) in AOBS for AOTT of 2 bytes (RBPs 704 to 705);
1342 bytes (RBPs 706 to 2047) of reserved area.
When the ATS has AOBS for AOTT, the AOTT-AOB-ATR shown in detail in FIG. 51 is described in an area of 128 (16X 8) bytes (RBPs 256 to 383). The AOTT-AOB-ATR (b127 to b0) is composed of the following parts in order from the MSB side:
8-bit (b127 to b120) Audio coding System
8-bit (b119 to b112) reserved region
4-bit (b111 to b108) quantization bit number Q1 of channel group 1
4-bit (b107 to b104) quantization bit number Q2 of channel group 2
Sampling frequency fs1 for channel group 1 of 4 bits (b103 to b100)
Sampling frequency fs2 of channel group 2 with 4 bits (b99 to b96)
Type of 3-bit (b 95-b 93) multichannel Structure
5-bit (b 92-b 88) channel assignment
8-bit × 11(b87 to b0) reserved region.
On the other hand, when the ATS does not have an AOBS for AOTT, the AOTT-VOB-AST-ATR shown in detail in FIG. 52 is described. The AOTT-VOB-AST-ATR (b127 to b0) is composed of the following parts in order from the MSB side:
8-bit (b127 to b120) Audio coding System
8-bit (b119 to b112) reserved region
4 bits (b111 to b108) of quantization bit number Q
Reserved region of 4 bits (b107 to b104)
Sampling frequency fs of 4 bits (b103 to b100)
Reserved region of 4 bits (b 99-b 96)
Type of 3-bit (b 95-b 93) multichannel Structure
5-bit (b 92-b 88) channel assignment
Number of decoded audio streams of 3 bits (b 87-b 85)
5-bit (b 84-b 80) reserved region
2-bit (b79, b78) MPEG Audio DRC
Reserved region of 2 bits (b77, b76)
4 compressed channel number of bits (b 75-b 72)
8-bit × 9(b71 to b0) reserved region.
The above data are shown in detail below. The number of quantization bits, sampling frequency, and multi-channel type are the same as those in fig. 34, and therefore, description thereof will be omitted.
(1) Audio coding mode (b 63-b 60)
00000000 b: linear PCM mode
00000001 b: reserved for compressed audio (Dolby number)
00000010 b: reserved for compressed audio (without MPEG2 extension)
00000011 b: reserved for compressed audio (with MPEG2 extension)
00000100 b: reserved for compressed audio (DTS)
00000101 b: reserved for compressed audio (SDDS)
And others: reserved for other coding modes
(8) Sound channel distribution (b 92-b 88)
Fig. 53 shows channel assignment information for groups "1" and "2" from 1 channel (monaural) to 6 channels. Reference numerals shown in the drawings are explained below.
C (mono): single sound channel
L, R: two-channel stereo
Lf: left front of multiple sound channels
Rf: multichannel right front
C: center of multi-channel
LFE: multi-channel Low Frequency Effect
S: multi-channel surround sound
Ls: left surround sound of multiple sound channels
Rs: decoding the number of audio streams (b 87-b 85) '0' or '1' for the right surround sound (9) of multiple channels
(10) DRC for MPEG audio (b79, b78)
00 b: absence of DRC data in MPEG audio streams
01 b: DRC data in MPEG audio stream
(11) Number of compressed channels (b 75-b 72)
The audio coding scheme is "1111 b" in the case of linear PCM audio
0000 b: 1ch (Single track)
0001 b: 2ch (Dual-track)
0010b:3ch
0011b:4ch
0100b:5ch
0101b:6ch
0110b:7ch
0111b:8ch
And others: retention
In order to down-mix multi-channel audio data into two channels, as shown in fig. 54, 18 bytes are used to describe down-mix coefficients (ATS-DM-COEFT # 0 to #15) having table numbers "0" to "15" in a region of 288(18 × 16) bytes (RBPs 384 to 671) shown in fig. 50
In order to describe the attributes of still picture data (AOTT-SPCT-ATR) in AOBS for AOTT, a 2-byte (RBP704, 705) region shown in fig. 50 is composed of the following parts in order from the MSB side as shown in detail in fig. 55:
2-bit (b15, b14) video compression method
TV system of 2 bits (b13, b12)
Aspect ratio of 2 bits (b11, b10)
Display mode of 2 bits (b9, b8)
Reserved region of 2 bits (b7, b6)
Resolution of 3-bit (b 5-b 3) Source pictures
3-bit (b 2-b 0) reserved regions.
The AST-SPCT-ATR will be described in detail below.
(1) Video compression mode (b15, b14)
00 b: corresponding to MPEG1
01 b: corresponding to MPEG2
And others: retention
(2) TV system (b13, b12)
00b:525/60
01b:625/60
And others: retention
(3) Aspect ratio (b11, b10)
00b:4∶3
01b:16∶9
And others: retention
(4) Display mode (b9, b8)
00 b: retention
01 b: retention
10 b: permission-only mailbox
11 b: is not described
(5) Resolution of source picture (b 5-b 3)
000 b: 720X 480(525/60 standard)
720X 576(625/60 standard)
And others: retention
The ATS-PGCIT (ATS program chain information table) shown in fig. 42(a) is composed of the following parts in order from the head as shown in detail in fig. 56:
the audio title set PGCI table information (ATS-PGCITI) shown in detail in FIG. 57
N audio title sets PGCI search pointers (ATS-PGCI-ARP # 1 to # n) shown in detail in FIGS. 58 and 59
A plurality of audio title sets PGCI shown in detail in fig. 60.
The ATS-PGCITI is constituted of 8 bytes as shown in detail in fig. 57, and is constituted of the following parts in order from the beginning:
2 byte reserved area
4 bytes of the end address of ATS-PGCIT.
ATS-PGCI-SRP # 1 to # n are each constituted of 8 bytes as shown in detail in FIG. 58, and are constituted of the following parts in order from the beginning:
カテゴリ one (ATS-PGC-CAT) for 4-byte ATS-PGC as shown in detail in FIG. 59
4 bytes of the ending address of the ATS-PGCI.
カテゴリ of the 4-bit (b 31-b 0) ATS-PGC is constituted of the following parts in order from the head as shown in detail in FIG. 59:
1-byte entry type of (b31)
7 bytes (b 30-b 24) ATS Audio title number (ATS-TTN)
2-byte (b23, b22) block mode
Block type of 2 bytes (b21, b20)
4-byte number of channels (b 19-b 16)
8-byte (b 15-b 8) Audio coding scheme
Reserved areas of 8 bytes (b7 to b 0).
The カテゴリ A (ATS-PGC-CAT) content is described in detail below.
(1) Type of inlet (b31)
0 b: without inlet PGC
1 b: inlet PGC
(2) ATS Audio title number (b 30-b 24)
Describing the number of ATS audio titles in the range of "" 1 "" to "" 99 ""
(3) Block type (b23, b22)
00 b: ATS-PGC without ATS-PGC block
01 b: initial ATS-PGC of ATS-PGC Block
10 b: retention
11 b: last ATS-PGC of ATS-PGC block
(4) Block type (b21, b20)
00 b: without a part of the block
01 b: difference block of recording-only mode
10 b: channel-only differential block
11 b: block of difference between recording mode and sound channel
(5) Number of channels (b 19-b 16)
0000 b: below two sound tracks
0001 b: over dual track
The audio title set PGCI (ATS-PGCI) shown in fig. 56 is constituted by the following parts in order from the beginning as shown in detail in fig. 60, respectively:
ATS-PGC general information (ATS-PGC-GI) shown in detail in FIGS. 61 and 62
ATS program information Table (ATS-PGIT) shown in detail in FIGS. 55 to 57
The ATS unit broadcast information table (ATS-C-PBIT) shown in detail in fig. 58 to 60.
The ATS-PGC-GI is composed of 16 bytes (RBPs 0-15) as shown in detail in FIG. 61, and is composed of the following parts in order from the beginning:
FIG. 62 shows ATS-PGC directory (ATS-PGC-CNT) of 4 bytes (RBP0 ~ 3) in detail
ATS-PGC Play time of 4 bytes (RBP4 ~ 7) (ATS-PGC-PB-TM)
Reserved area of 2 bytes (RBP8, 9)
2 bytes (RBP10, 11) ATS-PGIT Start Address
2 bytes (RBP12, 13) ATS-C-PBIT Start Address
Reserved area of 2 bytes (RBP14, 15).
The 4-byte (RBP0 ~ 3) ATS-PGC directory is composed of the following parts in order from the beginning as shown in detail in FIG. 62:
reserved region of 17 bits (b 31-b 15)
7 bits (b 14-b 8) of program number
The number of elements of 8 bits (b7 to b 0).
The process number is in the range of "1" to "99", and the cell number is in the range of "1" to "255".
The ATS program information table (ATS-PGIT) shown in fig. 60 is configured by n ATS program information (ATS-PGI) #1 through # n as shown in detail in fig. 63. ATS-PGI # 1 to # n are each constituted of 20 bytes (RBPs 0 to 19) as shown in detail in FIG. 64, and are constituted of the following parts in order from the beginning:
byte (RBP0 ~ 3) ATS-PGC directory (ATS-PGC-CNT)
1 byte (RBP4) of entry cell number of ATS-PG
Reserved area of 1 byte (RBP5)
4 bytes (RBP6 ~ 9) of start presentation time of first Audio Unit of ATS-PG (FAC-S-PTM)
4-byte (RBP10 ~ 13) ATS-PG playback time
4-byte (RBP14 ~ 17) ATS-PG pause time
1 byte (RBP18) reserved area (for copyright management data CMI)
Reserved area of 1 byte (RBP 19).
The 32-bit (b31 ~ 0) ATS-PG directory is composed of the following parts in order from the beginning as shown in detail in FIG. 65:
relationship (R/A) between the last PG and the PG of 1 bit (b31)
STC discontinuity indicator (STC-F) of 1 bit (b30)
Attribute number (ATRN) of 3 bits (b 29-b 27)
Bit-shift data of 3-bit (b 26-b 24) channel group (ChGr)' 2
Reserved region of 2 bits (b23, b22)
1 bit (b21) down-mixing mode (D-M)
Effectiveness of 1-bit (b20) downmix coefficient (shown in the attached color.)
4 bits (b 19-b 16) Down-conversion coefficient Table number (DM-COEFTN)
The RTI flags F15 to F0 each having 16 bits (b15 to b0) in total of 1 bit.
The ATS unit broadcast information table (ATS-C-PBIT) shown in fig. 60 is configured of n ATS unit broadcast information (ATS-C-PBI) #1 through # n as shown in detail in fig. 66. ATS-C-PBI # 1 to # n are each constituted of 12 bytes (RBPs 0 to 11) as shown in detail in FIG. 67, and are constituted of the following parts in order from the beginning:
index number of ATS-C of 1 byte (RBP0)
ATS-C type (ATS-C-TY) of 1 byte (RBP1) shown in detail in FIG. 65
Reserved area of 2 bytes (RBP2, 3)
4 bytes (RBP4 ~ 7) ATS-C start address
4 bytes (RBP8 ~ 11) of ATS-C end address.
The 1-byte (b 7-b 0) ATS-C type is constituted by the following parts in order from the beginning as shown in detail in fig. 68:
2 bits (b7, b6) of ATS unit element (ATS-C-COMP)
Reserved region of 2 bits (b5, b4)
4 bits (b 3-b 0) of ATS unit use (ATS-C-Usage).
The contents of the above data are shown in detail below.
(1) ATS Unit elements (b7, b6)
00 b: audio unit composed of audio data only
01 b: audio unit consisting of audio data and real-time information
10 b: mute unit composed of audio data for muting only
11 b: picture unit composed of still pictures only
(2) ATS Unit use (b 3-b 0)
0000 b: is not described
0001 b: light focusing part
And others: retention
The coding apparatus of the fifth embodiment is explained below. Fig. 69 and 70 show the configuration and processing of the encoding device, respectively. The analog audio signal a is sampled by the a/D converter 31 at a sufficiently high sampling frequency (sampling period Δ t), for example, 192kHz, and converted into a high-resolution PCM signal of, for example, 24 bits. Next, when the bit shift/signal processing circuit 32 does not perform compression, the PCM data converted by the a/D converter 31 is applied to the DVD formatter 34 as it is. On the other hand, when the compression is performed, the PCM data converted by the a/D converter 31 is compressed by the bit shift/signal processing circuit 32 according to the encoding method thereof, and then applied to the DVD formatter 34 (steps S5 and S6). The bit shift/signal processing circuit 32 bit shifts each channel of the group "2".
The video signal V is converted into a digital signal by the a/D converter 31, and the digital video signal is encoded into an MPEG format by the V encoder 32V and applied to the DVD formatting section 34 (steps S1, S2). The still picture signal SP is converted into a digital signal by the a/D converter 31SP, and the digital still picture signal SP is encoded into an MPEG format by the compression encoder 32SP and applied to the DVD formatter 34 (steps S3 and S4). The copyright information and the real-time text information (RTI) pass through the interface (I/F)40 (steps S7, S8), and the text information and the disc identifier EX are applied to the DVD formatting section 34 (steps S9, S10).
Next, the DVD formatter 34 performs packetization to obtain the above-described format (step S11). The data formatted by the DVD formatter 34 is modulated by the modulation circuit 35 in a modulation scheme corresponding to the disc, and the disc is manufactured based on the modulated data or recorded in the recording unit 38 and transmitted through the communication I/F39 (step S12).
Fig. 71 shows a specific configuration of a decoding device according to the fifth embodiment, and fig. 72 functionally shows the configuration of fig. 71. Fig. 73 shows the processing. In fig. 71 and 72, first, as in the case shown in fig. 19, when the operation section 18 and the remote controller 19 perform the track selection, the reproduction, the fast forward, and the stop operation, the control section 23 controls the drive device 2 and the reproduction device 17 in accordance with the operation, and at the time of reproduction, the drive device 2 reads the bit data recorded in the DVD player 1 and performs the EFM demodulation.
In the playback apparatus 17, the signal is sent to the still picture and V packet detection section 3 and the a and RTI packet detection section 9. When a still picture packet and a V packet are recorded on the disk 1, the still picture and V packet detecting section 3 detects the still picture packet and the V packet in the reproduced data, sets control parameters in the parameter section 8, and writes the still picture packet and the V packet in the still picture and V packet buffer 4 in this order. The still picture pack written in the still picture and V-pack buffer 4 and the user data (video signal, still picture information) in the V-pack are sequentially extracted in the pack order and output time order by the buffer extracting unit 5 based on the SCR (see fig. 13) in the still picture pack and V-pack, and then outputted as analog video signals by the extension/map converting unit 6, the D/a converting unit 7, and the video output terminals 15 and 15'.
The A and RTI packet detecting section 9 detects an A packet and an RTI packet in the playback data, sets control parameters in the parameter section 14, and sequentially writes the A packet and the RTI packet in the A and RTI packet buffer 10. User data (audio signal, real-time information) in an A packet and an RTI packet written in an A and RTI packet buffer 10 are sequentially extracted in the packet order and output time order by a buffer extraction unit 11. Next, the audio signal is output as an analog audio signal through the PCM conversion/bit shift/signal processing unit 12, the D/a conversion unit 13, and the audio output terminal 16. The real-time information is sent to the display signal generating unit 20 to generate a display signal, and the display signal is output through the display signal output terminal 22 or to the built-in character display unit 21.
The processing of the decoding apparatus is described with reference to fig. 73. First, recorded data is read by accessing the disc 1 (step S20), and then, in each of the separation steps S21 to S29, the recorded data is separated into a video signal, a still picture signal, an audio signal, copyright information, real-time information (RTI), character information, and a disc identifier EX. Next, in each of the decoding steps S22 to S30, the separated data is decoded and then synchronized playback is performed (steps S31, S32).
Among them, there are the following 3 steps in the process of playing back the still picture SP:
1) when the still picture SP is obtained, the audio signal a is interrupted to mute.
2) When the still picture SP is obtained, playback is performed together with the audio signal a according to the time control signal.
3) When the still picture SP is obtained, page-turning reproduction is performed in accordance with a page-turning instruction instructed by the user. At this time, the audio signal a is reproduced as it is.
When there is a need to keep the still picture synchronized with the sound, a time control signal for real-time synchronization is placed in the time control data information (SPCIT-TCDI) additionally set in the still picture control information table APCIT in the ATSI of fig. 42 (B).
And further, still picture page control instruction information (SPPI) containing a page-turning instruction is placed under SPCIT. As described above, SPCIT is composed of SPCIT general information (SPCIT-GI), temporal control data information (SPCIT-TCDI), and still picture page control command information (SPPI), which are general information.
The still picture data of the SPCT packet of fig. 49 may include size information for controlling a page of the still picture. The page control information defined by the size information can be interpreted while referring to the SPPI.
In the case where there is no spare capacity when accommodated in the still picture data, it is permissible to include the above-described size information for controlling the page of the still picture in the RTI data of the RTI packet.
Fig. 74 shows a device for reproducing a signal recorded on a disc 110 such as a DVD player or a DVD-video disc in which time information and a page turn command are recorded in order to synchronize a still picture with sound as shown in fig. 42 (B). The disk drive device 111 is controlled by a drive control circuit 112, and the disk 110 is driven by the disk drive device 111 to read a recording signal. This signal is subjected to EFM demodulation by the demodulation circuit/error correction circuit 113, and then, after error correction, the transport stream signal excluding the control data and the DSI data is written in the drive buffer 114 by the write control circuit 115, and the control data and the DSI data are written in the system buffer 117 and the DSI buffer 122, respectively. The data written in the DSI buffer 122 is decoded by the DSI decoder 151 and output.
The system controller 132 performs playback control in accordance with the control data written to the system buffer 117. To control playback, an operation unit 130, a display unit 131, a readable/writable system parameter memory 133, a playback-dedicated system parameter memory 134, a readable/writable general parameter memory 135, and a system timer 136 are connected to the system controller 132.
The transport stream signal written in the drive buffer 114 is read out by the read-out control circuit 116, and then separated into still picture packets, RTI packets, VBV packets, sub picture packets, VBI packets, and audio packets by the demultiplexer 128, and the respective packets are stored in the still picture buffer 147, RTI packet buffer 148, VBV packet 118, sub picture buffer 119, VBI buffer 120, and audio packet buffer 121. Next, the still picture pack and the RTI pack are decoded and output by the still picture decoder 149 and the RTI decoder 150, respectively, and the RTI data decoded by the RTI decoder 150 is stored in the buffer 150'.
The VBV packets are decoded by a video decoder 123 and then sent to an adder 127 via a mailbox transformer 126. The sub-picture packet and the VBI packet are decoded by a sub-picture decoder 124 and a VBI decoder 125, respectively, and sent to an adder 127, and the video signals are combined in the adder 127. The audio packets are supplied to an audio decoder 129, and converted into analog signals by a format decomposer 141, a channel separator 142, and D/a converters 144, 145 therein.
Next, the reproduction processing of the still picture will be described with reference to fig. 75 and 76. There are 2 (types "1" and "2") methods for the playback process. In fig. 75, first, when playback is started, still picture data is accessed and stored in the still picture buffer 147 (step S61). In this case, the audio data is not reproduced for 1 to 3 seconds, and the state is silent. Next, it is determined whether or not the type is "1" (step S62), and if the type is "1", the audio data is accessed and reproduced in synchronization with the still picture data based on several pieces of time control information (SPCIT-TCDI) shown in fig. 42(B) (step S63). On the other hand, when the type "1" is indicated, the audio data is accessed and reproduced (step S64).
In the processing of type "2", when the insertion is generated by the user' S command as shown in fig. 76, the command is interpreted (step S65), and the replay processing such as "go to next", "return to previous", and "cut", "enlarge" of the still picture is performed based on the command and the still picture page control command information (SPPI) (step S66). In the processing of the still picture in the type "2", synchronization with the audio signal is not maintained, and playback of the audio signal is not affected.
Next, a modification of the audio decoder 129 will be described in detail with reference to fig. 77 and 78. In this modification, a sample rate converter 143 is added, and the audio packet is first decomposed by the format decomposer 141, and then separated into PCM data of each channel by the channel separator 142. Next, the system controller 132 judges the sampling frequency fs of the audio data decomposed by the format decomposer 141, and if the sampling frequency fs is 48kHz of the first system, the audio data is supplied from the channel separator 142 to the D/a converters 144 and 145 through the switches 146 and 147 as it is, while if the sampling frequency fs is 44.1kHz of the second system, the sampling rate converter 143 increases the sampling frequency to 48kHz of the first system as shown in fig. 78(a), and the audio data is supplied to the D/a converters 144 and 145 through the switches 146 and 147.
In the case of 96kHz of the first system, the audio channel separator 142 is fed to the D/a converters 44 and 45 through the switches 146 and 147 as it is, while in the case of 88.2kHz of the second system, the sampling rate converter 143 performs up-sampling to 96kHz of the first system as shown in fig. 78(a) and feeds the data to the D/a converters 44 and 45 through the switches 146 and 147. Therefore, the D/a converters 144 and 145 can be constituted by only the first system. The sampling frequency fs of the D/a converters 144, 145 is controlled by the system controller 132.
Instead of converting the sampling frequency fs from the second system to the first system, 48kHz and 96kHz of the first system may be converted to 44.1kHz and 88.2kHz of the second system as shown in fig. 78 (B). Furthermore, even when the sampling frequency fs is the first system, as shown in fig. 78(C), in the case where one channel is 48kHz, the sampling is increased to 96kHz, and all channels are matched to 96kHz to perform D/a conversion, thereby improving the reproduction sound quality.
Next, a playback apparatus that displays copyright information on a superimposed screen will be described with reference to fig. 79 and 80. In fig. 79, the still picture packet is sent to the still picture decoder 149 via the still picture buffer 147 to decode the still picture data, which is applied to the adder 201. The RTI packets are sent to the RTI decoder 150 for decoding through the RTI packet buffer 148, and the decoded data is stored in the buffer 150'. In this case, when the RTI packet contains copyright data (UPC/EAN-ISRC data shown in fig. 45 and 46), the text character information is converted into character image data by the image conversion unit 200, and is applied to the adder 201, and is synthesized with the still picture and output to the outside, or is output to the outside through the switch 203.
The audio packet is decomposed by the format decomposer 141 through the audio packet buffer 121, and each data is stored in the buffer 141'. In this case, when the audio packet contains copyright data (UPC/EAN-ISRC data shown in fig. 45 and 46), the copyright data is converted into text character information by the transcoder 202 and is output to the outside through the switch 203.
The operation thereof will be described with reference to fig. 80. First, when a display instruction of the copyright information of the still picture is issued from the operation unit 130, the adder 201 adds the still picture decoded from the still picture pack and the copyright information decoded from the RTI pack and converted into image data, and outputs the result (step S71 → S72). On the other hand, when there is no indication of displaying the copyright information of the still picture, the addition operation is not performed on the copyright information decoded from the RTI packet (step S71 → S73), and it is determined whether or not the indication of displaying the copyright information of the still picture is transmitted from the operation unit 130 (step S74). Next, when a display instruction is given, the transcoder 202 converts the copyright information decoded from the audio packet into text character information, and the switch 203 outputs an RTI (step S74 → S75), whereas when no display instruction is given, the RTI is not output. This processing is performed in units of cells or tracks.
In addition to the method of incorporating and using the audio decoder having the configuration shown in fig. 77 and the copyright information reproducing apparatus (and audio decoder) having the configuration shown in fig. 79 as a part of the disk reproducing apparatus shown in fig. 74, the audio decoder may be configured as a product unit by using an IC chip and used as a component of a DVD video reproducing apparatus and a personal computer.
An embodiment in which the digital audio signal formatted as described above is transmitted via a communication line will be described below. First, a transmitting-side packetizing apparatus will be described with reference to fig. 81 to 85. The packetizing apparatus has a packetizing processing section 30, a buffer memory 30B, a control circuit 29, an operation section 27, and a display 28 as shown in fig. 81. In fig. 82 to 85, when the video signal V, the still picture signal SP, the audio signal a, the real-time information RTI, and the disc identifier EX are input, an audio packet is generated as shown in detail in fig. 83 in step S100 (step S101), a video packet is generated (step S102), a still picture packet is generated (step S103), and a real-time text is generated (step S104).
Then manages the audio unit (ATS-C) (step S200), then manages the PTT (title part) (step S300), then manages the title (AOTT-AOB) (step S400), and then manages the title group (AOTT-AOBs) (step S500). Next, in step S600, in order to generate the ATS, a header group is generated as shown in detail in fig. 84 (step S601), and then a menu is generated (step S602). カテゴリ of ATS-PGCI is described (step S603), PGIT consisting of a PG directory containing bit shifts is generated, and ATS-PGCIT is generated by generating PGIT (step S604). Next, ATSI is generated by generating MAT of attributes and coefficients (step S605). Next, AMG is generated (step S700), and finally TOC is generated (step S800).
Next, when the digital audio signal formatted as described above is transmitted via a communication line, as shown in fig. 85, the transmission data stored in the transmission buffer is divided into predetermined lengths and modularized (step S41), and then a header including the transmission destination address is given to the head of the module (step S42), and then the head is output to the network (step S43).
The data reception side is explained below with reference to fig. 86 to 90. As shown in fig. 86, the unpacking device on the data receiving side includes an unpacking processing unit 60, a buffer memory 60B, a parameter memory 56, a control circuit 59, an operation unit 57, and a display 58. First, as shown in fig. 87, a header is removed from a component received by the network (step S51), and then the received data is restored (step S52), and then transferred to the memory (step S53).
Next, as shown in fig. 88 to fig. 90, first, AMG is decoded to detect ATS (step S1100), next, in step S1200, to decode ATSI of the target ATS, ATS-PGCI カテゴリ is decoded as shown in fig. 89 in detail (step S1201), next, PGIT composed of PG directory containing bit shift is decoded (step S1202), next, MAT attribute and coefficient are decoded (step S1203), and then, these decoded parameters are set to the parameter memory 56 (step S1204).
When playback is started, the packet is identified (step S1300), and then in step S1400, in order to decode the packet, the audio packet is decoded as shown in detail in fig. 90 (step S1401), the video packet is decoded (step S1402), the still picture packet is decoded (step S1403), and the real-time text is decoded (step S1401). Next, the audio signal, video signal, still picture signal, and real-time text signal decoded from these packets are output (step S1500), and during playback, steps S1300 to S1500 are repeated.
The digital encoding apparatus and the decoding apparatus can store the above-described encoding method and decoding method as a computer program on an IC chip such as a ROM, and realize the functions thereof by operating a CPU (central processing unit) of a computer by the program. The present invention is not limited to the case of transmission via a recording medium such as a DVD, and may be applied to the case of transmission via a communication line such as the internet or a karaoke communication line, and processing on the playback side is performed by hardware or an application on a PC.
As described above, according to the present invention, by providing an audio title set having audio data and still picture data without a playback control packet and a management area containing information for managing the DVD-video title set instead of the DVD-video title set and the management area containing information for managing the DVD-video title set, it is possible to easily perform playback by a user when recording is performed mainly on an audio signal, and it is possible to simplify the use and the management of the actual time.
According to the present invention, in the case of reproducing a data structure having a first packet having audio data as real data, a second packet having real-time information data related to the audio data as the real data, and a third packet having still picture data related to the audio data as the real data, since the data structure is reproduced by a buffer and a decoder of 3 systems, it is not necessary to perform decoding processing on a still picture at high speed, and thus, the data structure can be configured at low cost. Further, the still picture may be reproduced in synchronization with the audio signal, or may be displayed by turning the page regardless of the reproduction of the audio signal.
According to the present invention, when audio data and multi-channel audio data are a/D converted at a sampling frequency of 2 systems, D/a conversion is performed in accordance with one system, and thus, it is not necessary to provide an a/D converter of 2 systems, and thus, the configuration can be made at low cost.
According to the present invention, when copyright data relating to audio data and still picture data is recorded on a disc, the still picture data and the copyright data are synthesized and telop display is performed, thereby effectively dealing with the duplication problem.
According to the present invention, by arranging character information related to audio data and display time control data for controlling the display time thereof in a management packet including information for managing an audio packet and displaying the character information based on the display time control data, it is possible to efficiently record and display character information indicating the content of audio data such as a music source when the audio data is mainly recorded.
According to the present invention, character information related to audio data and display time control data for controlling the display time thereof are arranged in a character display packet different from the audio packet and the management packet, and the character information is displayed based on the display time control data, whereby character information indicating the contents thereof can be efficiently recorded and displayed when audio data such as a music source is mainly recorded.
According to the present invention, character information related to audio data is arranged in a character display packet, information for managing the audio packet and display time control data for controlling display time of the character information are arranged in another management packet different from the audio packet and the character display packet, and the character information is displayed based on the display time control data, whereby it is possible to efficiently record and display character information indicating the content of audio data such as a music source when the audio data is mainly recorded.
According to the present invention, by arranging character information related to audio data and display time control data for controlling the display time thereof in a character display packet, arranging display control data other than the display time control data in a management packet, and displaying the character information based on the display control data other than the display time control data, it is possible to efficiently record and display character information indicating the content thereof when recording is performed mainly based on audio data such as a music source.
Claims (2)
1. A method of encoding a sound signal, comprising the steps of:
quantizing a first digital sound signal of a first channel group and a second digital sound signal of a second channel group, which are respectively allocated to a front channel group except a center channel and a rear channel group including the center channel, and have different sampling frequencies according to different channels of the first channel group and the second channel group, respectively, to generate a first digital sound signal of the first channel group and a second digital sound signal of the second channel group, respectively;
bit-shifting the second digital audio signal by the same bit-shift amount for the channels of the second channel group to generate a third digital audio signal; and
a data structure having an audio header group including a plurality of audio objects composed of audio packets in which the first digital sound signal and the third digital sound signal are arranged, the data structure being formatted such that: sampling frequencies of respective channels of the first channel group and the second channel group; the same bit shift amount for each channel of the second channel group; and channel assignment information for designating each channel of the assigned first channel group and second channel group, respectively, and the audio title group information of the audio title group is configured with: sampling frequencies of respective channels of the first channel group and the second channel group; the same bit shift amount for each channel of the second channel group; and channel allocation information for designating the channels of the first channel group and the second channel group allocated as described above, respectively.
2. A method for decoding an audio signal, which decodes data encoded by the audio signal encoding method according to claim 1, comprising the steps of:
extracting the first digital sound signal and the third digital sound signal configured in the audio packet;
extracting the sampling frequency, the same bit shift amount, and the channel allocation information of each channel, which are arranged in a predetermined area of the audio packet or/and audio header group information of the audio header group; and
decoding the first digital audio signal and the third digital audio signal based on the extracted sampling frequency of each channel, the same bit shift amount, and the channel allocation information.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP337700/97 | 1997-11-21 | ||
JP33770097A JP3248679B2 (en) | 1997-11-21 | 1997-11-21 | Disc playback device |
JP343916/97 | 1997-11-28 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB981248489A Division CN1261927C (en) | 1997-11-21 | 1998-11-19 | Audio signal decoding device, disc and disc replaying apparatus |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1870163A true CN1870163A (en) | 2006-11-29 |
CN100578640C CN100578640C (en) | 2010-01-06 |
Family
ID=18311144
Family Applications (8)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200610080335A Expired - Lifetime CN100580791C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
CNB2006100803335A Expired - Lifetime CN100481241C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
CNB200610080334XA Expired - Lifetime CN100481237C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoding and reproducing apparatus |
CN200610080339A Expired - Lifetime CN100580794C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
CN200610080336A Expired - Lifetime CN100578640C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
CNB2006100803320A Expired - Lifetime CN100481240C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
CN200610080337A Expired - Lifetime CN100580792C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
CN200610080338A Expired - Lifetime CN100580793C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
Family Applications Before (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200610080335A Expired - Lifetime CN100580791C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
CNB2006100803335A Expired - Lifetime CN100481241C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
CNB200610080334XA Expired - Lifetime CN100481237C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoding and reproducing apparatus |
CN200610080339A Expired - Lifetime CN100580794C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
Family Applications After (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2006100803320A Expired - Lifetime CN100481240C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
CN200610080337A Expired - Lifetime CN100580792C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
CN200610080338A Expired - Lifetime CN100580793C (en) | 1997-11-21 | 1998-11-19 | Audio signal encoder, disk and disk reproducing apparatus |
Country Status (2)
Country | Link |
---|---|
JP (1) | JP3248679B2 (en) |
CN (8) | CN100580791C (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100457512B1 (en) | 2001-11-29 | 2004-11-17 | 삼성전자주식회사 | Optical recording medium, apparatus and method for playing the optical recoding medium |
KR100457511B1 (en) | 2001-11-29 | 2004-11-17 | 삼성전자주식회사 | Optical recording medium, apparatus and method for playing the optical recoding medium |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
NL8900934A (en) * | 1989-04-14 | 1990-11-01 | Philips Nv | DEVICE FOR RECORDING AN AUDIO SIGNAL. |
JP3404837B2 (en) * | 1993-12-07 | 2003-05-12 | ソニー株式会社 | Multi-layer coding device |
KR100194928B1 (en) * | 1995-09-29 | 1999-06-15 | 윤종용 | Audio signal decoding apparatus and method of disk drive system |
-
1997
- 1997-11-21 JP JP33770097A patent/JP3248679B2/en not_active Expired - Lifetime
-
1998
- 1998-11-19 CN CN200610080335A patent/CN100580791C/en not_active Expired - Lifetime
- 1998-11-19 CN CNB2006100803335A patent/CN100481241C/en not_active Expired - Lifetime
- 1998-11-19 CN CNB200610080334XA patent/CN100481237C/en not_active Expired - Lifetime
- 1998-11-19 CN CN200610080339A patent/CN100580794C/en not_active Expired - Lifetime
- 1998-11-19 CN CN200610080336A patent/CN100578640C/en not_active Expired - Lifetime
- 1998-11-19 CN CNB2006100803320A patent/CN100481240C/en not_active Expired - Lifetime
- 1998-11-19 CN CN200610080337A patent/CN100580792C/en not_active Expired - Lifetime
- 1998-11-19 CN CN200610080338A patent/CN100580793C/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
CN1870162A (en) | 2006-11-29 |
CN100481241C (en) | 2009-04-22 |
CN100481240C (en) | 2009-04-22 |
CN100578640C (en) | 2010-01-06 |
CN1870165A (en) | 2006-11-29 |
CN100580794C (en) | 2010-01-13 |
JPH11162120A (en) | 1999-06-18 |
CN100580791C (en) | 2010-01-13 |
CN1870168A (en) | 2006-11-29 |
CN100580793C (en) | 2010-01-13 |
CN100580792C (en) | 2010-01-13 |
CN1870161A (en) | 2006-11-29 |
CN1870166A (en) | 2006-11-29 |
JP3248679B2 (en) | 2002-01-21 |
CN100481237C (en) | 2009-04-22 |
CN1870164A (en) | 2006-11-29 |
CN1870167A (en) | 2006-11-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1127059C (en) | Coding, decoding device for gramophone disc and audio frequency signal, and transmission method and recording medium | |
CN1261927C (en) | Audio signal decoding device, disc and disc replaying apparatus | |
CN1291410C (en) | Encoding apparatus of audio signal, audio disc and disc reproducing apparatus | |
CN1604219A (en) | Disk for audio data and reproduction apparatus | |
CN1870163A (en) | Audio signal encoder, disk and disk reproducing apparatus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: JVC KENWOOD CORPORATION Free format text: FORMER OWNER: VICTORY CO. LTD. Effective date: 20140228 |
|
TR01 | Transfer of patent right |
Effective date of registration: 20140228 Address after: Kanagawa Patentee after: JVC Kenwood Corp. Address before: Yokohama City, Kanagawa Prefecture, Japan Patentee before: Victory Co., Ltd. |
|
TR01 | Transfer of patent right | ||
CX01 | Expiry of patent term |
Granted publication date: 20100106 |
|
CX01 | Expiry of patent term |