WO2016132977A1 - 送信装置、送信方法、受信装置および受信方法 - Google Patents
送信装置、送信方法、受信装置および受信方法 Download PDFInfo
- Publication number
- WO2016132977A1 WO2016132977A1 PCT/JP2016/053869 JP2016053869W WO2016132977A1 WO 2016132977 A1 WO2016132977 A1 WO 2016132977A1 JP 2016053869 W JP2016053869 W JP 2016053869W WO 2016132977 A1 WO2016132977 A1 WO 2016132977A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- subtitle
- stream
- time stamp
- packet
- video
- Prior art date
Links
- 230000005540 biological transmission Effects 0.000 title claims description 52
- 238000000034 method Methods 0.000 title claims description 33
- 238000012545 processing Methods 0.000 claims abstract description 28
- 238000003780 insertion Methods 0.000 claims description 39
- 230000037431 insertion Effects 0.000 claims description 39
- 239000000284 extract Substances 0.000 claims description 11
- 238000004458 analytical method Methods 0.000 description 24
- 238000005516 engineering process Methods 0.000 description 19
- 238000006243 chemical reaction Methods 0.000 description 10
- 230000003321 amplification Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 101100190466 Caenorhabditis elegans pid-3 gene Proteins 0.000 description 1
- 101000609957 Homo sapiens PTB-containing, cubilin and LRP1-interacting protein Proteins 0.000 description 1
- 101150109471 PID2 gene Proteins 0.000 description 1
- 102100039157 PTB-containing, cubilin and LRP1-interacting protein Human genes 0.000 description 1
- 230000002730 additional effect Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005401 electroluminescence Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/236—Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
- H04N21/23605—Creation or processing of packetized elementary streams [PES]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L69/00—Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
- H04L69/22—Parsing or analysis of headers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/235—Processing of additional data, e.g. scrambling of additional data or processing content descriptors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/236—Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
- H04N21/23614—Multiplexing of additional data and video streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/4302—Content synchronisation processes, e.g. decoder synchronisation
- H04N21/4307—Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
- H04N21/43072—Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of multiple content streams on the same device
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/434—Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
- H04N21/4343—Extraction or processing of packetized elementary streams [PES]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/434—Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
- H04N21/4348—Demultiplexing of additional data and video streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/472—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
- H04N21/47217—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for controlling playback functions for recorded or on-demand content, e.g. using progress bars, mode or play-point indicators or bookmarks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/488—Data services, e.g. news ticker
- H04N21/4884—Data services, e.g. news ticker for displaying subtitles
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8456—Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/8543—Content authoring using a description language, e.g. Multimedia and Hypermedia information coding Expert Group [MHEG], eXtensible Markup Language [XML]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/8547—Content authoring involving timestamps for synchronizing content
Definitions
- the present technology relates to a transmission device, a transmission method, a reception device, and a reception method, and more particularly to a transmission device that transmits subtitle information together with image data.
- subtitle information is transmitted as bitmap data.
- a text character code that is, on a text basis.
- font development corresponding to the resolution is performed on the receiving side.
- the text information has timing information.
- TTML Timed Text Markup Language
- W3C World Wide Web Consortium
- the purpose of this technology is to simplify the subtitle display process in the variable speed playback mode on the receiving side.
- a video encoding unit that generates a video stream composed of video packets having encoded image data in the payload;
- a subtitle encoding unit that generates a subtitle stream composed of subtitle packets having subtitle information in the payload;
- a multiplexed stream generating unit that generates a multiplexed stream including the video stream and the subtitle stream;
- the multiplexed stream generator is The subtitle packet is in a transmission device arranged at a random access position.
- a video stream including video packets having encoded image data in the payload is generated by the video encoding unit.
- the subtitle encoding unit generates a subtitle stream including subtitle packets having subtitle information in the payload.
- the subtitle information may be text information of a subtitle having a predetermined format having display timing information.
- the text information of the subtitle in a predetermined format may be TTML or a derived format of this TTML.
- the multiplexed stream generation unit generates a multiplexed stream including at least a video stream and a subtitle stream.
- the subtitle packet is arranged at a random access position.
- the video packet and the subtitle packet may be PES packets, and the multiplexed stream may be an MPEG2 transport stream.
- the multiplexed stream is transmitted by the transmission unit.
- the subtitle packet when the video stream and the subtitle stream are multiplexed, the subtitle packet is arranged at the random access position. Therefore, on the receiving side, the subtitle packet can be extracted together with the video packet at the random access position from the multiplexed stream, and the subtitle display process in the variable speed playback mode can be simplified.
- the subtitle encoding unit has the same time stamp value inserted at the time stamp insertion position of the video packet header at the random access position at the time stamp insertion position of the header of the subtitle packet, or A time stamp of a nearby value may be inserted.
- the subtitle packet is arranged at the random access position.
- the subtitle encoding unit may insert a unique time stamp indicating the display time of the subtitle into the header or payload of the subtitle packet.
- the receiving side can easily control the display timing of the subtitle based on the unique time stamp.
- the multiplexed stream generation unit may insert identification information indicating that a unique time stamp is inserted into the subtitle packet into the multiplexed stream.
- the receiving side can easily recognize that the unique time stamp is inserted based on the identification information.
- information indicating an insertion position may be added to the identification information.
- the reception side can easily acquire the unique time stamp based on the information indicating the insertion position.
- the subtitle packet is arranged at a random access position
- the receiving apparatus further includes a processing unit that extracts the subtitle packet from the multiplexed stream together with the video packet at the random access position and performs a subtitle display process.
- the reception unit receives a multiplexed stream including a video stream composed of video packets having encoded image data in the payload and a subtitle stream composed of subtitle packets having subtitle information in the payload.
- the subtitle packet is arranged at a random access position.
- the processing unit extracts the subtitle packet together with the video packet at the random access position from the multiplexed stream, and performs subtitle display processing.
- the subtitle packet is arranged at the random access position in the multiplexed stream, and the subtitle packet is extracted from the multiplexed stream together with the video packet at the random access position, and the subtitle display process is performed. It is. Therefore, the subtitle display process in the variable speed playback mode can be simplified.
- the time stamp insertion position of the header of the subtitle packet is the same as or close to the time stamp value inserted at the time stamp insertion position of the video packet header at the random access position.
- the first time stamp is inserted, and the second time stamp indicating the display time of the subtitle is inserted in the header or payload of the subtitle packet.
- the processing unit performs the second time stamp.
- the subtitle display process may be performed with reference to FIG. In this case, the display timing of the subtitle can be easily controlled.
- identification information indicating that the second time stamp is inserted into the subtitle packet is inserted into the multiplexed stream, and the processing unit performs the second operation from the subtitle packet based on the identification information.
- the time stamp may be extracted and used.
- FIG. 1 shows a configuration example of a transmission / reception system 10 as an embodiment.
- the transmission / reception system 10 includes a broadcast transmission system 100 and a television receiver 200.
- the broadcast transmission system 100 transmits a transport stream TS as a multiplexed stream on a broadcast wave.
- the transport stream TS has a video stream and a subtitle stream.
- the video stream is composed of video PES packets having image data in the payload.
- the subtitle stream is composed of subtitle PES packets having subtitle (caption) information in the payload.
- the time stamp insertion position of the header of the subtitle PES packet is inserted with a time stamp value that is the same as or close to the time stamp value inserted at the time stamp insertion position of the header of the video PES packet at the random access position. Also, a unique time stamp indicating the display time of the subtitle is inserted into the header or payload of the subtitle PES packet.
- a TS packet including a subtitle PES packet (hereinafter simply referred to as a PES packet) is arranged at a random access position.
- a PES packet a TS packet including a subtitle PES packet
- the time stamp inserted at the time stamp insertion position of the header is referred to, and the subtitle PES packet is located at a random access position, that is, a position close to the position of the video PES packet at the random access position. Placed in.
- identification information indicating that a unique time stamp is inserted in the subtitle PES packet is inserted.
- This identification information is inserted, for example, under the program map table. As described above, a unique time stamp is inserted into the header or payload of the subtitle PES packet. Information indicating the insertion position is added to the identification information.
- the television receiver 200 receives the transport stream TS transmitted from the broadcast transmission system 100 by broadcast waves.
- this transport stream TS includes at least a video stream and a subtitle stream.
- the transport stream TS may include an audio stream.
- the video stream is composed of video PES packets having image data in the payload.
- the subtitle stream is composed of subtitle PES packets having subtitle information in the payload.
- the subtitle PES packet is arranged at a random access position.
- the television receiver 200 extracts the subtitle PES packet from the transport stream TS together with the video PES packet at the random access position, and performs subtitle (caption) display processing. For example, in the variable speed playback mode, the subtitle PES packet extracted from the random access position of the transport stream TS is immediately processed to display the subtitle.
- the subtitle PES packet extracted from the random access position of the transport stream TS is processed based on the unique time stamp indicating the display time of the subtitle inserted in the header or payload, Display the subtitle at the display time.
- the television receiver 200 extracts and uses a unique time stamp from the subtitle PES packet based on the identification information inserted in the transport stream TS.
- FIG. 2 shows a configuration example of the stream generation unit 110 of the broadcast transmission system 100.
- the stream generation unit 110 includes a control unit 111, a video encoder 112, an audio encoder 113, a text format conversion unit 114, a subtitle encoder 115, and a TS formatter (multiplexer) 116.
- the control unit 111 is configured to include, for example, a CPU (Central Processing Unit), and controls the operation of each unit of the stream generation unit 110.
- the video encoder 112 receives the image data DV, encodes the image data DV, and generates a video stream (PES stream) composed of video PES packets having encoded image data in the payload.
- the audio encoder 113 receives the audio data DA, encodes the audio data DA, and generates an audio stream (PES stream) composed of audio PES packets having encoded audio data.
- the text format conversion unit 114 inputs text data (character code) DT as subtitle information, and obtains text information of a subtitle having a predetermined format having display timing information.
- text information for example, TTML or a derived format of TTML can be considered. In this embodiment, TTML is assumed.
- FIG. 3 shows a TTML structure.
- TTML is described on an XML basis.
- metadata metadata
- styling styling
- layout layout
- FIG. 4A shows an example of the structure of metadata (TTM: TTML Metadata). This metadata includes metadata title information and copyright information.
- FIG. 4B shows a structural example of styling (TTS: TTML Styling).
- This styling includes information such as a color (color), a font (fontFamily), a size (fontSize), and an alignment (textAlign) in addition to the identifier (id).
- FIG. 4C shows a structural example of a layout (region: TTML layout). This layout includes information such as an extent (extent), an offset (padding), a background color (backgroundColor), and an alignment (displayAlign) in addition to the identifier (id) of the region in which the subtitle is arranged.
- FIG. 5 shows an example of the structure of the body.
- information of three subtitles subtitle 1 (subtitle 1), subtitle 2 (subtitle 2), and subtitle 3 (subtitle 3) is included.
- a display start timing and a display end timing are described, and text data is described.
- the display start timing is “0.76 s”
- the display end timing is “3.45 s”
- the text data is “It seems a paradox, dose it not,”. ing.
- the subtitle encoder 115 converts the TTML obtained by the text format conversion unit 114 into various segments, and a subtitle stream (subtitle stream) composed of subtitle PES packets in which those segments (subtitle information) are arranged in the payload. PES stream) is generated.
- the subtitle encoder 115 inserts the time of the video PES packet at the random access position, that is, the header of the video PES packet targeted for RAP (Random Access Point) at the time stamp insertion position of the header of the subtitle PES packet.
- a time stamp (first time stamp) having a value that is the same as or close to the time stamp value inserted at the stamp insertion position is inserted.
- the head part of the video PES packet targeted for RAP means, for example, the head part of a video PES packet having encoded image data of an intra picture (I picture) in the payload.
- RAP reproduction only the encoded image data of the intra picture included in the video PES packet to be RAP is decoded from the transport stream TS and displayed as an image.
- the subtitle encoder 115 under the control of the control unit 111, has a unique time stamp (second time stamp) indicating the display time (display start time, display end time) of the subtitle in the header or payload of the subtitle PES packet. Insert.
- FIG. 6 schematically shows a configuration example of the timing management unit 120 in the control unit 111.
- the timing management unit 120 includes a video timing management unit 121, an audio timing management unit 122, a RAP timing management unit 123, and a subtitle display timing management unit 124.
- the video timing management unit 121 manages a time stamp (time stamp of a video access unit) to be inserted at a time stamp insertion position of each video PES packet.
- the video encoder 112 inserts a time stamp at the time stamp insertion position of each video PES packet based on the management information of the video timing management unit 121.
- the audio timing management unit 122 manages the time stamp (time stamp of the audio access unit) to be inserted at the time stamp insertion position of each audio PES packet.
- the audio encoder 113 inserts a time stamp at the time stamp insertion position of each audio PES packet based on the management information of the audio timing management unit 122.
- the RAP timing management unit 123 manages the time stamp of the video PES packet targeted for RAP and manages the audio PES packet targeted for RAP among the audio PES packets. In this case, an audio PES packet having a time stamp close to the time stamp of the RAP target video PES packet is treated as a playback target at the time of RAP.
- the TS formatter 116 Based on the management information of the RAP target audio PES packet, the TS formatter 116 makes the RAP target audio PES packet close to the random access position, that is, the position of the RAP target video PES packet, as will be described later. Positioning is performed.
- the subtitle display timing management unit 124 displays the display time of the first time stamp (time stamp inserted in the time stamp insertion area of the header) and the second time stamp (subtitle inserted in the header or payload) of the subtitle PES packet. Manage your own time stamp).
- the subtitle encoder 115 inserts a first time stamp and a second time stamp into the subtitle PES packet based on the management information of the subtitle display timing management unit 124.
- FIG. 7 shows a detailed configuration of the subtitle display timing management unit 124.
- the subtitle display timing management unit 124 obtains a first time stamp (subtitle stream time stamp) based on the RAP timing information from the RAP timing management unit 123. Further, the subtitle display timing management unit 124 performs time stamp accuracy conversion on the relative start time “begin” and the relative end time “end” included in the TTML body, and performs the second time stamp (display start time stamp, display End time stamp).
- the time stamp accuracy conversion is performed for the relative start time “begin” and the relative end time “end” of the earliest timing region. To obtain a display start time stamp and a display end time stamp as the second time stamp.
- the display start time stamp and the display end time stamp as the second time stamp are used on the receiving side.
- the display time stamp (display start time stamp, display end time stamp) converted to time stamp accuracy based on the difference from the relative time of the earliest region supplied in the TTML body is used. Generated and used.
- FIG. 8 shows a configuration example (Syntax) of the PES packet.
- the first time stamp is inserted into a 33-bit area indicated by an arrow P, which is a time stamp insertion area of the header.
- This time stamp insertion area exists in “optional fields” under “optional PES HEADER” constituting the header.
- the second time stamp is inserted into the header or payload as described above.
- it When inserting into the header, for example, it is inserted into the area of “PES extension fields data” indicated by the arrow Q.
- This “PES extension fields data” exists in “optional fields” under “PES extension”.
- “PES extension” exists in “optional field” under “optional PES header” that constitutes the header.
- it when it is inserted into the payload, it is inserted into the area “PES packet data bytes” indicated by the arrow R.
- FIG. 9 and 10 show a configuration example (Syntax) of the PES extension “PES extension”.
- FIG. 11 shows a configuration example (Syntax) of the PES payload (PES_payload) arranged in the area of “PES packet data bytes”.
- FIG. 12 shows the contents (Semantics) of the new element in each configuration example.
- flag information of “PES_extension_negflag_new” is newly defined. This flag information indicates that newly defined data is arranged in the extension area (PES extension) of the PES header. “0” indicates that newly defined data is arranged in the extension area of the PES header. “1” indicates that the newly defined data is not arranged in the extension area of the PES header.
- extension_type When “PES_extension_negflag_new” is “0”, an 8-bit field of “extension_type” exists. This field indicates the type of newly defined data. “1” indicates a time stamp indicating the start / end of subtitle display. When “extension_type” is “1”, a 33-bit field of “subtitle_start_time” and a 33-bit field of “subtitle_end_time” exist.
- the 33-bit field of “subtitle_start_time” indicates a value (display start time stamp) indicating the subtitle display start time with the clock accuracy of the system.
- the 33-bit field of “subtitle_end_time” indicates a value (display end time stamp) representing the subtitle display end time with the clock accuracy of the system.
- the subtitle PES packet is defined as a private PES packet.
- a time stamp indicating display start / end is inserted into a part thereof.
- “The 8-bit field of“ private_type ” indicates the type of the private PES packet.
- “EC” indicates a subtitle stream with a time stamp.
- the flag information of “timestamp_insertion_flag” indicates that a time stamp is arranged at the beginning of the packet payload.
- “1” indicates that a time stamp is arranged.
- “0” indicates that no time stamp is arranged.
- timestamp_insertion_flag” is “1”
- a 33-bit field of “subtitle_start_time” and a 33-bit field of “subtitle_end_time” exist.
- the TS formatter 116 transport-packets and multiplexes the video stream generated by the video encoder 112, the audio stream generated by the audio encoder 113, and the subtitle stream generated by the subtitle encoder 115, A transport stream TS as a multiplexed stream is obtained.
- the TS formatter 116 converts the subtitle PES packet into a random access position, that is, based on the first time stamp (the time stamp inserted at the time stamp insertion position in the header). It is arranged at a position close to the position of the video PES packet targeted for RAP. At this time, based on the management information of the RAP target audio PES packet, the TS formatter 116 locates the RAP target audio PES packet in a random access position, that is, a position close to the position of the RAP target video PES packet. To place.
- the TS formatter 116 inserts identification information indicating that the second time stamp (unique time stamp indicating the display time of the subtitle) is inserted in the subtitle PES packet in the transport stream TS.
- the TS formatter 116 inserts a subtitle wrap descriptor (Subtitle_rap_descriptor) into a subtitle elementary stream loop corresponding to a subtitle stream under a program map table (PMT: Program Map Table). To do.
- FIG. 13A shows an example of the structure (Syntax) of the subtitle / wrap descriptor.
- FIG. 13B shows the content (Semantics) of main information in the structural example.
- An 8-bit field of “descriptor_tag” indicates a descriptor type, and here indicates a subtitle / wrap descriptor.
- the 8-bit field of “descriptor_length” indicates the length (size) of the descriptor, and indicates the number of subsequent bytes as the descriptor length.
- the flag information of “subtitle_presentation_time_flag” indicates whether subtitle display is performed at a timing different from the conventional time stamp (first time stamp), mainly at a delayed timing, that is, whether there is a display time stamp unique to the subtitle. Show me how. “1” indicates that delay display (arrangement of display time stamp unique to the subtitle) is performed. “0” indicates that delay display (arrangement of display time stamp unique to the subtitle) is not performed.
- time_insertion_type indicates the insertion type of the display time stamp unique to the subtitle, that is, where the time stamp is inserted. “01” indicates insertion into the PES header extension (PES (header ⁇ extension) portion. “10” indicates insertion into the payload (PESSprivate stream payload) of the PES private stream.
- FIG. 14 shows a configuration example of the transport stream TS.
- a video PES packet “Video PES” that is a PES packet of the video stream identified by PID1.
- an audio PES packet “Audio PES” that is a PES packet of the audio stream identified by PID2.
- a subtitle PES packet “Subtitle PES” that is a PES packet of the subtitle stream identified by PID3.
- the PES packet is composed of a PES header (PES header) and a PES payload (PES payload).
- PES header PES header
- PES payload PES payload
- a DTS / PTS time stamp is inserted in the PES header
- a video encoded stream encoded image data
- audio PES packet a PTS time stamp is inserted in the PES header
- an audio encoded stream encoded audio data
- a PTS time stamp (first time stamp) is inserted in the PES header, and a subtitle encoded stream (a plurality of subtitle segments as subtitle information) is inserted in the PES payload.
- the PES header or the PES payload includes a display start time stamp “subtitle_start_time” and a display end time stamp “subtitle_end_time” as a second time stamp (unique time stamp).
- the transport stream TS includes a PMT (Program Map Table) as PSI (Program Specific Information).
- PSI Program Specific Information
- This PSI is information describing to which program each elementary stream included in the transport stream TS belongs.
- the transport stream TS includes an EIT (Event Information Table) as SI (Serviced Information) for managing each event.
- SI Serviced Information
- the PMT includes a program descriptor (Program Descriptor) that describes information related to the entire program.
- the PMT includes an elementary stream loop having information related to each elementary stream.
- each loop information such as a packet identifier (PID) is arranged, and a descriptor (descriptor) describing information related to the elementary stream is also arranged.
- a descriptor (descriptor) describing information related to the elementary stream is also arranged.
- the above-described subtitle wrap descriptor (Subtitle_rap_descriptor) is arranged as one of descriptors.
- the operation of the stream generation unit 110 shown in FIG. 2 will be briefly described.
- the image data DV is supplied to the video encoder 112.
- the image data DV is encoded, and a video stream (PES stream) including a video PES packet having encoded image data in a payload is generated.
- This video stream is supplied to the TS formatter 116.
- the audio data DA is supplied to the audio encoder 113.
- the audio encoder 113 encodes the audio data DA, and generates an audio stream (PES stream) composed of audio PES packets having encoded audio data. This audio stream is supplied to the TS formatter 116.
- text data (character code) DT as subtitle information is supplied to the text format conversion unit 114.
- This text format conversion unit 114 obtains subtitle text information in a predetermined format having display timing information, here, TTML. This TTML is supplied to the subtitle encoder 115.
- the subtitle encoder 115 converts the TTML into various segments, and generates a subtitle stream composed of subtitle PES packets in which those segments are arranged in the payload. This subtitle stream is supplied to the TS formatter 116.
- the time inserted at the time stamp insertion position of the header of the subtitle PES packet is inserted at the time stamp insertion position of the header of the RAP target video PES packet. Inserting a time stamp (first time stamp) having a value that is the same as or close to the value of the stamp.
- the subtitle encoder 115 generates a unique time stamp (display start time, display end time) indicating the subtitle display time (display start time, display end time) in the header or payload of the subtitle PES packet under the control of the control unit 111 when the subtitle stream is generated.
- the second time stamp is inserted.
- the video stream generated by the video encoder 112 the audio stream generated by the audio encoder 113, and the subtitle stream generated by the subtitle encoder 115 are transport packeted and multiplexed to obtain a multiplexed stream.
- Transport stream TS is generated.
- the subtitle PES packet is controlled based on the first time stamp (time stamp inserted at the time stamp insertion position of the header) under the control of the control unit 111. It is arranged at a random access position, that is, a position close to the position of the video PES packet targeted for RAP.
- the audio PES packet targeted for RAP is a random access position, that is, the position of the video PES packet targeted for RAP. It is arranged at a position close to
- identification information is inserted into the transport stream TS.
- This identification information indicates that the second time stamp (unique time stamp indicating the display time of the subtitle) is inserted in the subtitle PES packet. That is, in the TS formatter 116, a subtitle wrap descriptor (Subtitle_rap_descriptor) is arranged in a subtitle elementary stream loop corresponding to a subtitle stream under a program map table (PMT: Program Map Table).
- PMT Program Map Table
- FIG. 15 shows a configuration example of the television receiver 200.
- the television receiver 200 includes a receiving unit 201, a TS analysis unit (demultiplexer) 202, a video decoder 203, a video superimposing unit 204, a panel drive circuit 205, and a display panel 206.
- the television receiver 200 includes an audio decoder 207, an audio output circuit 208, a speaker 209, and a subtitle decoder 210.
- the television receiver 200 also includes a CPU 221, a flash ROM 222, a DRAM 223, an internal bus 224, a remote control receiver 225, and a remote control transmitter 226.
- the CPU 221 controls the operation of each part of the television receiver 200.
- the flash ROM 222 stores control software and data.
- the DRAM 223 constitutes a work area for the CPU 221.
- the CPU 221 develops software and data read from the flash ROM 222 on the DRAM 223 to activate the software, and controls each unit of the television receiver 200.
- the remote control receiving unit 225 receives the remote control signal (remote control code) transmitted from the remote control transmitter 226 and supplies it to the CPU 221.
- the CPU 221 controls each part of the television receiver 200 based on this remote control code.
- the CPU 221, flash ROM 222, and DRAM 223 are connected to the internal bus 224.
- the receiving unit 201 receives the transport stream TS transmitted from the broadcast transmission system 100 on a broadcast wave.
- the transport stream TS includes a video stream, an audio stream, and a subtitle stream.
- the TS analysis unit 202 extracts PES packets of video, audio, and subtitle streams from the transport stream TS.
- the transport stream TS received by the receiving unit 201 is temporarily stored in a storage medium (buffer or storage) 201a, and a part corresponding to the reproduction mode is taken out and sent to the TS analyzing unit 202.
- a part corresponding to the reproduction mode is taken out and sent to the TS analyzing unit 202.
- all accumulated data is sent to the TS analysis unit 202.
- a part of the intermittent part corresponding to the double speed including the part of the RAP position (random access position) is sent to the TS analysis unit 202.
- the RAP playback mode only the portion of the RAP position (random access position) is sent to the TS analysis unit 202.
- the TS analysis unit 202 extracts various information inserted in the transport stream TS and sends it to the CPU 221.
- This information includes information on the subtitle wrap descriptor (Subtitle_rap_descriptor).
- the CPU 221 can easily recognize that the second time stamp (unique time stamp) is inserted into the subtitle PES packet and the insertion position thereof, and can be effectively used for the control of the subtitle decoder 210.
- the audio decoder 207 performs decoding processing on the audio PES packet extracted by the TS analysis unit 202 to obtain audio data.
- the audio output circuit 208 performs necessary processing such as D / A conversion and amplification on the audio data, and supplies the audio data to the speaker 209.
- the video decoder 203 performs a decoding process on the video PES packet extracted by the TS analysis unit 202 to obtain image data.
- the decoding and output processing for each PES packet in the audio decoder 207 and the video decoder 203 is controlled by the time stamp inserted in the header in the normal playback mode, but is not controlled by the time stamp in the variable speed playback mode. , Immediately after being supplied from the TS analysis unit 202.
- the subtitle decoder 210 performs a decoding process on the subtitle PES packet extracted by the TS analysis unit 202 to obtain bitmap data of each region to be superimposed on the image data.
- the decoding and output processing for the subtitle PES packet in the subtitle decoder 210 is controlled by the second time stamp (unique time stamp) inserted in the header or payload in the normal playback mode, but in the variable speed playback mode, It is not controlled by the second time stamp and is performed immediately after being supplied from the TS analysis unit 202.
- FIG. 16 schematically shows, for example, the decoding and output processing timing for the subtitle PES packet read at the RAP position.
- decoding is completed by the time indicated by the display start time stamp, and output of bitmap data as a decoding result starts from that time and continues until the time indicated by the display end time stamp.
- the subtitle display period is from the time indicated by the display start time stamp to the time indicated by the display end time stamp.
- variable speed reproduction mode decoding is performed immediately after reading from the storage medium 201a, and output of bitmap data as a decoding result is started.
- This output continuation time is a subtitle display period. This output continuation time is determined depending on the double speed of variable speed reproduction and the subtitle display maintenance function.
- the video superimposing unit 204 superimposes the bitmap data of each region obtained from the subtitle decoder 210 on the image data obtained by the video decoder 203.
- the panel drive circuit 205 drives the display panel 206 based on the display image data obtained by the video superimposing unit 204.
- the display panel 206 includes, for example, an LCD (Liquid Crystal Display), an organic EL display (organic electroluminescence display), and the like.
- the receiving unit 201 receives the transport stream TS transmitted from the broadcast transmission system 100 on the broadcast wave.
- the transport stream TS includes a video stream, an audio stream, and a subtitle stream.
- the transport stream TS is supplied to the TS analysis unit 202 through the storage medium 201a.
- the part corresponding to the playback mode is extracted and sent to the TS analysis unit 202.
- the TS analysis unit 202 For example, in the normal playback mode, all accumulated data is sent to the TS analysis unit 202.
- the variable speed reproduction mode a part of the intermittent part corresponding to the double speed including the part of the RAP position (random access position) is sent to the TS analysis unit 202.
- the RAP playback mode only the portion of the RAP position (random access position) is sent to the TS analysis unit 202.
- the video PES packet extracted by the TS analysis unit 202 is supplied to the video decoder 203.
- the video PES packet extracted by the TS analysis unit 202 is subjected to decoding processing to obtain image data.
- the decoding and output processing for each video PES packet is controlled by the time stamp inserted in the header in the normal playback mode, but is not controlled by the time stamp in the variable speed playback mode and is supplied from the TS analysis unit 202. Immediately after being done. This image data is supplied to the video superimposing unit 204.
- the subtitle stream (PES stream) extracted by the TS analysis unit 202 is supplied to the subtitle decoder 210.
- segment data of each region is processed, and bitmap data of each region to be superimposed on the image data is obtained.
- the decoding and output processing for each subtitle PES packet is controlled by the second time stamp (unique time stamp) inserted in the header or payload in the normal reproduction mode, but in the variable speed reproduction mode, the second time stamp is controlled. It is not controlled by the time stamp, but is performed immediately after being supplied from the TS analysis unit 202 (see FIG. 16).
- the bitmap data of each region output from the subtitle decoder 210 is supplied to the video superimposing unit 204.
- the bitmap data of each region output from the subtitle decoder 210 is superimposed on the image data obtained by the video decoder 203.
- Display image data obtained by the video superimposing unit 204 is supplied to the panel drive circuit 205.
- the panel drive circuit 205 drives the display panel 206 based on the display video data.
- an image on which the subtitle is superimposed is displayed on the display panel 206.
- the normal reproduction mode is a normal reproduction image at a normal speed
- the variable speed reproduction mode is a variable reproduction image corresponding to the double speed.
- the audio PES packet extracted by the TS analysis unit 202 is supplied to the audio decoder 207.
- the audio PES packet is decoded and audio data is obtained.
- the decoding and output processing for each audio PES packet is controlled by the time stamp inserted in the header in the normal playback mode, but is not controlled by the time stamp in the variable speed playback mode and is supplied from the TS analysis unit 202. Immediately after being done.
- the audio data is supplied to the audio output circuit 208.
- the audio output circuit 208 performs necessary processing such as D / A conversion and amplification on the audio data.
- the processed audio data is supplied to the speaker 209. Thereby, an audio output corresponding to the display image on the display panel 206 is obtained from the speaker 209.
- the subtitle PES packet is arranged at the random access position. Therefore, on the receiving side, when the video PES packet at the random access position is extracted from the multiplexed stream, the subtitle PES packet can be extracted together, and the subtitle display process in the variable speed playback mode is simplified.
- FIG. 17 shows conventional multiplexing.
- FIG. 17A shows a video PES packet sequence constituting a video stream.
- “V_TS00”, “V_TS10”, “V_TS20”, “V_TS30”, etc. indicate time stamps inserted in the header, and “V0”, “V10”, “V20”, “V30”,... Indicates encoded image data arranged in the payload.
- FIG. 17B shows an audio PES packet sequence constituting an audio stream.
- “A_TS00”, “A_TS10”, “A_TS20”, “A_TS30”,... Indicate time stamps inserted in the header, and “A0”, “A10”, “A20”, “A30”,... Indicates the encoded audio data arranged in the payload.
- FIG. 17C shows a subtitle PES packet sequence constituting the subtitle stream.
- “S_TS15”, “S_TS19”, “S_TS35”, “A_TS39”,... Indicate time stamps inserted in the header, and “S15”, “S19”, “S35”, “S39”,... Indicates subtitle information arranged in the payload.
- a time stamp is inserted into the header of the subtitle PES packet regardless of the time stamp inserted into the header of the video PES packet or the audio PES packet.
- FIG. 17 (d) shows an example of the arrangement of each PES packet in the multiplexed stream (transport stream TS).
- the time stamp inserted in the header of each PES packet is referred to determine the arrangement of each PES packet.
- the subtitle PES packet is arranged at a position slightly before the display period regardless of the RAP position (random access position).
- RAP playback for example, as shown in playback method 1 in FIG. 17 (e), a PES packet is read at a certain length for each RAP position and decoded, and image display and audio output are performed. Is called. In this case, since the subtitle PES packet is not necessarily arranged at the RAP position, the subtitle is not displayed.
- FIG. 18 shows the multiplexing of the present technology.
- 18 (a) shows a video PES packet sequence constituting a video stream.
- “V_TS00”, “V_TS10”, “V_TS20”, “V_TS30”, etc. indicate time stamps inserted in the header, and “V0”, “V10”, “V20”, “V30”,... Indicates encoded image data arranged in the payload.
- FIG. 18B shows an audio PES packet sequence constituting the audio stream.
- FIG. 18 (c) shows a subtitle PES packet sequence constituting the subtitle stream.
- a time stamp (first time stamp) having a value that is the same as or close to the time stamp inserted in the header of the RAP target video PES packet is inserted.
- FIG. 18D shows an example of the arrangement of each PES packet in the multiplexed stream (transport stream TS).
- the time stamp inserted in the header of each PES packet is referred to determine the arrangement of each PES packet.
- the subtitle PES packet is arranged at the RAP position (random access position) in the same manner as the RAP target video PES packet and audio PES packet.
- RAP playback for example, as shown in playback method 3 in FIG. 18 (e), a PES packet is read with a certain length for each RAP position, and decoding processing is performed.
- the subtitle PES packet is also read and decoded together with the video PES packet and the audio PES packet, the subtitle display is performed well together with the image display and the audio output.
- the jump for reading the subtitle PES packet in addition to the jump for reading the RAP position as in the reproduction method 2 shown in FIG. 17 (f), the jump for reading the subtitle PES packet. It is not necessary to repeat the above, and the variable speed reproduction process on the receiving side for enabling subtitle display is simplified.
- a second time stamp (unique time stamp) indicating the display time of the subtitle is inserted into the header or payload of the subtitle PES packet. Therefore, the receiving side can easily control the display timing of the subtitle based on the unique time stamp.
- identification information indicating that a unique time stamp is inserted into the subtitle PES packet is inserted into the transport stream TS. Therefore, the receiving side can easily recognize that the unique time stamp is inserted based on this identification information, and can extract the unique time stamp efficiently.
- TTML is used as text information of a subtitle having a predetermined format having display timing information.
- the present technology is not limited to this, and other timed text information having information equivalent to TTML may be used.
- a TTML derivative format may be used.
- the present technology can be applied to the conventional type bitmap format in the same manner as the subtitle format.
- the transmission / reception system 10 including the broadcast transmission system 100 and the television receiver 200 is shown, but the configuration of the transmission / reception system to which the present technology can be applied is not limited thereto.
- a configuration of a set top box and a monitor in which the television receiver 200 is connected by a digital interface such as HDMI (High-Definition Multimedia Interface) may be used.
- HDMI High-Definition Multimedia Interface
- HDMI High-Definition Multimedia Interface
- this technique can also take the following structures. (1) a video encoding unit that generates a video stream composed of video packets having encoded image data in a payload; A subtitle encoding unit that generates a subtitle stream composed of subtitle packets having subtitle information in the payload; A multiplexed stream generating unit that generates a multiplexed stream including the video stream and the subtitle stream; A transmission unit for transmitting the multiplexed stream; The multiplexed stream generator is A transmission apparatus that arranges the subtitle packet at a random access position.
- the subtitle encoding part The time stamp insertion position of the header of the subtitle packet is inserted with a time stamp value that is the same as or close to the time stamp value inserted at the time stamp insertion position of the header of the video packet at the random access position.
- the transmission device according to 1).
- (3) The subtitle encoding part The transmission device according to (2), wherein a unique time stamp indicating a display time of the subtitle is inserted into a header or payload of the subtitle packet.
- the multiplexed stream generation unit The transmission apparatus according to (3), wherein identification information indicating that the unique time stamp is inserted into the subtitle packet is inserted into the multiplexed stream.
- the transmission device according to any one of (1) to (5), wherein the subtitle information is text information of a subtitle having a predetermined format having display timing information.
- the subtitle text information in the predetermined format is TTML or a derivation format of the TTML.
- the video packet and the subtitle packet are PES packets,
- the transmission device according to any one of (1) to (7), wherein the multiplexed stream is an MPEG2 transport stream.
- a video encoding step for generating a video stream composed of video packets having encoded image data in a payload;
- a subtitle stream generation step for generating a subtitle stream composed of subtitle packets having subtitle information in the payload;
- a multiplexed stream generating step for generating a multiplexed stream including the video stream and the subtitle stream;
- In the multiplexed stream generation step A transmission method in which the subtitle packet is arranged at a random access position.
- a receiving unit that receives a multiplexed stream including a video stream including a video packet having encoded image data in the payload and a subtitle stream including a subtitle packet having subtitle information in the payload, In the multiplexed stream, the subtitle packet is arranged at a random access position
- a receiving apparatus further comprising: a processing unit that extracts the subtitle packet from the multiplexed stream together with the video packet at the random access position and performs a subtitle display process.
- the time stamp insertion position in the header of the subtitle packet has a first value that is the same as or close to the time stamp value inserted in the time stamp insertion position in the header of the video packet at the random access position.
- a time stamp is inserted, A second time stamp indicating the display time of the subtitle is inserted in the header or payload of the subtitle packet, The processing unit The receiving apparatus according to (10), wherein in the normal playback mode, a subtitle display process is performed with reference to the second time stamp. (12) In the multiplexed stream, identification information indicating that the second time stamp is inserted in the subtitle packet is inserted, The receiving device according to (11), wherein the processing unit extracts and uses the second time stamp from the subtitle packet based on the identification information. (13) The reception unit has a reception step of receiving a multiplexed stream including a video stream composed of video packets having encoded image data in the payload and a subtitle stream composed of subtitle packets having subtitle information in the payload. And In the multiplexed stream, the subtitle packet is arranged at a random access position, A receiving method further comprising a processing step of extracting the subtitle packet from the multiplexed stream together with the video packet at the random access position and performing a subtitle display process.
- the main feature of this technology is that, when generating a transport stream TS as a multiplexed stream, the subtitle PES packet is arranged at a random access position, thereby simplifying the subtitle display process in the variable speed playback mode on the receiving side. This is intended to be achieved (see FIG. 18).
- DESCRIPTION OF SYMBOLS 10 ... Transmission / reception system 100 ... Broadcast transmission system 110 ... Stream generation part 111 ... Control part 112 ... Video encoder 113 ... Audio encoder 114 ... Text format conversion part 115 ... Subtitle encoder 116 ... TS formatter (multiplexer) DESCRIPTION OF SYMBOLS 120 ... Timing management part 121 ... Video timing management part 122 ... Audio timing management part 123 ... RAP timing management part 124 ... Subtitle display timing management part 200 ... Television receiver 201 ... Reception unit 201a ... Storage medium 202 ... TS analysis unit 203 ... Video decoder 204 ... Video superposition unit 205 ... Panel drive circuit 206 ... Display panel 207 ... Audio decoder 208 ..Audio output circuit 209 ... Speaker 210 ... Subtitle decoder 221 ... CPU
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Computer Security & Cryptography (AREA)
- Databases & Information Systems (AREA)
- Human Computer Interaction (AREA)
- Computer Networks & Wireless Communication (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Studio Circuits (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Television Systems (AREA)
Abstract
Description
ペイロードに符号化画像データを持つビデオパケットにより構成されるビデオストリームを生成するビデオエンコード部と、
ペイロードにサブタイトル情報を持つサブタイトルパケットにより構成されるサブタイトルストリームを生成するサブタイトルエンコード部と、
上記ビデオストリームと上記サブタイトルストリームを含む多重化ストリームを生成する多重化ストリーム生成部と、
上記多重化ストリームを送信する送信部を備え、
上記多重化ストリーム生成部は、
上記サブタイトルパケットを、ランダムアクセス位置に配置する
送信装置にある。
ペイロードに符号化画像データを持つビデオパケットにより構成されるビデオストリームとペイロードにサブタイトル情報を持つサブタイトルパケットにより構成されるサブタイトルストリームを含む多重化ストリームを受信する受信部を備え、
上記多重化ストリームにおいて、上記サブタイトルパケットは、ランダムアクセス位置に配置されており、
上記多重化ストリームから上記サブタイトルパケットを、上記ランダムアクセス位置のビデオパケットと共に取り出してサブタイトルの表示処理を行う処理部をさらに備える
受信装置にある。
1.実施の形態
2.変形例
[送受信システムの構成例]
図1は、実施の形態としての送受信システム10の構成例を示している。この送受信システム10は、放送送出システム100とテレビ受信機200により構成されている。放送送出システム100は、多重化ストリームとしてのトランスポートストリームTSを、放送波に載せて送信する。トランスポートストリームTSは、ビデオストリームと、サブタイトルストリームとを有している。ビデオストリームは、ペイロードに画像データを持つビデオPESパケットにより構成されている。サブタイトルストリームは、ペイロードにサブタイトル(字幕)情報を持つサブタイトルPESパケットにより構成されている。
図2は、放送送出システム100のストリーム生成部110の構成例を示している。このストリーム生成部110は、制御部111と、ビデオエンコーダ112と、オーディオエンコーダ113と、テキストフォーマット変換部114と、サブタイトルエンコーダ115と、TSフォーマッタ(マルチプレクサ)116を有している。
ここで、第2のタイムスタンプ(独自タイムスタンプ)の挿入位置の具体例について説明する。図8は、PESパケットの構成例(Syntax)を示している。第1のタイムスタンプに関しては、矢印Pで示す、ヘッダのタイムスタンプ挿入領域である33ビット領域に挿入される。このタイムスタンプ挿入領域は、ヘッダを構成する“optional PES HEADER”の下の“optional fields”内に存在する。
図15は、テレビ受信機200の構成例を示している。このテレビ受信機200は、受信部201と、TS解析部(デマルチプレクサ)202と、ビデオデコーダ203と、ビデオ重畳部204と、パネル駆動回路205と、表示パネル206を有している。また、このテレビ受信機200は、オーディオデコーダ207と、音声出力回路208と、スピーカ209と、サブタイトルデコーダ210を有している。また、このテレビ受信機200は、CPU221と、フラッシュROM222と、DRAM223と、内部バス224と、リモコン受信部225と、リモコン送信機226を有している。
なお、上述実施の形態においては、表示タイミング情報を持つ所定フォーマットのサブタイトルのテキスト情報としてTTMLを用いる例を示した。しかし、本技術は、これに限定されず、TTMLと同等の情報を持つその他のタイムドテキスト情報を用いることも考えらえる。例えば、TTMLの派生フォーマットを用いてもよい。また、本技術は、サブタイトルのフォーマットとしては従来タイプのビットマップ方式も全く同様に適用できることは当然である。
(1)ペイロードに符号化画像データを持つビデオパケットにより構成されるビデオストリームを生成するビデオエンコード部と、
ペイロードにサブタイトル情報を持つサブタイトルパケットにより構成されるサブタイトルストリームを生成するサブタイトルエンコード部と、
上記ビデオストリームと上記サブタイトルストリームを含む多重化ストリームを生成する多重化ストリーム生成部と、
上記多重化ストリームを送信する送信部を備え、
上記多重化ストリーム生成部は、
上記サブタイトルパケットを、ランダムアクセス位置に配置する
送信装置。
(2)上記サブタイトルエンコード部は、
上記サブタイトルパケットのヘッダのタイムスタンプ挿入位置に、上記ランダムアクセス位置のビデオパケットのヘッダのタイムスタンプ挿入位置に挿入されるタイムスタンプの値と同じか、あるいは近傍の値のタイムスタンプを挿入する
前記(1)に記載の送信装置。
(3)上記サブタイトルエンコード部は、
上記サブタイトルパケットのヘッダまたはペイロードに、サブタイトルの表示時刻を示す独自タイムスタンプを挿入する
前記(2)に記載の送信装置。
(4)上記多重化ストリーム生成部は、
上記多重化ストリームに、上記サブタイトルパケットに上記独自タイムスタンプの挿入があることを示す識別情報を挿入する
前記(3)に記載の送信装置。
(5)上記識別情報には、挿入位置を示す情報が付加されている
前記(4)に記載の送信装置。
(6)上記サブタイトル情報は、表示タイミング情報を持つ所定フォーマットのサブタイトルのテキスト情報である
前記(1)から(5)のいずれかに記載の送信装置。
(7)上記所定フォーマットのサブタイトルのテキスト情報は、TTMLあるいは該TTMLの派生フォーマットである
前記(6)に記載の送信装置。
(8)上記ビデオパケットおよび上記サブタイトルパケットはPESパケットであり、
上記多重化ストリームはMPEG2トランスポートストリームである
前記(1)から(7)のいずれかに記載の送信装置。
(9)ペイロードに符号化画像データを持つビデオパケットにより構成されるビデオストリームを生成するビデオエンコードステップと、
ペイロードにサブタイトル情報を持つサブタイトルパケットにより構成されるサブタイトルストリームを生成するサブタイトルストリーム生成ステップと、
上記ビデオストリームと上記サブタイトルストリームを含む多重化ストリームを生成する多重化ストリーム生成ステップと、
送信部により、上記多重化ストリームを送信する送信ステップを有し、
上記多重化ストリーム生成ステップでは、
上記サブタイトルパケットを、ランダムアクセス位置に配置する
送信方法。
(10)ペイロードに符号化画像データを持つビデオパケットにより構成されるビデオストリームとペイロードにサブタイトル情報を持つサブタイトルパケットにより構成されるサブタイトルストリームを含む多重化ストリームを受信する受信部を備え、
上記多重化ストリームにおいて、上記サブタイトルパケットはランダムアクセス位置に配置されており、
上記多重化ストリームから上記サブタイトルパケットを、上記ランダムアクセス位置のビデオパケットと共に取り出してサブタイトルの表示処理を行う処理部をさらに備える
受信装置。
(11)上記サブタイトルパケットのヘッダのタイムスタンプ挿入位置には、上記ランダムアクセス位置のビデオパケットのヘッダのタイムスタンプ挿入位置に挿入されるタイムスタンプの値と同じか、あるいは近傍の値の第1のタイムスタンプが挿入されており、
上記サブタイトルパケットのヘッダまたはペイロードに、サブタイトルの表示時刻を示す第2のタイムスタンプが挿入されており、
上記処理部は、
通常再生モードでは、上記第2のタイムスタンプを参照してサブタイトルの表示処理を行う
前記(10)に記載の受信装置。
(12)上記多重化ストリームには、上記サブタイトルパケットに上記第2のタイムスタンプの挿入があることを示す識別情報が挿入されており、
上記処理部は、上記識別情報に基づいて、上記サブタイトルパケットから上記第2のタイムスタンプを抽出して用いる
前記(11)に記載の受信装置。
(13)受信部により、ペイロードに符号化画像データを持つビデオパケットにより構成されるビデオストリームとペイロードにサブタイトル情報を持つサブタイトルパケットにより構成されるサブタイトルストリームを含む多重化ストリームを受信する受信ステップを有し、
上記多重化ストリームにおいて、上記サブタイトルパケットはランダムアクセス位置に配置されており、
上記多重化ストリームから上記サブタイトルパケットを、上記ランダムアクセス位置のビデオパケットと共に取り出してサブタイトルの表示処理を行う処理ステップをさらに有する
受信方法。
100・・・放送送出システム
110・・・ストリーム生成部
111・・・制御部
112・・・ビデオエンコーダ
113・・・オーディオエンコーダ
114・・・テキストフォーマット変換部
115・・・サブタイトルエンコーダ
116・・・TSフォーマッタ(マルチプレクサ)
120・・・タイミング管理部
121・・・ビデオタイミング管理部
122・・・オーディオタイミング管理部
123・・・RAPタイミング管理部
124・・・サブタイトル表示タイミング管理部
200・・・テレビ受信機
201・・・受信部
201a・・・蓄積媒体
202・・・TS解析部
203・・・ビデオデコーダ
204・・・ビデオ重畳部
205・・・パネル駆動回路
206・・・表示パネル
207・・・オーディオデコーダ
208・・・音声出力回路
209・・・スピーカ
210・・・サブタイトルデコーダ
221・・・CPU
Claims (13)
- ペイロードに符号化画像データを持つビデオパケットにより構成されるビデオストリームを生成するビデオエンコード部と、
ペイロードにサブタイトル情報を持つサブタイトルパケットにより構成されるサブタイトルストリームを生成するサブタイトルエンコード部と、
上記ビデオストリームと上記サブタイトルストリームを含む多重化ストリームを生成する多重化ストリーム生成部と、
上記多重化ストリームを送信する送信部を備え、
上記多重化ストリーム生成部は、
上記サブタイトルパケットを、ランダムアクセス位置に配置する
送信装置。 - 上記サブタイトルエンコード部は、
上記サブタイトルパケットのヘッダのタイムスタンプ挿入位置に、上記ランダムアクセス位置のビデオパケットのヘッダのタイムスタンプ挿入位置に挿入されるタイムスタンプの値と同じか、あるいは近傍の値のタイムスタンプを挿入する
請求項1に記載の送信装置。 - 上記サブタイトルエンコード部は、
上記サブタイトルパケットのヘッダまたはペイロードに、サブタイトルの表示時刻を示す独自タイムスタンプを挿入する
請求項2に記載の送信装置。 - 上記多重化ストリーム生成部は、
上記多重化ストリームに、上記サブタイトルパケットに上記独自タイムスタンプの挿入があることを示す識別情報を挿入する
請求項3に記載の送信装置。 - 上記識別情報には、挿入位置を示す情報が付加されている
請求項4に記載の送信装置。 - 上記サブタイトル情報は、表示タイミング情報を持つ所定フォーマットのサブタイトルのテキスト情報である
請求項1に記載の送信装置。 - 上記所定フォーマットのサブタイトルのテキスト情報は、TTMLあるいは該TTMLの派生フォーマットである
請求項6に記載の送信装置。 - 上記ビデオパケットおよび上記サブタイトルパケットはPESパケットであり、
上記多重化ストリームはMPEG2トランスポートストリームである
請求項1に記載の送信装置。 - ペイロードに符号化画像データを持つビデオパケットにより構成されるビデオストリームを生成するビデオエンコードステップと、
ペイロードにサブタイトル情報を持つサブタイトルパケットにより構成されるサブタイトルストリームを生成するサブタイトルストリーム生成ステップと、
上記ビデオストリームと上記サブタイトルストリームを含む多重化ストリームを生成する多重化ストリーム生成ステップと、
送信部により、上記多重化ストリームを送信する送信ステップを有し、
上記多重化ストリーム生成ステップでは、
上記サブタイトルパケットを、ランダムアクセス位置に配置する
送信方法。 - ペイロードに符号化画像データを持つビデオパケットにより構成されるビデオストリームとペイロードにサブタイトル情報を持つサブタイトルパケットにより構成されるサブタイトルストリームを含む多重化ストリームを受信する受信部を備え、
上記多重化ストリームにおいて、上記サブタイトルパケットはランダムアクセス位置に配置されており、
上記多重化ストリームから上記サブタイトルパケットを、上記ランダムアクセス位置のビデオパケットと共に取り出してサブタイトルの表示処理を行う処理部をさらに備える
受信装置。 - 上記サブタイトルパケットのヘッダのタイムスタンプ挿入位置には、上記ランダムアクセス位置のビデオパケットのヘッダのタイムスタンプ挿入位置に挿入されるタイムスタンプの値と同じか、あるいは近傍の値の第1のタイムスタンプが挿入されており、
上記サブタイトルパケットのヘッダまたはペイロードに、サブタイトルの表示時刻を示す第2のタイムスタンプが挿入されており、
上記処理部は、
通常再生モードでは、上記第2のタイムスタンプを参照してサブタイトルの表示処理を行う
請求項10に記載の受信装置。 - 上記多重化ストリームには、上記サブタイトルパケットに上記第2のタイムスタンプの挿入があることを示す識別情報が挿入されており、
上記処理部は、上記識別情報に基づいて、上記サブタイトルパケットから上記第2のタイムスタンプを抽出して用いる
請求項11に記載の受信装置。 - 受信部により、ペイロードに符号化画像データを持つビデオパケットにより構成されるビデオストリームとペイロードにサブタイトル情報を持つサブタイトルパケットにより構成されるサブタイトルストリームを含む多重化ストリームを受信する受信ステップを有し、
上記多重化ストリームにおいて、上記サブタイトルパケットはランダムアクセス位置に配置されており、
上記多重化ストリームから上記サブタイトルパケットを、上記ランダムアクセス位置のビデオパケットと共に取り出してサブタイトルの表示処理を行う処理ステップをさらに有する
受信方法。
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP16752363.8A EP3261353B1 (en) | 2015-02-20 | 2016-02-09 | Transmission device, transmission method, reception device, and reception method |
US15/543,424 US10225589B2 (en) | 2015-02-20 | 2016-02-09 | Transmission apparatus, transmission method, reception apparatus, and reception method |
CN201680009873.6A CN107211170B (zh) | 2015-02-20 | 2016-02-09 | 传输装置、传输方法、接收装置以及接收方法 |
JP2017500621A JP6729547B2 (ja) | 2015-02-20 | 2016-02-09 | 送信装置、送信方法、受信装置および受信方法 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2015031800 | 2015-02-20 | ||
JP2015-031800 | 2015-02-20 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016132977A1 true WO2016132977A1 (ja) | 2016-08-25 |
Family
ID=56689236
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2016/053869 WO2016132977A1 (ja) | 2015-02-20 | 2016-02-09 | 送信装置、送信方法、受信装置および受信方法 |
Country Status (5)
Country | Link |
---|---|
US (1) | US10225589B2 (ja) |
EP (1) | EP3261353B1 (ja) |
JP (5) | JP6729547B2 (ja) |
CN (1) | CN107211170B (ja) |
WO (1) | WO2016132977A1 (ja) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP4322238A3 (en) | 2010-01-19 | 2024-05-15 | Sirigen II Limited | Novel reagents for directed biomarker signal amplification |
US10225600B2 (en) * | 2015-04-13 | 2019-03-05 | Sony Corporation | Transmission device, transmission method, reproduction device and reproduction method |
EP3470976A1 (en) * | 2017-10-12 | 2019-04-17 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Method and apparatus for efficient delivery and usage of audio messages for high quality of experience |
CN108111896B (zh) * | 2018-01-16 | 2020-05-05 | 北京三体云联科技有限公司 | 一种字幕同步方法及装置 |
CN114302215B (zh) * | 2021-12-29 | 2023-09-29 | 北京奕斯伟计算技术股份有限公司 | 视频数据流解码系统、方法、电子设备和介质 |
EP4221169A1 (en) * | 2022-01-31 | 2023-08-02 | Koa Health B.V. Sucursal en España | System and method for monitoring communication quality |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008193203A (ja) * | 2007-02-01 | 2008-08-21 | Mitsubishi Electric Corp | デジタル映像情報データ生成装置、デジタル映像情報記録装置、デジタル映像情報再生装置、及びデジタル映像情報データ生成方法 |
WO2009057298A1 (ja) * | 2007-11-01 | 2009-05-07 | Panasonic Corporation | 記録媒体、再生装置、記録装置、再生方法、及び記録方法 |
JP2009165169A (ja) * | 2009-04-20 | 2009-07-23 | Sony Corp | 情報処理装置および方法、プログラム、並びに記録媒体 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004032607A (ja) * | 2002-06-28 | 2004-01-29 | Sanyo Electric Co Ltd | ディジタル映像再生装置 |
JP4781600B2 (ja) * | 2002-09-24 | 2011-09-28 | ソニー株式会社 | 情報処理装置および方法、プログラム、並びに記録媒体 |
JP4275611B2 (ja) * | 2004-12-02 | 2009-06-10 | ソニー株式会社 | 再生装置、再生方法および再生プログラム、記録媒体、ならびに、データ構造体 |
KR100782835B1 (ko) * | 2005-01-29 | 2007-12-06 | 삼성전자주식회사 | 캡션 정보의 출력시점 및 출력 우선순위를 조절하는 방법및 그 장치 |
US8745687B2 (en) * | 2005-02-15 | 2014-06-03 | Sony Corporation | Digital closed caption transport in standalone stream |
JP4311570B2 (ja) * | 2005-07-01 | 2009-08-12 | 株式会社ソニー・コンピュータエンタテインメント | 再生装置、ビデオ復号装置および同期再生方法 |
US20070154176A1 (en) * | 2006-01-04 | 2007-07-05 | Elcock Albert F | Navigating recorded video using captioning, dialogue and sound effects |
JP5685969B2 (ja) | 2011-02-15 | 2015-03-18 | ソニー株式会社 | 表示制御方法、表示制御装置 |
US9584837B2 (en) * | 2011-12-16 | 2017-02-28 | Sony Corporation | Receiving device and method of controlling the same, distribution device and distribution method, program, and distribution system |
US20140078248A1 (en) * | 2012-01-13 | 2014-03-20 | Sony Corporation | Transmitting apparatus, transmitting method, receiving apparatus, and receiving method |
KR101781887B1 (ko) * | 2014-02-23 | 2017-09-26 | 엘지전자 주식회사 | 방송 신호 송수신 방법 및 장치 |
-
2016
- 2016-02-09 WO PCT/JP2016/053869 patent/WO2016132977A1/ja active Application Filing
- 2016-02-09 CN CN201680009873.6A patent/CN107211170B/zh active Active
- 2016-02-09 US US15/543,424 patent/US10225589B2/en active Active
- 2016-02-09 JP JP2017500621A patent/JP6729547B2/ja active Active
- 2016-02-09 EP EP16752363.8A patent/EP3261353B1/en active Active
-
2020
- 2020-06-25 JP JP2020110019A patent/JP6927374B2/ja active Active
-
2021
- 2021-08-04 JP JP2021127893A patent/JP7176598B2/ja active Active
-
2022
- 2022-11-09 JP JP2022179816A patent/JP7578129B2/ja active Active
-
2024
- 2024-08-26 JP JP2024144537A patent/JP2024152974A/ja active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008193203A (ja) * | 2007-02-01 | 2008-08-21 | Mitsubishi Electric Corp | デジタル映像情報データ生成装置、デジタル映像情報記録装置、デジタル映像情報再生装置、及びデジタル映像情報データ生成方法 |
WO2009057298A1 (ja) * | 2007-11-01 | 2009-05-07 | Panasonic Corporation | 記録媒体、再生装置、記録装置、再生方法、及び記録方法 |
JP2009165169A (ja) * | 2009-04-20 | 2009-07-23 | Sony Corp | 情報処理装置および方法、プログラム、並びに記録媒体 |
Also Published As
Publication number | Publication date |
---|---|
JP7176598B2 (ja) | 2022-11-22 |
JP2024152974A (ja) | 2024-10-25 |
JP7578129B2 (ja) | 2024-11-06 |
JP2020174367A (ja) | 2020-10-22 |
EP3261353A1 (en) | 2017-12-27 |
EP3261353A4 (en) | 2018-07-18 |
CN107211170B (zh) | 2020-07-10 |
JP2023001309A (ja) | 2023-01-04 |
US20180007406A1 (en) | 2018-01-04 |
US10225589B2 (en) | 2019-03-05 |
CN107211170A (zh) | 2017-09-26 |
JP6927374B2 (ja) | 2021-08-25 |
JP6729547B2 (ja) | 2020-07-22 |
JPWO2016132977A1 (ja) | 2017-11-30 |
EP3261353B1 (en) | 2021-03-31 |
JP2021184626A (ja) | 2021-12-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7176598B2 (ja) | 送信方法 | |
JP6825558B2 (ja) | 送信装置、送信方法、再生装置および再生方法 | |
RU2701172C2 (ru) | Передающее устройство, способ передачи, приемное устройство и способ приема | |
JP6760296B2 (ja) | 送信装置、送信方法、再生装置および再生方法 | |
US11470296B2 (en) | Transmitter, transmission method, receiver, and reception method | |
EP3306941B1 (en) | Transmission device, transmission method, media processing device and media processing method | |
WO2017126509A1 (ja) | 送信装置、送信方法、受信装置および受信方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16752363 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2017500621 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 15543424 Country of ref document: US |
|
REEP | Request for entry into the european phase |
Ref document number: 2016752363 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |