WO2016042765A1 - 映像音声処理装置、映像音声処理方法およびプログラム - Google Patents
映像音声処理装置、映像音声処理方法およびプログラム Download PDFInfo
- Publication number
- WO2016042765A1 WO2016042765A1 PCT/JP2015/004718 JP2015004718W WO2016042765A1 WO 2016042765 A1 WO2016042765 A1 WO 2016042765A1 JP 2015004718 W JP2015004718 W JP 2015004718W WO 2016042765 A1 WO2016042765 A1 WO 2016042765A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- video
- moving image
- audio
- sound output
- volume
- Prior art date
Links
- 238000003672 processing method Methods 0.000 title claims description 5
- 230000005236 sound signal Effects 0.000 claims abstract description 74
- 230000007704 transition Effects 0.000 claims description 17
- 230000008859 change Effects 0.000 claims description 16
- 238000000034 method Methods 0.000 description 27
- 230000004048 modification Effects 0.000 description 27
- 238000012986 modification Methods 0.000 description 27
- 238000010586 diagram Methods 0.000 description 24
- 230000008569 process Effects 0.000 description 22
- 238000000926 separation method Methods 0.000 description 16
- 230000015572 biosynthetic process Effects 0.000 description 10
- 238000003786 synthesis reaction Methods 0.000 description 10
- 230000002123 temporal effect Effects 0.000 description 6
- 230000007423 decrease Effects 0.000 description 5
- 238000007792 addition Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000010365 information processing Effects 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44008—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/485—End-user interface for client configuration
- H04N21/4852—End-user interface for client configuration for modifying audio parameters, e.g. switching between mono and stereo
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/44—Receiver circuitry for the reception of television signals according to analogue transmission standards
- H04N5/445—Receiver circuitry for the reception of television signals according to analogue transmission standards for displaying additional information
- H04N5/45—Picture in picture, e.g. displaying simultaneously another television channel in a region of the screen
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/44—Receiver circuitry for the reception of television signals according to analogue transmission standards
- H04N5/60—Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals
Definitions
- the present disclosure relates to a video / audio processing apparatus and a video / audio processing method for generating a video for displaying a plurality of moving images on one display screen.
- Patent Document 1 discloses an information processing apparatus using a technique for displaying a plurality of moving images in one display screen.
- the information processing apparatus determines the audio output coordinates of each program so as to be interlocked with the program display scroll operation, and synthesizes the audio of each program based on the audio output coordinates.
- a video / audio processing apparatus that generates video that displays a plurality of moving images on one display screen, it is desirable that the user can generate audio that is easy to hear.
- the present disclosure provides a video / audio processing apparatus and a video / audio processing method capable of generating audio that can be easily heard by a user.
- the video / audio processing apparatus includes a video generation unit, a selection unit, and a volume adjustment unit.
- the video generation unit generates a video signal of a display video in which a region where a plurality of moving images are displayed automatically moves in a predetermined direction within the display screen.
- the selection unit selects an audio signal of one moving image from the plurality of moving images according to the positions of the plurality of moving images in the display screen.
- the volume adjustment unit adjusts the volume of each audio signal of the plurality of moving images so that the audio signal selected by the selection unit is output at a volume higher than that of the other audio signals.
- the video / audio processing device can generate audio that is easy for the user to hear when a plurality of moving images are displayed in one display screen.
- FIG. 1 is a block diagram illustrating a configuration example of a video / audio processing apparatus according to the first embodiment.
- FIG. 2 is a diagram schematically illustrating an example of a display image generated by the video / audio processing apparatus according to the first embodiment.
- FIG. 3 is a diagram schematically showing an outline of the audio adjustment process performed by the video / audio processing apparatus according to the first embodiment.
- FIG. 4 is a diagram schematically showing an example of the operation of the video / audio processing apparatus when two moving images are included in the sound output area in the first embodiment.
- FIG. 5 is a diagram schematically illustrating an example of a temporal change in volume of each moving image when two moving images are included in the sound output area in the first embodiment.
- FIG. 1 is a block diagram illustrating a configuration example of a video / audio processing apparatus according to the first embodiment.
- FIG. 2 is a diagram schematically illustrating an example of a display image generated by the video / audio processing apparatus according to the first embodiment.
- FIG. 3 is
- FIG. 6 is a flowchart illustrating an example of a video / audio generation process executed by the video / audio processing apparatus according to the first embodiment.
- FIG. 7 is a flowchart illustrating an example of the initial volume setting process according to the first embodiment.
- FIG. 8 is a diagram schematically illustrating an example of a temporal change in volume of a moving image included in the sound output area in Modification 1 of the other embodiment.
- FIG. 9 is a diagram schematically illustrating an example of the operation of the video / audio processing device when two moving images are included in the sound output area in the second modification of the other embodiment.
- FIG. 10 is a diagram schematically illustrating an example of a temporal change in volume of each moving image when two moving images are included in the sound output area in the second modification of the other embodiment.
- FIG. 11 is a flowchart illustrating an example of a video / audio generation process executed by the video / audio processing apparatus according to Modification 2 of the other embodiment.
- FIG. 12 is a diagram schematically illustrating an outline of the audio adjustment processing in the third modification of the other embodiment.
- FIG. 13 is a flowchart illustrating an example of a video / audio generation process executed by the video / audio processing apparatus according to Modification 3 of the other embodiment.
- FIG. 14 is a block diagram illustrating a configuration example of a video / audio processing apparatus according to Modification 4 of the other embodiment.
- FIG. 15 is a diagram schematically illustrating an example of a display image in Modification 4 of the other embodiment.
- FIG. 1 is a block diagram showing a configuration example of the video / audio processing apparatus 100 according to the first embodiment.
- the video / audio processing apparatus 100 shown in FIG. 1 generates a video signal in which a plurality of moving images are displayed in one display screen.
- the video / audio processing apparatus 100 is mounted on a television, a recorder, a personal computer, a portable terminal, a smartphone, or the like.
- the video / audio processing apparatus 100 includes a video generation unit 101, an operation reception unit 102, a control unit 103, a video output unit 104, a selection unit 105, a volume adjustment unit 106, and an audio output unit 107.
- the video generation unit 101 displays a video signal of a display video (hereinafter referred to as “scroll”) in which a region where a plurality of moving images are displayed automatically moves in a predetermined direction within the display screen (hereinafter referred to as “scroll”). Generated and output "display video signal”. In addition, the video generation unit 101 outputs a plurality of audio signals corresponding to the plurality of moving images.
- the video generation unit 101 includes a channel selection unit 111, a broadcast signal separation unit 112, a content data separation unit 113, a video reproduction unit 114, an audio reproduction unit 115, an OSD (On Screen Display) generation unit 116, and a display composition. Part 117.
- the channel selection unit 111 selects a broadcast program signal to be reproduced from a plurality of broadcast signals received by the antenna 131, and outputs the selected broadcast program signal to the broadcast signal separation unit 112. For example, the channel selection unit 111 outputs a TS (transport stream) to the broadcast signal separation unit 112.
- the broadcast signal separation unit 112 separates video data and audio data from the TS output from the channel selection unit 111, outputs the video data to the video reproduction unit 114, and outputs the audio data to the audio reproduction unit 115.
- the content data separation unit 113 acquires moving image content from the storage device 132, outputs video data of the acquired moving image content to the video reproduction unit 114, and outputs audio data to the audio reproduction unit 115.
- the storage device 132 is a large-capacity storage device such as an HDD (Hard Disk Drive). The storage device 132 accumulates a plurality of moving image contents and outputs the moving image content selected by the user.
- the video reproduction unit 114 reproduces the video data of the broadcast program output from the broadcast signal separation unit 112 and the video data of the moving image content output from the content data separation unit 113 to generate a plurality of video signals, The generated plurality of video signals are output to the display composition unit 117.
- the audio reproduction unit 115 reproduces the audio data of the broadcast program output from the broadcast signal separation unit 112 and the audio data of the moving image content output from the content data separation unit 113 to generate a plurality of audio signals, The plurality of generated audio signals are output to the volume adjustment unit 106.
- the audio / video processing apparatus 100 is configured to be able to reproduce both video content and broadcast programs, but the present disclosure is not limited to this configuration.
- the video / audio processing apparatus 100 may be configured to reproduce only one of them.
- the video reproduction unit 114 and the audio reproduction unit 115 reproduce only one of the plurality of moving image contents and the plurality of broadcast programs.
- the video reproduction unit 114 and the audio reproduction unit 115 may reproduce only one of a plurality of moving image contents and a plurality of broadcast programs.
- FIG. 1 illustrates a configuration example in which the antenna 131 and the storage device 132 are both installed outside the video / audio processing device 100, but the present disclosure is not limited to this configuration. At least one of the antenna 131 and the storage device 132 may be included in the video / audio processing device 100.
- the video / audio processing apparatus 100 may be configured to hold the moving image content by itself and reproduce the stored moving image content. Further, the video / audio processing apparatus 100 may be configured to acquire, for example, moving image content stored in a moving image server or the like via the Internet or the like.
- the OSD generation unit 116 generates an OSD video signal for displaying the OSD video on the display screen.
- the display synthesis unit 117 generates a display video signal by synthesizing a plurality of video signals and OSD video signals reproduced by the video reproduction unit 114, and outputs the generated display video signal.
- the generated display video signal is a video signal for displaying a display video including a plurality of moving images on the display screen. In this way, the display synthesis unit 117 generates a display video signal in which a plurality of video signals reproduced by the video reproduction unit 114 are superimposed on each other.
- the operation reception unit 102 receives a user operation.
- the user operation includes, for example, a direct operation on the video / audio processing apparatus 100 by a user, a remote operation on the video / audio processing apparatus 100 using a remote controller (not shown), and the like.
- the control unit 103 controls the video generation unit 101 according to the user operation received by the operation receiving unit 102.
- the control unit 103 specifies the broadcast signal to be selected by the channel selection unit 111, specifies the moving image content to be acquired by the content data separation unit 113, specifies the broadcast program to be processed by the broadcast signal separation unit 112, and the OSD generation unit 116.
- the video output unit 104 outputs the display video signal generated by the display synthesis unit 117 to a display unit such as a monitor.
- FIG. 1 illustrates a configuration example in which the monitor is installed outside the video / audio processing apparatus 100, but the present disclosure is not limited to this configuration.
- the video / audio processing apparatus 100 may be configured to include a display unit and display a display video on the display unit.
- the selection unit 105 selects one audio signal from among the plurality of audio signals output from the audio reproduction unit 115 based on the display video signal output from the display synthesis unit 117. That is, the selection unit 105 selects an audio signal of one moving image from the plurality of moving images according to the positions of the plurality of moving images in the display screen.
- the audio signal is also simply referred to as “audio”.
- the volume adjusting unit 106 adjusts the volume of the plurality of audio signals output from the audio reproducing unit 115 to generate an output audio signal, and outputs the generated output audio signal to the audio output unit 107. At this time, the volume adjustment unit 106 adjusts the volume of each audio signal of the plurality of moving images so that the audio signal selected by the selection unit 105 is output at a higher volume than the other audio signals.
- the audio output unit 107 outputs the output audio signal generated by the volume adjustment unit 106 to the speaker.
- 1 illustrates a configuration example in which the speaker is installed outside the audio / video processing apparatus 100, the present disclosure is not limited to this configuration.
- the video / audio processing apparatus 100 may include a speaker and be configured to output sound from the speaker.
- FIG. 2 is a diagram schematically illustrating an example of a display image generated by the video / audio processing apparatus 100 according to the first embodiment.
- FIG. 2 shows an example in which an image in which the moving image 202A, the moving image 202B, and the moving image 202C move from the right to the left of the screen 201 as time passes is displayed on the screen 201.
- the moving images 202A to 202C correspond to a plurality of video signals generated by the video playback unit 114.
- Each of the moving images 202A to 202C is, for example, a broadcast program, moving image content recorded or shot by a user, or moving image content acquired from an external moving image server or the like via the Internet or the like.
- an image including a plurality of moving images (for example, moving images 202A to 202C) is automatically scrolled from the right to the left on the screen 201. . Accordingly, a plurality of moving images are sequentially displayed on the screen 201.
- other information may be displayed on the screen 201 outside the moving image display area.
- the other information may be, for example, a still image, text information, menu, icon, or link information (for example, URL (Uniform Resource Locator)).
- URL Uniform Resource Locator
- FIG. 2 shows an example in which three moving images 202A to 202C are displayed on the screen 201.
- the number of moving images displayed on the screen 201 may be two or less, and may be four or more. It may be.
- the sizes of the moving images displayed on the screen 201 may be the same or different from each other.
- the arrangement positions of the moving images 202A to 202C on the screen 201 shown in FIG. 2 are merely an example, and each moving image may be arranged appropriately.
- FIG. 2 shows an example in which an image including a plurality of moving images (for example, moving images 202A to 202C) scrolls from the right to the left of the screen 201 in the display video generated by the video / audio processing apparatus 100.
- the image may be scrolled from the left to the right of the screen 201, or may be scrolled from the top to the bottom of the screen 201 or from the bottom to the top.
- the image may be scrolled in an oblique direction.
- FIG. 2 shows an operation example in which the image is linearly scrolled. However, the image may be scrolled while drawing a predetermined locus such as a wave shape.
- the entire screen may be scrolled, or only a part of the area including the plurality of moving images may be scrolled.
- the video / audio processing apparatus 100 operates, for example, as shown in FIG. 2 in which a moving image is automatically scrolled (hereinafter referred to as “automatic scrolling operation”) in a display video when a user operation is not performed for a certain period of time. May be started).
- the audio / video processing apparatus 100 may cancel the automatic scroll operation when a user operation is performed during the automatic scroll operation.
- the video / audio processing apparatus 100 may start or cancel the automatic scroll operation when a predetermined user operation is received.
- FIG. 3 is a diagram schematically showing an outline of the audio adjustment process performed by the video / audio processing apparatus 100 according to the first embodiment.
- (A) of FIG. 3 is a figure which shows typically a mode that the moving image 202A and a part of moving image 202B are displayed on the screen 201.
- FIG. 3A schematically shows the display area of the screen 201.
- the horizontal axis represents the horizontal direction (longitudinal direction) of the screen 201, and the vertical axis represents the vertical direction (short side) of the screen 201. Direction).
- FIG. 3 is a diagram schematically showing an outline of the audio adjustment process performed by the video / audio processing apparatus 100 according to the first embodiment.
- (A) of FIG. 3 is a figure which shows typically a mode that the moving image 202A and a part of moving image 202B are displayed on the screen 201.
- FIG. 3A schematically shows the display area of the screen 201.
- the horizontal axis represents the horizontal direction (longit
- FIG. 3A is a diagram schematically showing the volume of the moving image included in the sound output area.
- the vertical axis of FIG. 3B represents the volume level.
- the audio / video processing apparatus 100 provides a sound output area 203 in the screen 201 as shown in FIG.
- the sound output area 203 is an area from the position X1 to the position X2 shown in FIG. 3A, and is an area where the audio / video processing apparatus 100 outputs the sound of a moving image.
- the sound output area 203 is set near the center of the screen 201 in the audio / video processing apparatus 100, but the present disclosure does not limit the sound output area 203 to the range illustrated in FIG. 3 at all.
- the sound output area 203 may be set so as to cover the entire screen 201.
- the moving image 202 ⁇ / b> A and a part of the moving image 202 ⁇ / b> B are displayed on the screen 201, the moving image 202 ⁇ / b> A is included in the sound output region 203, and the moving image 202 ⁇ / b> B is included in the sound output region 203. Absent.
- the video / audio processing apparatus 100 selects the moving image 202A included in the sound output area 203 as a moving image (hereinafter, also referred to as “target moving image”) to be output with sound.
- volume V2 the sound volume of the target moving image
- volume V1 the sound volume of the moving image other than the target moving image
- the volume V1 may be 0 (zero). That is, the video / audio processing apparatus 100 may operate so as to output only the sound of the target moving image and not output the sound other than the target moving image.
- the video / audio processing apparatus 100 when a moving image is included in the sound output region 203 and becomes a target moving image, gradually increases the volume of the moving image from the volume V1 to the volume V2. When the target moving image moves from the sound output area 203 to the outside of the sound output area 203 and is no longer the target moving image, the video / audio processing apparatus 100 gradually decreases the volume of the moving image from the volume V2 and returns it to the volume V1. . It is desirable that the time until the volume of the target moving image reaches the volume V2 from the volume V1 and the time until the volume reaches the volume V1 from the volume V2 are set appropriately so that the user does not feel uncomfortable.
- volume graph shown in FIG. 3 indicates the gain that the volume adjustment unit 106 multiplies to the original audio signal.
- the position of the moving image used for determination is the left end of the moving image. That is, when the left end of the moving image is included in the sound output region 203, the selection unit 105 of the video / audio processing apparatus 100 determines that the moving image is included in the sound output region 203, and the left end of the moving image is the sound output region. When moving out of the sound output area 203 from 203, it is determined that the moving image is no longer included in the sound output area 203.
- the position of the moving image used for determination is not limited to the left end of the moving image. For example, the center of the moving image may be used for the determination, or the right end of the moving image may be used for the determination. Alternatively, positions other than those may be used for the determination.
- volume V1 is set to 0 (zero)
- the volume V1 is not limited to 0 at all, and may be another numerical value.
- the video / audio processing apparatus 100 When two moving images are simultaneously included in the sound output area 203, the video / audio processing apparatus 100 outputs the sound of the moving image included in the sound output area 203 first.
- the video / audio processing apparatus 100 moves a moving image that is outputting sound from the sound output region 203 to the outside of the sound output region 203 and is not included in the sound output region 203. Then, switching to the moving image included in the sound output area 203 is performed later.
- the selection unit 105 selects the sound signal of the moving image included in the sound output region 203 earliest from among the plurality of moving images.
- the selection unit 105 outputs the sound at that time.
- the audio signal of the moving image included in the sound output region 203 is selected earliest among the plurality of moving images included in the region 203.
- FIG. 4 is a diagram schematically illustrating an example of the operation of the video / audio processing apparatus 100 when two moving images (the moving image 202A and the moving image 202B) are included in the sound output area 203 in the first embodiment.
- FIG. 4 schematically shows a display image on the screen 201, and it is assumed that time elapses in the order of (a), (b), and (c).
- the scroll direction of the moving image is indicated by a white arrow. Also, in FIG.
- the moving image 202A is first included in the sound output area 203, and the moving image 202B (the left end of the moving image 202B) is moved before the moving image 202A (the left end of the moving image 202A) moves from the sound output area 203 to the outside of the sound output area 203.
- Is included in the sound output area 203 and then an operation example when the moving image 202 ⁇ / b> A (the left end of the moving image 202 ⁇ / b> A) moves out of the sound output area 203 and is not included in the sound output area 203 is shown.
- the white arrow indicating the scroll direction in FIG. 4 is shown for convenience and is not displayed on the screen 201.
- FIG. 5 is a diagram schematically illustrating an example of a temporal change in volume of each moving image when two moving images (moving image 202A and moving image 202B) are included in the sound output area 203 in the first embodiment.
- FIG. 4A shows a state from time T1 to time T2 in FIG. 4 and 5, the moving image 202A (the left end of the moving image 202A) is included in the sound output region 203 at time T1, and the moving image 202B (the left end of the moving image 202B) is included in the sound output region 203 at time T2.
- the moving image 202A (the left end of the moving image 202A) moves out of the sound output region 203 and is not included in the sound output region 203 at T3. Therefore, in the period from time T1 to time T2, the moving image 202A is included in the sound output region 203, and the moving image 202B is outside the sound output region 203 and is not included in the sound output region 203.
- the audio / video processing apparatus 100 outputs the audio of the moving image 202A.
- FIG. 4B shows a state from time T2 to time T3 in FIG.
- the video / audio processing apparatus 100 continues to output the sound of the moving image 202A and does not output the sound of the moving image 202B.
- FIG. 4C shows a state after time T3 in FIG. Since the moving image 202A moves from the sound output area 203 to the outside of the sound output area 203 at time T3 and is not included in the sound output area 203, the video / audio processing apparatus 100 selects a moving image to be output as an audio at time T3. Switching from the moving image 202A to the moving image 202B. At this time, the volume adjustment unit 106 of the video / audio processing apparatus 100 gradually lowers the sound of the moving image 202A to fade out, and gradually increases the sound of the moving image 202B to fade in.
- the volume adjustment unit 106 gradually decreases the volume of the unselected audio and gradually increases the volume of the newly selected audio when switching the audio selected by the selection unit 105 to another audio. To do.
- FIG. 6 is a flowchart illustrating an example of a video / audio generation process executed by the video / audio processing apparatus 100 according to the first embodiment.
- the video reproduction unit 114 starts reproduction of video data of a plurality of moving image contents
- the audio reproduction unit 115 starts reproduction of audio data of the plurality of moving image contents (step S101).
- the display composition unit 117 generates a display image including a plurality of moving images reproduced in step S101 (step S102).
- the display video generated in step S102 may include, for example, an OSD video.
- step S103 the selection unit 105 and the volume adjustment unit 106 perform an initial volume setting process. Note that, at the time when step S103 is executed, image scrolling has not started in the display video.
- step S103 the initial volume setting process in step S103 will be described with reference to FIG.
- FIG. 7 is a flowchart showing an example of the initial volume setting process in the first embodiment.
- the selection unit 105 determines whether or not a moving image exists in the sound output area 203 (step S121).
- step S121 If it is determined in step S121 that there is no moving image in the sound output area 203 (No in step S121), the selection unit 105 does not select any moving image sound, and the volume adjustment unit 106 Also does not output the audio of the video.
- this operation is an operation when the volume V1 is set to zero. If the volume V1 is not 0, the volume adjustment unit 106 adjusts the volume of the sound of the moving image outside the sound output area 203 (that is, the sound not selected by the selection unit 105) to a preset volume V1. .
- the selection unit 105 selects the sound of the moving image included in the sound output area 203 (step S122). If a plurality of moving images are included in the sound output area 203, the selection unit 105 determines that the sound output area 203 is included in the sound output area 203 earliest based on the scroll direction of the display video among the plurality of moving images. Select the audio of the video to be played. In the example illustrated in FIG. 4, the selection unit 105 selects the sound of the moving image located on the leftmost side.
- the volume adjustment unit 106 sets the volume of the audio selected by the selection unit 105 to the volume V2, and sets the volume of the audio not selected by the selection unit 105 to a volume V1 (for example, 0) smaller than the volume V2. Setting is made (step S123).
- the volume adjusting unit 106 generates an output audio signal by synthesizing the plurality of audio signals after the volume adjustment, and outputs the output audio signal to the audio output unit 107.
- the above processing is the initial volume setting processing.
- step S103 the processing after step S103 will be described.
- the display composition unit 117 scrolls the display video in a predetermined direction (for example, from the right to the left of the screen 201), and updates the display position of the display video (step S104).
- the selection unit 105 determines whether or not the current sound output moving image is no longer included in the sound output region 203 by the processing executed in step S104 (that is, updating of the display position of the moving image in the display video). Is determined (step S105).
- the determination in step S105 is performed based on, for example, whether or not the left end of the moving image is included in the sound output area 203.
- This determination criterion is the same in the determination performed in step S109 described later. This determination criterion is an example, and another determination criterion may be set.
- step S105 If it is determined in step S105 that the moving image currently being output has moved from the output region 203 outside the output region 203 and is no longer included in the output region 203 (Yes in step S105), the selection unit 105 cancels the selection of the sound of the moving image, and the volume adjusting unit 106 gradually decreases the volume of the moving image currently being output (step S106).
- the selection unit 105 determines whether or not the sound output area 203 includes a moving image other than the current moving image (hereinafter also referred to as “other moving images”) (step S107). ).
- step S107 If it is determined in step S107 that the sound output area 203 includes another moving image (Yes in step S107), the selection unit 105 selects the sound of the other moving image included in the sound output area 203. To do. Then, the volume adjustment unit 106 gradually increases the volume of the sound selected by the selection unit 105 (step S108).
- step S108 if there are a plurality of other moving images, the selection unit 105 determines that, among the plurality of other moving images, the earliest included in the sound output region 203 based on the scroll direction of the display video. Select the audio for the video to be played. For example, the selection unit 105 selects the sound of the moving image located on the leftmost side from among the plurality of other moving images.
- step S105 When it is determined in step S105 that the moving image currently being output is included in the output region 203 (No in step S105), or in step S107, another moving image is included in the output region 203. If it is determined that it is not (No in Step S107), or after Step S108, the selection unit 105 determines whether or not a new moving image is included in the sound output area 203 (Step S109).
- step S109 If it is determined in step S109 that a new moving image is included in the sound output region 203 (Yes in step S109), the selection unit 105 determines whether another sound image is included in the sound output region 203 or not. Is determined (step S110). That is, the selection unit 105 determines whether or not a moving image that is currently sounding is present in the sound output region 203.
- step S110 If it is determined in step S110 that the sound output region 203 does not include a moving image currently being output (No in step S110), the selection unit 105 newly adds a moving image included in the sound output region 203. Select the sound. Then, the volume adjustment unit 106 gradually increases the volume of the sound selected by the selection unit 105 (step S111).
- step S110 If it is determined in step S110 that the sound output area 203 includes a moving image that is currently being output (Yes in step S110), the selection unit 105 selects the current output included in the sound output area 203. Continue sound selection of moving images in sound. That is, the selection unit 105 does not select the sound of the moving image newly included in the sound output area 203.
- control unit 103 determines whether or not the automatic scrolling operation is continued (step S112).
- step S112 when it is determined that the automatic scrolling moving image is continued (Yes in step S112), the process returns to step S104, and the processes after step S104 are executed.
- step S112 If it is determined in step S112 that the automatic scrolling video has ended (No in step S112), the video / audio processing apparatus 100 ends the process. For example, the video / audio processing device 100 ends the process when receiving an operation by the user.
- step S106 the operation example in which the sound is faded in (the sound volume is gradually increased) in steps S108 and S111 and the sound is faded out (the sound volume is gradually decreased) in step S106 has been described.
- the present disclosure is not limited to this operation example.
- At least one of the fade-in of step S108, step S111 and the fade-out of step S106 may not be performed.
- processing similar to the initial voice setting processing in step S103 may be performed instead of the processing in steps S105 to S111.
- the operation example in which it is determined that a moving image is included in the sound output region 203 when the left end of the moving image is included in the sound output region 203 has been described. It is not limited to this operation example. This determination may be performed based on the center or the right end of the moving image. Alternatively, this determination may be performed based on other determination criteria (for example, the area of a moving image, etc.).
- the video / audio processing apparatus includes the video generation unit, the selection unit, and the volume adjustment unit.
- the video generation unit generates a video signal of a display video in which a region where a plurality of moving images are displayed automatically moves in a predetermined direction within the display screen.
- the selection unit selects an audio signal of one moving image from the plurality of moving images according to the positions of the plurality of moving images in the display screen.
- the volume adjustment unit adjusts the volume of each audio signal of the plurality of moving images so that the audio signal selected by the selection unit is output at a volume higher than that of the other audio signals.
- the video / audio processing apparatus 100 and the video / audio processing apparatus 100A described later are examples of the video / audio processing apparatus.
- the video generation unit 101 is an example of a video generation unit.
- the selection unit 105 is an example of a selection unit.
- the volume adjustment unit 106 is an example of a volume adjustment unit.
- the moving image 202A, the moving image 202B, and the moving image 202C are examples of a plurality of moving images.
- a screen 201 is an example of a display screen.
- the audio / video processing apparatus selects the audio signal of one moving image in the display image in which a plurality of moving images are automatically scrolled, and makes it easy for the user to hear the audio from the audio signal. You can adjust the volume of multiple voices. That is, the video / audio processing apparatus according to the present embodiment can generate audio that is easy for the user to hear when a plurality of moving images are displayed in one display screen.
- the selection unit may select the audio signal of the moving image included in the predetermined sound output area in the display screen from the plurality of moving images.
- the sound output area 203 is an example of a sound output area.
- the video / audio processing apparatus can select and output a sound signal of a moving image existing in an area easily recognized by the user. That is, the video / audio processing apparatus can appropriately select one sound from a plurality of moving image sounds and output the sound.
- the selection unit may select the audio signal of the moving image included in the sound output region 203 earliest among the plurality of moving images included in the sound output region.
- the video / audio processing apparatus can prevent the audio of the moving image in which the user is paying attention from being switched to the audio of another moving image in the middle.
- the volume adjustment unit gradually reduces the volume of the audio signal that has been deselected when the audio signal selected by the selection unit is switched to another audio signal, and is newly selected.
- the volume of the audio signal may be gradually increased.
- the video / audio processing apparatus can realize switching of audio that is easier for the user to hear when the audio signal selected by the selection unit is switched to another audio signal.
- the first embodiment has been described as an example of the technique disclosed in the present application.
- the technology in the present disclosure is not limited to this, and can also be applied to embodiments in which changes, replacements, additions, omissions, and the like are performed.
- the volume adjusting unit 106 of the video / audio processing apparatus 100 takes time required for the audio to fade out in accordance with the scrolling speed of the moving image in the display video (time until the audio is gradually reduced from the volume V2 to the volume V1). Alternatively, the time required for fade-in (the time until the sound is gradually increased from the volume V1 to the volume V2) may be changed. That is, the volume adjustment unit 106 may change the amount of change in volume per unit time when the sound is faded out or faded in according to the scrolling speed of the moving image in the display video. In the first modification, a video / audio processing apparatus 100 configured to perform such an operation will be described.
- FIG. 8 is a diagram schematically illustrating an example of a temporal change in volume of a moving image included in the sound output area 203 in Modification 1 of the other embodiment.
- FIG. 8A shows the time change of the volume when the moving image scroll speed is relatively slow
- FIG. 8B shows the volume when the moving image scroll speed is relatively fast. The time change of is shown.
- the volume adjusting unit 106 determines the time t0 required for the audio to fade out or fade in when the moving image scrolling speed is relatively slow. If it is fast, it may be longer than the time t1 required to fade out or fade in the sound.
- the sound volume adjusting unit moves a plurality of moving images within the display screen for the time required to change the sound signal volume. You may change according to speed.
- the volume adjustment unit increases the time for gradually decreasing the volume of the sound that has been deselected when switching the sound to be selected by the selection unit as the moving image scrolling speed in the display video is faster, and newly selects The time for gradually increasing the volume of the generated voice may be shortened.
- the audio can be appropriately faded out or faded in according to the scrolling speed.
- the volume adjusting unit 106 is configured to select a sound when a new sound is selected from a state where no sound of a moving image is selected by the selection unit 105 and when a sound selected by the selection unit 105 is switched.
- the time required for fade-in or fade-out may be changed.
- the volume adjustment unit 106 selects a new sound from the state in which no sound is selected in the selection unit 105 as the time required for the sound to fade in or fade out when the sound selected in the selection unit 105 is switched. In this case, it may be shorter than the time required for the audio to fade in or fade out.
- the volume adjusting unit 106 configured in this way can continuously switch the sound when the sound selected by the selecting unit 105 is switched.
- the selection unit 105 and the volume adjustment unit 106 include a moving image previously included in the sound output region 203 among the plurality of moving images.
- the operation example of selecting the voice and outputting the voice has been described.
- the present disclosure is not limited to this configuration.
- the selection unit 105 and the volume adjusting unit 106 select the sound of the moving image included in the sound output region 203 later and output the sound. May be configured.
- the selection unit 105 selects the audio signal of the moving image included in the sound output area 203 latest among the plurality of moving images. Also good. In other words, when a new moving image is included in the sound output area 203, the selection unit 105 may operate so as to select an audio signal of the moving image.
- the volume V1 is not limited to zero.
- FIG. 9 is a diagram schematically illustrating an example of the operation of the video / audio processing device 100 when two moving images (moving image 202A and moving image 202B) are included in the sound output region 203 in the second modification of the other embodiment. It is.
- FIG. 9 schematically shows a display image on the screen 201, and it is assumed that time elapses in the order of (a), (b), and (c). Further, in FIG. 9, the scroll direction of the moving image is indicated by a white arrow.
- FIG. 9 schematically shows a display image on the screen 201, and it is assumed that time elapses in the order of (a), (b), and (c). Further, in FIG. 9, the scroll direction of the moving image is indicated by a white arrow.
- the moving image 202A is first included in the sound output region 203, and the moving image 202B (the left end of the moving image 202B) is moved before the moving image 202A (the left end of the moving image 202A) moves from the sound output region 203 to the outside of the sound output region 203.
- Is included in the sound output area 203 and then an operation example when the moving image 202 ⁇ / b> A (the left end of the moving image 202 ⁇ / b> A) moves out of the sound output area 203 and is not included in the sound output area 203 is shown.
- the white arrow indicating the scroll direction in FIG. 9 is shown for convenience and is not displayed on the screen 201.
- FIG. 10 schematically illustrates an example of a temporal change in volume of each moving image when two moving images (moving image 202A and moving image 202B) are included in the sound output area 203 in Modification 2 of the other embodiment.
- FIG. 10 schematically illustrates an example of a temporal change in volume of each moving image when two moving images (moving image 202A and moving image 202B) are included in the sound output area 203 in Modification 2 of the other embodiment.
- FIG. 9A shows a state from time T1 to time T2 in FIG. 9 and 10, the moving image 202A (the left end of the moving image 202A) is included in the sound output area 203 at time T1, and the moving image 202B (the left end of the moving image 202B) is included in the sound output area 203 at time T2.
- the moving image 202A (the left end of the moving image 202A) moves out of the sound output region 203 and is not included in the sound output region 203 at T3. Therefore, in the period from time T1 to time T2, the moving image 202A is included in the sound output region 203, and the moving image 202B is outside the sound output region 203 and is not included in the sound output region 203.
- the audio / video processing apparatus 100 outputs the audio of the moving image 202A.
- FIG. 9B shows a state from time T2 to time T3 in FIG.
- the moving image 202A is included in the sound output region 203, but at time T2, the moving image 202B (the left end of the moving image 202B) moves from outside the sound output region 203 into the sound output region 203 and outputs sound. Since it is included in the region 203, the video / audio processing apparatus 100 switches the moving image to be output from the moving image 202A to the moving image 202B. At this time, the volume adjustment unit 106 of the video / audio processing apparatus 100 gradually lowers the sound of the moving image 202A to fade out, and gradually increases the sound of the moving image 202B to fade in.
- FIG. 9C shows the state after time T3 in FIG. Since the moving image 202B is included in the sound output area 203 from time T3 to time T4, the video / audio processing apparatus 100 continues to output the sound of the moving image 202B.
- FIG. 11 is a flowchart illustrating an example of a video / audio generation process executed by the video / audio processing apparatus 100 according to the second modification of the other embodiment.
- the processes in steps S101 to S105 and step S112 are substantially the same as the processes shown in the same step of the flowchart in FIG.
- step S105 in the flowchart of FIG. 6 it is determined whether or not the moving image currently being output has moved from the sound output area 203 to the outside of the sound output area 203 and is no longer included in the sound output area 203. explained. However, in step S105 in the flowchart of FIG. 11, the moving image is simply moved from the sound output area 203 to the outside of the sound output area 203 and is not included in the sound output area 203 regardless of whether or not it is a sound output sound. It is determined whether or not. This determination is performed based on, for example, whether or not the left end of the moving image is included in the sound output area 203 as in the first embodiment. The same applies to the other modifications described below. However, the present disclosure is not limited to this operation example. For example, this determination may be performed based on the center or right end of the moving image. Alternatively, this determination may be performed based on other determination criteria (for example, the area of a moving image, etc.).
- step S105 If it is determined in step S105 that the moving image has moved out of the sound output area 203 from the sound output area 203 and is no longer included in the sound output area 203 (Yes in step S105), the selection unit 105 outputs the sound. It is determined whether another moving image is included in the area 203 (step S106A).
- step S106A when it is determined that no other moving images are included in the sound output area 203 (No in step S106A), the sound output area 203 moves from the sound output area 203 to the outside of the sound output area 203 in step S105.
- the moving image determined not to be included in 203 is a moving image that is currently being produced. Therefore, the selection unit 105 moves from the sound output area 203 to the outside of the sound output area 203 and cancels the sound selection of the moving image that is no longer included in the sound output area 203. Then, the volume adjustment unit 106 gradually decreases the volume of the sound whose selection is canceled by the selection unit 105 (step S107A).
- step S106A If it is determined in step S106A that another moving image is included in the sound output area 203 (Yes in step S106A), the sound output area 203 is output from the sound output area 203 in step S105 in the flowchart shown in FIG. A moving image determined to have moved out of 203 and no longer included in the sound output area 203 is not a moving image currently being output. Therefore, the selection unit 105 and the volume adjustment unit 106 do not change the currently selected voice (without switching the voice selection) and continue the current state.
- step S105 When it is determined in step S105 that the moving image has not moved from the sound output area 203 to the outside of the sound output area 203 (No in step S105), or in step S106A, another moving image is present in the sound output area 203.
- the selection unit 105 determines whether or not a new moving image is included in the sound output area 203 (step S108A). .
- step S108A If it is determined in step S108A that a new moving image is included in the sound output region 203 (Yes in step S108A), the selection unit 105 selects the sound of the moving image newly included in the sound output region 203. . Then, the volume adjustment unit 106 gradually increases the volume of the voice newly selected by the selection unit 105 (step S109A).
- the selection unit 105 determines whether or not a moving image (another moving image) other than the moving image newly included in the sound output area 203 is included in the sound output area 203 (step S110A).
- step S110A when it is determined that another moving image is included in the sound output area 203 (Yes in step S110A), a sound is currently being output in addition to the moving image newly included in the sound output area 203. Are present in the sound output area 203. Therefore, the selection unit 105 cancels the sound selection of the moving image currently being sounded other than the moving image newly included in the sound output area 203. Then, the volume adjustment unit 106 gradually decreases the volume of the sound whose selection is canceled by the selection unit 105 (step S111A).
- the selection unit may select the audio signal of the moving image included in the sound output region the latest among the plurality of moving images included in the sound output region.
- the video / audio processing apparatus configured as described above can always output the sound of a moving image newly included in the sound output area.
- the selection unit 105 moves to the sound output region 203 when a moving image corresponding to the sound signal being selected by the selection unit 105 moves from the sound output region 203 to the outside of the sound output region 203 and is not included in the sound output region 203. Audio signals of moving images that are not included in the set transition target area 204 (see FIG. 12) may be excluded from the next selection target.
- FIG. 12 is a diagram schematically showing an outline of the audio adjustment processing in the third modification of the other embodiment.
- the sound selected by the selection unit 105 is immediately switched from the sound of the moving image 202A to the sound of the moving image 202B.
- 202B moves out of the sound output area 203 from the sound output area 203. For this reason, the period during which the audio of the moving image 202B is output is relatively short.
- the transition target area 204 may be provided in the sound output area 203 based on the scroll direction of the moving image.
- the transition target area 204 is set in the sound output area 203 excluding the area on the exit side of the moving image to be scrolled.
- the region on the right side of the sound output region 203 is the transition target region 204.
- the transition target area 204 is an area set as follows. That is, the selection unit 105 moves the moving image 202A from the sound output region 203 to the outside of the sound output region 203 and does not include the sound output region 203. If the moving image 202B is included in the transition target region 204, The voice of 202B is selected. However, if the moving image 202B is not included in the transition target area 204, the sound of the moving image 202B is not selected.
- FIG. 13 is a flowchart illustrating an example of a video / audio generation process executed by the video / audio processing apparatus 100 according to Modification 3 of the other embodiment.
- Each process shown in the flowchart of FIG. 13 is different from the process shown in the flowchart of FIG. 6 in that step S107 is replaced with step S107B.
- step S107B is replaced with step S107B.
- the two are substantially the same, so detailed description will be omitted, and only the processing of step S107B will be described.
- the selection unit 105 determines whether or not the transition target area 204 includes a moving image other than the moving image currently being output (step S107B).
- step S107B If it is determined in step S107B that the transition target area 204 includes a moving image other than the moving image currently being output (Yes in step S107B), the same processing as step S108 shown in the flowchart of FIG. 6 is performed. Step S108 is executed.
- step S107B If it is determined in step S107B that the transition target area 204 does not include a moving image other than the currently output moving image (No in step S107B), the same processing as step S109 shown in the flowchart of FIG. 6 is performed. Step S109 is executed.
- the selection unit 105 includes, for example, two sound images in the sound output area 203, and one of the moving images (the moving image previously included in the sound output area 203) is output from the sound output area 203.
- the other moving image moving image included later in the sound output area 203
- the sound of the other moving image is selected
- the other moving image is not included in the transition target area 204
- an operation of not selecting the sound of the other moving image may be performed. In this operation, when two sound images are included in the sound output area 203 and one of the moving images moves out of the sound output area 203 from the sound output area 203, the other moving image is set to a predetermined time. In other words, when moving outside the sound output area 203 from the sound output area 203, the other moving image is not selected.
- the selection unit moves the sound output region when the moving image corresponding to the sound signal being selected by the selection unit moves from the sound output region to the outside of the sound output region 203 and is not included in the sound output region 203.
- a moving image that is not included in the transition target area set in the above may be excluded from the next selection target.
- transition target area 204 is an example of a transition target area.
- the audio of the moving image included in the sound output area is output only for a short period later. Can be prevented from occurring.
- the video generation unit may generate an icon indicating the volume of the audio signal selected by the selection unit.
- an icon indicating the volume of each moving image may be superimposed and displayed on each moving image that scrolls in the display screen.
- FIG. 14 is a block diagram illustrating a configuration example of the video / audio processing device 100A according to Modification 4 of the other embodiment.
- the video / audio processing apparatus 100A includes a video generation unit 101A, an operation reception unit 102, a control unit 103, a video output unit 104, a selection unit 105, a volume adjustment unit 106, and an audio output unit 107.
- the video generation unit 101A includes a channel selection unit 111, a broadcast signal separation unit 112, a content data separation unit 113, a video reproduction unit 114, an audio reproduction unit 115, an OSD generation unit 116, and a display synthesis unit 117A. Prepare.
- the video / audio processing device 100A shown in FIG. 14 is different from the video / audio processing device 100 shown in FIG. 1 in that the function of the display synthesis unit 117A of the video generation unit 101A is the same as that of the display synthesis unit 117 of the video generation unit 101. Different from function. However, except for this point, the two are substantially the same, so detailed description will be omitted, and only the display composition unit 117A will be described.
- the display synthesis unit 117A of the video / audio processing device 100A shown in FIG. 14 indicates the volume of a moving image that scrolls in the display screen, in addition to the functions of the display synthesis unit 117 of the video / audio processing device 100 shown in FIG. It has a function of generating an icon and displaying it superimposed on each moving image.
- FIG. 1 An example of an icon indicating the volume of a moving image generated by the display composition unit 117A is shown in FIG.
- FIG. 15 is a diagram schematically illustrating an example of a display image in Modification 4 of the other embodiment.
- the display composition unit 117A generates an icon 205A as an icon indicating the volume of the moving image 202A, and generates an icon 205B as an icon indicating the volume of the moving image 202B. Then, the display composition unit 117A superimposes the icon 205A on the moving image 202A and the icon 205B on the moving image 202B, and synthesizes them to generate a display video signal.
- the moving image 202A on which the icon 205A is superimposed and the moving image 202B on which the icon 205B is superimposed are displayed on the screen 201, and these moving images scroll in the screen 201 from right to left.
- the display composition unit 117A generates an icon with a size corresponding to the volume level. That is, the display composition unit 117A generates a relatively large icon for a moving image with a relatively large volume and superimposes the icon on the moving image, and relatively small for a moving image with a relatively small volume. An icon is generated and superimposed on the moving image. Therefore, for example, in the example shown in FIG. 15, the user can easily understand that the currently output sound is the sound of the moving image 202A by comparing the icon 205A and the icon 205B displayed on the screen 201. .
- the display composition unit 117A may generate an icon indicating the volume only for the sound selected by the selection unit 105, and may not generate an icon indicating the volume for a sound not selected by the selection unit 105.
- the display composition unit 117A may display only the icon 205A of the moving image 202A on the screen 201 and may not display the icon 205B of the moving image 202B on the screen 201. In that case, the user can easily understand that the currently output sound is the sound of the moving image 202 ⁇ / b> A by looking at the icon 205 ⁇ / b> A displayed on the screen 201.
- the display composition unit 117A may represent the volume level by the icon color instead of the icon size.
- the display composition unit 117A may change the icon design in conjunction with the effect at the time of sound fade-in or sound fade-out.
- the display composition unit 117A may display an icon indicating the volume level not on the moving image but on the periphery of the moving image so that the moving image is not hidden by the icon.
- the video generation unit may generate an icon indicating the volume of the audio signal selected by the selection unit. Thereby, the user can visually confirm the volume of the moving image that scrolls the display screen.
- these general or specific aspects may be realized by an apparatus, a system, a method, an integrated circuit, a computer program, or a recording medium such as a computer-readable CD-ROM. You may implement
- each component may be configured by dedicated hardware or may be realized by executing a software program suitable for each component.
- Each component may be realized by a program execution unit such as a CPU or a processor reading and executing a software program recorded on a recording medium such as a hard disk or a semiconductor memory.
- division of functional blocks in the block diagram is an example, and a plurality of functional blocks can be realized as one functional block, a single functional block can be divided into a plurality of functions, or some functions can be transferred to other functional blocks. May be.
- the functions of a plurality of functional blocks having similar functions may be processed in parallel or in time division by a single hardware or software.
- This disclosure is applicable to a video / audio processing apparatus. Specifically, the present disclosure can be applied to a television, a recorder, a personal computer, a tablet terminal device, or the like.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Human Computer Interaction (AREA)
- Television Receiver Circuits (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Controls And Circuits For Display Device (AREA)
Abstract
Description
以下、図1~図15を用いて、実施の形態1を説明する。
図1は、実施の形態1における映像音声処理装置100の一構成例を示すブロック図である。
図2は、実施の形態1における映像音声処理装置100で生成される表示映像の一例を模式的に示す図である。
図3は、実施の形態1における映像音声処理装置100で行う音声調整処理の概要を模式的に示す図である。図3の(a)は、画面201に、動画202Aと、動画202Bの一部と、が表示されている様子を模式的に示す図である。なお、図3の(a)には、画面201の表示領域を模式的に示しており、横軸は画面201の横方向(長手方向)を表し、縦軸は画面201の縦方向(短手方向)を表す。また、図3の(a)に示す例では、動画202Bの約1/3は画面201に表示され、約2/3は画面201外にあるが、動画202Bはスクロールにより左方向に徐々に移動して画面201に表示される面積が徐々に大きくなることを示している。図3の(b)は、出音領域に含まれる動画像の音量を模式的に示す図である。図3の(b)の縦軸は音量の大きさを表す。
図6は、実施の形態1における映像音声処理装置100で実行する映像音声生成処理の一例を示すフローチャートである。
以上のように、本実施の形態における映像音声処理装置は、映像生成部と、選択部と、音量調整部と、を備える。映像生成部は、複数の動画像が表示される領域が、表示画面内を、予め定められた方向に自動的に移動する表示映像の映像信号、を生成する。選択部は、複数の動画像の表示画面内における位置に応じて、それら複数の動画像の中から1つの動画像の音声信号を選択する。音量調整部は、選択部で選択された音声信号が、他の音声信号より大きい音量で出力されるように、複数の動画像のそれぞれの音声信号の音量を調整する。
以上のように、本出願において開示する技術の例示として、実施の形態1を説明した。しかしながら、本開示における技術は、これに限定されず、変更、置き換え、付加、省略等を行った実施の形態にも適用できる。また、上記実施の形態1で説明した各構成要素を組み合わせて、新たな実施の形態とすることも可能である。
映像音声処理装置100の音量調整部106は、表示映像における動画像のスクロールの速度に応じて、音声のフェードアウトに要する時間(音声を音量V2から徐々に小さくして音量V1にするまでの時間)またはフェードインに要する時間(音声を音量V1から徐々に大きくして音量V2にするまでの時間)を変更してもよい。すなわち、音量調整部106は、音声をフェードアウトまたはフェードインするときの単位時間当たりの音量の変化量を、表示映像における動画像のスクロールの速度に応じて変更してもよい。変形例1では、そのような動作をするように構成された映像音声処理装置100を説明する。
実施の形態1では、選択部105および音量調整部106は、複数の動画像が出音領域203に含まれる場合、それら複数の動画像のうち、先に出音領域203に含まれた動画像の音声を選択し、その音声を出力する動作例を説明した。しかし、本開示は何らこの構成に限定されない。例えば、選択部105および音量調整部106は、複数の動画像が出音領域203に含まれる場合、後から出音領域203に含まれた動画像の音声を選択し、その音声を出力するように構成されてもよい。
変形例3では、スクロールする複数の動画像間の距離が相対的に短い場合の、映像音声処理装置100の動作を説明する。
実施の形態1では、音量の表示について特に触れなかったが、映像生成部は、選択部によって選択された音声信号の音量を示すアイコンを生成してもよい。
101,101A 映像生成部
102 操作受付部
103 制御部
104 映像出力部
105 選択部
106 音量調整部
107 音声出力部
111 選局部
112 放送信号分離部
113 コンテンツデータ分離部
114 映像再生部
115 音声再生部
116 OSD生成部
117,117A 表示合成部
131 アンテナ
132 記憶装置
201 画面
202A,202B,202C,A,B,C 動画
203 出音領域
204 遷移対象領域
205A,205B アイコン
Claims (10)
- 複数の動画像が表示される領域が、表示画面内を予め定められた方向に自動的に移動する表示映像の映像信号を生成する映像生成部と、
複数の前記動画像の前記表示画面内における位置に応じて、複数の前記動画像の中から1つの動画像の音声信号を選択する選択部と、
前記選択部で選択された音声信号が、他の音声信号より大きい音量で出力されるように、複数の前記動画像のそれぞれの音声信号の音量を調整する音量調整部と、を備える、
映像音声処理装置。 - 前記選択部は、複数の前記動画像のうち、前記表示画面内の予め定められた出音領域に含まれる動画像の音声信号を選択する、
請求項1に記載の映像音声処理装置。 - 前記選択部は、前記出音領域に含まれる前記複数の動画像のうち、最も早く前記出音領域に含まれた動画像の音声信号を選択する、
請求項2に記載の映像音声処理装置。 - 前記選択部は、前記出音領域に含まれる前記複数の動画像のうち、最も遅く前記出音領域に含まれた動画像の音声信号を選択する、
請求項2に記載の映像音声処理装置。 - 前記選択部は、前記選択部が選択中の音声信号に対応する動画像が前記出音領域から前記出音領域外に移動して前記出音領域に含まれなくなるときに、前記出音領域に設定された遷移対象領域に含まれない動画像は次の選択の対象外とする、
請求項3に記載の映像音声処理装置。 - 前記音量調整部は、前記選択部が選択する音声信号を他の音声信号に切り替えるときに、前記選択を外れた音声信号の音量を徐々に小さくするとともに、新たに前記選択がなされた音声信号の音量を徐々に大きくする、
請求項1に記載の映像音声処理装置。 - 前記音量調整部は、前記選択部が選択する音声信号を他の音声信号に切り替えるときに、音声信号の音量を変化させるのに要する時間を、複数の前記動画像が前記表示画面内を移動する速さに応じて変更する、
請求項6記載の映像音声処理装置。 - 前記映像生成部は、前記選択部によって選択された音声信号の音量を示すアイコンを生成する、
請求項1に記載の映像音声処理装置。 - 複数の動画像が表示される領域が、表示画面内を予め定められた方向に自動的に移動する表示映像の映像信号を生成し、
複数の前記動画像の前記表示画面内における位置に応じて、複数の前記動画像の中から1つの動画像の音声信号を選択し、
前記選択がなされた音声信号が、他の音声信号より大きい音量で出力されるように、複数の前記動画像のそれぞれの音声信号の音量を調整する、
映像音声処理方法。 - 請求項9記載の映像音声処理方法をコンピュータに実行させるための、
プログラム。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/303,774 US20170034568A1 (en) | 2014-09-19 | 2015-09-16 | Video audio processing device, video audio processing method, and program |
JP2016548562A JP6609795B2 (ja) | 2014-09-19 | 2015-09-16 | 映像音声処理装置、映像音声処理方法およびプログラム |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2014-190783 | 2014-09-19 | ||
JP2014190783 | 2014-09-19 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016042765A1 true WO2016042765A1 (ja) | 2016-03-24 |
Family
ID=55532820
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2015/004718 WO2016042765A1 (ja) | 2014-09-19 | 2015-09-16 | 映像音声処理装置、映像音声処理方法およびプログラム |
Country Status (3)
Country | Link |
---|---|
US (1) | US20170034568A1 (ja) |
JP (1) | JP6609795B2 (ja) |
WO (1) | WO2016042765A1 (ja) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109314833A (zh) * | 2016-05-30 | 2019-02-05 | 索尼公司 | 音频处理装置和音频处理方法以及程序 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2009004999A (ja) * | 2007-06-20 | 2009-01-08 | Panasonic Corp | 映像データ管理装置 |
JP2009212678A (ja) * | 2008-03-03 | 2009-09-17 | Canon Inc | 表示制御装置、方法、およびプログラム |
JP2010074258A (ja) * | 2008-09-16 | 2010-04-02 | Sony Corp | 表示装置及び表示方法 |
Family Cites Families (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6072480A (en) * | 1997-11-05 | 2000-06-06 | Microsoft Corporation | Method and apparatus for controlling composition and performance of soundtracks to accompany a slide show |
US6538665B2 (en) * | 1999-04-15 | 2003-03-25 | Apple Computer, Inc. | User interface for presenting media information |
JP4127750B2 (ja) * | 2000-05-30 | 2008-07-30 | 富士フイルム株式会社 | 音楽再生機能付デジタルカメラ |
US20040095379A1 (en) * | 2002-11-15 | 2004-05-20 | Chirico Chang | Method of creating background music for slideshow-type presentation |
US7734154B2 (en) * | 2003-02-14 | 2010-06-08 | Lg Electronics Inc. | Recording medium having data structure for managing reproduction duration of still pictures recorded thereon and recording and reproducing methods and apparatuses |
US20050275805A1 (en) * | 2004-06-15 | 2005-12-15 | Yu-Ru Lin | Slideshow composition method |
US7236226B2 (en) * | 2005-01-12 | 2007-06-26 | Ulead Systems, Inc. | Method for generating a slide show with audio analysis |
US7952535B2 (en) * | 2005-02-20 | 2011-05-31 | Mediatek Singapore Pte Ltd | Electronic visual jockey file |
US20060204214A1 (en) * | 2005-03-14 | 2006-09-14 | Microsoft Corporation | Picture line audio augmentation |
JP4717734B2 (ja) * | 2006-06-30 | 2011-07-06 | キヤノン株式会社 | データ再生装置及びデータ再生方法 |
US7844354B2 (en) * | 2006-07-27 | 2010-11-30 | International Business Machines Corporation | Adjusting the volume of an audio element responsive to a user scrolling through a browser window |
US9158776B2 (en) * | 2007-08-06 | 2015-10-13 | Apple Inc. | Slideshows comprising various forms of media |
US8381086B2 (en) * | 2007-09-18 | 2013-02-19 | Microsoft Corporation | Synchronizing slide show events with audio |
WO2009081478A1 (ja) * | 2007-12-21 | 2009-07-02 | Fujitsu Limited | 電子装置、制御方法及びプログラム |
JP5033098B2 (ja) * | 2008-10-16 | 2012-09-26 | シャープ株式会社 | 画像表示装置、画像表示方法および画像表示プログラム |
US8626322B2 (en) * | 2008-12-30 | 2014-01-07 | Apple Inc. | Multimedia display based on audio and visual complexity |
US20130317951A1 (en) * | 2012-05-25 | 2013-11-28 | Rawllin International Inc. | Auto-annotation of video content for scrolling display |
US20140219634A1 (en) * | 2013-02-05 | 2014-08-07 | Redux, Inc. | Video preview creation based on environment |
-
2015
- 2015-09-16 JP JP2016548562A patent/JP6609795B2/ja active Active
- 2015-09-16 US US15/303,774 patent/US20170034568A1/en not_active Abandoned
- 2015-09-16 WO PCT/JP2015/004718 patent/WO2016042765A1/ja active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2009004999A (ja) * | 2007-06-20 | 2009-01-08 | Panasonic Corp | 映像データ管理装置 |
JP2009212678A (ja) * | 2008-03-03 | 2009-09-17 | Canon Inc | 表示制御装置、方法、およびプログラム |
JP2010074258A (ja) * | 2008-09-16 | 2010-04-02 | Sony Corp | 表示装置及び表示方法 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109314833A (zh) * | 2016-05-30 | 2019-02-05 | 索尼公司 | 音频处理装置和音频处理方法以及程序 |
CN109314833B (zh) * | 2016-05-30 | 2021-08-10 | 索尼公司 | 音频处理装置和音频处理方法以及程序 |
Also Published As
Publication number | Publication date |
---|---|
JP6609795B2 (ja) | 2019-11-27 |
JPWO2016042765A1 (ja) | 2017-07-06 |
US20170034568A1 (en) | 2017-02-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4170808B2 (ja) | 情報表示装置、情報表示方法及びプログラム | |
JP4735991B2 (ja) | 画像処理装置および方法、プログラム並びに記録媒体 | |
US8434006B2 (en) | Systems and methods for adjusting volume of combined audio channels | |
JP2006135851A (ja) | 映像機器一体型映像表示装置 | |
JP2007336593A (ja) | 情報表示装置、情報表示方法及びプログラム | |
JP5215077B2 (ja) | コンテンツ再生装置、コンテンツ再生方法、プログラムおよび記録媒体 | |
JPWO2009050903A1 (ja) | オーディオミキシング装置 | |
JP2010074258A (ja) | 表示装置及び表示方法 | |
JP6039108B2 (ja) | 電子機器、制御方法およびプログラム | |
JP6609795B2 (ja) | 映像音声処理装置、映像音声処理方法およびプログラム | |
JP2009094796A (ja) | テレビジョン受信機 | |
JP5071040B2 (ja) | 情報処理装置、情報処理方法、プログラム並びに記録媒体 | |
KR20160093404A (ko) | 캐릭터 선택적 오디오 줌인을 제공하는 멀티미디어 콘텐츠 서비스 방법 및 장치 | |
JP5886431B2 (ja) | マルチメディアコンテンツを再生するための方法、関連システム、および関連する再生モジュール | |
JP4529495B2 (ja) | 映像音声再生システムおよびアンプ装置 | |
JP2006211488A (ja) | 映像再生装置 | |
JP5213630B2 (ja) | 映像信号再生装置 | |
JP2009027430A (ja) | 動画再生装置 | |
JP2004336430A (ja) | 再生装置 | |
JP4264028B2 (ja) | 要約番組生成装置、及び要約番組生成プログラム | |
JP6590221B2 (ja) | 映像音声出力装置 | |
JP2007180662A (ja) | 映像音声再生装置、方法およびプログラム | |
JP2005166188A (ja) | ディジタルオーディオ信号処理装置及びディジタルオーディオ信号処理方法 | |
JP4193821B2 (ja) | 光ディスク装置 | |
JP2010183534A (ja) | 記録再生装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15841908 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2016548562 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 15303774 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 15841908 Country of ref document: EP Kind code of ref document: A1 |