TWI665663B - Video and audio reverse playback device and system and method thereof - Google Patents
Video and audio reverse playback device and system and method thereof Download PDFInfo
- Publication number
- TWI665663B TWI665663B TW107129265A TW107129265A TWI665663B TW I665663 B TWI665663 B TW I665663B TW 107129265 A TW107129265 A TW 107129265A TW 107129265 A TW107129265 A TW 107129265A TW I665663 B TWI665663 B TW I665663B
- Authority
- TW
- Taiwan
- Prior art keywords
- reverse playback
- words
- video
- playback
- audio
- Prior art date
Links
Landscapes
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Television Signal Processing For Recording (AREA)
Abstract
一種影音倒轉播放裝置,包含一接收一聲音訊號的收發模組、一連接收發模組的辨識模組、一連接辨識模組的處理模組及一連接處理模組的播放模組,辨識模組辨識該聲音訊號並產生一對應該聲音訊號的文字資訊,且該文字資訊包括複數文字,處理模組根據一倒轉播放條件將該等文字重新排列,並產生一倒轉播放文字檔,播放模組將倒轉播放文字檔與倒轉的該聲音訊號同步播放。An audio and video reverse playback device includes a transceiver module that receives a sound signal, an identification module connected to the transceiver module, a processing module connected to the identification module, and a playback module connected to the processing module. The identification module Recognize the sound signal and generate a pair of text information corresponding to the sound signal, and the text information includes plural texts, the processing module rearranges the texts according to an inverted playback condition, and generates an inverted playback text file, the playback module will The reverse playback text file is played synchronously with the reversed sound signal.
Description
本發明是有關於一種影音倒轉播放裝置與系統,特別是指一種能夠在執行倒轉播放時可使聲音以及字幕軌也能同步播放之影音倒轉播放裝置與系統。The present invention relates to a video and audio reverse playback device and system, in particular to a video and audio reverse playback device and system capable of synchronously playing sounds and subtitle tracks when performing reverse playback.
在傳統的影音播放操作中,可以使用諸如前進(forward)、跳轉(seek)、倒退(reverse)和播放(playback)的功能來展示媒體播放器中的多媒體內容。而隨著科技的進步,許多消費電子設備也經常額外支持各種增強的播放模式,例如:變速播放和倒轉播放等。In traditional video playback operations, functions such as forward, seek, reverse, and playback can be used to display multimedia content in a media player. As technology advances, many consumer electronics devices often support additional enhanced playback modes, such as variable speed playback and reverse playback.
然而,在一般的倒轉播放的過程中,對於聲音軌的處理方式往往是以靜音方式播放。這樣的方式不僅失去了帶給觀賞者在倒轉播放時的趣味性,也無法反映出在倒轉播放時聲音軌應該有的呈現方式。However, in the normal reverse playback process, the processing method for the sound track is often played in a silent manner. This way not only loses the fun that it brings to the viewer when playing backwards, but it also fails to reflect the way the sound track should appear when playing backwards.
因此,本發明之目的,即在提供一種在執行倒轉播放時,可使聲音以及字幕軌也能同步播放之影音倒轉播放裝置。Therefore, an object of the present invention is to provide a video and audio reverse playback device that can simultaneously play sounds and subtitle tracks when performing reverse playback.
於是,本發明影音倒轉播放裝置,包含一接收一聲音訊號的收發模組、一連接收發模組的辨識模組、一連接辨識模組的處理模組及一連接處理模組的播放模組,辨識模組辨識該聲音訊號並產生一對應該聲音訊號的文字資訊,且該文字資訊包括複數文字,處理模組根據一倒轉播放條件將該等文字重新排列,並產生一倒轉播放文字檔,播放模組將倒轉播放文字檔與倒轉的該聲音訊號同步播放。Therefore, the audio and video reverse playback device of the present invention includes a transceiver module that receives a sound signal, an identification module connected to the transceiver module, a processing module connected to the identification module, and a playback module connected to the processing module. The recognition module recognizes the sound signal and generates a pair of text information corresponding to the sound signal, and the text information includes plural texts. The processing module rearranges the texts according to a reverse playback condition, and generates a reverse playback text file. The module will play the reversed text file synchronously with the reversed audio signal.
在一實施例中,收發模組還接收一與聲音訊號有關的影像訊號,播放模組係將倒轉播放文字檔與倒轉的聲音訊號及影像訊號同步播放。In an embodiment, the transceiver module also receives an image signal related to the sound signal, and the playback module plays the reverse playback text file synchronously with the reversed audio signal and image signal.
在一實施例中,影音倒轉播放裝置還包含一連接辨識模組及處理模組的儲存模組,用以儲存文字資訊及倒轉播放文字檔。In one embodiment, the audio and video reverse playback device further includes a storage module connected to the identification module and the processing module, for storing text information and playing back the text file in reverse.
在一實施例中,處理模組係根據倒轉播放條件將該等文字以字、詞或句為單位倒敘排列,該倒轉播放條件包括字/詞/句的表示設定及字/詞/句的個數設定。In one embodiment, the processing module reversely arranges the words in units of words, words, or sentences according to the reverse playback conditions. The reverse playback conditions include the representation settings of the words / words / sentences and the individual words / words / sentences. Number setting.
進一步地,文字資訊還包括複數對應該等文字的持續時間,該倒轉播放條件還包括以該持續時間為基準或以字/詞/句的基本單位為基準之設定。Further, the text information also includes a plurality of durations corresponding to the texts, and the reverse playback condition further includes a setting based on the duration or a basic unit of words / words / sentences.
在一實施例中,倒轉播放文字檔的檔案格式為純文字檔、字幕軌的形式檔案,或是具有階層、時間資料欄位概念的元資料(metadata)。In one embodiment, the file format of the reverse playback text file is a plain text file, a file in the form of a subtitle track, or metadata with the concept of hierarchy and time data fields.
此外,本發明之另一目的,即在提供一種在執行倒轉播放時,可使聲音以及字幕軌也能同步播放之影音倒轉播放系統。In addition, another object of the present invention is to provide an audio and video reverse playback system that can simultaneously play sounds and subtitle tracks when performing reverse playback.
本發明影音倒轉播放系統,包含一影音倒轉播放裝置、一辨識伺服器及一語音分析伺服器,影音倒轉播放裝置與辨識伺服器及語音分析伺服器通訊,且包括一接收一聲音訊號的收發模組及一連接收發模組的播放模組,辨識伺服器辨識該聲音訊號並產生一對應該聲音訊號的文字資訊,且該文字資訊包括複數文字,語音分析伺服器根據一倒轉播放條件將該等文字重新排列,並產生一倒轉播放文字檔,使得播放模組將該倒轉播放文字檔與倒轉的該聲音訊號同步播放。The video reverse playback system of the present invention includes a video reverse playback device, a recognition server and a voice analysis server. The video reverse playback device communicates with the recognition server and the voice analysis server, and includes a transceiver module for receiving a voice signal. And a playback module connected to the transceiver module, the recognition server recognizes the sound signal and generates a pair of text information corresponding to the sound signal, and the text information includes plural words, and the voice analysis server The text is rearranged, and a reverse playback text file is generated, so that the playback module plays the reverse playback text file synchronously with the reversed sound signal.
在一實施例中,辨識伺服器及語音分析伺服器係整合成同一伺服器。In one embodiment, the recognition server and the speech analysis server are integrated into a same server.
在一實施例中,收發模組還接收一與該聲音訊號有關的影像訊號,播放模組將該倒轉播放文字檔與倒轉的該聲音訊號及該影像訊號同步播放。In an embodiment, the transceiver module also receives an image signal related to the sound signal, and the playback module plays the inverted play text file in synchronization with the inverted sound signal and the image signal.
在一實施例中,語音分析伺服器係根據倒轉播放條件將該等文字以字、詞或句為單位倒敘排列,該倒轉播放條件包括字/詞/句的表示設定及字/詞/句的個數設定。In an embodiment, the speech analysis server sorts the words back in units of words, words, or sentences according to the reverse playback conditions. The reverse playback conditions include the representation settings of the words / words / sentences and the Number setting.
進一步地,文字資訊還包括複數對應該等文字的持續時間,該倒轉播放條件還包括以該持續時間為基準或以字/詞/句的基本單位為基準之設定。Further, the text information also includes a plurality of durations corresponding to the texts, and the reverse playback condition further includes a setting based on the duration or a basic unit of words / words / sentences.
在一實施例中,倒轉播放文字檔的檔案格式可為純文字檔、字幕軌的形式檔案,或是具有階層、時間資料欄位概念的元資料(metadata)。In one embodiment, the file format of the reverse playback text file may be a plain text file, a file in the form of a subtitle track, or metadata with the concept of hierarchy and time data fields.
此外,本發明之又一目的,即在提供一種在執行倒轉播放時,可使聲音以及字幕軌也能同步播放之影音倒轉播放方法。In addition, another object of the present invention is to provide a video and audio reverse playback method that can simultaneously play sounds and subtitle tracks when performing reverse playback.
本發明影音倒轉播放方法,係針對一聲音訊號進行辨識,並產生一對應該聲音訊號的文字資訊,並根據一倒轉播放條件將該文字資訊中的複數文字重新排列,且將重新排列後的該等文字與倒轉的該聲音訊號同步播放。The method for reversing playback of video and audio in the present invention is to identify a sound signal and generate a pair of text information corresponding to the sound signal, and to rearrange the plural texts in the text information according to a reverse playback condition, and the rearranged Wait for the text to play in sync with the reversed sound signal.
在一實施例中,本影音倒轉播放方法係應用於一影音倒轉播放裝置中執行。In one embodiment, the present method for reversing playback of an audio and video is implemented in a device for reversing playback of an audio and video.
在一實施例中,本影音倒轉播放方法係應用於一影音倒轉播放系統,該影音倒轉播放系統包含一影音倒轉播放裝置、一辨識伺服器及一語音分析伺服器,影音倒轉播放裝置與辨識伺服器及語音分析伺服器通訊,且包括一接收聲音訊號的收發模組及一連接收發模組的播放模組,辨識伺服器辨識該聲音訊號並產生文字資訊,語音分析伺服器根據倒轉播放條件將文字資訊中的該等文字重新排列,使得該播放模組將重新排列後的該等文字與倒轉的聲音訊號同步播放。In one embodiment, the present video reverse playback method is applied to a video reverse playback system. The video reverse playback system includes a video reverse playback device, a recognition server and a voice analysis server, a video reverse playback device and a recognition servo. And the voice analysis server communicate with each other, and includes a transceiver module for receiving a sound signal and a playback module connected to the transceiver module. The recognition server recognizes the sound signal and generates text information. The voice analysis server will The texts in the text information are rearranged, so that the playback module plays the rearranged texts in sync with the inverted sound signal.
在一實施例中,本影音倒轉播放方法係根據倒轉播放條件將文字資訊中的複數文字重新排列而產生一倒轉播放文字檔,且將倒轉播放文字檔與倒轉的該聲音訊號同步播放,該倒轉播放文字檔的檔案格式為純文字檔、字幕軌的形式檔案,或是具有階層、時間資料欄位概念的元資料(metadata)。In one embodiment, the present method for reversing playback of audio and video is to rearrange the plural texts in the text information according to the playback conditions of the reverse to generate a reverse playback text file, and synchronously play the reverse playback text file with the reversed sound signal, the reverse The file format of the play text file is a plain text file, a file in the form of a subtitle track, or metadata with the concept of hierarchy and time data fields.
在一實施例中,本影音倒轉播放方法係根據倒轉播放條件將該等文字以字、詞或句為單位倒敘排列,該倒轉播放條件包括字/詞/句的表示設定及字/詞/句的個數設定。In one embodiment, the method for reversing playback of the video and audio is to arrange the words back in units of words, words, or sentences according to the reversing playback conditions. The reversing playback conditions include the representation settings of the words / words / sentences and the words / words / sentences The number of settings.
進一步地,文字資訊包括該等文字及複數對應該等文字的持續時間,該倒轉播放條件還包括以該持續時間為基準或以字/詞/句的基本單位為基準之設定。Further, the text information includes the duration of the text and the plural corresponding to the text, and the reverse playback condition also includes a setting based on the duration or a basic unit of words / words / sentences.
本發明之功效在於:在執行倒轉播放時,聲音軌及字幕軌隨著影像同步倒轉播放,以增加趣味性。The effect of the present invention is that when performing reverse playback, the sound track and subtitle track are played back synchronously with the image to increase the fun.
在本發明被詳細描述之前,應當注意在以下的說明內容中,類似的元件是以相同的編號來表示。Before the present invention is described in detail, it should be noted that in the following description, similar elements are represented by the same numbers.
參閱圖1,本發明影音倒轉播放裝置100之第一實施例,本影音倒轉播放裝置100能夠使多媒體資訊(例如:電影檔或線上影片等)在倒轉播放時,其聲音以及字幕軌也能同步播放,以增加趣味性。Referring to FIG. 1, a first embodiment of a video reverse playback device 100 according to the present invention. The video reverse playback device 100 enables multimedia information (such as movie files or online videos) to be synchronized and its sound and subtitle tracks can be synchronized during reverse playback. Play for fun.
在本實施例中,影音倒轉播放裝置100可為一實現語音辨識功能及播放軟體的智慧型手機,其中包含一收發模組10、一連接收發模組10的辨識模組20、一連接辨識模組20的處理模組30及一連接處理模組30的播放模組40。In this embodiment, the audio and video reverse playback device 100 may be a smart phone that implements a voice recognition function and playback software, and includes a transceiver module 10, a recognition module 20 connected to the transceiver module 10, and a connection recognition module. The processing module 30 of the group 20 and a playback module 40 connected to the processing module 30.
收發模組10接收一多媒體資訊,該多媒體資訊包括一聲音訊號及一與該聲音訊號有關的影像訊號,例如:影片檔中的視訊(Video)及音訊(Audio),不過在不同的應用需求中,多媒體資訊也可以僅包括聲音訊號(例如:音樂檔),不以本實施例為限。The transceiver module 10 receives a multimedia information, the multimedia information includes a sound signal and an image signal related to the sound signal, such as video and audio in a video file, but in different application requirements The multimedia information may also include only audio signals (for example, music files), which is not limited to this embodiment.
辨識模組20針對多媒體資訊中的聲音訊號進行辨識,並產生一對應該聲音訊號的文字資訊,且該文字資訊中包括複數個文字,該等文字可以字、詞、句三種階層來表示,以圖2之舉例來說,其中共有33個第一階層的基本單元(字),19個第二階層的基本單元(詞)和3個第3階層的基本單元(句),由於詞由字構成,句由詞構成,所以第二階層中的詞是由第一階層中的文字所組成,第三階層中的句則是由第二階層中的詞所組成。The recognition module 20 recognizes the sound signal in the multimedia information and generates a pair of text information corresponding to the sound signal, and the text information includes a plurality of texts, and these texts can be expressed in three levels of words, words, and sentences. For example, in Figure 2, there are 33 basic units (words) of the first layer, 19 basic units (words) of the second layer, and 3 basic units (sentences) of the third layer, since words are composed of words Sentences are composed of words, so the words in the second hierarchy are composed of words in the first hierarchy, and the sentences in the third hierarchy are composed of words in the second hierarchy.
除此之外,在一些實施例中,文字資訊可包括複數個文字及複數個對應該等文字的持續時間,以圖3之舉例來說,文字資訊包括「I」、「will」、「be」、「back」、「soon」、「you」、「are」、「fired」8個文字,各文字對應的持續時間分別為T1~T8,若該些文字以第一階層(字)表示,其與時間的對應列表如下表一所示。 表一
若該些文字以第二階層(詞)表示,其與時間的對應列表如下表二所示。 表二
特別說明的是,上述僅是舉例說明,文字資訊中各文字的表示方式及持續時間皆可配合不同的設定及應用而調整,不以上述為限。In particular, the above is only an example, and the representation and duration of each text in the text information can be adjusted according to different settings and applications, not limited to the above.
參閱圖1,處理模組30根據一倒轉播放條件將該等文字重新排列,並產生一倒轉播放文字檔。在本實施例中,倒轉播放條件為上述不同階層的表示方式、字/詞/句的個數等,使用者可以透過應用程式(APP)或軟體來進行設定,以圖4之畫面來說,倒轉播放條件包括播放單位、限制單位及限制個數,其中播放單位是選擇文字表示的階層,也就是在倒轉播放時字幕是要以字、詞或句為基本單位顯示;限制單位是選擇時間或單位倒轉播放,若選擇的是時間(例如:2秒),則字幕會時間為基準(例如:以每2秒為時間區段)倒轉播放,若選擇的是單位,則字幕會字、詞或句的基本單位為基準倒轉播放;限制個數是設定倒轉播放的字/詞/句的單位個數或時間區段。Referring to FIG. 1, the processing module 30 rearranges the characters according to a reverse playback condition, and generates a reverse playback text file. In this embodiment, the reverse playback conditions are the above-mentioned different levels of representation, the number of words / words / sentences, etc. The user can set it through an application (APP) or software. Taking the screen in FIG. 4 as an example, Reverse playback conditions include playback units, restricted units, and the number of restrictions. The playback unit is to select the level of text representation, that is, the subtitles are displayed in words, words, or sentences as the basic unit during reverse playback. The restricted unit is to select time or Reverse playback of the unit. If time is selected (for example: 2 seconds), the subtitle will be played back based on the time (for example, every 2 seconds as the time zone). If the unit is selected, the subtitle will be displayed in words, words, or The basic unit of a sentence is the standard reverse playback; the limit number is the number of time units or units of words / words / sentences that are set for reverse playback.
以圖2中第二句之舉例來說,若播放單位設定為第二階層(詞,以實線表示),限制單位設定為單位播放,限制個數設定為1,則該些文字會如圖5所示倒敘排列,其中每個「詞」屬於一個逆向播放單元體,在逆向播放單元體中是以正向順序播放。若播放單位設定為第二階層(詞),限制單位設定為單位播放,限制個數設定為2(以點鏈線表示),則該些文字將會以2個詞為一個逆向播放單元體進行倒敘排列,其重新排列後的結果將如圖6所示。Taking the example of the second sentence in Figure 2 as an example, if the playback unit is set to the second level (words, represented by solid lines), the limit unit is set to unit playback, and the limit number is set to 1, the text will be as shown in the figure. The flashback arrangement shown in Figure 5, where each "word" belongs to a reverse playback unit, in the reverse playback unit is played in the forward order. If the playback unit is set to the second level (words), the limit unit is set to unit playback, and the limit number is set to 2 (indicated by the dot chain line), then these words will be performed with 2 words as a reverse playback unit body Flashback arrangement, the results of rearrangement will be shown in Figure 6.
以圖3之舉例來說,若播放單位設定為第一階層(字),限制單位設定為時間播放,限制個數設定為900ms,則文字重新排列後的結果將如圖7所示,其中由於限制單位是設定為時間,因此每900ms屬於一個逆向播放單元體,在逆向播放單元體中是以正向順序播放,而有些字(例如:be)會因同時落於兩時間區間而重複播放。Taking the example in FIG. 3 as an example, if the playback unit is set to the first level (word), the limit unit is set to time playback, and the limit number is set to 900ms, the result after the text rearrangement will be shown in FIG. 7, where The limit unit is set to time, so every 900ms belongs to a reverse playback unit, and the reverse playback unit is played in the forward order, and some characters (such as be) will be repeatedly played because they fall into two time intervals at the same time.
因此,處理模組30根據預設或使用者設定的倒轉播放條件將該等文字重新排列並產生倒轉播放文字檔,該倒轉播放文字檔的檔案格式可以為純文字檔或是字幕軌的形式檔案(例如:webvtt),亦或是任何具有階層、時間資料欄位概念的元資料(metadata),並將該倒轉播放文字檔輸出至播放模組40。Therefore, the processing module 30 rearranges the text and generates a reverse playback text file according to a preset or user-set reverse playback condition. The file format of the reverse playback text file can be a plain text file or a subtitle track. (For example: webvtt), or any metadata with the concept of hierarchy and time data fields, and output the inverted playback text file to the playback module 40.
補充說明的是,本實施例之影音倒轉播放裝置100還包含一連接辨識模組20及處理模組30的儲存模組50,用以儲存辨識模組20辨識聲音訊號後而產生的文字資訊,以及處理模組30所產生的倒轉播放文字檔,兩者儲存的格式皆可為純文字檔或字幕軌的形式檔案,或是任何具有階層、時間資料欄位概念的元資料。再者,倒轉播放文字檔可以與辨識模組20所產生的文字資訊合併儲存成同一檔案,或是另存於一個新檔,也可以是儲存為原本多媒體檔案的一個資料軌,皆不以此為限。It is added that the video reverse playback device 100 of this embodiment further includes a storage module 50 connected to the identification module 20 and the processing module 30 to store text information generated by the identification module 20 after identifying the audio signal. And the reverse playback text file generated by the processing module 30, both can be stored in the form of plain text files or subtitle track files, or any metadata with the concept of hierarchy and time data fields. In addition, the reverse playback text file can be combined with the text information generated by the recognition module 20 into the same file, or saved in a new file, or it can be stored as a data track of the original multimedia file. limit.
播放模組40可為顯示螢幕、喇叭或是可執行播放軟體的電路或其組合,播放模組40將該倒轉播放文字檔與倒轉的聲音訊號及影像訊號同步播放,如此一來,當使用者在執行倒轉播放時,除了影像會倒轉外,聲音及對應的文字也會隨之倒轉,將能更增加製作影片或是使用上的趣味性。The playback module 40 may be a display screen, a speaker, or a circuit or a combination of executable playback software. The playback module 40 plays the inverted playback text file synchronously with the inverted sound signal and video signal. In this way, when the user When performing reverse playback, in addition to the image will be reversed, the sound and corresponding text will also be reversed accordingly, which will increase the fun of making videos or using it.
參閱圖8,為本發明影音倒轉播放裝置100之第二實施例,在本實施例中,影音倒轉播放裝置100可應用於一影音倒轉播放系統,該影音倒轉播放系統包含影音倒轉播放裝置100,以及與影音倒轉播放裝置100通訊的一辨識伺服器200及一語音分析伺服器300,其中辨識伺服器200的功能及運作如同第一實施例之辨識模組20(如圖1),或包含一如同第一實施例之辨識模組20的電路,而語音分析伺服器300則包含一處理模組30及一連接處理模組30的儲存模組50,該處理模組30及儲存模組50的功能與運作與第一實施例相同,故不多加贅述。Referring to FIG. 8, a second embodiment of the video reverse playback device 100 according to the present invention. In this embodiment, the video reverse playback device 100 can be applied to a video reverse playback system. The video reverse playback system includes the video reverse playback device 100. And an identification server 200 and a speech analysis server 300 in communication with the video reverse playback device 100, wherein the function and operation of the identification server 200 are the same as the identification module 20 (see FIG. 1) of the first embodiment, or include a Like the circuit of the identification module 20 of the first embodiment, the speech analysis server 300 includes a processing module 30 and a storage module 50 connected to the processing module 30. The processing module 30 and the storage module 50 The functions and operations are the same as those of the first embodiment, so they will not be described in detail.
本實施例之影音倒轉播放裝置100可為智慧型手機、智慧型電視,或任何具有多媒體播放器的電子裝置,其中收發模組10及播放模組40的功能與運作與第一實施例相同。當收發模組10接收到一多媒體資訊,可透過網路將該多媒體資訊傳送至辨識伺服器200,使得辨識伺服器200會針對多媒體資訊中的聲音訊號進行辨識,並產生一對應該聲音訊號的文字資訊,且將該文字資訊回傳至影音倒轉播放裝置100,當使用者欲要執行倒轉播放時,影音倒轉播放裝置100會將該文字資訊傳送至語音分析伺服器300,使其根據一倒轉播放條件將文字資訊中的文字重新排列,並產生一倒轉播放文字檔,且將該倒轉播放文字檔回傳至影音倒轉播放裝置100,以供播放模組40將該倒轉播放文字檔與倒轉的聲音訊號同步播放。如此,透過雲端伺服器(辨識伺服器200及語音分析伺服器300)的服務,同樣能使聲音軌及字幕軌隨著影像同步倒轉播放,以增加趣味性。The video reverse playback device 100 in this embodiment may be a smart phone, a smart TV, or any electronic device with a multimedia player. The functions and operations of the transceiver module 10 and the playback module 40 are the same as those of the first embodiment. When the transceiver module 10 receives a multimedia message, the multimedia message can be transmitted to the identification server 200 through the network, so that the identification server 200 can identify the audio signal in the multimedia information and generate a pair of audio signals corresponding to the audio signal. Text information, and return the text information to the video reverse playback device 100, when the user wants to perform reverse playback, the video reverse playback device 100 will transmit the text information to the voice analysis server 300, so that The playback conditions rearrange the text in the text information, and generate a reverse playback text file, and return the reverse playback text file to the audio and video reverse playback device 100 for the playback module 40 to convert the reverse playback text file and the reversed The audio signal is played synchronously. In this way, through the services of the cloud server (identification server 200 and voice analysis server 300), the audio track and subtitle track can also be played back synchronously with the image to increase the fun.
特別說明的是,辨識伺服器200及語音分析伺服器300可以整合成同一伺服器(例如:影音伺服器),可提供使用者連線上傳/下載多媒體影片並進行語音分析處理。此外,語音分析伺服器300產生的倒轉播放文字檔同樣也可儲存於儲存模組50中,或是回傳至影音倒轉播放裝置100中儲存,亦或是兩者都儲存,可配合不同的使用需求而定。In particular, the identification server 200 and the speech analysis server 300 can be integrated into a same server (for example, an audio and video server), which can provide users to connect to upload / download multimedia videos and perform speech analysis processing. In addition, the reverse playback text file generated by the voice analysis server 300 can also be stored in the storage module 50, or returned to the video reverse playback device 100, or both, which can be used for different purposes. Demand.
參閱圖9,為本發明影音倒轉播放方法,本影音倒轉播放方法可應用於上述兩實施例之影音倒轉播放裝置100及影音倒轉播放系統,或是任何可執行本方法之電子裝置。以下將詳細說明本影音倒轉播放方法的流程。Referring to FIG. 9, a method for reverse playback of video and audio according to the present invention is applicable to the video reverse playback device 100 and the video reverse playback system of the above two embodiments, or any electronic device that can execute the method. The following will explain in detail the process of this video reverse playback method.
步驟S10,影音倒轉播放裝置100可透過網路等方式取得多媒體資訊,該多媒體資訊可包括一聲音訊號及一與該聲音訊號有關的影像訊號,或是僅包括聲音訊號。In step S10, the audio / video reverse playback device 100 can obtain multimedia information through a network or the like. The multimedia information may include a sound signal and an image signal related to the sound signal, or only a sound signal.
步驟S20,影音倒轉播放裝置100(或透過辨識伺服器200)辨識該聲音訊號並產生一對應該聲音訊號的文字資訊,該文字資訊可僅包括複數文字(純文字檔),或是包括複數文字及複數對應該等文字的持續時間。In step S20, the audio and video reverse playback device 100 (or through the recognition server 200) recognizes the sound signal and generates a pair of text information corresponding to the sound signal. The text information may include only plural texts (plain text files) or plural texts. And plural corresponds to the duration of such words.
步驟S30,影音倒轉播放裝置100(或透過語音分析伺服器300)根據一倒轉播放條件將文字資訊中的文字重新排列並產生一倒轉播放文字檔,其中倒轉播放條件可包括播放單位、限制單位及限制個數等,且倒轉播放文字檔的檔案格式可為純文字檔、字幕軌的形式檔案,或是具有階層、時間資料欄位概念的元資料(metadata)。In step S30, the audio and video reverse playback device 100 (or through the voice analysis server 300) rearranges the text in the text information according to a reverse playback condition and generates a reverse playback text file. The reverse playback conditions may include a playback unit, a restricted unit, and The number of files is limited, and the file format of the reverse playback text file can be a plain text file, a subtitle track format file, or metadata with the concept of hierarchy and time data fields.
步驟S40,影音倒轉播放裝置100將倒轉播放文字檔與倒轉的聲音訊號同步播放,使得在執行倒轉播放時,聲音軌及字幕軌隨著影像同步倒轉播放,以增加趣味性。In step S40, the audio / video reverse playback device 100 synchronously plays the reverse playback text file and the reversed audio signal, so that when the reverse playback is performed, the audio track and the subtitle track are played back synchronously with the image to increase the fun.
另外,在一些特定的應用中,本發明影音倒轉播放方法可以一電腦程式產品呈現,使得於電腦或任何電子裝置載入該程式並執行後,可執行上述之影音倒轉播放方法。In addition, in some specific applications, the video reverse playback method of the present invention can be presented by a computer program product, so that the computer or any electronic device can load the program and execute the above-mentioned video reverse playback method.
綜上所述,本發明影音倒轉播放裝置100及影音倒轉播放系統,藉由語音辨識及文字分析,使得多媒體資訊在執行倒轉播放時,其聲音軌以及字幕軌也能同步播放,以增加趣味性,故確實能達成本發明之目的。In summary, the audio and video reverse playback device 100 and the audio and video reverse playback system of the present invention, through speech recognition and text analysis, enable the audio track and subtitle track of multimedia information to be played synchronously during reverse playback to increase interest , So it can indeed achieve the purpose of cost invention.
惟以上所述者,僅為本發明之實施例而已,當不能以此限定本發明實施之範圍,凡是依本發明申請專利範圍及專利說明書內容所作之簡單的等效變化與修飾,皆仍屬本發明專利涵蓋之範圍內。However, the above are only examples of the present invention. When the scope of implementation of the present invention cannot be limited in this way, any simple equivalent changes and modifications made in accordance with the scope of the patent application and the content of the patent specification of the present invention are still Within the scope of the invention patent.
100‧‧‧影音倒轉播放裝置100‧‧‧ Reverse playback device
200‧‧‧辨識伺服器 200‧‧‧Identification server
300‧‧‧語音分析伺服器 300‧‧‧Voice Analysis Server
10‧‧‧收發模組 10‧‧‧ transceiver module
20‧‧‧辨識模組 20‧‧‧Identification Module
30‧‧‧處理模組 30‧‧‧Processing Module
40‧‧‧播放模組 40‧‧‧playback module
50‧‧‧儲存模組 50‧‧‧Storage Module
S10~S40‧‧‧步驟 S10 ~ S40‧‧‧‧Steps
本發明之其他的特徵及功效,將於參照圖式的實施方式中清楚地呈現,其中: 圖1是本發明影音倒轉播放裝置的第一實施例的電路方塊示意圖; 圖2是第一實施例中文字資訊的示意圖; 圖3是第一實施例中文字資訊之另一種態樣的示意圖; 圖4是第一實施例之設定倒轉播放條件的畫面示意圖; 圖5是將圖2之第二句以播放單位為第二階層,限制單位為單位播放,限制個數為1之設定下的倒轉播放結果; 圖6是將圖2之第二句以播放單位為第二階層,限制單位為單位播放,限制個數為2之設定下的倒轉播放結果; 圖7是將圖3以播放單位為第一階層,限制單位為時間播放,限制個數為900ms之設定下的倒轉播放結果; 圖8是本發明影音倒轉播放裝置的第二實施例的電路方塊示意圖;及 圖9是本發明影音倒轉播放方法的流程圖。Other features and effects of the present invention will be clearly presented in the embodiment with reference to the drawings, in which: FIG. 1 is a schematic circuit block diagram of a first embodiment of the video reverse playback device of the present invention; FIG. 2 is a first embodiment Schematic diagram of Chinese character information; Figure 3 is a schematic diagram of another aspect of the text information in the first embodiment; Figure 4 is a schematic diagram of the reverse playback condition setting of the first embodiment; Figure 5 is a second sentence of Figure 2 The playback unit is the second layer, the playback unit is the playback unit, and the limit is 1. The reverse playback result is set as shown in Figure 1. Figure 6 shows the second sentence in Figure 2 as the playback unit and the playback unit as the second unit. , The reverse playback result under the setting of 2 is limited; FIG. 7 is the reverse playback result under the setting of FIG. 3 with the playback unit as the first level, the limit unit is time playback, and the limit number is 900ms; FIG. 8 is A schematic circuit block diagram of a second embodiment of the video reverse playback device of the present invention; and FIG. 9 is a flowchart of the video reverse playback method of the present invention.
Claims (19)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW107129265A TWI665663B (en) | 2018-08-22 | 2018-08-22 | Video and audio reverse playback device and system and method thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW107129265A TWI665663B (en) | 2018-08-22 | 2018-08-22 | Video and audio reverse playback device and system and method thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
TWI665663B true TWI665663B (en) | 2019-07-11 |
TW202009928A TW202009928A (en) | 2020-03-01 |
Family
ID=68049312
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW107129265A TWI665663B (en) | 2018-08-22 | 2018-08-22 | Video and audio reverse playback device and system and method thereof |
Country Status (1)
Country | Link |
---|---|
TW (1) | TWI665663B (en) |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0725399A2 (en) * | 1995-01-31 | 1996-08-07 | Sony Corporation | Decoding and reverse playback of encoded signals |
US5661846A (en) * | 1995-02-23 | 1997-08-26 | Lg Electronics Inc. | Reversely reproducing apparatus for DVCR |
US5828995A (en) * | 1995-02-28 | 1998-10-27 | Motorola, Inc. | Method and apparatus for intelligible fast forward and reverse playback of time-scale compressed voice messages |
WO2005106875A1 (en) * | 2004-04-28 | 2005-11-10 | Matsushita Electric Industrial Co., Ltd. | Moving picture stream generation apparatus, moving picture coding apparatus, moving picture multiplexing apparatus and moving picture decoding apparatus |
CN101340676A (en) * | 2008-08-21 | 2009-01-07 | 深圳华为通信技术有限公司 | Method, apparatus and mobile terminal implementing simultaneous interpretation |
US20100247066A1 (en) * | 2009-03-30 | 2010-09-30 | Samsung Electronics Co., Ltd. | Method and apparatus for reverse playback of encoded multimedia content |
TW201446007A (en) * | 2013-05-31 | 2014-12-01 | Taiwan Secom Co Ltd | Data playback device and operating method for data playback |
TW201517017A (en) * | 2013-10-18 | 2015-05-01 | Via Tech Inc | Method for building language model, speech recognition method and electronic apparatus |
CN104702791A (en) * | 2015-03-13 | 2015-06-10 | 安徽声讯信息技术有限公司 | Smart phone recording sound for a long time and synchronously transliterating text, information processing method thereof |
US20150281633A1 (en) * | 2014-03-26 | 2015-10-01 | Vivotek Inc. | Method for reverse video playback and computer-readable medium |
-
2018
- 2018-08-22 TW TW107129265A patent/TWI665663B/en not_active IP Right Cessation
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0725399A2 (en) * | 1995-01-31 | 1996-08-07 | Sony Corporation | Decoding and reverse playback of encoded signals |
US5661846A (en) * | 1995-02-23 | 1997-08-26 | Lg Electronics Inc. | Reversely reproducing apparatus for DVCR |
US5828995A (en) * | 1995-02-28 | 1998-10-27 | Motorola, Inc. | Method and apparatus for intelligible fast forward and reverse playback of time-scale compressed voice messages |
WO2005106875A1 (en) * | 2004-04-28 | 2005-11-10 | Matsushita Electric Industrial Co., Ltd. | Moving picture stream generation apparatus, moving picture coding apparatus, moving picture multiplexing apparatus and moving picture decoding apparatus |
CN101340676A (en) * | 2008-08-21 | 2009-01-07 | 深圳华为通信技术有限公司 | Method, apparatus and mobile terminal implementing simultaneous interpretation |
US20100247066A1 (en) * | 2009-03-30 | 2010-09-30 | Samsung Electronics Co., Ltd. | Method and apparatus for reverse playback of encoded multimedia content |
TW201446007A (en) * | 2013-05-31 | 2014-12-01 | Taiwan Secom Co Ltd | Data playback device and operating method for data playback |
TW201517017A (en) * | 2013-10-18 | 2015-05-01 | Via Tech Inc | Method for building language model, speech recognition method and electronic apparatus |
US20150281633A1 (en) * | 2014-03-26 | 2015-10-01 | Vivotek Inc. | Method for reverse video playback and computer-readable medium |
CN104702791A (en) * | 2015-03-13 | 2015-06-10 | 安徽声讯信息技术有限公司 | Smart phone recording sound for a long time and synchronously transliterating text, information processing method thereof |
Also Published As
Publication number | Publication date |
---|---|
TW202009928A (en) | 2020-03-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20180226101A1 (en) | Methods and systems for interactive multimedia creation | |
CN104952471B (en) | A kind of media file synthetic method, device and equipment | |
KR101963753B1 (en) | Method and apparatus for playing videos for music segment | |
JP2012527007A (en) | Multimedia file playback method and multimedia playback device | |
US20080159724A1 (en) | Method and system for inputting and displaying commentary information with content | |
US12100374B2 (en) | Artificial intelligence models for composing audio scores | |
CN106910491A (en) | Karaoke OK system | |
KR20140132209A (en) | Media Recorded with Multi-Track Media File, Method and Apparatus for Editing Multi-Track Media File | |
JP2017184841A (en) | Information processing program, information processing device, and information processing method | |
Madsen et al. | Voice-cast: The distribution of the voice via podcasting | |
US11665406B2 (en) | Verbal queries relative to video content | |
US11093120B1 (en) | Systems and methods for generating and broadcasting digital trails of recorded media | |
TWI665663B (en) | Video and audio reverse playback device and system and method thereof | |
JP2018146961A (en) | Voice reproduction device and voice reproduction program | |
Hoover | The missing narrator: Fictional podcasting and kaleidosonic remediation in Gimlet’s Homecoming | |
US20230245587A1 (en) | System and method for integrating special effects to a story | |
Dhiman | The Rise and Power of Audio Storytelling in the 21st Century: A Critical Review | |
Tidal | Podcasting: A practical guide for librarians | |
KR101477492B1 (en) | Apparatus for editing and playing video contents and the method thereof | |
KR20190027645A (en) | Method for producing multimedia book | |
KR20160010843A (en) | Method for playing audio book with vibration, device and computer readable medium | |
KR20130092692A (en) | Method and computer readable recording medium for making electronic book which can be realized by user voice | |
US20120169508A1 (en) | Exhibition method and apparatus for multi-media color ring back tone resource box | |
JP6964918B1 (en) | Content creation support system, content creation support method and program | |
CN111696516B (en) | Multi-role intelligent sound box partner system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MM4A | Annulment or lapse of patent due to non-payment of fees |