TWI535281B

TWI535281B - System device and method of compositing a real-time selfie music video with effects adapted for karaoke

Info

Publication number: TWI535281B
Application number: TW102143277A
Authority: TW
Inventors: 莊嘉賓
Original assignee: 音圓國際股份有限公司
Priority date: 2013-11-27
Filing date: 2013-11-27
Publication date: 2016-05-21
Also published as: TW201521437A

Description

System device and method for real-time self-timer special effect synthesis MV applied to phonograph

本發明有關於一種系統裝置及其方法，特別是指應用於伴唱機之即時自拍特效合成MV之系統裝置及其方法。 The invention relates to a system device and a method thereof, in particular to a system device and a method for realizing a self-timer special effect synthesis MV of a phonograph.

一般卡拉OK設備在播放時，多使用內建預置或外部儲存的風景、劇情、情境影片或圖片當背景，並在其上方疊上歌詞字幕，或直接以原歌星的音樂錄影帶(MV)播放，提供使用者伴唱。然而，使用者在進行歌唱時，大多會想成為音樂錄影帶(MV)中的主角，而目前較進階的卡拉OK設備雖然可以讓使用者連接外部影像輸入裝置，例如攝影機，但只能擷取現場單調原始之使用者影像，並無法即時產生類似MV剪輯含有使用者影像搭配各式歌曲情境變化的動態視覺效果，然而若想利用電腦後製達到此效果，不僅僅缺少即時性，又是一筆龐大的花費。 Generally, when playing karaoke equipment, use the built-in presets or externally stored landscapes, plots, situational movies or pictures as the background, and stack the lyrics subtitles on top of them, or directly use the original singer's music video (MV). Play, provide user sing. However, when users are singing, they mostly want to be the protagonists in the music video (MV). Currently, the more advanced karaoke equipment allows users to connect external video input devices, such as cameras, but only Taking the original monotonous user image on the spot, it is not possible to instantly generate a dynamic visual effect similar to the MV clip containing the user's image and various songs. However, if you want to use the computer to achieve this effect, not only lack of immediacy, but also A huge expense.

本發明的目的之一在於提供一種應用於伴唱機(點歌機/卡拉OK/Karaoke)之即時自拍特效合成MV(Music Video)之系統裝置及其方法，能即時顯示或錄製含有使用者搭配各式各樣的情境及視覺特效。 One of the objects of the present invention is to provide a system device and a method for real-time self-timer MV (Music Video) applied to a karaoke machine (karaoke machine/karaoke/Karaoke), which can be displayed or recorded in real time. A variety of situations and visual effects.

為了達到上面所描述的，本發明提供之應用於伴唱機之即時自拍特效合成MV之系統裝置，包括一影像輸入介面，用以輸入一影像輸出裝置所輸出之影像，以產生一原始影像資料；一特效影像庫，係用以儲放至少一特效影像資料；一歌詞字幕庫，係用以儲放至少一首歌曲之歌詞字幕資料；一動態影像合成處理單元，耦接該影像輸入介面及該特效影像庫，用以接收及處理該原始影像資料，並由該特效影像庫讀取所述至少一特效影像資料與該原始影像資料進行即時合成處理，以產生一合成影像資料；一核心處理單元，係耦接該歌詞字幕庫並與該動態影像合成處理單元通訊連接，用以接收及處理該合成影像資料，並由該歌詞字幕庫讀取所述至少一首歌曲之歌詞字幕資料進行疊加處理及塗字處理；及一影像輸出介面，耦接該核心處理單元，用以輸出該合成影像資料及所述至少一首歌曲之歌詞字幕資料至一顯示裝置，使該顯示裝置顯示該合成影像資料及所述至少一首歌曲之歌詞字幕資料。 In order to achieve the above, the present invention provides a system device for real-time self-timer effect synthesis MV for a phonograph, comprising an image input interface for inputting an image output by an image output device to generate an original image data; a special effect The image library is used for storing at least one special effect image data; a lyrics subtitle library is used for storing lyrics subtitle data of at least one song; a dynamic image synthesis processing unit is coupled to the image input interface and the special effect image a library for receiving and processing the original image data, and reading, by the special effect image library, the at least one special effect image data and the original image data for real-time synthesis processing to generate a synthetic image data; a core processing unit The lyrics subtitle library is coupled and communicatively coupled to the dynamic image synthesizing processing unit for receiving and processing the synthesized image data, and the lyrics subtitle library reads the lyrics subtitle data of the at least one song for superposition processing and coating And the image processing interface is coupled to the core processing unit for outputting the synthesized image data and the lyrics subtitle data of the at least one song to a display device, so that the display device displays the synthesized image data and the The lyrics subtitle data of at least one song.

為了達到上面所描述的，本發明提供之應用於伴唱機之即時自拍特效合成MV之方法，以即時合成至少一使用者之影像，該應用於伴唱機之即時自拍特效合成MV之方法包括：接收該使用者之一原始影像資料；判別並選擇至少一特效影像資料；即時合成該原始影像資料與該特效影像資料，以形成一合成影像資料；疊加上一歌詞字幕資料；及顯示該合成影像資料及該歌詞字幕資料。 In order to achieve the above description, the present invention provides a method for synthesizing a MV of an instant self-timer effect of a phonograph to instantly synthesize at least one user's image, and the method for synthesizing a MV of an instant self-timer effect applied to a phonograph includes: receiving The original image data of the user; discriminating and selecting at least one special effect image data; synthesizing the original image data and the special effect image data to form a synthetic image data; superimposing the lyrics subtitle data; and displaying the synthesized image data And the lyrics subtitles.

為使能更進一步瞭解本發明之特徵及技術內容，請參閱以下有關本發明之詳細說明與附圖，然而所附圖式僅提供參考與說明用，並非用來對本發明加以限制者。 For a better understanding of the features and technical aspects of the present invention, reference should be made to the accompanying drawings.

A‧‧‧使用者 A‧‧‧ user

M‧‧‧伴唱機 M‧‧‧ record player

100‧‧‧伴唱機即時影音特效合成裝置 100‧‧‧ phonograph real-time audio and video special effects synthesizer

C‧‧‧影像輸出裝置 C‧‧‧Image output device

D‧‧‧顯示裝置 D‧‧‧ display device

D1‧‧‧畫面 D1‧‧‧ screen

11‧‧‧影像輸入介面 11‧‧‧Image input interface

12‧‧‧影像輸出介面 12‧‧‧Image output interface

13‧‧‧核心處理單元 13‧‧‧Core Processing Unit

14‧‧‧動態影像合成處理單元 14‧‧‧Dynamic image synthesis processing unit

15‧‧‧特效影像庫 15‧‧‧Special Effects Image Library

16‧‧‧歌詞字幕庫 16‧‧‧ Lyrics subtitle library

17‧‧‧內部儲存媒體 17‧‧‧Internal storage media

18‧‧‧外部儲存媒體 18‧‧‧External storage media

19‧‧‧操作單元 19‧‧‧Operating unit

191‧‧‧操作介面 191‧‧‧Operator interface

20‧‧‧音訊處理單元 20‧‧‧Optical Processing Unit

21‧‧‧錄製單元 21‧‧‧recording unit

22‧‧‧網路介面 22‧‧‧Network interface

圖1為本發明一實施例之架構示意圖。 FIG. 1 is a schematic structural diagram of an embodiment of the present invention.

圖2為本發明一實施例之功能方塊示意圖。 2 is a functional block diagram of an embodiment of the present invention.

圖3為本發明一實施例之示意圖。 Figure 3 is a schematic illustration of an embodiment of the invention.

圖4為本發明一方法流程圖。 4 is a flow chart of a method of the present invention.

請參閱圖1，本發明主要在於提供一種應用於伴唱機之即時自拍特效合成MV之系統裝置。如圖1所示，本發明之應用於伴唱機之即時自拍特效合成MV之系統裝置包括有一伴唱機即時影音特效合成裝置100、一影像輸出裝置C、及一顯示裝置D。伴唱機即時影音特效合成裝置100可耦接在一影像輸出裝置C及一顯示裝置D之間。本發明所指伴唱機M係具有處理多媒體資料及重現多媒體資料並具有伴唱功能之裝置，而伴唱機M亦可稱為點歌機/卡拉OK/Karaoke。另外，需強調的是，有關本發明中「影音」一詞，實質上可包含影像或聲音至少之一者，並非侷限需同時包含影像或聲音之兩種格式。 Referring to FIG. 1, the present invention mainly provides a system device for real-time self-timer effect synthesis MV applied to a phonograph. As shown in FIG. 1, the system device for real-time self-timer special effect synthesis MV applied to a phonograph comprises a phonograph real-time audio and video special effects synthesizing device 100, an image output device C, and a display device D. The phonograph real-time audio and video special effects synthesizing device 100 can be coupled between an image output device C and a display device D. The phonograph M of the present invention has a device for processing multimedia materials and reproducing multimedia materials and having a vocal function, and the phonograph M can also be called a karaoke machine/karaoke/Karaoke. In addition, it should be emphasized that the term "video" in the present invention may substantially include at least one of an image or a sound, and is not limited to two formats including an image or a sound.

請參閱圖2及圖3。圖2係本發明一較佳實施例之功能方塊示意圖。如圖2所示，伴唱機即時影音特效合成裝置100基本上包括一影像輸入介面11、一影像輸出介面12、一核心處理單元13、一動態影像合成處理單元14、及一特效影像庫15。更進一步地，伴唱機即時影音特效合成裝置100還可包括一歌詞字幕庫16、一內部儲存媒體17、一外部儲存媒體18、一操作單元19、一音訊處理單元20、一錄製單元21、及一網路介面22。 Please refer to Figure 2 and Figure 3. 2 is a functional block diagram of a preferred embodiment of the present invention. As shown in FIG. 2, the phonograph real-time audio and video special effects synthesizing device 100 basically includes an image input interface 11, an image output interface 12, a core processing unit 13, a motion image synthesis processing unit 14, and a special effect image library 15. Further, the phonograph real-time audio and video special effects synthesizing device 100 may further include a lyrics subtitle library 16, an internal storage medium 17, an external storage medium 18, an operation unit 19, an audio processing unit 20, a recording unit 21, and A network interface 22.

影像輸入介面11可連結至影像輸出裝置C，用以輸入影像輸出裝置C所輸出之影像。其中，影像輸出裝置C1可包含但不限於電荷耦合元件(CCD)、攝影機(camera)、電腦或網路攝影機(PC-cam,Web-cam)、或照相機等具有擷取影像功能的裝置，亦可為任何多媒體播放器(Media player)。影像輸出介面11可連結至顯示裝置D，用以輸出影像至顯示裝置D。其中，顯示裝置D可包含但不限於CRT、液晶顯示幕(LED,LCD)或電漿顯示幕、投影機等。其中，影像輸入介面10與影像輸出介面20可包含但不限於HDMI介面、CVBS介面、色差端子介面、AV端子介面、VGA介面、DVI介面、無線訊號傳輸介面等。 The image input interface 11 can be connected to the image output device C for inputting the image output by the image output device C. The image output device C1 may include, but is not limited to, a charge coupled device (CCD), a camera, a computer or a webcam (PC-cam), or a camera having a function of capturing images. Can be any multimedia player (Media player). The image output interface 11 can be coupled to the display device D for outputting images to the display device D. The display device D may include, but is not limited to, a CRT, a liquid crystal display (LED, LCD) or a plasma display screen, a projector, and the like. The image input interface 10 and the image output interface 20 may include, but are not limited to, an HDMI interface, a CVBS interface, a color difference terminal interface, an AV terminal interface, a VGA interface, a DVI interface, a wireless signal transmission interface, and the like.

核心處理單元13可為一中央處理器(CPU)或是一單晶片系統 (SOC)且可為控制伴唱機M之主控元件。 The core processing unit 13 can be a central processing unit (CPU) or a single chip system (SOC) and can be the main control element for controlling the phonograph M.

動態影像合成處理單元14耦接至影像輸入介面11並與核心處理單元13通訊連接。動態影像合成處理單元14可為一單獨的數位信號處理器(DSP)，亦可集成至核心處理單元13或者是設置在機上盒(setup-box)。另外，動態影像合成處理單元14內可寫入可執行之程序。動態影像合成處理單元14可經由影像輸入介面11接收影像輸出裝置C所擷取之影像所產生之一原始影像資料。影像輸出裝置C所擷取之影像可經由影像輸出介面12呈現在顯示裝置D的畫面D1中。在本實施例中，影像輸出裝置C為一攝影機，其係可以擷取一連續動態影像。影像輸出裝置C所擷取之連續動態影像於顯示裝置D的畫面D1中呈現即時入鏡於影像輸出裝置C的至少一使用者A(或可稱為演唱者)，亦即影像輸出裝置C所擷取之連續動態影像為使用者之即時影像。也可以說，影像輸出裝置C可即時擷取使用者A之即時影像並經由類比/數位轉換後成為動態影像合成處理單元14所接收的一原始影像資料。 The dynamic image synthesis processing unit 14 is coupled to the image input interface 11 and communicatively coupled to the core processing unit 13. The motion picture synthesis processing unit 14 can be a separate digital signal processor (DSP), can be integrated into the core processing unit 13 or can be placed in a set-box. In addition, an executable program can be written in the motion picture synthesis processing unit 14. The motion image synthesis processing unit 14 can receive one of the original image data generated by the image captured by the image output device C via the image input interface 11. The image captured by the image output device C can be presented in the screen D1 of the display device D via the image output interface 12. In this embodiment, the image output device C is a camera that can capture a continuous motion image. The continuous motion image captured by the image output device C presents at least one user A (or may be referred to as a singer) that is immediately incident on the image output device C in the screen D1 of the display device D, that is, the image output device C The continuous motion image captured is a real-time image of the user. It can also be said that the image output device C can instantly capture the real-time image of the user A and convert it into an original image data received by the dynamic image synthesis processing unit 14 via analog/digital conversion.

特效影像庫15可建置於內部儲存媒體17或是直接置入於動態影像合成處理單元14。內部儲存媒體17可包含但不限於一快閃記憶體、一暫存記憶體、一隨機存取記憶體。特效影像庫15預先儲放有多個特效影像資料以供選擇與應用。特效影像庫15至少包含以下：雪花、星星特效影像資料、泡泡特效影像資料、場景特效影像資料、臉部特效影像等常常出現在MV影片中各式各樣具有情境效果及視覺效果之特效影像資料。上述特效影像資料可包含一動態物件、一靜態物件。 The effect image library 15 can be built into the internal storage medium 17 or directly placed in the motion picture synthesis processing unit 14. The internal storage medium 17 can include, but is not limited to, a flash memory, a temporary memory, and a random access memory. The special effect image library 15 pre-stores a plurality of special effect image materials for selection and application. The special effect image library 15 includes at least the following: snowflake, star special effect image data, bubble special effect image data, scene special effect image data, facial special effect image, and the like, and various special effects images with visual effects and visual effects often appear in the MV film. data. The above special effect image data may include a dynamic object and a static object.

動態影像合成處理單元14經由影像輸入介面11接收影像輸出裝置C所擷取之影像所產生之原始影像資料後，依據指令自特效影像庫15讀取一特效影像資料或一個以上的特效影像資料加以搭配應用，使原始影像資料與特效影像資料進行即時合成處理，以產生一合成影像資料輸出至核心處理單元13，並經由影像輸出介面12即時顯示在顯示裝置D的畫面D1中。細部來說，動態影像合成處理單元14可依據其寫入的程序對原始影像資料自動進行各種預定的動態影像處理及預定的動態合成處理，例如：亮度、色調、對比度、飽和度，畫面分割、堆疊、旋轉處理等。亦可進行臉部辨識去背景合成處理、或臉部複製去背景合成處理、或者人體辨識去背景合成處理等，以使原始影像資料呈現出類似MV影片效果的畫面。進而使原始影像資料經處理後與特效影像資料進行即時合成，以產生合成影像資料。 The dynamic image synthesis processing unit 14 receives the original image data generated by the image captured by the image output device C via the image input interface 11, and then reads a special effect image data or one or more special effect image data from the special effect image library 15 according to the instruction. The application image is combined with the original image data and the special effect image data to generate a composite image data output to the core processing unit 13 and output through the image. The interface 12 is instantly displayed in the screen D1 of the display device D. In detail, the motion image synthesis processing unit 14 can automatically perform various predetermined motion image processing and predetermined dynamic composition processing on the original image data according to the program written therein, for example, brightness, hue, contrast, saturation, screen segmentation, Stacking, rotation processing, etc. It is also possible to perform face recognition to background synthesis processing, or face copying to background synthesis processing, or human body recognition to background synthesis processing, etc., so that the original image data presents a picture similar to the MV movie effect. The original image data is processed and then combined with the special effect image data to generate synthetic image data.

歌詞字幕庫16可建置於內部儲存媒體17且預先儲放有多首歌曲的歌詞字幕資料。核心處理單元13係耦接歌詞字幕庫16，且當核心處理單元13接收由動態影像合成處理單元14所輸出之合成影像資料後，可由歌詞字幕庫16讀取對應歌曲的歌詞字幕資料，並將對應歌曲的歌詞字幕資料與合成影像資料進行疊加處理及塗字處理，而塗字處理係指將歌詞字幕資料顯示出的歌詞顏色隨著歌曲進行由白色塗滿成藍色，當然也可以由白色塗滿成紅色或綠色，在此並不對塗字顏色作任何的限定，更可以藉由不同的視覺效果明白地區隔目前歌曲的進度(例如：依歌曲進度所呈現的圓點效果)，然後將合成影像資料及歌詞字幕資料經由影像輸出介面12即時呈現於顯示裝置D的畫面D1中，以即時呈現出類似MV影片效果以及含有歌詞塗字效果的畫面。 The lyrics subtitle library 16 can be built on the internal storage medium 17 and pre-stored lyrics subtitle data of a plurality of songs. The core processing unit 13 is coupled to the lyrics subtitle library 16, and after the core processing unit 13 receives the synthesized image data output by the dynamic image synthesis processing unit 14, the lyrics subtitle data of the corresponding song can be read by the lyrics subtitle library 16 and The lyrics subtitle data of the corresponding song and the synthetic image data are superimposed and painted, and the word processing refers to the color of the lyrics displayed by the lyrics subtitle data is white-colored with the song, and may of course be white Painted in red or green, there is no limit to the color of the painting. You can also understand the progress of the current song (for example, the dot effect according to the progress of the song) by different visual effects, and then The synthesized image data and the lyrics subtitle data are instantly presented in the screen D1 of the display device D via the image output interface 12, so as to instantly display a picture similar to the MV movie effect and the lyric composition effect.

操作單元19耦接核心處理單元19。操作單元19可包含但不限於一遙控器、一平板、一智慧手機、一感測器等。操作單元19具有一操作介面191，其可為多個功能性按鍵、或一使用者圖形介面、一人體動作感知介面、一聲控介面等。因此，經由人體動作或聲音下達指令，或是經由按壓或觸碰操作介面191上的功能性按鍵或功能性圖像可供使用者輸入指令，使核心處理單元13及動態影像合成處理單元14根據使用者自操作介面191所輸入之指令而進行相對應之操作。因此，當顯示裝置D的畫面D1中呈現出即時入鏡於影像輸出裝置C的使用者A後，使用者A可以透過操作介面191下達指令而從特效影像庫15中選擇一特效影像資料或兩個以上的特效影像資料與原始影像資料進行即時特效合成，並即時呈現於顯示單元D的畫面D1中，以供使用者A即時觀看。舉例來說，當使用者A想在畫面D1中呈現置身在下雪的情境，即可透過操作介面191下達指令從特效影像庫15選擇雪花特效影像資料進行即時合成，或者可再搭配改變色調之特效影像資料進行即時合成，即可猶如置身在下雪又變色的情境裡。另外，也可由特效影像庫15選擇其他的特效影像資料，例如臉部特效影像資料進行即時合成，即可呈現臉部特效的視覺效果。除此之外，使用者A還可透過操作介面191之功能設定下達指令，以加入美術效果處理，例如油畫效果、水彩效果、素描效果等處理。藉此，使用者A於歌曲歡唱中透過操作介面191下達指令選擇偏好的特效影像資料與包含有使用者A之原始影像資料進行即時合成而即時呈現各式各樣的情境效果及視覺效果於顯示裝置D的畫面D1中，讓使用者猶如MV影片導演一般。 The operating unit 19 is coupled to the core processing unit 19. The operating unit 19 can include, but is not limited to, a remote controller, a tablet, a smart phone, a sensor, and the like. The operating unit 19 has an operation interface 191, which can be a plurality of functional buttons, or a user graphical interface, a human motion sensing interface, a voice control interface, and the like. Therefore, the core processing unit 13 and the motion image synthesis processing unit 14 are configured according to a human body motion or sound command, or by pressing or touching a functional button or a functional image on the operation interface 191 for the user to input an instruction. The user performs an operation corresponding to the instruction input by the operation interface 191. Therefore, when the display device D is displayed in the screen D1 Immediately after the user A of the image output device C, the user A can select an effect image data or two or more special effect image data and the original image data from the special effect image library 15 through the operation interface 191 to issue an instruction. The special effects are synthesized and instantly presented in the screen D1 of the display unit D for the user A to view instantly. For example, when the user A wants to present the situation in the snow D1 in the screen D1, the user can select the snowflake effect image data from the special effect image library 15 to perform instant synthesis through the operation interface 191, or can be combined with the effect of changing the color tone. The instant synthesis of the image data is like being in a snowy and discolored situation. In addition, other special effect image materials, such as facial special effect image data, can be selected by the special effect image library 15 for real-time synthesis, and the visual effects of the facial effects can be presented. In addition, the user A can also set the instruction through the function of the operation interface 191 to add art effect processing, such as oil painting effect, watercolor effect, sketch effect and the like. In this way, the user A performs the instant synthesis of the special effect image data selected by the operation interface 191 and the original image data including the user A in the song singer to instantly present various various situational effects and visual effects. In the screen D1 of the display device D, the user is made to be like a MV film director.

音訊處理單元20可內建在核心處理單元13內或為單獨的音效晶片或音效卡耦接至核心處理單元13。音訊處理單元20用以接收多個或一個使用者A之人聲(或者說使用者之歌聲)並從內部儲存媒體17中選出一對應歌曲之背景音樂進行疊加混合之混音處理，以產生一混音資料。 The audio processing unit 20 can be built into the core processing unit 13 or coupled to the core processing unit 13 as a separate audio or sound card. The audio processing unit 20 is configured to receive a plurality of or one user A voice (or a user's voice) and select a background music of the corresponding song from the internal storage medium 17 to perform superimposition and mixing processing to generate a mixture. Audio data.

錄製單元21可內建在核心處理單元13內或為單獨的錄製晶片耦接至核心處理單元13。錄製單元21用以選擇性地錄製合成影像資料、混音資料、及歌曲之字幕資料以形成一影音檔。藉此，使用者A可經由操作單元19的操作介面191設定錄製功能選項，選擇只錄影或同時錄影及錄音，以將含有使用者A自拍並合成後的即時影像或歌聲錄製起來，並進一步儲存在外部儲存媒體18。外部儲存媒體18可包含有一硬碟、一光碟、一隨身碟、及一記憶卡。 The recording unit 21 can be built into the core processing unit 13 or coupled to the core processing unit 13 for a separate recording chip. The recording unit 21 is configured to selectively record the synthesized image data, the mixed data, and the subtitle data of the song to form a video file. Thereby, the user A can set the recording function option via the operation interface 191 of the operation unit 19, and select only the video recording or the simultaneous recording and recording to record the instant image or the singing voice containing the self-photographed and synthesized by the user A, and further store the image. The medium 18 is stored externally. The external storage medium 18 can include a hard disk, a compact disc, a flash drive, and a memory. card.

網路介面22可以為一以太網路介面卡或者一無線網路介面卡耦接至核心處理單元13。網路介面22用以有線或無線連結至一網路。上述網路可以為無線網路、區域網路或網際網路。藉此，使用者A可經由操作單元19的操作介面191設定網路功能選項以進行網路連結，進而使儲存在外部儲存媒體18的影音檔被核心處理單元13讀出後上傳至網路進行分享。另外，也可透過網路介面22直接進行無線傳輸，進而使影音檔無限傳輸至無線訊號之距離範圍內的平板或智慧手機進行分享。讓使用者A自拍並合成後的即時影像及聲音所錄製起來的影音檔可立即分享給其親友、特定人或大眾觀賞。 The network interface 22 can be coupled to the core processing unit 13 for an Ethernet interface card or a wireless network interface card. The network interface 22 is used to connect to a network by wire or wirelessly. The above network can be a wireless network, a regional network or an internet network. Therefore, the user A can set the network function option to perform network connection via the operation interface 191 of the operation unit 19, so that the video file stored in the external storage medium 18 is read by the core processing unit 13 and then uploaded to the network. share it. In addition, the wireless transmission can be directly performed through the network interface 22, so that the audio and video files can be transmitted to the tablet or the smart phone within the distance range of the wireless signal for sharing. The video files recorded by the user A's self-portrait and synthesized instant video and sound can be immediately shared with their friends, relatives or the public.

請參考圖4，並配合參考圖2。如圖4所示，為本發明之一種應用於伴唱機之即時自拍特效合成MV之方法之主要流程步驟，本方法用以即時合成至少一使用者A之影像，而使用者之影像可由任一影像輸出裝置C所即時擷取並經過編解碼及壓縮等處理而產生使用者之原始影像資料。首先，由動態影像處理單元14接收上述使用者之原始影像資料(S201)，此時可視需要將該原始影像資料進行動態影像前處理(例如：亮度、色調、對比度、飽和度、畫面分割、堆疊、旋轉處理等)(S203)，然後可由動態影像處理單元14判別並從特效影像庫15選取至少一特效影像資料(S205)(例如：雪花特效影像資料、下雨特效影像資料、星星特效影像資料、場景特效影像資料)。其中，判別選取特效影像資料的方法可根據一亂數法則從一特效影像庫15中隨機地選出至少一特效影像資料，或根據特定的順序從一特效影像庫15中依序地選出該至少一特效影像資料，或者是根據使用者喜好由操作單元19所輸入之指令從一特效影像庫15中選擇該至少一特效影像資料，或者是根據系統裝置偵測音樂的類型或節拍，自動選擇對應的特效影像資料。之後，動態影像合成處理單元14將處理後的原始影像資料與特效影像資料進行即時動態合成而形成一合成影像資料(S207)。其中，處理後的原始影像資料與特效影像資料即時動態合成的方法可利用臉部辨識技術，擷取使用者之臉部作為前景，然後與特效影像資料進行即時去背影合成，以形成合成影像資料，或者利用人體辨識技術，擷取使用者之全身作為前景，然後與特效影像資料進行即時去背景合成，以形成合成影像資料。之後，可視需要將合成影像資料進行動態影像後處理(S209)(例如：油畫效果、水彩效果、素描效果等美術效果處理)，換言之，可依據指令對合成影像資料進行運算以使合成影像資料呈現出具有油畫、水彩或素描等美術效果的畫面。最後，疊加上歌詞字幕資料(含塗字效果)(S211)，並顯示經處理後的合成影像資料及歌詞字幕資料於一顯示裝置D的畫面D1中(S211)，以即時呈現出類似MV影片效果以及含有塗字效果的畫面。另外，還可接收一包含使用者之歌聲及一歌曲之背影音樂的混音資料，並選擇只錄製合成影像資料、歌詞字幕資料、或混音資料以形成一影音檔，進而儲存該影音檔或傳送該影檔以供分享。 Please refer to FIG. 4 with reference to FIG. 2 . As shown in FIG. 4, it is a main flow step of a method for synthesizing an MV of an instant self-timer effect applied to a phonograph, and the method is used for synthesizing at least one image of the user A, and the image of the user can be any The image output device C immediately captures and performs processing such as encoding and decoding and compression to generate original image data of the user. First, the original image data of the user is received by the dynamic image processing unit 14 (S201), and the original image data may be subjected to dynamic image pre-processing (eg, brightness, hue, contrast, saturation, picture segmentation, stacking). (S203), and then the motion image processing unit 14 can determine and select at least one special effect image data (S205) from the special effect image library 15 (for example, snowflake effect image data, rain effect image data, star effect image data) , scene effects video data). The method for determining the selected effect image data may randomly select at least one special effect image data from a special effect image library 15 according to a random number rule, or sequentially select the at least one from a special effect image library 15 according to a specific order. The special effect image data, or the instruction input by the operation unit 19 according to the user's preference, selects the at least one special effect image data from a special effect image library 15, or automatically selects the corresponding type according to the type or beat of the music detected by the system device. Special effects image data. After that, the motion image synthesis processing unit 14 processes the processed original image data and special effect image resources. The material is subjected to real-time dynamic synthesis to form a synthetic image data (S207). The method for real-time dynamic synthesis of the processed original image data and the special effect image data can utilize the face recognition technology to capture the face of the user as a foreground, and then perform instant back-shadow synthesis with the special effect image data to form a synthetic image data. Or use the human body identification technology to capture the user's whole body as a foreground, and then perform instant background synthesis with the special effect image data to form a synthetic image data. After that, the synthetic image data may be subjected to dynamic image post-processing (S209) (for example, oil painting effect, watercolor effect, sketch effect, etc.), in other words, the synthetic image data may be calculated according to the instruction to render the synthetic image data. A picture with artistic effects such as oil painting, watercolor or sketch. Finally, the lyrics subtitle data (including the wording effect) is superimposed (S211), and the processed synthetic image data and the lyrics subtitle data are displayed on the screen D1 of a display device D (S211) to instantly display an MV-like movie. The effect and the screen with the matching effect. In addition, a sound mixing material including a user's singing voice and a back music of a song may be received, and only the synthesized image data, the lyrics subtitle data, or the mixed data may be recorded to form a video file, and then the video file may be stored or The image file is transferred for sharing.

以上所述僅為本發明之較佳可行實施例，非因此侷限本發明之專利範圍，故舉凡運用本發明說明書及圖示內容所為之等效技術變化，均包含於本發明之範圍內。 The above description is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention, and the equivalents of the present invention are intended to be included within the scope of the present invention.

C‧‧‧影像輸出裝置 C‧‧‧Image output device

D‧‧‧顯示裝置 D‧‧‧ display device

D1‧‧‧畫面 D1‧‧‧ screen

11‧‧‧影像輸入介面 11‧‧‧Image input interface

12‧‧‧影像輸出介面 12‧‧‧Image output interface

13‧‧‧核心處理單元 13‧‧‧Core Processing Unit

15‧‧‧特效影像庫 15‧‧‧Special Effects Image Library

16‧‧‧歌詞字幕庫 16‧‧‧ Lyrics subtitle library

17‧‧‧內部儲存媒體 17‧‧‧Internal storage media

18‧‧‧外部儲存媒體 18‧‧‧External storage media

19‧‧‧操作單元 19‧‧‧Operating unit

191‧‧‧操作介面 191‧‧‧Operator interface

20‧‧‧音訊處理單元 20‧‧‧Optical Processing Unit

21‧‧‧錄製單元 21‧‧‧recording unit

22‧‧‧網路介面 22‧‧‧Network interface

Claims

A system device for real-time self-timer special effect synthesis MV for a phonograph machine, comprising: an image input interface for inputting an image output by an image output device to generate an original image data; and a special effect image library for storing Having at least one special effect image data; a dynamic image synthesis processing unit coupled to the image input interface and the special effect image library for receiving and processing the original image data, and reading the at least one special effect image by the special effect image library The data is combined with the original image data to generate a synthetic image data; a lyrics subtitle library is used to store at least one song lyrics subtitle data; a core processing unit is coupled to the lyrics subtitle library and Communicating with the dynamic image synthesizing processing unit for receiving and processing the synthesized image data, and reading the lyrics subtitle data of the at least one song from the lyrics subtitle library and superimposing and smearing the synthesized image data Processing and an image output interface coupled to the core processing unit for outputting the synthesized image data and And displaying, by the display device, the synthesized image data and the lyrics subtitle data of the at least one song; wherein the dynamic image synthesis processing unit is based on the at least one song Randomly selecting the at least one special effect image data from the special effect image library; or sequentially selecting the at least one special effect image data from the special effects image library according to a specific order; or according to an instruction input by an operation unit The at least one special effect image data is selected in the special effect image library.

The system device for applying the instant self-timer effect synthesis MV of the karaoke machine according to claim 1, wherein the image output by the image output device is an instant image of a user.

The system device for applying the instant self-timer effect synthesis MV of the phonograph according to claim 1, wherein the special effect image data is at least one of the following: snowflake effect image data, star effect image data, bubble special effect image data, scene Special effects image data.

The system device for applying the instant self-timer effect synthesis MV of the phonograph according to claim 1, wherein the special effect image data comprises at least one of the following: a dynamic object and a static object.

The system device for applying the instant self-timer effect synthesis MV of the karaoke machine according to claim 1, wherein the operation unit is coupled to the core processing unit, and the operation unit has an operation interface for the user to input an instruction to enable the core processing The unit and the motion picture synthesis processing unit perform corresponding operations according to instructions input from the operation interface.

The system device for the instant self-timer effect synthesis MV of the karaoke machine according to claim 1, further comprising an audio processing unit coupled to the core processing unit, the audio processing unit for receiving at least one use The vocal of the person is superimposed and mixed with the background music of a corresponding song to generate a mixed material.

The system device for the instant self-timer effect synthesis MV of the karaoke machine according to claim 6, further comprising a recording unit coupled to the core processing unit, the recording unit for selectively recording the composite image The data, the mixing material, and the lyrics subtitle data to form a video file.

The system device for the instant self-timer effect synthesis MV of the karaoke machine according to claim 7, further comprising a storage medium, the storage medium comprising an internal storage medium and an external storage medium, wherein the internal storage medium is coupled to the motion image a processing unit and the core processing unit, wherein the internal storage medium is used to store the lyrics subtitle library and the special effect image library, the external storage medium is coupled to the recording unit and the core processing unit, and the external storage medium is used for storing The video file.

The system device for applying the instant self-timer effect synthesis MV of the karaoke machine according to claim 8, further comprising a network interface coupled to the core processing unit for connecting to a network, thereby enabling The video file is read by the core processing unit and uploaded to the network for sharing.

A method for synthesizing MVs by using a real-time self-timer effect of a phonograph machine to instantly synthesize at least one user's image, and the method for synthesizing MVs by using an instant self-timer effect of a phonograph includes: receiving original image data of one of the users; And selecting at least one special effect image data; wherein the method for determining and selecting at least one special effect image data is: randomly selecting the special effect image data from a special effect image library according to at least one song; or selecting a special effect image according to a specific order Selecting the special effect image data in the library sequentially; or selecting the special effect image data from a special effect image library according to an instruction input by an operation unit; synthesizing the original image data and the special effect image data to form a synthetic image data Superimposing the lyrics subtitle data; and displaying the synthesized image data and the lyrics subtitle data.

The method for synthesizing an MV of an instant self-timer effect applied to a phonograph according to claim 10, further comprising performing dynamic image pre-processing on the original image data.

The method for synthesizing a MV for an instant self-timer effect applied to a phonograph according to claim 10, further comprising performing dynamic image post-processing on the synthesized image data.

The method for synthesizing an MV of an instant self-timer effect applied to a karaoke machine according to claim 10, wherein the method of synthesizing the original image data and the special effect image data is performed by performing facial recognition on the original image data of the user. The user's face is captured as a foreground and the special effect image data is instantly combined to form a background image to form the synthesized image data.

The method for synthesizing an MV of an instant self-timer effect applied to a phonograph according to claim 10, wherein the method for synthesizing the original image data and the special effect image data in real time Performing human body recognition on the original image data of the user and extracting the whole body of the user as a foreground and the special effect image data for immediate background synthesis to form the synthesized image data.