WO2008116400A1

WO2008116400A1 - A terminal, method and system for realizing video communication

Info

Publication number: WO2008116400A1
Application number: PCT/CN2008/070237
Authority: WO
Inventors: Jing Lv
Original assignee: Tencent Technology (Shenzhen) Company Limited
Priority date: 2007-03-28
Filing date: 2008-02-01
Publication date: 2008-10-02
Also published as: CN101193261A; CN101193261B

Abstract

A terminal for realizing video communication is provided, it includes: an image analyzing unit for dividing the video images of video sequence into at least two portions (202), an image processing unit for encoding each portion of the at least two portions of the video images respectively (203); a data transmitting unit for outputting the encoded data of the video images (204). A method and system for realizing video communication are also provided.

Description

Terminal, method and system for realizing video communication

The present invention relates to the field of computer graphics technology, and more particularly to a terminal, method and system for implementing video communication. Background of the invention

With the development of video coding technology and digital image processing technology, it is now possible to perform some automatic or semi-automatic sub-regional processing on video sequences to meet the different needs of different applications.

The above digital image processing techniques include techniques such as image segmentation, image description, and recognition. Image segmentation is the extraction of meaningful features from images. The meaningful features include edges, regions, etc. in the image, which is the basis for further image recognition, analysis and understanding. Image description is a necessary prerequisite for image recognition and understanding. As the most compact binary image, its geometric characteristics can be used to describe the characteristics of the object. The general image description method uses two-dimensional shape description, which has two kinds of methods: boundary description and region description. Image classification (recognition) belongs to the category of pattern recognition. Its main content is image segmentation and feature extraction after some preprocessing (enhancement, restoration, compression), or matching identification through some a priori features. Conduct a classification of judgments.

Most existing video communications, such as video communications in Instant Messenger (IM), use traditional video encoding modes. When performing video communication using an existing video communication system, the following process is generally included: the transmitting terminal picks up a video image through a camera; encodes the taken video image into video data; transmits the encoded video data to a receiving terminal; and receives the terminal Decode and play the received encoded data.

However, in the above video communication, a single frame (single image) in the video sequence is The entire video sequence is rendered in the same resolution, with no different processing for different regions. There is a problem with this: When users are reluctant to expose an area of the video, such as their own environment, to the other party, and they want to use the video function, there is no suitable solution. Summary of the invention

In view of this, the embodiments of the present invention provide a terminal, a method, and a system for implementing video communication, which solve the problem that the prior art cannot perform different resolution processing for different areas in the video.

A terminal for implementing video communication according to an embodiment of the present invention includes:

An image analyzing unit, configured to divide the video image in the video sequence into at least two parts;

An image processing unit, configured to respectively encode each of the at least two portions of the video image;

a data transmission unit, configured to output encoded data of the video image.

A method for implementing video communication according to an embodiment of the present invention includes:

Segmenting the video image in the video sequence into at least two parts;

Each of the at least two portions of the video image is encoded separately; the encoded data of the video image is output.

A system for implementing video communication according to an embodiment of the present invention includes:

a transmitting terminal, configured to divide a video image in the video sequence into at least two parts; respectively encode each of the at least two portions of the video image, and output encoded data of the video image;

And a receiving terminal, configured to receive encoded data of the video image from the transmitting terminal, and decode and play the video image. According to the foregoing technical solution, a terminal, a method, and a system for implementing video communication according to an embodiment of the present invention provide a method for dividing a video image into different regions and different regions according to characteristics of the existing video communication. The use of different resolution processing methods satisfies the individualized requirements in the video communication process, such as privacy protection in the video communication process, and solves the problem that the prior art cannot perform different resolution processing for different areas in the video. BRIEF DESCRIPTION OF THE DRAWINGS

1 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention; FIG. 2 is a schematic flowchart of a method for implementing video communication according to an embodiment of the present invention; FIG. 3 is a schematic structural diagram of a terminal for implementing video communication according to Embodiment 1 of the present invention; FIG. 4 is a schematic flowchart of a method for implementing video communication according to Embodiment 1 of the present invention; FIG. 5 is a schematic structural diagram of an image analyzing unit according to Embodiment 2 of the present invention;

6 is a schematic flowchart of a method for implementing video communication according to Embodiment 2 of the present invention; FIG. 7 is a schematic structural diagram of an image analyzing unit according to Embodiment 3 of the present invention;

8 is a schematic flowchart of a method for implementing video communication according to Embodiment 3 of the present invention; FIG. 9 is a schematic structural diagram of an image processing unit according to Embodiment 4 of the present invention;

10 is a schematic flowchart of a method for implementing video communication according to Embodiment 4 of the present invention; FIG. 11 is a schematic structural diagram of an image processing unit according to Embodiment 5 of the present invention; FIG. 12 is a flowchart of a method for implementing video communication according to Embodiment 5 of the present invention; FIG. 13 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention; FIG. 14 is a schematic structural diagram of a method for implementing video communication according to an embodiment of the present invention; FIG. 16 is a schematic flowchart diagram of a system method for implementing video communication according to an embodiment of the present invention. Mode for carrying out the invention

In order to make the objects, the technical solutions and the advantages of the present invention more comprehensible, the present invention will be further described in detail below with reference to the accompanying drawings. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

The terminal, the method and the system for implementing video communication provided by the embodiment of the present invention, after acquiring the video sequence, dividing the video image in the video sequence into at least two parts; each part of at least two parts of the video image The encoding is performed separately, and the encoded video image data is output.

Referring to FIG. 1, FIG. 1 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention. As shown in FIG. 1, the terminal includes: an image analyzing unit 102, an image processing unit 103, and a data transfer unit 104.

The image analyzing unit 102 is configured to divide the video image in the video sequence into at least two parts.

The image processing unit 103 is for encoding each of the at least two portions of the video image, respectively.

The data transmission unit 104 is for outputting encoded data of the above video image.

Referring to FIG. 2, FIG. 2 is a schematic flowchart of a method for implementing video communication in an embodiment of the present invention. As shown in Figure 2, the method includes:

Step 202: Divide the video image in the video sequence into at least two parts.

Step 203: Encode each of the at least two parts of the video image separately.

Step 204: Output encoded data of the video image. Method and system.

Embodiment 1 Referring to FIG. 3, FIG. 3 is a schematic structural diagram of a terminal for implementing video communication according to Embodiment 1 of the present invention. As shown in FIG. 3, the terminal includes: an image analysis unit 302, an image processing unit 303, and a data transmission unit 304. The terminal further includes: a video collection unit 301.

The video collection unit 301 is configured to collect a video sequence, and send the video sequence to the image analysis unit 302. The video image in the sequence is split into at least two parts.

The image processing unit 303 is for encoding each of the at least two portions of the video image, respectively.

The data transmission unit 304 is for outputting encoded data of the above video image.

Referring to FIG. 4, FIG. 4 is a schematic flowchart of a method for implementing video communication according to Embodiment 1 of the present invention. As shown in Figure 4, the method includes:

Step 401: Collect a video sequence.

Step 402: Analyze a video image in the video sequence, and divide the video image into at least two parts.

Step 403: Encode each of the at least two parts of the video image separately.

Step 404: Output encoded data of the video image.

Embodiment 2

This embodiment describes the image analysis unit in detail based on the first embodiment. As shown in FIG. 3, the terminal for implementing video communication in the second embodiment of the present invention includes: a video capture unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304.

The video collection unit 301 is configured to collect a video sequence. In the present embodiment, the video capture unit 301 can include a video capture device (eg, a camera connected to a computer, etc.) and corresponding video image processing software. The image analyzing unit 302 is configured to acquire a video image from the video sequence collected by the video capturing unit 301, analyze and identify the video image, and divide the video image into a first portion and a second portion.

Referring to FIG. 5, FIG. 5 is a schematic structural diagram of an image analyzing unit according to Embodiment 2 of the present invention. As shown in FIG. 5, in the second embodiment of the present invention, the image analyzing unit 302 includes an image recognizing unit 502, configured to analyze a video image in the video sequence, and extract a foreground portion and a background from the video image. Part, where for example the foreground part is the first part and the background part is the second part.

In a specific implementation, the image recognition unit 502 can analyze the pixel changes in the before and after image frames by comparing the front and rear image frames (usually bitmaps) in the video sequence to obtain the foreground portion and the background portion in the video image. The image analyzing unit 302 compares the first 3 - 5 image frames of the current image frame in the video sequence. If the pixels in the same position in the preceding and following image frames are unchanged, the position is a stationary point; if the image in the same position in the preceding and succeeding frames If the elements are different, the position is a moving point; the image analyzing unit 302 takes a plurality of moving points and a still point surrounded by the plurality of moving points as a foreground portion, and the other portion is a background portion.

Of course, those skilled in the art will appreciate that the present invention is exemplified by the above-described foreground and background recognition manners, and is not intended to limit the scope of the present invention. Image analysis unit 302 can also use any of the existing foreground and background recognition methods to obtain the foreground portion and the background portion of the video image.

The image processing unit 303 is for encoding the first portion and the second portion of the video image transmitted by the image analyzing unit 302.

The data transmission unit 304 is for outputting the encoded data of the video image transmitted by the image processing unit 303.

Referring to FIG. 6, FIG. 6 is a schematic flowchart diagram of a method for implementing video communication according to Embodiment 2 of the present invention. As shown in FIG. 6, the method includes: Step 601: The terminal collects a video sequence by using a camera or the like.

Step 602: The video image in the video sequence is divided into a first part and a second part by a foreground and a background recognition manner.

In analyzing the video image in the above video sequence, the foreground portion and the background portion are extracted from the above video image. In a second embodiment of the invention, the first portion is the foreground portion of the video image and the second portion is the background portion of the video image. In the specific implementation, the foreground and background portions in the video image can be obtained by comparing the front and rear image frames (usually bitmaps) in the video sequence to analyze the pixel changes in the image frames before and after.

Of course, any existing foreground and background recognition methods can be used to obtain the foreground and background portions of the video image.

Step 603: Encode the first part and the second part of the video image.

Step 604: Output the above encoded data of the video image.

The terminal and the method for realizing video communication in this embodiment segment the video image by using foreground and background recognition modes, which satisfies the problem of individualization in the video communication process and solves the problem of resolution processing.

Embodiment 3

This embodiment describes the image analysis unit in detail based on the first embodiment. As shown in FIG. 3, the terminal for implementing video communication in Embodiment 3 of the present invention includes: a video capture unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304.

The video collection unit 301 is the same as the first embodiment.

The image analyzing unit 302 is configured to obtain a video image from the video sequence acquired by the video capturing unit 301, and divide the video image into the first portion and the second portion by manual selection.

Referring to FIG. 7, FIG. 7 is a schematic structural diagram of an image analyzing unit according to Embodiment 3 of the present invention. As shown in FIG. 7, in the third embodiment of the present invention, the image analyzing unit 302 includes an image selecting unit 702. The image selection unit 702 is configured to select a partial region from the video image of the video sequence, for example, a rectangular, circular, or irregularly shaped region by a mouse frame. And, for example, the selected area is taken as the first part, and the unselected area is taken as the second part.

Referring to FIG. 8, FIG. 8 is a schematic flowchart of a method for implementing video communication according to Embodiment 3 of the present invention. As shown in Figure 8, the method includes:

Step 801: The terminal collects a video sequence by using a camera or the like.

Step 802: The area selected in the video image is taken as the first part, and the unselected area is taken as the second part. That is, a partial region is selected from the video images of the above video sequence, and the selected region can be taken as the first portion and the unselected region as the second portion.

Step 803: Encode the first part and the second part of the video image.

Step 804: Output the above encoded data of the video image.

The terminal and method for realizing video communication in this embodiment solves the existing problem by segmenting the video image by manual selection, which satisfies the individualized requirements in the video communication process.

Embodiment 4

This embodiment describes the image processing unit in detail based on the first embodiment. As shown in FIG. 3, the terminal for implementing video communication in Embodiment 4 of the present invention includes: a video collection unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304. The video collection unit 301 is the same as the first embodiment.

The image analyzing unit 302 is configured to obtain a video image from the video sequence acquired by the video capturing unit 301, and divide the video image into a first portion and a second portion.

Referring to FIG. 9, FIG. 9 is a schematic structural diagram of an image processing unit according to Embodiment 4 of the present invention. As shown in Fig. 9, in the fourth embodiment of the present invention, the image processing unit 303 includes an image encoding unit 903. The image encoding unit 903 is configured to encode the video image divided by the image analyzing unit 302: encoding the first portion of the video image using the first quality parameter, and using the second quality parameter for the second portion of the video image coding. The first quality parameter and the second quality parameter are used to control the compression ratio at the time of encoding. For example, it may be specified that the larger the quality parameter value is, the smaller the data amount is, and the corresponding image quality is degraded. Of course, in practical applications, it can also be specified that the smaller the quality parameter value is, the smaller the data amount is, and the corresponding image quality is degraded.

The quality parameter strategy can be processed by setting a default value, or by the user within a certain range, that is, the user himself controls the clarity of the foreground and the background separately. Therefore, the image encoding unit 903 may further include: a parameter setting unit (not shown) for setting the first quality parameter and the second quality parameter. The image coding unit 903 performs video coding of different quality levels on the foreground and the background according to different quality parameters. The corresponding code stream of each area carries the quality level parameter corresponding to the area.

For the receiving end, after receiving the encoded data of the video image sent by the terminal, the code stream corresponding to the area without the quality level carries the quality level information of the local area, so only the corresponding processing parameters are required according to the general processing flow. Just restore the image.

The data transmission unit is configured to output encoded data of the video image transmitted by the image processing unit. Referring to FIG. 10, FIG. 10 is a schematic flowchart diagram of a method for implementing video communication according to Embodiment 4 of the present invention. As shown in FIG. 10, the method includes:

Step 1001: The terminal collects a video sequence by using a camera or the like.

Step 1002: The video image in the video sequence is divided into a first part and a second part.

Step 1003: Encode the first portion of the video image using the first quality parameter and encode the second portion of the video image using the second quality parameter.

The quality parameter strategy can be processed with default values or by the user within a certain range. Therefore, preferably, the step further includes: setting the first quality parameter and the second quality parameter.

Step 1004: Output the above encoded data of the video image.

The terminal and method for realizing video communication in this embodiment, for the characteristics of the existing video communication, identify and segment the video sequence collected by the camera, for example, and perform different quality coding on different parts, thereby satisfying the video communication user. Personalized requirements solve the problem that the prior art cannot handle different resolutions for different areas in the video. Embodiment 5

This embodiment describes the image processing unit in detail based on the first embodiment. As shown in FIG. 3, the terminal for implementing video communication in Embodiment 5 of the present invention includes: a video capture unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304.

The video collection unit 301 is the same as the first embodiment.

Referring to FIG. 11, FIG. 11 is a schematic structural diagram of an image processing unit according to Embodiment 5 of the present invention; Figure. As shown in FIG. 11, in the fifth embodiment of the present invention, the image processing unit 303 includes: a pre-processing unit 11031, and an image encoding unit 11032.

The pre-processing unit 11031 is configured to perform fuzzy pre-processing on the second part of the video image provided by the image analyzing unit 302. That is, after reading the video image, the pre-processing unit 11031 performs blurring processing on the second portion before encoding, and sends the processed image to the image encoding unit 11032 for encoding at a uniform quality level.

The image encoding unit 11032 is configured to encode the first portion of the video image supplied through the image analyzing unit 302 and the second portion subjected to the blurring processing by the pre-processing unit 11031 to perform uniform quality level (same quality parameter).

For the receiving end, after receiving the encoded data of the video image sent by the terminal, the video image is encoded according to the quality parameter carried in the encoded data according to a general operation procedure.

Referring to FIG. 12, FIG. 12 is a schematic flowchart of a method for implementing video communication according to Embodiment 5 of the present invention. As shown in Figure 12, the method includes:

Step 1201: The terminal collects a video sequence by using a camera or the like.

Step 1202: Divide the video image in the video sequence into a first part and a second part.

Step 1203: Perform blur preprocessing on the second part of the video image, and encode the first part of the video image and the second part of the video image after the blur preprocessing using the same quality parameter.

Among them, fuzzy preprocessing is a prior art, similar to the common "mosaic" technology, that is, the part that needs to be processed is processed by a pre-selected algorithm, and the most simple solution is to put all the pixels and surrounding pixels in the image. Weighted mean to achieve fuzzy processing Effect. However, those skilled in the art will appreciate that the present invention is described by way of example only, and is not intended to limit the scope of the present invention.

After the second part of the video image is subjected to blur preprocessing, the same quality parameter is used to encode the first part of the video image and the second part of the video image after blur preprocessing.

Step 1204: Output the above encoded data of the video image.

The terminal and method for implementing video communication in this embodiment, for the characteristics of the existing video communication, identify and segment the video sequence collected by the camera, for example, and perform blur preprocessing on some parts, and then perform video sequence on the video sequence. The same quality coding, thereby satisfying the individualized requirements of the video communication user, solves the problem that the prior art cannot perform different resolution processing for different areas in the video.

Hereinafter, the present invention will be described in more detail with reference to a specific embodiment. However, those skilled in the art should understand that the present invention is not limited by the scope of the present invention.

At present, IM video communication adopts the traditional video coding mode, and uses the same quality parameters for a single frame (single image) or even the entire video sequence in the video sequence, and presents with the same definition, without different regions. deal with. This way, when users are not willing to expose their environment to each other and want to use the video function, there is no suitable solution.

Accordingly, the present invention provides a terminal and method for implementing video communication for solving the above technical problems. Referring to FIG. 13, FIG. 13 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention. As shown in FIG. 13, the terminal includes: a video collection unit 1301, an image analysis unit 1302, an image processing unit 1303, and a data transmission unit 1304.

The video collection unit 1301 is configured to collect a video sequence. In this embodiment, Video capture unit 1301 may include a video capture device (eg, a camera coupled to a computer, etc.) and corresponding video image processing software.

The image analyzing unit 1302 is for dividing the video image in the video sequence into a foreground portion and a background portion, wherein the foreground portion is the first portion and the background portion is the second portion.

The technical solution for dividing the video image in the video sequence into the foreground portion and the background portion can be divided into two parts: automatic implementation according to settings or interaction between users in the video process.

1) Automatically implemented according to user settings: This program provides the option "Automatic Blur Background" in the video settings. The basic idea is to highlight only the foreground information (persons) in the user's video process, and to blur the background information. The software automatically divides the foreground and background, and handles the foreground and background differently.

2) Users need to interact in the video process: This program provides the option "Manual Blur Background" in the video settings. The basic idea is to receive real-time participation of users during the interaction process, and adjust the foreground and background regions according to user participation.

In a specific implementation, the image analyzing unit 1302 detects whether the user selects the "automatic blur background" setting or the "manual blur background" setting. If the user does not check the two options, the same quality level is followed according to the general operation flow. The video image is encoded; the quality parameters are transmitted along with the code stream.

If the user selects the "Auto Blur Background" setting, the image analysis unit 1302 can analyze the pixel changes in the image frames before and after, by comparing the front and rear image frames (usually bitmaps) in the video sequence to obtain the foreground portion of the video image. And the background section. The image analyzing unit 1302 compares the first 3 - 5 image frames of the current image frame in the video sequence. If the pixels in the same position in the preceding and following image frames are unchanged, the position is a stationary point; if the image in the same position in the preceding and succeeding frames If the elements are different, the position is a moving point; the image analyzing unit 1302 will be more The moving points and the still points surrounded by the plurality of moving points are the foreground portions, and the other portions are the background portions.

Of course, image analysis unit 1302 can also use any existing foreground and background recognition methods to obtain foreground and background portions of the video image.

If the user selects "Manually Blur Background," the user can select the background area by using the selection tool provided by the image analysis unit 1302 (frame selection, rectangle, circle, or custom shape), video image The rest of the portion is the foreground portion. The image analysis unit 1302 may further provide a reset option for canceling the background area selected by the user and restoring the default video output.

The image processing unit 1303 is for encoding the video image divided by the image analyzing unit 1302. The encoding method is either of the following two:

1) The image processing unit 1303 can determine different quality parameters for the foreground and the background according to a pre-defined strategy, and perform different processing on different regions. That is, the first portion of the video image is encoded using the first quality parameter and the second portion of the video image is encoded using the second quality parameter. The first quality parameter and the second quality parameter are used to control the compression ratio at the time of encoding. The larger the quality parameter value is, the smaller the data amount is, and the corresponding image quality is degraded.

When the first quality parameter value is equal to the second quality parameter value, the resolution of the decoded video sequence is consistent; when the first quality parameter value is lower than the second quality parameter value, the first part of the decoded video sequence is clear The degree is higher than the sharpness of the second part; when the first quality parameter value is higher than the second quality parameter value, the resolution of the first part in the decoded video sequence is lower than the sharpness of the second part.

For example, the first quality parameter is lower than the second quality parameter. At this time, in the video sequence played by the receiving end, the background portion (ie, the corresponding portion of the second quality parameter value) has poor definition (similar to the mosaic effect), so that part of the content in the video sequence can be hidden. Hidden view The surrounding environment when chatting.

The above-mentioned quality parameter strategy can be processed by default values. In addition, the user can also select within a certain range, that is, the user himself and the user can separately control the clarity of the foreground and the background, that is, in the above system, a parameter setting module can also be included. The values of the first quality parameter and the second quality parameter are set.

2) The image processing unit 1303 may also pre-process the background area after reading the video image, that is, perform blur processing before encoding, and perform uniform quality level encoding on the processed background image and the unprocessed foreground image.

Of course, in practical applications, any other existing encoding methods may be used to separately encode the foreground portion and the background portion of the video image.

For the receiving end, since the area corresponding to the code stream does not carry the quality level information of the area, it is only necessary to restore the image according to the corresponding parameters of each area according to the general processing flow.

Preferably, for the restored image, in order to make the quality difference of the whole picture not appear too disparate, the receiving end can record whether the quality parameters carried in different areas are different, and the transition can be appropriately performed at the boundary between the foreground area and the background area. deal with. This over-processing can be smoothed to avoid the visually inconspicuous effect of the blurred area boundaries being too obvious.

The data transmission unit 1304 is for outputting the encoded data of the video image transmitted by the image processing unit 1303.

Referring to FIG. 14, FIG. 14 is a schematic structural diagram of a method for implementing video communication according to an embodiment of the present invention. As shown in FIG. 14, it is a flowchart of an embodiment of a video communication method according to the present invention. The method is used to implement video sequence transmission between terminals, and specifically includes the following steps:

Step 1401: The terminal collects a video sequence through a camera or the like.

Step 1402: In step 1401, by automatic identification mode or manual selection mode The video image in the acquired video sequence is divided into a foreground portion and a background portion, wherein the foreground portion is the first portion and the background portion is the second portion.

Step 1403: Code the foreground portion and the background portion of the video image separately. The encoding method can be either of the following two:

1) encoding the first portion of the video image using the first quality parameter and encoding the second portion of the video image using the second quality parameter.

At this time, this step may further include a parameter setting step for setting specific values of the first quality parameter and the second quality parameter.

2) Pre-processing the background area, that is, performing blurring processing before encoding, and performing uniform quality level encoding on the processed background image and the unprocessed foreground image.

Step 1404: Send the encoded video image encoded data to the receiving end to decode the broadcast. When the first quality parameter value is equal to the second quality parameter value or not pre-processed, the resolution of the decoded video sequence is consistent; when the first quality parameter value is lower than the second quality parameter value or the second part (background part) When performing fuzzy preprocessing, the resolution of the first part of the decoded video sequence is higher than the resolution of the second part; when the first quality parameter value is higher than the second quality parameter value or the first part (foreground part) is performed In the fuzzy preprocessing, the resolution of the first part of the decoded video sequence is lower than the resolution of the second part.

Preferably, in order to improve the visual effect of the video sequence decoding and playing, when the receiving end decodes the encoded data of the video image, a boundary equalization step may be further included for the foreground part and the background of the video image when outputting the video image. Part of the transition process. Therefore, when decoding the video sequence, the boundary between the first part and the second part is smoothed, and the boundary of the blurred area is prevented from being too obvious.

The terminal and method for realizing video communication in the embodiment, according to the characteristics of the existing video communication, by automatically identifying or manually selecting a video image collected by a camera, and separately encoding different parts, thereby satisfying Video communication The user's personalized needs solve the problem that the prior art cannot handle different resolutions for different areas in the video.

The present invention also provides a system for implementing video communication. Hereinafter, the system will be described in detail by way of examples.

Referring to FIG. 15, FIG. 15 is a schematic structural diagram of a system for implementing video communication according to an embodiment of the present invention. As can be seen from Fig. 15, the system includes: a transmitting terminal 1501 and a receiving terminal 1502.

The transmitting terminal 1501 is configured to divide the video image in the video sequence into at least two parts; separately encode each of the at least two parts of the video image, and output the encoded data of the video image. The sending terminal 1501 is further configured to collect a video sequence.

The receiving terminal 1502 is configured to receive the encoded data of the video image from the transmitting terminal 1501 and decode the playing video sequence. Of course, the video transmission described above can be bidirectional.

Specifically, the transmitting terminal 1501 is configured to collect a video sequence by means of a camera or the like, and encode the video sequence and transmit the video sequence, and the receiving terminal 1502 can receive the video sequence from the transmitting terminal 1501 through the network, for example, and decode the playing video sequence. Of course, the video transmission described above can be bidirectional. In an actual application, in order to improve the visual effect of video sequence decoding and playback, the receiving terminal 1502 may further include a boundary equalization unit (not shown) for outputting the video image to the video image. At least two parts of the transition process. Therefore, when decoding the video sequence, the boundary of at least two parts is smoothed, and the boundary of the blurred area is prevented from being too obvious.

Referring to FIG. 16, FIG. 16 is a schematic flowchart diagram of a system method for implementing video communication according to an embodiment of the present invention. As shown in Figure 16, the method includes:

Step 1602, dividing the video image in the video sequence into at least two parts. Step 1603, encoding each of the at least two portions of the video image separately.

Step 1604: Output encoded data of the video image.

Step 1605: Decode and play the above video image.

Of course, in the actual application, before step 1602, the method further includes: Step 1601: Acquire a video sequence.

In step 1605, the method further includes: performing a transition processing on at least two portions of the video image when the video image is output. Therefore, when decoding the video sequence, the boundary of at least two parts is smoothed, and the blurred area is prevented from being too obvious.

For the restored image, in order to make the quality difference of the whole picture not appear too disparate, the receiving terminal needs to record whether the quality parameters carried in different areas are different when decoding, and appropriately perform the transition processing at the boundary between the foreground area and the background area.

The above description is only a preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily think of changes or within the technical scope disclosed by the present invention. Alternatives are intended to be covered by the scope of the present invention. Therefore, the scope of protection of the present invention should be determined by the scope of the claims.

The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the protection of the present invention. Within the scope.

Claims

Claim

A terminal for implementing video communication, comprising:

a data transmission unit, configured to output encoded data of the video image.

2. The terminal according to claim 1, further comprising: a video collection unit, configured to collect a video sequence, and send the video sequence to the image analysis unit.

3. The terminal according to claim 2, wherein the image analysis unit comprises:

And an image recognition unit, configured to analyze a video image in the video sequence, extract a foreground portion from the video image as a first portion, and extract a background portion from the video image as a second portion.

4. The terminal according to claim 2, wherein the image analysis unit comprises:

And an image selecting unit, configured to select a partial region from the video image of the video sequence, and select the selected region as the first portion and the unselected region as the second portion.

The terminal according to claim 3 or 4, wherein the image processing unit comprises:

An image encoding unit for encoding the first portion of the video image using the first quality parameter and encoding the second portion in the video image using the second quality parameter.

The terminal according to claim 5, further comprising: a number setting unit, configured to set the first quality parameter and the second quality parameter.

a pre-processing unit, configured to perform fuzzy pre-processing on the second part of the video image provided by the image analyzing unit;

An image encoding unit for encoding the first portion of the video image provided by the image analyzing unit and the second portion subjected to blur preprocessing using the same quality parameter.

8. A method for implementing video communication, comprising:

Segmenting the video image in the video sequence into at least two parts;

9. The method of claim 8 further comprising: acquiring a video sequence.

10. The method according to claim 9, wherein the dividing the video image in the video sequence into at least two parts comprises:

A video image in the video sequence is analyzed, a foreground portion is extracted from the video image as a first portion, and a background portion is extracted from the video image as a second portion.

The method according to claim 9, wherein the dividing the video image in the video sequence into at least two parts comprises:

Selecting a partial region from a video image of the video sequence;

The selected area is taken as the first part, and the unselected area is taken as the second part.

12. The method according to claim 10 or 11, wherein the encoding each of the at least two portions of the video image separately comprises: The image encoding unit encodes the first portion of the video image using the first quality parameter and encodes the second portion of the video image using the second quality parameter.

13. The method according to claim 12, further comprising: a parameter setting unit setting the first quality parameter and the second quality parameter.

14. The method of claim 10 or 11, wherein the encoding each of the at least two portions of the video image separately comprises:

The pre-processing unit performs fuzzy pre-processing on the second part of the video image;

The image encoding unit encodes the first portion of the video image and the second portion of the blurred preprocessed video image using the same quality parameter.

15. The method of claim 8 further comprising: decoding and playing the video image.

16. The method of claim 15, further comprising: performing a transition process on at least two portions of the video image when the video image is decoded.

17. A system for implementing video communication, comprising:

And a receiving terminal, configured to receive encoded data of the video image from the transmitting terminal, and decode and play the video image.

The system according to claim 17, wherein the receiving terminal is further configured to perform transition processing on at least two parts of the video image when the video image is decoded.

The system according to claim 17 or 18, wherein the transmitting terminal comprises: the terminal according to any one of claims 1 to 7.