[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2008116400A1 - A terminal, method and system for realizing video communication - Google Patents

A terminal, method and system for realizing video communication Download PDF

Info

Publication number
WO2008116400A1
WO2008116400A1 PCT/CN2008/070237 CN2008070237W WO2008116400A1 WO 2008116400 A1 WO2008116400 A1 WO 2008116400A1 CN 2008070237 W CN2008070237 W CN 2008070237W WO 2008116400 A1 WO2008116400 A1 WO 2008116400A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
image
video image
unit
quality parameter
Prior art date
Application number
PCT/CN2008/070237
Other languages
French (fr)
Chinese (zh)
Inventor
Jing Lv
Original Assignee
Tencent Technology (Shenzhen) Company Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology (Shenzhen) Company Limited filed Critical Tencent Technology (Shenzhen) Company Limited
Publication of WO2008116400A1 publication Critical patent/WO2008116400A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/142Constructional details of the terminal equipment, e.g. arrangements of the camera and the display
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/142Constructional details of the terminal equipment, e.g. arrangements of the camera and the display
    • H04N2007/145Handheld terminals

Definitions

  • Terminal method and system for realizing video communication
  • the present invention relates to the field of computer graphics technology, and more particularly to a terminal, method and system for implementing video communication. Background of the invention
  • the above digital image processing techniques include techniques such as image segmentation, image description, and recognition.
  • Image segmentation is the extraction of meaningful features from images.
  • the meaningful features include edges, regions, etc. in the image, which is the basis for further image recognition, analysis and understanding.
  • Image description is a necessary prerequisite for image recognition and understanding.
  • the general image description method uses two-dimensional shape description, which has two kinds of methods: boundary description and region description.
  • Image classification recognition belongs to the category of pattern recognition. Its main content is image segmentation and feature extraction after some preprocessing (enhancement, restoration, compression), or matching identification through some a priori features. Conduct a classification of judgments.
  • the transmitting terminal picks up a video image through a camera; encodes the taken video image into video data; transmits the encoded video data to a receiving terminal; and receives the terminal Decode and play the received encoded data.
  • the embodiments of the present invention provide a terminal, a method, and a system for implementing video communication, which solve the problem that the prior art cannot perform different resolution processing for different areas in the video.
  • An image analyzing unit configured to divide the video image in the video sequence into at least two parts
  • An image processing unit configured to respectively encode each of the at least two portions of the video image
  • a data transmission unit configured to output encoded data of the video image.
  • Each of the at least two portions of the video image is encoded separately; the encoded data of the video image is output.
  • a transmitting terminal configured to divide a video image in the video sequence into at least two parts; respectively encode each of the at least two portions of the video image, and output encoded data of the video image;
  • a terminal, a method, and a system for implementing video communication provide a method for dividing a video image into different regions and different regions according to characteristics of the existing video communication.
  • the use of different resolution processing methods satisfies the individualized requirements in the video communication process, such as privacy protection in the video communication process, and solves the problem that the prior art cannot perform different resolution processing for different areas in the video.
  • FIG. 1 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention
  • FIG. 2 is a schematic flowchart of a method for implementing video communication according to an embodiment of the present invention
  • FIG. 3 is a schematic structural diagram of a terminal for implementing video communication according to Embodiment 1 of the present invention
  • FIG. 4 is a schematic flowchart of a method for implementing video communication according to Embodiment 1 of the present invention
  • FIG. 5 is a schematic structural diagram of an image analyzing unit according to Embodiment 2 of the present invention
  • FIG. 6 is a schematic flowchart of a method for implementing video communication according to Embodiment 2 of the present invention
  • FIG. 7 is a schematic structural diagram of an image analyzing unit according to Embodiment 3 of the present invention.
  • FIG. 8 is a schematic flowchart of a method for implementing video communication according to Embodiment 3 of the present invention
  • FIG. 9 is a schematic structural diagram of an image processing unit according to Embodiment 4 of the present invention.
  • FIG. 10 is a schematic flowchart of a method for implementing video communication according to Embodiment 4 of the present invention
  • FIG. 11 is a schematic structural diagram of an image processing unit according to Embodiment 5 of the present invention
  • FIG. 12 is a flowchart of a method for implementing video communication according to Embodiment 5 of the present invention
  • FIG. 13 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention
  • FIG. 14 is a schematic structural diagram of a method for implementing video communication according to an embodiment of the present invention
  • FIG. 16 is a schematic flowchart diagram of a system method for implementing video communication according to an embodiment of the present invention. Mode for carrying out the invention
  • the terminal, the method and the system for implementing video communication provided by the embodiment of the present invention, after acquiring the video sequence, dividing the video image in the video sequence into at least two parts; each part of at least two parts of the video image
  • the encoding is performed separately, and the encoded video image data is output.
  • FIG. 1 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention.
  • the terminal includes: an image analyzing unit 102, an image processing unit 103, and a data transfer unit 104.
  • the image analyzing unit 102 is configured to divide the video image in the video sequence into at least two parts.
  • the image processing unit 103 is for encoding each of the at least two portions of the video image, respectively.
  • the data transmission unit 104 is for outputting encoded data of the above video image.
  • FIG. 2 is a schematic flowchart of a method for implementing video communication in an embodiment of the present invention. As shown in Figure 2, the method includes:
  • Step 202 Divide the video image in the video sequence into at least two parts.
  • Step 203 Encode each of the at least two parts of the video image separately.
  • Step 204 Output encoded data of the video image.
  • FIG. 3 is a schematic structural diagram of a terminal for implementing video communication according to Embodiment 1 of the present invention.
  • the terminal includes: an image analysis unit 302, an image processing unit 303, and a data transmission unit 304.
  • the terminal further includes: a video collection unit 301.
  • the video collection unit 301 is configured to collect a video sequence, and send the video sequence to the image analysis unit 302.
  • the video image in the sequence is split into at least two parts.
  • the image processing unit 303 is for encoding each of the at least two portions of the video image, respectively.
  • the data transmission unit 304 is for outputting encoded data of the above video image.
  • FIG. 4 is a schematic flowchart of a method for implementing video communication according to Embodiment 1 of the present invention. As shown in Figure 4, the method includes:
  • Step 401 Collect a video sequence.
  • Step 402 Analyze a video image in the video sequence, and divide the video image into at least two parts.
  • Step 403 Encode each of the at least two parts of the video image separately.
  • Step 404 Output encoded data of the video image.
  • the terminal for implementing video communication in the second embodiment of the present invention includes: a video capture unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304.
  • the video collection unit 301 is configured to collect a video sequence.
  • the video capture unit 301 can include a video capture device (eg, a camera connected to a computer, etc.) and corresponding video image processing software.
  • the image analyzing unit 302 is configured to acquire a video image from the video sequence collected by the video capturing unit 301, analyze and identify the video image, and divide the video image into a first portion and a second portion.
  • FIG. 5 is a schematic structural diagram of an image analyzing unit according to Embodiment 2 of the present invention.
  • the image analyzing unit 302 includes an image recognizing unit 502, configured to analyze a video image in the video sequence, and extract a foreground portion and a background from the video image. Part, where for example the foreground part is the first part and the background part is the second part.
  • the image recognition unit 502 can analyze the pixel changes in the before and after image frames by comparing the front and rear image frames (usually bitmaps) in the video sequence to obtain the foreground portion and the background portion in the video image.
  • the image analyzing unit 302 compares the first 3 - 5 image frames of the current image frame in the video sequence. If the pixels in the same position in the preceding and following image frames are unchanged, the position is a stationary point; if the image in the same position in the preceding and succeeding frames If the elements are different, the position is a moving point; the image analyzing unit 302 takes a plurality of moving points and a still point surrounded by the plurality of moving points as a foreground portion, and the other portion is a background portion.
  • Image analysis unit 302 can also use any of the existing foreground and background recognition methods to obtain the foreground portion and the background portion of the video image.
  • the image processing unit 303 is for encoding the first portion and the second portion of the video image transmitted by the image analyzing unit 302.
  • the data transmission unit 304 is for outputting the encoded data of the video image transmitted by the image processing unit 303.
  • FIG. 6 is a schematic flowchart diagram of a method for implementing video communication according to Embodiment 2 of the present invention. As shown in FIG. 6, the method includes: Step 601: The terminal collects a video sequence by using a camera or the like.
  • Step 602 The video image in the video sequence is divided into a first part and a second part by a foreground and a background recognition manner.
  • the foreground portion and the background portion are extracted from the above video image.
  • the first portion is the foreground portion of the video image and the second portion is the background portion of the video image.
  • the foreground and background portions in the video image can be obtained by comparing the front and rear image frames (usually bitmaps) in the video sequence to analyze the pixel changes in the image frames before and after.
  • any existing foreground and background recognition methods can be used to obtain the foreground and background portions of the video image.
  • Step 603 Encode the first part and the second part of the video image.
  • Step 604 Output the above encoded data of the video image.
  • the terminal and the method for realizing video communication in this embodiment segment the video image by using foreground and background recognition modes, which satisfies the problem of individualization in the video communication process and solves the problem of resolution processing.
  • the terminal for implementing video communication in Embodiment 3 of the present invention includes: a video capture unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304.
  • the video collection unit 301 is the same as the first embodiment.
  • the image analyzing unit 302 is configured to obtain a video image from the video sequence acquired by the video capturing unit 301, and divide the video image into the first portion and the second portion by manual selection.
  • FIG. 7 is a schematic structural diagram of an image analyzing unit according to Embodiment 3 of the present invention.
  • the image analyzing unit 302 includes an image selecting unit 702.
  • the image selection unit 702 is configured to select a partial region from the video image of the video sequence, for example, a rectangular, circular, or irregularly shaped region by a mouse frame. And, for example, the selected area is taken as the first part, and the unselected area is taken as the second part.
  • the image processing unit 303 is for encoding the first portion and the second portion of the video image transmitted by the image analyzing unit 302.
  • the data transmission unit 304 is for outputting the encoded data of the video image transmitted by the image processing unit 303.
  • FIG. 8 is a schematic flowchart of a method for implementing video communication according to Embodiment 3 of the present invention. As shown in Figure 8, the method includes:
  • Step 801 The terminal collects a video sequence by using a camera or the like.
  • Step 802 The area selected in the video image is taken as the first part, and the unselected area is taken as the second part. That is, a partial region is selected from the video images of the above video sequence, and the selected region can be taken as the first portion and the unselected region as the second portion.
  • Step 803 Encode the first part and the second part of the video image.
  • Step 804 Output the above encoded data of the video image.
  • the terminal and method for realizing video communication in this embodiment solves the existing problem by segmenting the video image by manual selection, which satisfies the individualized requirements in the video communication process.
  • the terminal for implementing video communication in Embodiment 4 of the present invention includes: a video collection unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304.
  • the video collection unit 301 is the same as the first embodiment.
  • the image analyzing unit 302 is configured to obtain a video image from the video sequence acquired by the video capturing unit 301, and divide the video image into a first portion and a second portion.
  • the image processing unit 303 is for encoding the first portion and the second portion of the video image transmitted by the image analyzing unit 302.
  • FIG. 9 is a schematic structural diagram of an image processing unit according to Embodiment 4 of the present invention.
  • the image processing unit 303 includes an image encoding unit 903.
  • the image encoding unit 903 is configured to encode the video image divided by the image analyzing unit 302: encoding the first portion of the video image using the first quality parameter, and using the second quality parameter for the second portion of the video image coding.
  • the first quality parameter and the second quality parameter are used to control the compression ratio at the time of encoding.
  • the quality parameter strategy can be processed by setting a default value, or by the user within a certain range, that is, the user himself controls the clarity of the foreground and the background separately. Therefore, the image encoding unit 903 may further include: a parameter setting unit (not shown) for setting the first quality parameter and the second quality parameter.
  • the image coding unit 903 performs video coding of different quality levels on the foreground and the background according to different quality parameters.
  • the corresponding code stream of each area carries the quality level parameter corresponding to the area.
  • the code stream corresponding to the area without the quality level carries the quality level information of the local area, so only the corresponding processing parameters are required according to the general processing flow. Just restore the image.
  • FIG. 10 is a schematic flowchart diagram of a method for implementing video communication according to Embodiment 4 of the present invention. As shown in FIG. 10, the method includes:
  • Step 1001 The terminal collects a video sequence by using a camera or the like.
  • Step 1002 The video image in the video sequence is divided into a first part and a second part.
  • Step 1003 Encode the first portion of the video image using the first quality parameter and encode the second portion of the video image using the second quality parameter.
  • the quality parameter strategy can be processed with default values or by the user within a certain range. Therefore, preferably, the step further includes: setting the first quality parameter and the second quality parameter.
  • Step 1004 Output the above encoded data of the video image.
  • the terminal and method for realizing video communication in this embodiment for the characteristics of the existing video communication, identify and segment the video sequence collected by the camera, for example, and perform different quality coding on different parts, thereby satisfying the video communication user.
  • Personalized requirements solve the problem that the prior art cannot handle different resolutions for different areas in the video.
  • the terminal for implementing video communication in Embodiment 5 of the present invention includes: a video capture unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304.
  • the video collection unit 301 is the same as the first embodiment.
  • the image analyzing unit 302 is configured to obtain a video image from the video sequence acquired by the video capturing unit 301, and divide the video image into a first portion and a second portion.
  • the image processing unit 303 is for encoding the first portion and the second portion of the video image transmitted by the image analyzing unit 302.
  • FIG. 11 is a schematic structural diagram of an image processing unit according to Embodiment 5 of the present invention; Figure.
  • the image processing unit 303 includes: a pre-processing unit 11031, and an image encoding unit 11032.
  • the pre-processing unit 11031 is configured to perform fuzzy pre-processing on the second part of the video image provided by the image analyzing unit 302. That is, after reading the video image, the pre-processing unit 11031 performs blurring processing on the second portion before encoding, and sends the processed image to the image encoding unit 11032 for encoding at a uniform quality level.
  • the image encoding unit 11032 is configured to encode the first portion of the video image supplied through the image analyzing unit 302 and the second portion subjected to the blurring processing by the pre-processing unit 11031 to perform uniform quality level (same quality parameter).
  • the video image is encoded according to the quality parameter carried in the encoded data according to a general operation procedure.
  • the data transmission unit 304 is for outputting the encoded data of the video image transmitted by the image processing unit 303.
  • FIG. 12 is a schematic flowchart of a method for implementing video communication according to Embodiment 5 of the present invention. As shown in Figure 12, the method includes:
  • Step 1201 The terminal collects a video sequence by using a camera or the like.
  • Step 1202 Divide the video image in the video sequence into a first part and a second part.
  • Step 1203 Perform blur preprocessing on the second part of the video image, and encode the first part of the video image and the second part of the video image after the blur preprocessing using the same quality parameter.
  • fuzzy preprocessing is a prior art, similar to the common "mosaic" technology, that is, the part that needs to be processed is processed by a pre-selected algorithm, and the most simple solution is to put all the pixels and surrounding pixels in the image. Weighted mean to achieve fuzzy processing Effect.
  • the present invention is described by way of example only, and is not intended to limit the scope of the present invention.
  • the same quality parameter is used to encode the first part of the video image and the second part of the video image after blur preprocessing.
  • Step 1204 Output the above encoded data of the video image.
  • the terminal and method for implementing video communication in this embodiment for the characteristics of the existing video communication, identify and segment the video sequence collected by the camera, for example, and perform blur preprocessing on some parts, and then perform video sequence on the video sequence.
  • the same quality coding thereby satisfying the individualized requirements of the video communication user, solves the problem that the prior art cannot perform different resolution processing for different areas in the video.
  • IM video communication adopts the traditional video coding mode, and uses the same quality parameters for a single frame (single image) or even the entire video sequence in the video sequence, and presents with the same definition, without different regions. deal with. This way, when users are not willing to expose their environment to each other and want to use the video function, there is no suitable solution.
  • FIG. 13 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention.
  • the terminal includes: a video collection unit 1301, an image analysis unit 1302, an image processing unit 1303, and a data transmission unit 1304.
  • the video collection unit 1301 is configured to collect a video sequence.
  • Video capture unit 1301 may include a video capture device (eg, a camera coupled to a computer, etc.) and corresponding video image processing software.
  • the image analyzing unit 1302 is for dividing the video image in the video sequence into a foreground portion and a background portion, wherein the foreground portion is the first portion and the background portion is the second portion.
  • the technical solution for dividing the video image in the video sequence into the foreground portion and the background portion can be divided into two parts: automatic implementation according to settings or interaction between users in the video process.
  • This program provides the option "Automatic Blur Background" in the video settings.
  • the basic idea is to highlight only the foreground information (persons) in the user's video process, and to blur the background information.
  • the software automatically divides the foreground and background, and handles the foreground and background differently.
  • This program provides the option "Manual Blur Background" in the video settings.
  • the basic idea is to receive real-time participation of users during the interaction process, and adjust the foreground and background regions according to user participation.
  • the image analyzing unit 1302 detects whether the user selects the "automatic blur background” setting or the “manual blur background” setting. If the user does not check the two options, the same quality level is followed according to the general operation flow.
  • the video image is encoded; the quality parameters are transmitted along with the code stream.
  • the image analysis unit 1302 can analyze the pixel changes in the image frames before and after, by comparing the front and rear image frames (usually bitmaps) in the video sequence to obtain the foreground portion of the video image. And the background section.
  • the image analyzing unit 1302 compares the first 3 - 5 image frames of the current image frame in the video sequence.
  • the position is a stationary point; if the image in the same position in the preceding and succeeding frames If the elements are different, the position is a moving point; the image analyzing unit 1302 will be more The moving points and the still points surrounded by the plurality of moving points are the foreground portions, and the other portions are the background portions.
  • image analysis unit 1302 can also use any existing foreground and background recognition methods to obtain foreground and background portions of the video image.
  • the user can select the background area by using the selection tool provided by the image analysis unit 1302 (frame selection, rectangle, circle, or custom shape), video image The rest of the portion is the foreground portion.
  • the image analysis unit 1302 may further provide a reset option for canceling the background area selected by the user and restoring the default video output.
  • the image processing unit 1303 is for encoding the video image divided by the image analyzing unit 1302.
  • the encoding method is either of the following two:
  • the image processing unit 1303 can determine different quality parameters for the foreground and the background according to a pre-defined strategy, and perform different processing on different regions. That is, the first portion of the video image is encoded using the first quality parameter and the second portion of the video image is encoded using the second quality parameter.
  • the first quality parameter and the second quality parameter are used to control the compression ratio at the time of encoding. The larger the quality parameter value is, the smaller the data amount is, and the corresponding image quality is degraded.
  • the resolution of the decoded video sequence is consistent; when the first quality parameter value is lower than the second quality parameter value, the first part of the decoded video sequence is clear The degree is higher than the sharpness of the second part; when the first quality parameter value is higher than the second quality parameter value, the resolution of the first part in the decoded video sequence is lower than the sharpness of the second part.
  • the first quality parameter is lower than the second quality parameter.
  • the background portion ie, the corresponding portion of the second quality parameter value
  • Hidden view The surrounding environment when chatting.
  • the above-mentioned quality parameter strategy can be processed by default values.
  • the user can also select within a certain range, that is, the user himself and the user can separately control the clarity of the foreground and the background, that is, in the above system, a parameter setting module can also be included.
  • the values of the first quality parameter and the second quality parameter are set.
  • the image processing unit 1303 may also pre-process the background area after reading the video image, that is, perform blur processing before encoding, and perform uniform quality level encoding on the processed background image and the unprocessed foreground image.
  • any other existing encoding methods may be used to separately encode the foreground portion and the background portion of the video image.
  • the area corresponding to the code stream does not carry the quality level information of the area, it is only necessary to restore the image according to the corresponding parameters of each area according to the general processing flow.
  • the receiving end can record whether the quality parameters carried in different areas are different, and the transition can be appropriately performed at the boundary between the foreground area and the background area. deal with.
  • This over-processing can be smoothed to avoid the visually inconspicuous effect of the blurred area boundaries being too obvious.
  • the data transmission unit 1304 is for outputting the encoded data of the video image transmitted by the image processing unit 1303.
  • FIG. 14 is a schematic structural diagram of a method for implementing video communication according to an embodiment of the present invention. As shown in FIG. 14, it is a flowchart of an embodiment of a video communication method according to the present invention. The method is used to implement video sequence transmission between terminals, and specifically includes the following steps:
  • Step 1401 The terminal collects a video sequence through a camera or the like.
  • Step 1402 In step 1401, by automatic identification mode or manual selection mode The video image in the acquired video sequence is divided into a foreground portion and a background portion, wherein the foreground portion is the first portion and the background portion is the second portion.
  • Step 1403 Code the foreground portion and the background portion of the video image separately.
  • the encoding method can be either of the following two:
  • this step may further include a parameter setting step for setting specific values of the first quality parameter and the second quality parameter.
  • Pre-processing the background area that is, performing blurring processing before encoding, and performing uniform quality level encoding on the processed background image and the unprocessed foreground image.
  • Step 1404 Send the encoded video image encoded data to the receiving end to decode the broadcast.
  • the resolution of the decoded video sequence is consistent; when the first quality parameter value is lower than the second quality parameter value or the second part (background part)
  • the resolution of the first part of the decoded video sequence is higher than the resolution of the second part; when the first quality parameter value is higher than the second quality parameter value or the first part (foreground part) is performed
  • the resolution of the first part of the decoded video sequence is lower than the resolution of the second part.
  • a boundary equalization step may be further included for the foreground part and the background of the video image when outputting the video image. Part of the transition process. Therefore, when decoding the video sequence, the boundary between the first part and the second part is smoothed, and the boundary of the blurred area is prevented from being too obvious.
  • the terminal and method for realizing video communication in the embodiment according to the characteristics of the existing video communication, by automatically identifying or manually selecting a video image collected by a camera, and separately encoding different parts, thereby satisfying Video communication
  • the user's personalized needs solve the problem that the prior art cannot handle different resolutions for different areas in the video.
  • the present invention also provides a system for implementing video communication.
  • the system will be described in detail by way of examples.
  • FIG. 15 is a schematic structural diagram of a system for implementing video communication according to an embodiment of the present invention. As can be seen from Fig. 15, the system includes: a transmitting terminal 1501 and a receiving terminal 1502.
  • the transmitting terminal 1501 is configured to divide the video image in the video sequence into at least two parts; separately encode each of the at least two parts of the video image, and output the encoded data of the video image.
  • the sending terminal 1501 is further configured to collect a video sequence.
  • the receiving terminal 1502 is configured to receive the encoded data of the video image from the transmitting terminal 1501 and decode the playing video sequence.
  • the video transmission described above can be bidirectional.
  • the transmitting terminal 1501 is configured to collect a video sequence by means of a camera or the like, and encode the video sequence and transmit the video sequence, and the receiving terminal 1502 can receive the video sequence from the transmitting terminal 1501 through the network, for example, and decode the playing video sequence.
  • the video transmission described above can be bidirectional.
  • the receiving terminal 1502 may further include a boundary equalization unit (not shown) for outputting the video image to the video image. At least two parts of the transition process. Therefore, when decoding the video sequence, the boundary of at least two parts is smoothed, and the boundary of the blurred area is prevented from being too obvious.
  • FIG. 16 is a schematic flowchart diagram of a system method for implementing video communication according to an embodiment of the present invention. As shown in Figure 16, the method includes:
  • Step 1602 dividing the video image in the video sequence into at least two parts.
  • Step 1603 encoding each of the at least two portions of the video image separately.
  • Step 1604 Output encoded data of the video image.
  • Step 1605 Decode and play the above video image.
  • Step 1601 Acquire a video sequence.
  • step 1605 the method further includes: performing a transition processing on at least two portions of the video image when the video image is output. Therefore, when decoding the video sequence, the boundary of at least two parts is smoothed, and the blurred area is prevented from being too obvious.
  • the receiving terminal needs to record whether the quality parameters carried in different areas are different when decoding, and appropriately perform the transition processing at the boundary between the foreground area and the background area.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A terminal for realizing video communication is provided, it includes: an image analyzing unit for dividing the video images of video sequence into at least two portions (202), an image processing unit for encoding each portion of the at least two portions of the video images respectively (203); a data transmitting unit for outputting the encoded data of the video images (204). A method and system for realizing video communication are also provided.

Description

一种实现视频通信的终端、 方法及系统 技术领域  Terminal, method and system for realizing video communication
本发明涉及计算机图形技术领域, 更具体地说, 涉及一种实现视 频通信的终端、 方法及系统。 发明背景  The present invention relates to the field of computer graphics technology, and more particularly to a terminal, method and system for implementing video communication. Background of the invention
随着视频编码技术和数字图像处理技术的发展, 目前已经能够对 视频序列进行一些自动或者半自动的分区域处理,来满足不同应用的 不同需求。  With the development of video coding technology and digital image processing technology, it is now possible to perform some automatic or semi-automatic sub-regional processing on video sequences to meet the different needs of different applications.
上述的数字图像处理技术包括图像分割、 图像描述以及识别等技 术。 图像分割是将图像中有意义的特征部分提取出来, 其中有意义的 特征包括图像中的边缘、 区域等, 这是进一步进行图像识别、 分析和 理解的基础。 图像描述是图像识别和理解的必要前提。 作为最筒单的 二值图像可采用其几何特性描述物体的特性,一般图像的描述方法采 用二维形状描述, 它有边界描述和区域描述两类方法。 图像分类(识 别)属于模式识别的范畴,其主要内容是图像经过某些预处理(增强、 复原、 压缩)后, 进行图像分割和特征提取, 或者经由某些先验的特 征进行匹配识别, 从而进行判决分类。  The above digital image processing techniques include techniques such as image segmentation, image description, and recognition. Image segmentation is the extraction of meaningful features from images. The meaningful features include edges, regions, etc. in the image, which is the basis for further image recognition, analysis and understanding. Image description is a necessary prerequisite for image recognition and understanding. As the most compact binary image, its geometric characteristics can be used to describe the characteristics of the object. The general image description method uses two-dimensional shape description, which has two kinds of methods: boundary description and region description. Image classification (recognition) belongs to the category of pattern recognition. Its main content is image segmentation and feature extraction after some preprocessing (enhancement, restoration, compression), or matching identification through some a priori features. Conduct a classification of judgments.
现有的多数视频通信(例如即时通信工具( Instant Messenger, IM ) 中的视频通信)都采用传统的视频编码模式。 在使用现有的视频通信 系统进行视频通信时, 通常包括以下过程: 发送终端通过摄像头摄取 视频图像; 对摄取的视频图像进行编码为视频数据; 将编码后的视频 数据传送到接收终端; 接收终端解码并播放接收到的编码数据。  Most existing video communications, such as video communications in Instant Messenger (IM), use traditional video encoding modes. When performing video communication using an existing video communication system, the following process is generally included: the transmitting terminal picks up a video image through a camera; encodes the taken video image into video data; transmits the encoded video data to a receiving terminal; and receives the terminal Decode and play the received encoded data.
然而, 上述视频通信中, 对视频序列中的单个帧 (单幅图像)甚 至整个视频序列均采用同样的清晰度呈现,没有针对不同区域的不同 处理。 这样就存在一个问题: 当用户不愿将视频中某一区域, 例如自 己所处的环境暴露给对方, 同时又想使用视频功能时, 没有比较合适 的解决方案。 发明内容 However, in the above video communication, a single frame (single image) in the video sequence is The entire video sequence is rendered in the same resolution, with no different processing for different regions. There is a problem with this: When users are reluctant to expose an area of the video, such as their own environment, to the other party, and they want to use the video function, there is no suitable solution. Summary of the invention
有鉴于此, 本发明的实施例提供了一种实现视频通信的终端、 方 法及系统,解决了现有技术中不能针对视频中不同区域进行不同清晰 度处理的问题。  In view of this, the embodiments of the present invention provide a terminal, a method, and a system for implementing video communication, which solve the problem that the prior art cannot perform different resolution processing for different areas in the video.
根据本发明实施例的一种实现视频通信的终端, 包括:  A terminal for implementing video communication according to an embodiment of the present invention includes:
图像分析单元, 用于将视频序列中的视频图像分割为至少两个部 分;  An image analyzing unit, configured to divide the video image in the video sequence into at least two parts;
图像处理单元, 用于分别对视频图像的所述至少两个部分中的每 个部分进行编码;  An image processing unit, configured to respectively encode each of the at least two portions of the video image;
数据传输单元, 用于输出所述视频图像的编码数据。  a data transmission unit, configured to output encoded data of the video image.
根据本发明实施例的一种实现视频通信的方法, 包括:  A method for implementing video communication according to an embodiment of the present invention includes:
将视频序列中的视频图像分割为至少两个部分;  Segmenting the video image in the video sequence into at least two parts;
分别对视频图像的所述至少两个部分中的每个部分进行编码; 输出所述视频图像的编码数据。  Each of the at least two portions of the video image is encoded separately; the encoded data of the video image is output.
根据本发明实施例的一种实现视频通信的系统, 包括:  A system for implementing video communication according to an embodiment of the present invention includes:
发送终端, 用于将视频序列中的视频图像分割为至少两个部分; 分别对视频图像的所述至少两个部分中的每个部分进行编码,并输出 所述视频图像的编码数据;  a transmitting terminal, configured to divide a video image in the video sequence into at least two parts; respectively encode each of the at least two portions of the video image, and output encoded data of the video image;
接收终端, 用于从发送终端接收所述视频图像的编码数据, 并解 码、 播放所述视频图像。 由上述技术方案可见, 相比现有技术, 本发明实施例提供的一种 实现视频通信的终端、 方法及系统, 针对现有视频通信的特点, 通过 对视频图像划分不同区域, 并对不同区域采用不同的清晰度处理方 式, 满足了视频通信过程中的个性化需求, 例如视频通信过程中的隐 私保护,解决了现有技术中不能针对视频中不同区域进行不同清晰度 处理的问题。 附图简要说明 And a receiving terminal, configured to receive encoded data of the video image from the transmitting terminal, and decode and play the video image. According to the foregoing technical solution, a terminal, a method, and a system for implementing video communication according to an embodiment of the present invention provide a method for dividing a video image into different regions and different regions according to characteristics of the existing video communication. The use of different resolution processing methods satisfies the individualized requirements in the video communication process, such as privacy protection in the video communication process, and solves the problem that the prior art cannot perform different resolution processing for different areas in the video. BRIEF DESCRIPTION OF THE DRAWINGS
图 1是本发明实施例中实现视频通信的终端的结构示意图; 图 2为本发明实施例中实现视频通信的方法的流程示意图; 图 3为本发明实施例一中实现视频通信的终端的结构示意图; 图 4为本发明实施例一中实现视频通信的方法的流程示意图; 图 5为本发明实施例二中图像分析单元的结构示意图;  1 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention; FIG. 2 is a schematic flowchart of a method for implementing video communication according to an embodiment of the present invention; FIG. 3 is a schematic structural diagram of a terminal for implementing video communication according to Embodiment 1 of the present invention; FIG. 4 is a schematic flowchart of a method for implementing video communication according to Embodiment 1 of the present invention; FIG. 5 is a schematic structural diagram of an image analyzing unit according to Embodiment 2 of the present invention;
图 6为本发明实施例二中实现视频通信的方法的流程示意图; 图 7为本发明实施例三中图像分析单元的结构示意图;  6 is a schematic flowchart of a method for implementing video communication according to Embodiment 2 of the present invention; FIG. 7 is a schematic structural diagram of an image analyzing unit according to Embodiment 3 of the present invention;
图 8为本发明实施例三中实现视频通信的方法的流程示意图; 图 9为本发明实施例四中图像处理单元的结构示意图;  8 is a schematic flowchart of a method for implementing video communication according to Embodiment 3 of the present invention; FIG. 9 is a schematic structural diagram of an image processing unit according to Embodiment 4 of the present invention;
图 10为本发明实施例四中实现视频通信的方法的流程示意图; 图 11为本发明实施例五中图像处理单元的结构示意图; 图 12为本发明实施例五中实现视频通信的方法的流程示意图; 图 13为本发明实施例中实现视频通信的终端的结构示意图; 图 14为本发明实施例中实现视频通信的方法的结构示意图; 图 15为本发明实施例中实现视频通信的系统的结构示意图; 图 16为本发明实施例中实现视频通信的系统方法的流程示意图。 实施本发明的方式 10 is a schematic flowchart of a method for implementing video communication according to Embodiment 4 of the present invention; FIG. 11 is a schematic structural diagram of an image processing unit according to Embodiment 5 of the present invention; FIG. 12 is a flowchart of a method for implementing video communication according to Embodiment 5 of the present invention; FIG. 13 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention; FIG. 14 is a schematic structural diagram of a method for implementing video communication according to an embodiment of the present invention; FIG. 16 is a schematic flowchart diagram of a system method for implementing video communication according to an embodiment of the present invention. Mode for carrying out the invention
为了使本发明的目的、 技术方案及优点更加清楚明白, 以下结合附 图及实施例, 对本发明进行进一步详细说明。 应当理解, 此处所描述的 具体实施例仅仅用以解释本发明, 并不用于限定本发明。  In order to make the objects, the technical solutions and the advantages of the present invention more comprehensible, the present invention will be further described in detail below with reference to the accompanying drawings. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
本发明实施例提供的实现视频通信的终端、 方法及系统, 在采集 到视频序列后, 将视频序列中的视频图像分割为至少两个部分; 为视 频图像的至少两个部分中的每个部分分别进行编码,并输出编码后的 视频图像数据。  The terminal, the method and the system for implementing video communication provided by the embodiment of the present invention, after acquiring the video sequence, dividing the video image in the video sequence into at least two parts; each part of at least two parts of the video image The encoding is performed separately, and the encoded video image data is output.
参见图 1 , 图 1为本发明实施例中实现视频通信的终端的结构示 意图。 如图 1所示, 该终端包括: 图像分析单元 102、 图像处理单元 103、 数据传输单元 104。  Referring to FIG. 1, FIG. 1 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention. As shown in FIG. 1, the terminal includes: an image analyzing unit 102, an image processing unit 103, and a data transfer unit 104.
其中, 图像分析单元 102用于将视频序列中的视频图像分割为至 少两个部分。  The image analyzing unit 102 is configured to divide the video image in the video sequence into at least two parts.
图像处理单元 103用于分别对视频图像的上述至少两个部分中的 每个部分进行编码。  The image processing unit 103 is for encoding each of the at least two portions of the video image, respectively.
数据传输单元 104用于输出上述视频图像的编码数据。  The data transmission unit 104 is for outputting encoded data of the above video image.
参见图 2, 图 2为本发明实施例中实现视频通信的方法的流程示 意图。 如图 2所示, 该方法包括:  Referring to FIG. 2, FIG. 2 is a schematic flowchart of a method for implementing video communication in an embodiment of the present invention. As shown in Figure 2, the method includes:
步骤 202、 将视频序列中的视频图像分割为至少两个部分。  Step 202: Divide the video image in the video sequence into at least two parts.
步骤 203、 分别对视频图像的上述至少两个部分中的每个部分进 行编码。  Step 203: Encode each of the at least two parts of the video image separately.
步骤 204、 输出上述视频图像的编码数据。 方法及系统。  Step 204: Output encoded data of the video image. Method and system.
实施例一 参见图 3 , 图 3为本发明实施例一中实现视频通信的终端的结构 示意图。 如图 3所示, 该终端包括: 图像分析单元 302、 图像处理单 元 303、 数据传输单元 304。 该终端进一步包括: 视频采集单元 301。 Embodiment 1 Referring to FIG. 3, FIG. 3 is a schematic structural diagram of a terminal for implementing video communication according to Embodiment 1 of the present invention. As shown in FIG. 3, the terminal includes: an image analysis unit 302, an image processing unit 303, and a data transmission unit 304. The terminal further includes: a video collection unit 301.
其中, 视频采集单元 301用于采集视频序列, 将上述视频序列发 送给图像分析单元 302。 序列中的视频图像分割为至少两个部分。  The video collection unit 301 is configured to collect a video sequence, and send the video sequence to the image analysis unit 302. The video image in the sequence is split into at least two parts.
图像处理单元 303用于分别对视频图像的上述至少两个部分中的 每个部分进行编码。  The image processing unit 303 is for encoding each of the at least two portions of the video image, respectively.
数据传输单元 304用于输出上述视频图像的编码数据。  The data transmission unit 304 is for outputting encoded data of the above video image.
参见图 4, 图 4为本发明实施例一中实现视频通信的方法的流程 示意图。 如图 4所示, 该方法包括:  Referring to FIG. 4, FIG. 4 is a schematic flowchart of a method for implementing video communication according to Embodiment 1 of the present invention. As shown in Figure 4, the method includes:
步骤 401、 采集视频序列。  Step 401: Collect a video sequence.
步骤 402、 分析上述视频序列中的视频图像, 将上述视频图像分 割为至少两个部分。  Step 402: Analyze a video image in the video sequence, and divide the video image into at least two parts.
步骤 403、 分别对视频图像的上述至少两个部分中的每个部分进 行编码。  Step 403: Encode each of the at least two parts of the video image separately.
步骤 404、 输出上述视频图像的编码数据。  Step 404: Output encoded data of the video image.
实施例二 Embodiment 2
本实施例在实施例一的基础上对图像分析单元进行详细说明。 如 图 3所示, 本发明实施例二中实现视频通信的终端包括: 视频采集单 元 301、 图像分析单元 302、 图像处理单元 303、 数据传输单元 304。  This embodiment describes the image analysis unit in detail based on the first embodiment. As shown in FIG. 3, the terminal for implementing video communication in the second embodiment of the present invention includes: a video capture unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304.
其中, 视频采集单元 301用于采集视频序列。 在本实施例中, 视 频采集单元 301 可包括视频摄取装置 (例如与计算机连接的摄像头 等) 以及对应的视频图像处理软件。 图像分析单元 302用于从视频采集单元 301采集的视频序列中获 取视频图像, 对视频图像进行分析识别, 并将视频图像分割为第一部 分和第二部分。 The video collection unit 301 is configured to collect a video sequence. In the present embodiment, the video capture unit 301 can include a video capture device (eg, a camera connected to a computer, etc.) and corresponding video image processing software. The image analyzing unit 302 is configured to acquire a video image from the video sequence collected by the video capturing unit 301, analyze and identify the video image, and divide the video image into a first portion and a second portion.
参见图 5 ,图 5为本发明实施例二中图像分析单元的结构示意图。 如图 5所示, 在本发明的第二实施例中, 图像分析单元 302包括有图 像识别单元 502, 用于分析上述视频序列中的视频图像中, 从上述视 频图像中提取出前景部分和背景部分,其中例如前景部分即为第一部 分, 而背景部分则为第二部分。  Referring to FIG. 5, FIG. 5 is a schematic structural diagram of an image analyzing unit according to Embodiment 2 of the present invention. As shown in FIG. 5, in the second embodiment of the present invention, the image analyzing unit 302 includes an image recognizing unit 502, configured to analyze a video image in the video sequence, and extract a foreground portion and a background from the video image. Part, where for example the foreground part is the first part and the background part is the second part.
在具体实现时, 图像识别单元 502可通过对比视频序列中前后图 像帧 (通常为位图) , 从而分析前后图像帧中象素变化, 获取视频图 像中的前景部分和背景部分。图像分析单元 302对比当前图像帧在视 频序列中的前 3 - 5个图像帧, 若前后图像帧中同一位置的象素未变 化的, 则该位置为静止点; 若前后帧中同一位置的象素不同, 则该位 置为移动点; 图像分析单元 302将多个移动点以及由上述多个移动点 包围的静止点作为前景部分, 其它部分为背景部分。  In a specific implementation, the image recognition unit 502 can analyze the pixel changes in the before and after image frames by comparing the front and rear image frames (usually bitmaps) in the video sequence to obtain the foreground portion and the background portion in the video image. The image analyzing unit 302 compares the first 3 - 5 image frames of the current image frame in the video sequence. If the pixels in the same position in the preceding and following image frames are unchanged, the position is a stationary point; if the image in the same position in the preceding and succeeding frames If the elements are different, the position is a moving point; the image analyzing unit 302 takes a plurality of moving points and a still point surrounded by the plurality of moving points as a foreground portion, and the other portion is a background portion.
当然, 本领域技术人员可以意识到, 以上述前景和背景识别方式 为例对本发明进行说明仅仅是示范性的,并不用于对本发明的保护范 围进行限定。图像分析单元 302也可使用任意现有的前景和背景识别 方式来获取视频图像中的前景部分和背景部分。  Of course, those skilled in the art will appreciate that the present invention is exemplified by the above-described foreground and background recognition manners, and is not intended to limit the scope of the present invention. Image analysis unit 302 can also use any of the existing foreground and background recognition methods to obtain the foreground portion and the background portion of the video image.
图像处理单元 303用于为图像分析单元 302发送的视频图像的第 一部分和第二部分编码。  The image processing unit 303 is for encoding the first portion and the second portion of the video image transmitted by the image analyzing unit 302.
数据传输单元 304用于输出图像处理单元 303发送的视频图像的 编码数据。  The data transmission unit 304 is for outputting the encoded data of the video image transmitted by the image processing unit 303.
参见图 6, 图 6为本发明实施例二中实现视频通信的方法的流程 示意图。 如图 6所示, 该方法包括: 步骤 601、 终端通过摄像头或类似装置采集视频序列。 Referring to FIG. 6, FIG. 6 is a schematic flowchart diagram of a method for implementing video communication according to Embodiment 2 of the present invention. As shown in FIG. 6, the method includes: Step 601: The terminal collects a video sequence by using a camera or the like.
步骤 602、 通过前景和背景识别方式将视频序列中的视频图像分 割为第一部分和第二部分。  Step 602: The video image in the video sequence is divided into a first part and a second part by a foreground and a background recognition manner.
分析上述视频序列中的视频图像中, 从上述视频图像中提取出前 景部分和背景部分。 在本发明的第二实施例中, 第一部分为视频图像 的前景部分, 而第二部分则为视频图像的背景部分。 在具体实现时, 可通过对比视频序列中前后图像帧(通常为位图), 从而分析前后图 像帧中象素变化, 获取视频图像中的前景部分和背景部分。  In analyzing the video image in the above video sequence, the foreground portion and the background portion are extracted from the above video image. In a second embodiment of the invention, the first portion is the foreground portion of the video image and the second portion is the background portion of the video image. In the specific implementation, the foreground and background portions in the video image can be obtained by comparing the front and rear image frames (usually bitmaps) in the video sequence to analyze the pixel changes in the image frames before and after.
当然, 也可使用任意现有的前景和背景识别方式来获取视频图像 中的前景部分和背景部分。  Of course, any existing foreground and background recognition methods can be used to obtain the foreground and background portions of the video image.
步骤 603、 为视频图像的上述第一部分和第二部分编码。  Step 603: Encode the first part and the second part of the video image.
步骤 604、 输出视频图像的上述编码数据。  Step 604: Output the above encoded data of the video image.
本实施例的实现视频通信的终端及方法, 通过对视频图像采用前景 和背景识别方式进行分割, 满足了视频通信过程中的个性化需求, 解决 度处理的问题。  The terminal and the method for realizing video communication in this embodiment segment the video image by using foreground and background recognition modes, which satisfies the problem of individualization in the video communication process and solves the problem of resolution processing.
实施例三 Embodiment 3
本实施例在实施例一的基础上对图像分析单元进行详细说明。 如 图 3所示, 本发明实施例三中实现视频通信的终端包括: 视频采集单 元 301、 图像分析单元 302、 图像处理单元 303、 数据传输单元 304。  This embodiment describes the image analysis unit in detail based on the first embodiment. As shown in FIG. 3, the terminal for implementing video communication in Embodiment 3 of the present invention includes: a video capture unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304.
其中, 视频采集单元 301同实施例一。  The video collection unit 301 is the same as the first embodiment.
图像分析单元 302用于从视频采集单元 301采集的视频序列中获 取视频图像,通过手动选择方式将视频图像分割为第一部分和第二部 分。  The image analyzing unit 302 is configured to obtain a video image from the video sequence acquired by the video capturing unit 301, and divide the video image into the first portion and the second portion by manual selection.
参见图 7 ,图 7为本发明实施例三中图像分析单元的结构示意图。 如图 7所示, 在本发明的第三实施例中, 图像分析单元 302包括有图 像选择单元 702。 该图像选择单元 702用于从上述视频序列的视频图 像中选取部分区域, 例如通过鼠标框选矩形、 圆形或不规则形状的区 域。 并可以例如将选中的区域作为第一部分, 未选中区域作为第二部 分。 Referring to FIG. 7, FIG. 7 is a schematic structural diagram of an image analyzing unit according to Embodiment 3 of the present invention. As shown in FIG. 7, in the third embodiment of the present invention, the image analyzing unit 302 includes an image selecting unit 702. The image selection unit 702 is configured to select a partial region from the video image of the video sequence, for example, a rectangular, circular, or irregularly shaped region by a mouse frame. And, for example, the selected area is taken as the first part, and the unselected area is taken as the second part.
图像处理单元 303用于为图像分析单元 302发送的视频图像的第 一部分和第二部分编码。  The image processing unit 303 is for encoding the first portion and the second portion of the video image transmitted by the image analyzing unit 302.
数据传输单元 304用于输出图像处理单元 303发送的视频图像的 编码数据。  The data transmission unit 304 is for outputting the encoded data of the video image transmitted by the image processing unit 303.
参见图 8 , 图 8为本发明实施例三中实现视频通信的方法的流程 示意图。 如图 8所示, 该方法包括:  Referring to FIG. 8, FIG. 8 is a schematic flowchart of a method for implementing video communication according to Embodiment 3 of the present invention. As shown in Figure 8, the method includes:
步骤 801、 终端通过摄像头或类似装置采集视频序列。  Step 801: The terminal collects a video sequence by using a camera or the like.
步骤 802、 将在视频图像中选中的区域作为第一部分, 未选中区 域作为第二部分。 即, 从上述视频序列的视频图像中选择部分区域, 并可以将选中区域作为第一部分, 将未选中区域作为第二部分。  Step 802: The area selected in the video image is taken as the first part, and the unselected area is taken as the second part. That is, a partial region is selected from the video images of the above video sequence, and the selected region can be taken as the first portion and the unselected region as the second portion.
步骤 803、 为视频图像的上述第一部分和第二部分编码。  Step 803: Encode the first part and the second part of the video image.
步骤 804、 输出视频图像的上述编码数据。  Step 804: Output the above encoded data of the video image.
本实施例的实现视频通信的终端及方法, 通过对视频图像采用手动 选择方式进行分割, 满足了视频通信过程中的个性化需求, 解决了现有 的问题。  The terminal and method for realizing video communication in this embodiment solves the existing problem by segmenting the video image by manual selection, which satisfies the individualized requirements in the video communication process.
实施例四 Embodiment 4
本实施例在实施例一的基础上对图像处理单元进行详细说明。 如 图 3所示, 本发明实施例四中实现视频通信的终端包括: 视频采集单 元 301、 图像分析单元 302、 图像处理单元 303、 数据传输单元 304。 其中, 视频采集单元 301同实施例一。 This embodiment describes the image processing unit in detail based on the first embodiment. As shown in FIG. 3, the terminal for implementing video communication in Embodiment 4 of the present invention includes: a video collection unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304. The video collection unit 301 is the same as the first embodiment.
图像分析单元 302用于从视频采集单元 301采集的视频序列中获 取视频图像, 并将视频图像分割为第一部分和第二部分。  The image analyzing unit 302 is configured to obtain a video image from the video sequence acquired by the video capturing unit 301, and divide the video image into a first portion and a second portion.
图像处理单元 303用于为图像分析单元 302发送的视频图像的第 一部分和第二部分编码。  The image processing unit 303 is for encoding the first portion and the second portion of the video image transmitted by the image analyzing unit 302.
参见图 9,图 9为本发明实施例四中图像处理单元的结构示意图。 如图 9所示, 在本发明的第四实施例中, 图像处理单元 303包括有图 像编码单元 903。 该图像编码单元 903用于将经过所述图像分析单元 302分割后的视频图像进行编码: 使用第一质量参数为视频图像的第 一部分编码, 并使用第二质量参数为视频图像的第二部分进行编码。 上述第一质量参数和第二质量参数用于控制编码时的压缩比例,例如 可以规定:质量参数值越大,数据量越小,相应图像质量下降也越大。 当然, 在实际应用中, 也可以规定: 质量参数值越小, 数据量越小, 相应图像质量下降也越大。  Referring to FIG. 9, FIG. 9 is a schematic structural diagram of an image processing unit according to Embodiment 4 of the present invention. As shown in Fig. 9, in the fourth embodiment of the present invention, the image processing unit 303 includes an image encoding unit 903. The image encoding unit 903 is configured to encode the video image divided by the image analyzing unit 302: encoding the first portion of the video image using the first quality parameter, and using the second quality parameter for the second portion of the video image coding. The first quality parameter and the second quality parameter are used to control the compression ratio at the time of encoding. For example, it may be specified that the larger the quality parameter value is, the smaller the data amount is, and the corresponding image quality is degraded. Of course, in practical applications, it can also be specified that the smaller the quality parameter value is, the smaller the data amount is, and the corresponding image quality is degraded.
质量参数的策略可由设定一默认值进行处理, 也可由用户在一定 范围内选择, 即由用户自己分别控制前景和背景的清晰程度。 因此较 佳地, 图像编码单元 903还可以进一步包括: 参数设置单元(图中未 示出), 用于设置上述第一质量参数和所述第二质量参数。 图像编码 单元 903 根据不同质量参数对前景和背景进行不同质量级别的视频 编码。 各区域的对应码流中携带本区域所对应的质量级别参数。  The quality parameter strategy can be processed by setting a default value, or by the user within a certain range, that is, the user himself controls the clarity of the foreground and the background separately. Therefore, the image encoding unit 903 may further include: a parameter setting unit (not shown) for setting the first quality parameter and the second quality parameter. The image coding unit 903 performs video coding of different quality levels on the foreground and the background according to different quality parameters. The corresponding code stream of each area carries the quality level parameter corresponding to the area.
对于接收端, 在收到上述终端发送的视频图像的编码数据后, 由 于不用质量级别的区域所对应码流均携带有本区域的质量级别信息, 所以只需要按照通用处理流程根据各区域对应参数恢复图像即可。  For the receiving end, after receiving the encoded data of the video image sent by the terminal, the code stream corresponding to the area without the quality level carries the quality level information of the local area, so only the corresponding processing parameters are required according to the general processing flow. Just restore the image.
数据传输单元用于输出图像处理单元发送的视频图像的编码数 据。 参见图 10, 图 10为本发明实施例四中实现视频通信的方法的流 程示意图。 如图 10所示, 该方法包括: The data transmission unit is configured to output encoded data of the video image transmitted by the image processing unit. Referring to FIG. 10, FIG. 10 is a schematic flowchart diagram of a method for implementing video communication according to Embodiment 4 of the present invention. As shown in FIG. 10, the method includes:
步骤 1001、 终端通过摄像头或类似装置采集视频序列。  Step 1001: The terminal collects a video sequence by using a camera or the like.
步骤 1002、 将视频序列中的视频图像分割为第一部分和第二部 分。  Step 1002: The video image in the video sequence is divided into a first part and a second part.
步骤 1003、使用第一质量参数为视频图像的第一部分编码, 并使 用第二质量参数为视频图像的第二部分编码。  Step 1003: Encode the first portion of the video image using the first quality parameter and encode the second portion of the video image using the second quality parameter.
质量参数的策略可采用默认值进行处理, 也可由用户在一定范围 内选择。 因此较佳地, 本步骤还可以进一步包括: 设置上述第一质量 参数和所述第二质量参数。  The quality parameter strategy can be processed with default values or by the user within a certain range. Therefore, preferably, the step further includes: setting the first quality parameter and the second quality parameter.
步骤 1004、 输出视频图像的上述编码数据。  Step 1004: Output the above encoded data of the video image.
本实施例的实现视频通信的终端及方法, 针对现有视频通信的特 点, 将例如由摄像头采集到的视频序列进行识别和分割, 并对不同部 分进行不同质量编码, 从而满足了视频通信用户的个性化需求, 解决 了现有技术中不能针对视频中不同区域进行不同清晰度处理的问题。 实施例五  The terminal and method for realizing video communication in this embodiment, for the characteristics of the existing video communication, identify and segment the video sequence collected by the camera, for example, and perform different quality coding on different parts, thereby satisfying the video communication user. Personalized requirements solve the problem that the prior art cannot handle different resolutions for different areas in the video. Embodiment 5
本实施例在实施例一的基础上对图像处理单元进行详细说明。 如 图 3所示, 本发明实施例五中实现视频通信的终端包括: 视频采集单 元 301、 图像分析单元 302、 图像处理单元 303、 数据传输单元 304。  This embodiment describes the image processing unit in detail based on the first embodiment. As shown in FIG. 3, the terminal for implementing video communication in Embodiment 5 of the present invention includes: a video capture unit 301, an image analysis unit 302, an image processing unit 303, and a data transmission unit 304.
其中, 视频采集单元 301同实施例一。  The video collection unit 301 is the same as the first embodiment.
图像分析单元 302用于从视频采集单元 301采集的视频序列中获 取视频图像, 并将视频图像分割为第一部分和第二部分。  The image analyzing unit 302 is configured to obtain a video image from the video sequence acquired by the video capturing unit 301, and divide the video image into a first portion and a second portion.
图像处理单元 303用于为图像分析单元 302发送的视频图像的第 一部分和第二部分编码。  The image processing unit 303 is for encoding the first portion and the second portion of the video image transmitted by the image analyzing unit 302.
参见图 11 , 图 11为本发明实施例五中图像处理单元的结构示意 图。 如图 11所示, 在本发明的第五实施例中, 图像处理单元 303包 括: 预处理单元 11031、 图像编码单元 11032。 Referring to FIG. 11, FIG. 11 is a schematic structural diagram of an image processing unit according to Embodiment 5 of the present invention; Figure. As shown in FIG. 11, in the fifth embodiment of the present invention, the image processing unit 303 includes: a pre-processing unit 11031, and an image encoding unit 11032.
其中, 预处理单元 11031用于对所述图像分析单元 302提供的视 频图像中的第二部分进行模糊预处理。 即读入视频图像后, 预处理单 元 11031在编码之前对第二部分进行模糊处理,将处理后的图像送入 图像编码单元 11032进行统一质量级别的编码。  The pre-processing unit 11031 is configured to perform fuzzy pre-processing on the second part of the video image provided by the image analyzing unit 302. That is, after reading the video image, the pre-processing unit 11031 performs blurring processing on the second portion before encoding, and sends the processed image to the image encoding unit 11032 for encoding at a uniform quality level.
图像编码单元 11032用于将经过所述图像分析单元 302提供的视 频图像的第一部分和经过预处理单元 11031 进行模糊处理后的第二 部分进行统一质量级别 (相同质量参数) 的编码。  The image encoding unit 11032 is configured to encode the first portion of the video image supplied through the image analyzing unit 302 and the second portion subjected to the blurring processing by the pre-processing unit 11031 to perform uniform quality level (same quality parameter).
对于接收端, 在收到上述终端发送的视频图像的编码数据后, 按 照一般操作流程, 以编码数据中携带的质量参数对视频图像进行编 码。  For the receiving end, after receiving the encoded data of the video image sent by the terminal, the video image is encoded according to the quality parameter carried in the encoded data according to a general operation procedure.
数据传输单元 304用于输出图像处理单元 303发送的视频图像的 编码数据。  The data transmission unit 304 is for outputting the encoded data of the video image transmitted by the image processing unit 303.
参见图 12, 图 12为本发明实施例五中实现视频通信的方法的流 程示意图。 如图 12所示, 该方法包括:  Referring to FIG. 12, FIG. 12 is a schematic flowchart of a method for implementing video communication according to Embodiment 5 of the present invention. As shown in Figure 12, the method includes:
步骤 1201、 终端通过摄像头或类似装置采集视频序列。  Step 1201: The terminal collects a video sequence by using a camera or the like.
步骤 1202、 将视频序列中的视频图像分割为第一部分和第二部 分。  Step 1202: Divide the video image in the video sequence into a first part and a second part.
步骤 1203、对视频图像的第二部分进行模糊预处理,使用相同质 量参数为视频图像的第一部分和经过模糊预处理后的视频图像的第 二部分编码。  Step 1203: Perform blur preprocessing on the second part of the video image, and encode the first part of the video image and the second part of the video image after the blur preprocessing using the same quality parameter.
其中, 模糊预处理是现有技术, 类似于常见的 "马赛克" 技术, 即对需要处理的部分利用一种事先选定的算法进行处理,最筒单的方 案就是将图像中所有像素和周围像素进行加权均值,达到模糊处理的 效果。 但是, 本领域技术人员可以意识到, 以上述模糊预处理方式为 例对本发明进行说明仅仅是示范性的,并不用于对本发明的保护范围 进行限定。 Among them, fuzzy preprocessing is a prior art, similar to the common "mosaic" technology, that is, the part that needs to be processed is processed by a pre-selected algorithm, and the most simple solution is to put all the pixels and surrounding pixels in the image. Weighted mean to achieve fuzzy processing Effect. However, those skilled in the art will appreciate that the present invention is described by way of example only, and is not intended to limit the scope of the present invention.
对视频图像的第二部分进行模糊预处理后, 使用相同的质量参数 为视频图像中的第一部分和经过模糊预处理后的视频图像的第二部 分编码。  After the second part of the video image is subjected to blur preprocessing, the same quality parameter is used to encode the first part of the video image and the second part of the video image after blur preprocessing.
步骤 1204、 输出视频图像的上述编码数据。  Step 1204: Output the above encoded data of the video image.
本实施例的实现视频通信的终端及方法, 针对现有视频通信的特 点, 将例如由摄像头采集到的视频序列进行识别和分割, 并对某些部 分进行模糊预处理后, 再对视频序列进行相同质量编码, 从而满足了 视频通信用户的个性化需求, 解决了现有技术中不能针对视频中不同 区域进行不同清晰度处理的问题。  The terminal and method for implementing video communication in this embodiment, for the characteristics of the existing video communication, identify and segment the video sequence collected by the camera, for example, and perform blur preprocessing on some parts, and then perform video sequence on the video sequence. The same quality coding, thereby satisfying the individualized requirements of the video communication user, solves the problem that the prior art cannot perform different resolution processing for different areas in the video.
下面, 以一个具体实施例对本发明进行更加详细的说明。 不过, 本领域技术人员可以意识到, 以本具体实施例为例对本发明进行说明 仅仅是示范性的, 并不用于对本发明的保护范围进行限定。  Hereinafter, the present invention will be described in more detail with reference to a specific embodiment. However, those skilled in the art should understand that the present invention is not limited by the scope of the present invention.
目前 IM视频通信的采用的都是传统的视频编码模式, 对视频序 列中的单个帧 (单幅图像) 甚至整个视频序列采用相同的质量参数, 以同样的清晰度呈现, 没有针对不同区域的不同处理。 这样当用户不 愿将自己所处的环境暴露给对方, 同时又想使用视频功能时, 没有比 较合适的解决方案。  At present, IM video communication adopts the traditional video coding mode, and uses the same quality parameters for a single frame (single image) or even the entire video sequence in the video sequence, and presents with the same definition, without different regions. deal with. This way, when users are not willing to expose their environment to each other and want to use the video function, there is no suitable solution.
因此, 本发明提供了一种实现视频通信的终端和方法用于解决上 述技术问题。 参见图 13 , 图 13为本发明实施例中实现视频通信的终 端的结构示意图。 如图 13所示, 该终端包括: 视频采集单元 1301、 图像分析单元 1302、 图像处理单元 1303、 数据传输单元 1304。  Accordingly, the present invention provides a terminal and method for implementing video communication for solving the above technical problems. Referring to FIG. 13, FIG. 13 is a schematic structural diagram of a terminal for implementing video communication according to an embodiment of the present invention. As shown in FIG. 13, the terminal includes: a video collection unit 1301, an image analysis unit 1302, an image processing unit 1303, and a data transmission unit 1304.
其中, 视频采集单元 1301 用于采集视频序列。 在本实施例中, 视频采集单元 1301可包括视频摄取装置 (例如与计算机连接的摄像 头等) 以及对应的视频图像处理软件。 The video collection unit 1301 is configured to collect a video sequence. In this embodiment, Video capture unit 1301 may include a video capture device (eg, a camera coupled to a computer, etc.) and corresponding video image processing software.
图像分析单元 1302用于将视频序列中的视频图像分割为前景部 分和背景部分, 其中前景部分即为第一部分, 而背景部分则为第二部 分。  The image analyzing unit 1302 is for dividing the video image in the video sequence into a foreground portion and a background portion, wherein the foreground portion is the first portion and the background portion is the second portion.
将视频序列中的视频图像分割为前景部分和背景部分的技术方 案可筒单分为:根据设置自动实现或需要用户在视频过程中交互两种 方式实现。  The technical solution for dividing the video image in the video sequence into the foreground portion and the background portion can be divided into two parts: automatic implementation according to settings or interaction between users in the video process.
1 ) 根据用户设置自动实现: 本方案在视频设置中提供选项 "自 动模糊背景,,,基本思想是只突出用户视频过程中的前景信息(人物 ) , 模糊背景信息, 用户勾选此设置后由软件自动划分出前景和背景, 对 前景和背景进行不同的处理。  1) Automatically implemented according to user settings: This program provides the option "Automatic Blur Background" in the video settings. The basic idea is to highlight only the foreground information (persons) in the user's video process, and to blur the background information. The software automatically divides the foreground and background, and handles the foreground and background differently.
2 ) 需要用户在视频过程中交互: 本方案在视频设置中提供选项 "手动模糊背景" , 基本思想是在交互过程中接收用户的实时参与, 根据用户参与随时调整前景和背景区域。  2) Users need to interact in the video process: This program provides the option "Manual Blur Background" in the video settings. The basic idea is to receive real-time participation of users during the interaction process, and adjust the foreground and background regions according to user participation.
在具体实现时, 图像分析单元 1302检测用户是否勾选了 "自动 模糊背景" 设置或 "手动模糊背景" 设置, 如果用户没有勾选这两个 选项,则按照一般操作流程,以相同的质量级别对视频图像进行编码; 质量参数随码流一起传送。  In a specific implementation, the image analyzing unit 1302 detects whether the user selects the "automatic blur background" setting or the "manual blur background" setting. If the user does not check the two options, the same quality level is followed according to the general operation flow. The video image is encoded; the quality parameters are transmitted along with the code stream.
如果用户勾选了 "自动模糊背景" 设置时, 图像分析单元 1302 可通过对比视频序列中前后图像帧(通常为位图), 从而分析前后图 像帧中象素变化, 获取视频图像中的前景部分和背景部分。 图像分析 单元 1302对比当前图像帧在视频序列中的前 3 - 5个图像帧,若前后 图像帧中同一位置的象素未变化的, 则该位置为静止点; 若前后帧中 同一位置的象素不同, 则该位置为移动点; 图像分析单元 1302将多 个移动点以及由上述多个移动点包围的静止点作为前景部分,其它部 分为背景部分。 If the user selects the "Auto Blur Background" setting, the image analysis unit 1302 can analyze the pixel changes in the image frames before and after, by comparing the front and rear image frames (usually bitmaps) in the video sequence to obtain the foreground portion of the video image. And the background section. The image analyzing unit 1302 compares the first 3 - 5 image frames of the current image frame in the video sequence. If the pixels in the same position in the preceding and following image frames are unchanged, the position is a stationary point; if the image in the same position in the preceding and succeeding frames If the elements are different, the position is a moving point; the image analyzing unit 1302 will be more The moving points and the still points surrounded by the plurality of moving points are the foreground portions, and the other portions are the background portions.
当然, 图像分析单元 1302也可使用任意现有的前景和背景识别 方式来获取视频图像中的前景部分和背景部分。  Of course, image analysis unit 1302 can also use any existing foreground and background recognition methods to obtain foreground and background portions of the video image.
如果用户勾选了 "手动模糊背景,, 设置时, 用户可以通过图像分 析单元 1302提供的选择工具 (用鼠标进行框选, 矩形, 圆形, 或者 自定义形状)对背景区域进行选取, 视频图像中的其余部分为前景部 分。 图像分析单元 1302还可以进一步提供一个复位选项, 用于取消 用户选择的背景区域, 恢复默认的视频输出。  If the user selects "Manually Blur Background," the user can select the background area by using the selection tool provided by the image analysis unit 1302 (frame selection, rectangle, circle, or custom shape), video image The rest of the portion is the foreground portion. The image analysis unit 1302 may further provide a reset option for canceling the background area selected by the user and restoring the default video output.
图像处理单元 1303用于将经过所述图像分析单元 1302分割后的 视频图像进行编码。 编码的方式采用以下两种中任意一种:  The image processing unit 1303 is for encoding the video image divided by the image analyzing unit 1302. The encoding method is either of the following two:
1 )图像处理单元 1303可以根据预先制定的策略分别为前景和背 景决定不同的质量参数, 对不同区域进行不同的处理。 即使用第一质 量参数为视频图像的第一部分编码,并使用第二质量参数为视频图像 的第二部分进行编码。上述第一质量参数和第二质量参数用于控制编 码时的压缩比例, 质量参数值越大, 数据量越小, 相应图像质量下降 也越大。  1) The image processing unit 1303 can determine different quality parameters for the foreground and the background according to a pre-defined strategy, and perform different processing on different regions. That is, the first portion of the video image is encoded using the first quality parameter and the second portion of the video image is encoded using the second quality parameter. The first quality parameter and the second quality parameter are used to control the compression ratio at the time of encoding. The larger the quality parameter value is, the smaller the data amount is, and the corresponding image quality is degraded.
当第一质量参数值等于第二质量参数值时, 解码播放的视频序列 的清晰度一致; 当第一质量参数值低于第二质量参数值时, 解码播放 的视频序列中, 第一部分的清晰度高于第二部分的清晰度; 当第一质 量参数值高于第二质量参数值时, 解码播放的视频序列中, 第一部分 的清晰度低于第二部分的清晰度。  When the first quality parameter value is equal to the second quality parameter value, the resolution of the decoded video sequence is consistent; when the first quality parameter value is lower than the second quality parameter value, the first part of the decoded video sequence is clear The degree is higher than the sharpness of the second part; when the first quality parameter value is higher than the second quality parameter value, the resolution of the first part in the decoded video sequence is lower than the sharpness of the second part.
例如第一质量参数低于第二质量参数。 此时, 接收端播放的视频 序列中, 背景部分(即第二质量参数值对应部分 )的清晰度较差(类 似马赛克效果), 从而可以隐藏视频序列中的部分内容。 例如隐藏视 频聊天时的周围环境。 For example, the first quality parameter is lower than the second quality parameter. At this time, in the video sequence played by the receiving end, the background portion (ie, the corresponding portion of the second quality parameter value) has poor definition (similar to the mosaic effect), so that part of the content in the video sequence can be hidden. Hidden view The surrounding environment when chatting.
上述质量参数的策略可采用默认值处理, 此外, 也可由用户在一 定范围内选择, 即由用户自己分别控制前景和背景的清晰程度, 即在 上述系统中, 还可包括一个参数设置模块, 用于设置第一质量参数和 第二质量参数的值。  The above-mentioned quality parameter strategy can be processed by default values. In addition, the user can also select within a certain range, that is, the user himself and the user can separately control the clarity of the foreground and the background, that is, in the above system, a parameter setting module can also be included. The values of the first quality parameter and the second quality parameter are set.
2 )图像处理单元 1303也可以读入视频图像后先对背景区域进行 预处理, 即在编码之前进行模糊处理, 将处理后的背景图像和未处理 的前景图像进行统一质量级别的编码。  2) The image processing unit 1303 may also pre-process the background area after reading the video image, that is, perform blur processing before encoding, and perform uniform quality level encoding on the processed background image and the unprocessed foreground image.
当然, 在实际应用中, 也可使用任意现有的其他编码方式来分别 编码视频图像中的前景部分和背景部分。  Of course, in practical applications, any other existing encoding methods may be used to separately encode the foreground portion and the background portion of the video image.
对于接收端, 由于不用区域对应码流均携带有本区域的质量级别 信息,所以只需要按照通用处理流程根据各区域对应参数恢复图像即 可。  For the receiving end, since the area corresponding to the code stream does not carry the quality level information of the area, it is only necessary to restore the image according to the corresponding parameters of each area according to the general processing flow.
较佳地, 对于恢复出的图像, 为了使画面整体的质量差别不至于 显得过于悬殊, 在接收端可以记录不同区域携带的质量参数是否不 同, 可以在前景区域和背景区域的交界处适当进行过渡处理。 这种过 度处理可以是平滑处理,用于避免被模糊的区域边界过于明显造成视 觉体验效果较差。  Preferably, for the restored image, in order to make the quality difference of the whole picture not appear too disparate, the receiving end can record whether the quality parameters carried in different areas are different, and the transition can be appropriately performed at the boundary between the foreground area and the background area. deal with. This over-processing can be smoothed to avoid the visually inconspicuous effect of the blurred area boundaries being too obvious.
数据传输单元 1304用于输出图像处理单元 1303发送的视频图像 的编码数据。  The data transmission unit 1304 is for outputting the encoded data of the video image transmitted by the image processing unit 1303.
参见图 14, 图 14为本发明实施例中实现视频通信的方法的结构 示意图。 如图 14所示, 是本发明一种视频通信方法实施例的流程图。 该方法用于实现终端之间的视频序列传输, 具体包括以下步骤:  Referring to FIG. 14, FIG. 14 is a schematic structural diagram of a method for implementing video communication according to an embodiment of the present invention. As shown in FIG. 14, it is a flowchart of an embodiment of a video communication method according to the present invention. The method is used to implement video sequence transmission between terminals, and specifically includes the following steps:
步骤 1401 : 终端通过摄像头或类似装置采集视频序列。  Step 1401: The terminal collects a video sequence through a camera or the like.
步骤 1402: 通过自动识别方式或手动选择方式, 将步骤 1401中 采集的视频序列中的视频图像分割为前景部分和背景部分,其中前景 部分即为第一部分, 而背景部分则为第二部分。 Step 1402: In step 1401, by automatic identification mode or manual selection mode The video image in the acquired video sequence is divided into a foreground portion and a background portion, wherein the foreground portion is the first portion and the background portion is the second portion.
步骤 1403: 分别对视频图像的上述前景部分和背景部分进行编 码。 编码方式可采用以下两种中任意一种:  Step 1403: Code the foreground portion and the background portion of the video image separately. The encoding method can be either of the following two:
1 ) 使用第一质量参数为视频图像中的第一部分编码并使用第二 质量参数为视频图像中的第二部分编码。  1) encoding the first portion of the video image using the first quality parameter and encoding the second portion of the video image using the second quality parameter.
此时, 本步骤中还可以进一步包括一个参数设置步骤, 用于设置 第一质量参数和第二质量参数的具体数值。  At this time, this step may further include a parameter setting step for setting specific values of the first quality parameter and the second quality parameter.
2 ) 对背景区域进行预处理, 即在编码之前进行模糊处理, 将处 理后的背景图像和未处理的前景图像进行统一质量级别的编码。  2) Pre-processing the background area, that is, performing blurring processing before encoding, and performing uniform quality level encoding on the processed background image and the unprocessed foreground image.
步骤 1404:将编码完成的视频图像编码数据发送到接收端解码播 放。 当第一质量参数值等于第二质量参数值或未进行预处理时, 解码 播放的视频序列的清晰度一致; 当第一质量参数值低于第二质量参数 值或对第二部分(背景部分)进行模糊预处理时, 解码播放的视频序 列中, 第一部分的清晰度高于第二部分的清晰度; 当第一质量参数值 高于第二质量参数值或对第一部分(前景部分) 进行模糊预处理时, 解码播放的视频序列中, 第一部分的清晰度低于第二部分的清晰度。  Step 1404: Send the encoded video image encoded data to the receiving end to decode the broadcast. When the first quality parameter value is equal to the second quality parameter value or not pre-processed, the resolution of the decoded video sequence is consistent; when the first quality parameter value is lower than the second quality parameter value or the second part (background part) When performing fuzzy preprocessing, the resolution of the first part of the decoded video sequence is higher than the resolution of the second part; when the first quality parameter value is higher than the second quality parameter value or the first part (foreground part) is performed In the fuzzy preprocessing, the resolution of the first part of the decoded video sequence is lower than the resolution of the second part.
较佳地, 为了改善视频序列解码播放的视觉效果, 在接收端对视 频图像的编码数据进行解码时, 还可以包括一个边界均衡步骤, 用于 在输出视频图像时对视频图像的前景部分和背景部分做过渡处理。从 而在解码播放视频序列时, 使第一部分和第二部分的边界平滑, 避免 被模糊的区域边界过于明显。  Preferably, in order to improve the visual effect of the video sequence decoding and playing, when the receiving end decodes the encoded data of the video image, a boundary equalization step may be further included for the foreground part and the background of the video image when outputting the video image. Part of the transition process. Therefore, when decoding the video sequence, the boundary between the first part and the second part is smoothed, and the boundary of the blurred area is prevented from being too obvious.
本实施例的实现视频通信的终端及方法, 针对现有视频通信的特 点, 通过对例如由摄像头采集到的视频图像采用自动识别或者手动选 择方式进行分割, 并对不同部分分别进行编码, 从而满足了视频通信 用户的个性化需求,解决了现有技术中不能针对视频中不同区域进行不 同清晰度处理的问题。 The terminal and method for realizing video communication in the embodiment, according to the characteristics of the existing video communication, by automatically identifying or manually selecting a video image collected by a camera, and separately encoding different parts, thereby satisfying Video communication The user's personalized needs solve the problem that the prior art cannot handle different resolutions for different areas in the video.
本发明还提供了一种实现视频通信的系统。 下面, 通过实施例对 该系统进行详细说明。  The present invention also provides a system for implementing video communication. Hereinafter, the system will be described in detail by way of examples.
参见图 15 , 图 15为本发明实施例中实现视频通信的系统的结构 示意图。 由图 15 可见, 该系统包括: 发送终端 1501 以及接收终端 1502。  Referring to FIG. 15, FIG. 15 is a schematic structural diagram of a system for implementing video communication according to an embodiment of the present invention. As can be seen from Fig. 15, the system includes: a transmitting terminal 1501 and a receiving terminal 1502.
其中, 发送终端 1501 用于将视频序列中的视频图像分割为至少 两个部分;为视频图像的上述至少两个部分中的每个部分分别进行编 码, 并输出上述视频图像的编码数据。 其中, 发送终端 1501进一步 用于采集视频序列。  The transmitting terminal 1501 is configured to divide the video image in the video sequence into at least two parts; separately encode each of the at least two parts of the video image, and output the encoded data of the video image. The sending terminal 1501 is further configured to collect a video sequence.
接收终端 1502用于从发送终端 1501接收所述视频图像的编码数 据, 并解码播放视频序列。 当然, 上述的视频传输可以是双向的。  The receiving terminal 1502 is configured to receive the encoded data of the video image from the transmitting terminal 1501 and decode the playing video sequence. Of course, the video transmission described above can be bidirectional.
具体来说, 发送终端 1501用于通过摄像头等装置采集视频序列, 并将视频序列编码后发送出去, 而接收终端 1502则例如可以通过网 络从发送终端 1501接收视频序列, 并解码播放视频序列。 当然, 上 述的视频传输可以是双向的。 在实际应用中, 上述实现视频通信的系 为了改善视频序列解码播放的视觉效果, 接收终端 1502还可以 进一步包括一个边界均衡单元(图中未示出) , 用于在输出视频图像 时对视频图像的至少两个部分做过渡处理。从而在解码播放视频序列 时, 使至少两个部分的边界平滑, 避免被模糊的区域边界过于明显。  Specifically, the transmitting terminal 1501 is configured to collect a video sequence by means of a camera or the like, and encode the video sequence and transmit the video sequence, and the receiving terminal 1502 can receive the video sequence from the transmitting terminal 1501 through the network, for example, and decode the playing video sequence. Of course, the video transmission described above can be bidirectional. In an actual application, in order to improve the visual effect of video sequence decoding and playback, the receiving terminal 1502 may further include a boundary equalization unit (not shown) for outputting the video image to the video image. At least two parts of the transition process. Therefore, when decoding the video sequence, the boundary of at least two parts is smoothed, and the boundary of the blurred area is prevented from being too obvious.
参见图 16, 图 16为本发明实施例中实现视频通信的系统方法的 流程示意图。 如图 16所示, 该方法包括:  Referring to FIG. 16, FIG. 16 is a schematic flowchart diagram of a system method for implementing video communication according to an embodiment of the present invention. As shown in Figure 16, the method includes:
步骤 1602、 将视频序列中的视频图像分割为至少两个部分。 步骤 1603、分别对视频图像的上述至少两个部分中的每个部分进 行编码。 Step 1602, dividing the video image in the video sequence into at least two parts. Step 1603, encoding each of the at least two portions of the video image separately.
步骤 1604、 输出上述视频图像的编码数据。  Step 1604: Output encoded data of the video image.
步骤 1605、 解码并播放上述视频图像。  Step 1605: Decode and play the above video image.
当然, 在实际应用中, 在步骤 1602前, 还可以进一步包括: 步 骤 1601、 采集视频序列。  Of course, in the actual application, before step 1602, the method further includes: Step 1601: Acquire a video sequence.
在步骤 1605 中, 还可以进一步包括: 在输出视频图像时, 对视 频图像的至少两个部分做过渡处理。 从而在解码播放视频序列时, 使 至少两个部分的边界平滑, 避免被模糊的区域过于明显。  In step 1605, the method further includes: performing a transition processing on at least two portions of the video image when the video image is output. Therefore, when decoding the video sequence, the boundary of at least two parts is smoothed, and the blurred area is prevented from being too obvious.
对于恢复出的图像为了使画面整体的质量差别不至于显得过于 悬殊, 接收终端在解码时需要记录不同区域携带的质量参数是否不 同, 在前景区域和背景区域的交界处适当进行过渡处理。  For the restored image, in order to make the quality difference of the whole picture not appear too disparate, the receiving terminal needs to record whether the quality parameters carried in different areas are different when decoding, and appropriately perform the transition processing at the boundary between the foreground area and the background area.
以上所述, 仅为本发明较佳的具体实施方式, 但本发明的保护范 围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技 术范围内, 可轻易想到的变化或替换, 都应涵盖在本发明的保护范围 之内。 因此, 本发明的保护范围应该以权利要求的保护范围为准。  The above description is only a preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily think of changes or within the technical scope disclosed by the present invention. Alternatives are intended to be covered by the scope of the present invention. Therefore, the scope of protection of the present invention should be determined by the scope of the claims.
以上所述仅为本发明的较佳实施例而已, 并不用以限制本发明, 凡在本发明的精神和原则之内所作的任何修改、 等同替换和改进等, 均应包含在本发明的保护范围之内。  The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the protection of the present invention. Within the scope.

Claims

权利要求书 Claim
1、 一种实现视频通信的终端, 其特征在于, 包括:  A terminal for implementing video communication, comprising:
图像分析单元, 用于将视频序列中的视频图像分割为至少两个部 分;  An image analyzing unit, configured to divide the video image in the video sequence into at least two parts;
图像处理单元, 用于分别对视频图像的所述至少两个部分中的每 个部分进行编码;  An image processing unit, configured to respectively encode each of the at least two portions of the video image;
数据传输单元, 用于输出所述视频图像的编码数据。  a data transmission unit, configured to output encoded data of the video image.
2、 根据权利要求 1所述的终端, 其特征在于, 进一步包括: 视频采集单元, 用于采集视频序列, 将所述视频序列发送给所述 图像分析单元。  2. The terminal according to claim 1, further comprising: a video collection unit, configured to collect a video sequence, and send the video sequence to the image analysis unit.
3、 根据权利要求 2所述的终端, 其特征在于, 所述图像分析单 元包括:  3. The terminal according to claim 2, wherein the image analysis unit comprises:
图像识别单元, 用于分析所述视频序列中的视频图像, 从所述视 频图像中提取出前景部分作为第一部分,从所述视频图像中提取出背 景部分作为第二部分。  And an image recognition unit, configured to analyze a video image in the video sequence, extract a foreground portion from the video image as a first portion, and extract a background portion from the video image as a second portion.
4、 根据权利要求 2所述的终端, 其特征在于, 所述图像分析单 元包括:  4. The terminal according to claim 2, wherein the image analysis unit comprises:
图像选择单元, 用于从所述视频序列的视频图像中选择部分区 域, 并将选中区域作为第一部分, 将未选中区域作为第二部分。  And an image selecting unit, configured to select a partial region from the video image of the video sequence, and select the selected region as the first portion and the unselected region as the second portion.
5、 根据权利要求 3或 4所述的终端, 其特征在于, 所述图像处 理单元包括:  The terminal according to claim 3 or 4, wherein the image processing unit comprises:
图像编码单元, 用于使用第一质量参数为视频图像的所述第一部 分编码并使用第二质量参数为视频图像中的所述第二部分编码。  An image encoding unit for encoding the first portion of the video image using the first quality parameter and encoding the second portion in the video image using the second quality parameter.
6、 根据权利要求 5所述的终端, 其特征在于, 进一步包括: 参 数设置单元, 用于设置所述第一质量参数和第二质量参数。 The terminal according to claim 5, further comprising: a number setting unit, configured to set the first quality parameter and the second quality parameter.
7、 根据权利要求 3或 4所述的终端, 其特征在于, 所述图像处 理单元包括:  The terminal according to claim 3 or 4, wherein the image processing unit comprises:
预处理单元,用于对所述图像分析单元提供的视频图像中的第二 部分进行模糊预处理;  a pre-processing unit, configured to perform fuzzy pre-processing on the second part of the video image provided by the image analyzing unit;
图像编码单元, 用于使用相同质量参数为所述视频图像中由所述 图像分析单元提供的第一部分和经过模糊预处理后的所述第二部分 进行编码。  An image encoding unit for encoding the first portion of the video image provided by the image analyzing unit and the second portion subjected to blur preprocessing using the same quality parameter.
8、 一种实现视频通信的方法, 其特征在于, 包括:  8. A method for implementing video communication, comprising:
将视频序列中的视频图像分割为至少两个部分;  Segmenting the video image in the video sequence into at least two parts;
分别对视频图像的所述至少两个部分中的每个部分进行编码; 输出所述视频图像的编码数据。  Each of the at least two portions of the video image is encoded separately; the encoded data of the video image is output.
9、 根据权利要求 8所述的方法, 其特征在于, 进一步包括: 采 集视频序列。  9. The method of claim 8 further comprising: acquiring a video sequence.
10、 根据权利要求 9所述的方法, 其特征在于, 所述将视频序列 中的视频图像分割为至少两个部分包括:  10. The method according to claim 9, wherein the dividing the video image in the video sequence into at least two parts comprises:
分析所述视频序列中的视频图像,从所述视频图像中提取出前景 部分作为第一部分, 从所述视频图像中提取出背景部分作为第二部 分。  A video image in the video sequence is analyzed, a foreground portion is extracted from the video image as a first portion, and a background portion is extracted from the video image as a second portion.
11、 根据权利要求 9所述的方法, 其特征在于, 所述将视频序列 中的视频图像分割为至少两个部分包括:  The method according to claim 9, wherein the dividing the video image in the video sequence into at least two parts comprises:
从所述视频序列的视频图像中选择部分区域;  Selecting a partial region from a video image of the video sequence;
将选中区域作为第一部分, 并将未选中区域作为第二部分。 The selected area is taken as the first part, and the unselected area is taken as the second part.
12、 根据权利要求 10或 11所述的方法, 其特征在于, 所述分别 对视频图像的所述至少两个部分中的每个部分进行编码包括: 图像编码单元使用第一质量参数为视频图像中的所述第一部分 编码, 并使用第二质量参数为视频图像中的所述第二部分编码。 12. The method according to claim 10 or 11, wherein the encoding each of the at least two portions of the video image separately comprises: The image encoding unit encodes the first portion of the video image using the first quality parameter and encodes the second portion of the video image using the second quality parameter.
13、 根据权利要求 12所述的方法, 其特征在于, 进一步包括: 参数设置单元设置所述第一质量参数和所述第二质量参数。 13. The method according to claim 12, further comprising: a parameter setting unit setting the first quality parameter and the second quality parameter.
14、 根据权利要求 10或 11所述的方法, 其特征在于, 所述分别 对视频图像的所述至少两个部分中的每个部分进行编码包括: 14. The method of claim 10 or 11, wherein the encoding each of the at least two portions of the video image separately comprises:
预处理单元对视频图像中的第二部分进行模糊预处理;  The pre-processing unit performs fuzzy pre-processing on the second part of the video image;
图像编码单元使用相同质量参数为视频图像的所述第一部分和 经过模糊预处理后的视频图像的所述第二部分编码。  The image encoding unit encodes the first portion of the video image and the second portion of the blurred preprocessed video image using the same quality parameter.
15、 根据权利要求 8所述的方法, 其特征在于, 进一步包括: 解 码并播放所述视频图像。  15. The method of claim 8 further comprising: decoding and playing the video image.
16、 根据权利要求 15所述的方法, 其特征在于, 进一步包括: 在解码所述视频图像时, 对所述视频图像的至少两个部分做过渡处 理。  16. The method of claim 15, further comprising: performing a transition process on at least two portions of the video image when the video image is decoded.
17、 一种实现视频通信的系统, 其特征在于, 包括:  17. A system for implementing video communication, comprising:
发送终端, 用于将视频序列中的视频图像分割为至少两个部分; 分别对视频图像的所述至少两个部分中的每个部分进行编码,并输出 所述视频图像的编码数据;  a transmitting terminal, configured to divide a video image in the video sequence into at least two parts; respectively encode each of the at least two portions of the video image, and output encoded data of the video image;
接收终端, 用于从发送终端接收所述视频图像的编码数据, 并解 码、 播放所述视频图像。  And a receiving terminal, configured to receive encoded data of the video image from the transmitting terminal, and decode and play the video image.
18、 根据权利要求 17所述的系统, 其特征在于, 所述接收终端 进一步用于在解码所述视频图像时,对所述视频图像的至少两个部分 #文过渡处理。  The system according to claim 17, wherein the receiving terminal is further configured to perform transition processing on at least two parts of the video image when the video image is decoded.
19、 根据权利要求 17或 18所述的系统, 其特征在于, 所述发送 终端包括: 根据权利要求 1至 7任一项所述的终端。  The system according to claim 17 or 18, wherein the transmitting terminal comprises: the terminal according to any one of claims 1 to 7.
PCT/CN2008/070237 2007-03-28 2008-02-01 A terminal, method and system for realizing video communication WO2008116400A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2007100737198A CN101193261B (en) 2007-03-28 2007-03-28 Video communication system and method
CN200710073719.8 2007-03-28

Publications (1)

Publication Number Publication Date
WO2008116400A1 true WO2008116400A1 (en) 2008-10-02

Family

ID=39487965

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2008/070237 WO2008116400A1 (en) 2007-03-28 2008-02-01 A terminal, method and system for realizing video communication

Country Status (2)

Country Link
CN (1) CN101193261B (en)
WO (1) WO2008116400A1 (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101998104B (en) * 2009-08-31 2013-05-29 中国移动通信集团公司 Generating method for video telephone and substitute video thereof
CN101668157B (en) 2009-09-24 2011-09-21 中兴通讯股份有限公司 Method used for privacy protection in video call, application server and system
JP5375490B2 (en) * 2009-09-29 2013-12-25 ソニー株式会社 Transmitting apparatus, receiving apparatus, communication system, and program
CN102630043B (en) * 2012-04-01 2014-11-12 北京捷成世纪科技股份有限公司 Object-based video transcoding method and device
CN103428483B (en) * 2012-05-16 2017-10-17 华为技术有限公司 A kind of media data processing method and equipment
CN105100671A (en) * 2014-05-20 2015-11-25 西安中兴新软件有限责任公司 Image processing method and device based on video call
CN107295360B (en) * 2016-04-13 2020-08-18 成都鼎桥通信技术有限公司 Video transmission method and device
CN105872448A (en) * 2016-05-31 2016-08-17 宇龙计算机通信科技(深圳)有限公司 Display method and device of video images in video calls
CN106550243A (en) * 2016-12-09 2017-03-29 武汉斗鱼网络科技有限公司 Live video processing method, device and electronic equipment
CN106851171A (en) * 2017-02-21 2017-06-13 福建江夏学院 Intimacy protection system and method are realized in video calling
CN107054937A (en) * 2017-03-23 2017-08-18 广东数相智能科技有限公司 A kind of refuse classification suggestion device and system based on image recognition
CN107493440A (en) * 2017-09-14 2017-12-19 光锐恒宇(北京)科技有限公司 A kind of method and apparatus of display image in the application
CN109862365B (en) * 2019-01-30 2022-01-11 西安万像电子科技有限公司 Image data processing method and device
CN111416939A (en) * 2020-03-30 2020-07-14 咪咕视讯科技有限公司 Video processing method, video processing equipment and computer readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1672164A (en) * 2002-06-26 2005-09-21 摩托罗拉公司 Method and apparatus for limiting storage or transmission of visual information
CN1717058A (en) * 2004-06-29 2006-01-04 三洋电机株式会社 Method and apparatus for coding images and method and apparatus for decoding the images
CN1875636A (en) * 2003-11-04 2006-12-06 松下电器产业株式会社 Video transmitting apparatus and video receiving apparatus

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11506296A (en) * 1996-03-28 1999-06-02 フィリップス エレクトロニクス ネムローゼ フェンノートシャップ Image encoding and decoding method and apparatus
US5764803A (en) * 1996-04-03 1998-06-09 Lucent Technologies Inc. Motion-adaptive modelling of scene content for very low bit rate model-assisted coding of video sequences
CN100369488C (en) * 1998-05-22 2008-02-13 松下电器产业株式会社 Data block noise detector and data block noise eliminator
EP1311124A1 (en) * 2001-11-13 2003-05-14 Matsushita Electric Industrial Co., Ltd. Selective protection method for images transmission
US7167519B2 (en) * 2001-12-20 2007-01-23 Siemens Corporate Research, Inc. Real-time video object generation for smart cameras
KR100608810B1 (en) * 2004-07-09 2006-08-08 엘지전자 주식회사 A method and a apparatus of improvement image quality at multimedia communication for mobile phone
CN100469132C (en) * 2004-07-28 2009-03-11 C&S技术有限公司 Method for secrete communication between visual telephones
CN100414997C (en) * 2004-09-29 2008-08-27 腾讯科技(深圳)有限公司 Quantization method for video data compression
CN1816149A (en) * 2005-02-06 2006-08-09 腾讯科技(深圳)有限公司 Filtering-wave method of removing blocking-effect in video-frequency picture and loop filter

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1672164A (en) * 2002-06-26 2005-09-21 摩托罗拉公司 Method and apparatus for limiting storage or transmission of visual information
CN1875636A (en) * 2003-11-04 2006-12-06 松下电器产业株式会社 Video transmitting apparatus and video receiving apparatus
CN1717058A (en) * 2004-06-29 2006-01-04 三洋电机株式会社 Method and apparatus for coding images and method and apparatus for decoding the images

Also Published As

Publication number Publication date
CN101193261A (en) 2008-06-04
CN101193261B (en) 2010-07-21

Similar Documents

Publication Publication Date Title
WO2008116400A1 (en) A terminal, method and system for realizing video communication
CN105959700B (en) Video image coding method, device, storage medium and terminal equipment
CN101141608B (en) Video instant communication system and method
EP1680928B1 (en) Methods of processing digital image and/or video data including luminance filtering based on chrominance data
JP2002531020A (en) Foreground information extraction method in stereoscopic image coding
US6539099B1 (en) System and method for visual chat
US20030058939A1 (en) Video telecommunication system
WO2005025219A2 (en) Video communications method and system
CN110139147B (en) Video processing method, system, mobile terminal, server and storage medium
CN109640169B (en) Video enhancement control method and device and electronic equipment
TWI620149B (en) Method, device, and system for pre-processing a video stream for subsequent motion detection processing
CN111476866B (en) Video optimization and playing method, system, electronic equipment and storage medium
US7388966B2 (en) System and method for visual chat
WO2009030597A1 (en) Method and apparatus for processing video data using cartoonization
CN115665477A (en) Video enhancement system, method and device
Loh et al. Quality assessment for natural and screen visual contents
JP2919236B2 (en) Image coding device
CN116366852A (en) Video coding and decoding method, device, equipment and medium for machine vision task
Zaghetto et al. Iterative pre-and post-processing for MRC layers of scanned documents
Tran et al. Spherical LSB Data Hiding in 360° Videos Using Morphological Operations
CN110784716B (en) Media data processing method, device and medium
Strutz Improved probability modelling for exception handling in lossless screen content coding
WO2023051705A1 (en) Video communication method and apparatus, electronic device, and computer readable medium
Watanabe et al. Traffic reduction in video call and chat using dnn-based image reconstruction
KR20230077602A (en) Method and apparatus for adaptive image pre-processing and reconstruction

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08706614

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC, EPO FORM 1205A DATED 01.02.2010

122 Ep: pct application non-entry in european phase

Ref document number: 08706614

Country of ref document: EP

Kind code of ref document: A1