Li et al., 2022 - Google Patents
FAIVconf: Face enhancement for AI-based video conference with low bit-rateLi et al., 2022
View PDF- Document ID
- 4967047199422630932
- Author
- Li Z
- Lin S
- Liu S
- Li S
- Lin X
- Wang W
- Jiang W
- Publication year
- Publication venue
- 2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)
External Links
Snippet
Recently, high-quality video conferencing with fewer transmission bits becomes a very hot and challenging problem. We propose FAIVConf, a specially designed video compression framework for video conferencing, based on the effective neural human face generation …
- 230000001815 facial 0 abstract description 20
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00228—Detection; Localisation; Normalisation
- G06K9/00248—Detection; Localisation; Normalisation using facial parts and geometric relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00275—Holistic features and representations, i.e. based on the facial image taken as a whole
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding, e.g. from bit-mapped to non bit-mapped
- G06T9/001—Model-based coding, e.g. wire frame
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11290682B1 (en) | Background modification in video conferencing | |
Wang et al. | One-shot free-view neural talking-head synthesis for video conferencing | |
CN109376582B (en) | An Interactive Face Cartoon Method Based on Generative Adversarial Networks | |
US9232189B2 (en) | Background modification in video conferencing | |
Pearson | Developments in model-based video coding | |
Zhang et al. | Dinet: Deformation inpainting network for realistic face visually dubbing on high resolution video | |
Sinha et al. | Emotion-controllable generalized talking face generation | |
CN102271241A (en) | Image communication method and system based on facial expression/action recognition | |
Zhao et al. | Sparse to dense motion transfer for face image animation | |
Chen et al. | Compressed domain deep video super-resolution | |
Wang et al. | One-shot free-view neural talking-head synthesis for video conferencing | |
Stoffels et al. | Object‐oriented image analysis for very‐low‐bitrate video‐coding systems using the CNN universal machine | |
CN117896552B (en) | Video conference processing method, video conference system and related device | |
Lin et al. | SMNet: Synchronous multi-scale low light enhancement network with local and global concern | |
Wang et al. | Emotional talking head generation based on memory-sharing and attention-augmented networks | |
Isikdogan et al. | Eye contact correction using deep neural networks | |
Du et al. | Optical flow-based spatiotemporal sketch for video representation: A novel framework | |
Li et al. | FAIVconf: Face enhancement for AI-based video conference with low bit-rate | |
CN112200816B (en) | Method, device and equipment for region segmentation and hair replacement of video images | |
Agnolucci et al. | Perceptual quality improvement in videoconferencing using keyframes-based gan | |
Nijhawan et al. | 3DFlowRenderer: One-shot Face Re-enactment via Dense 3D Facial Flow Estimation | |
LU et al. | Ultra-lightweight face animation method for ultra-low bitrate video conferencing | |
Hegde et al. | Extreme-scale talking-face video upsampling with audio-visual priors | |
US20250029346A1 (en) | Method, system, and medium for enhancing a 3d image during electronic communication | |
Chang et al. | Motion-based convolutional neural networks for super-resolution from compressed videos |