Li et al., 2022 - Google Patents

FAIVconf: Face enhancement for AI-based video conference with low bit-rate

Li et al., 2022

Document ID: 4967047199422630932
Author: Li Z; Lin S; Liu S; Li S; Lin X; Wang W; Jiang W
Publication year: 2022
Publication venue: 2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)

External Links

Cited by

Snippet

Recently, high-quality video conferencing with fewer transmission bits becomes a very hot and challenging problem. We propose FAIVConf, a specially designed video compression framework for video conferencing, based on the effective neural human face generation …

Continue reading at arxiv.org (PDF) (other versions)

230000001815 facial 0 abstract description 20

Classifications

- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00228—Detection; Localisation; Normalisation
- G06K9/00248—Detection; Localisation; Normalisation using facial parts and geometric relationships
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00275—Holistic features and representations, i.e. based on the facial image taken as a whole
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding, e.g. from bit-mapped to non bit-mapped
- G06T9/001—Model-based coding, e.g. wire frame

Similar Documents

Publication	Publication Date	Title
US11290682B1 (en)	2022-03-29	Background modification in video conferencing
Wang et al.	2021	One-shot free-view neural talking-head synthesis for video conferencing
CN109376582B (en)	2022-07-29	An Interactive Face Cartoon Method Based on Generative Adversarial Networks
US9232189B2 (en)	2016-01-05	Background modification in video conferencing
Pearson	1995	Developments in model-based video coding
Zhang et al.	2023	Dinet: Deformation inpainting network for realistic face visually dubbing on high resolution video
Sinha et al.	2022	Emotion-controllable generalized talking face generation
CN102271241A (en)	2011-12-07	Image communication method and system based on facial expression/action recognition
Zhao et al.	2021	Sparse to dense motion transfer for face image animation
Chen et al.	2021	Compressed domain deep video super-resolution
Wang et al.	2020	One-shot free-view neural talking-head synthesis for video conferencing
Stoffels et al.	1997	Object‐oriented image analysis for very‐low‐bitrate video‐coding systems using the CNN universal machine
CN117896552B (en)	2024-07-12	Video conference processing method, video conference system and related device
Lin et al.	2023	SMNet: Synchronous multi-scale low light enhancement network with local and global concern
Wang et al.	2023	Emotional talking head generation based on memory-sharing and attention-augmented networks
Isikdogan et al.	2020	Eye contact correction using deep neural networks
Du et al.	2024	Optical flow-based spatiotemporal sketch for video representation: A novel framework
Li et al.	2022	FAIVconf: Face enhancement for AI-based video conference with low bit-rate
CN112200816B (en)	2025-01-24	Method, device and equipment for region segmentation and hair replacement of video images
Agnolucci et al.	2023	Perceptual quality improvement in videoconferencing using keyframes-based gan
Nijhawan et al.	2024	3DFlowRenderer: One-shot Face Re-enactment via Dense 3D Facial Flow Estimation
LU et al.	2023	Ultra-lightweight face animation method for ultra-low bitrate video conferencing
Hegde et al.	2022	Extreme-scale talking-face video upsampling with audio-visual priors
US20250029346A1 (en)	2025-01-23	Method, system, and medium for enhancing a 3d image during electronic communication
Chang et al.	2023	Motion-based convolutional neural networks for super-resolution from compressed videos