[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Li et al., 2022 - Google Patents

FAIVconf: Face enhancement for AI-based video conference with low bit-rate

Li et al., 2022

View PDF
Document ID
4967047199422630932
Author
Li Z
Lin S
Liu S
Li S
Lin X
Wang W
Jiang W
Publication year
Publication venue
2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)

External Links

Snippet

Recently, high-quality video conferencing with fewer transmission bits becomes a very hot and challenging problem. We propose FAIVConf, a specially designed video compression framework for video conferencing, based on the effective neural human face generation …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • G06K9/00268Feature extraction; Face representation
    • G06K9/00281Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • G06K9/00228Detection; Localisation; Normalisation
    • G06K9/00248Detection; Localisation; Normalisation using facial parts and geometric relationships
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • G06K9/00268Feature extraction; Face representation
    • G06K9/00275Holistic features and representations, i.e. based on the facial image taken as a whole
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding, e.g. from bit-mapped to non bit-mapped
    • G06T9/001Model-based coding, e.g. wire frame

Similar Documents

Publication Publication Date Title
US11290682B1 (en) Background modification in video conferencing
Wang et al. One-shot free-view neural talking-head synthesis for video conferencing
CN109376582B (en) An Interactive Face Cartoon Method Based on Generative Adversarial Networks
US9232189B2 (en) Background modification in video conferencing
Pearson Developments in model-based video coding
Zhang et al. Dinet: Deformation inpainting network for realistic face visually dubbing on high resolution video
Sinha et al. Emotion-controllable generalized talking face generation
CN102271241A (en) Image communication method and system based on facial expression/action recognition
Zhao et al. Sparse to dense motion transfer for face image animation
Chen et al. Compressed domain deep video super-resolution
Wang et al. One-shot free-view neural talking-head synthesis for video conferencing
Stoffels et al. Object‐oriented image analysis for very‐low‐bitrate video‐coding systems using the CNN universal machine
CN117896552B (en) Video conference processing method, video conference system and related device
Lin et al. SMNet: Synchronous multi-scale low light enhancement network with local and global concern
Wang et al. Emotional talking head generation based on memory-sharing and attention-augmented networks
Isikdogan et al. Eye contact correction using deep neural networks
Du et al. Optical flow-based spatiotemporal sketch for video representation: A novel framework
Li et al. FAIVconf: Face enhancement for AI-based video conference with low bit-rate
CN112200816B (en) Method, device and equipment for region segmentation and hair replacement of video images
Agnolucci et al. Perceptual quality improvement in videoconferencing using keyframes-based gan
Nijhawan et al. 3DFlowRenderer: One-shot Face Re-enactment via Dense 3D Facial Flow Estimation
LU et al. Ultra-lightweight face animation method for ultra-low bitrate video conferencing
Hegde et al. Extreme-scale talking-face video upsampling with audio-visual priors
US20250029346A1 (en) Method, system, and medium for enhancing a 3d image during electronic communication
Chang et al. Motion-based convolutional neural networks for super-resolution from compressed videos