Author: Han, Xiaoguang : Search

research-article

Controllable Shape Modeling with Neural Generalized Cylinder

SA '24: SIGGRAPH Asia 2024 Conference PapersArticle No.: 80, Pages 1–11https://doi.org/10.1145/3680528.3687617

Neural shape representation, such as neural signed distance field (NSDF), becomes more and more popular in shape modeling as its ability to deal with complex topology and arbitrary resolution. Due to the implicit manner to use features for shape ...

research-article

Towards Unified 3D Hair Reconstruction from Single-View Portraits

SA '24: SIGGRAPH Asia 2024 Conference PapersArticle No.: 114, Pages 1–11https://doi.org/10.1145/3680528.3687597

Single-view 3D hair reconstruction is challenging, due to the wide range of shape variations among diverse hairstyles. Current state-of-the-art methods are specialized in recovering un-braided 3D hairs and often take braided styles as their failure cases, ...

research-article

MVImgNet2.0: A Larger-scale Dataset of Multi-view Images

ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 173, Pages 1–16https://doi.org/10.1145/3687973

MVImgNet is a large-scale dataset that contains multi-view images of ~220k real-world objects in 238 classes. As a counterpart of ImageNet, it introduces 3D visual signals via multi-view shooting, making a soft bridge between 2D and 3D vision. This paper ...

research-article

StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 250, Pages 1–18https://doi.org/10.1145/3687971

This work addresses the challenge of high-quality surface normal estimation from monocular colored inputs (i.e., images and videos), a field which has recently been revolutionized by repurposing diffusion priors. However, previous attempts still struggle ...

research-article

GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details

ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 204, Pages 1–12https://doi.org/10.1145/3687921

Neural implicit functions have brought impressive advances to the state-of-the-art of clothed human digitization from multiple or even single images. However, despite the progress, current arts still have difficulty generalizing to unseen images with ...

Article

DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors

Computer Vision – ECCV 2024Pages 124–141https://doi.org/10.1007/978-3-031-73254-6_8

Abstract

Text-to-3D generation has recently seen significant progress. To enhance its practicality in real-world applications, it is crucial to generate multiple independent objects with interactions, similar to layer-compositing in 2D image editing. ...

Article

GaussReg: Fast 3D Registration with Gaussian Splatting

Computer Vision – ECCV 2024Pages 407–423https://doi.org/10.1007/978-3-031-72633-0_23

Abstract

Point cloud registration is a fundamental problem for large-scale 3D scene scanning and reconstruction. With the help of deep learning, registration methods have evolved significantly, reaching a nearly-mature stage. As the introduction of Neural ...

Article

Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images

Computer Vision – ECCV 2024Pages 465–482https://doi.org/10.1007/978-3-031-73661-2_26

Abstract

This paper studies visual representation learning with diffusion-generated synthetic images. We start by uncovering that diffusion models’ cross-attention layers inherently provide annotation-free attention masks aligned with corresponding text ...

Article

SphereHead: Stable 3D Full-Head Synthesis with Spherical Tri-Plane Representation

Computer Vision – ECCV 2024Pages 324–341https://doi.org/10.1007/978-3-031-73226-3_19

Abstract

While recent advances in 3D-aware Generative Adversarial Networks (GANs) have aided the development of near-frontal view human face synthesis, the challenge of comprehensively synthesizing a full 3D head viewable from all angles still persists. ...

Article

Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement

Computer Vision – ECCV 2024Pages 204–221https://doi.org/10.1007/978-3-031-72667-5_12

Abstract

Previous low-light image enhancement (LLIE) approaches, while employing frequency decomposition techniques to address the intertwined challenges of low frequency (e.g., illumination recovery) and high frequency (e.g., noise reduction), primarily ...

Article

PIFu for the Real World: A Self-supervised Framework to Reconstruct Dressed Human from Single-View Images

Computational Visual MediaPages 3–23https://doi.org/10.1007/978-981-97-2095-8_1

Abstract

It is very challenging to accurately reconstruct sophisticated human geometry caused by various poses and garments from a single image. Recently, works based on pixel-aligned implicit function (PIFu) have made a big step and achieved state-of-the-...

research-article

Open Access

Transfer force perception skills to robot‐assisted laminectomy via imitation learning from human demonstrations

CAAI Transactions on Intelligence Technology (CIT2), Volume 9, Issue 4Pages 903–916https://doi.org/10.1049/cit2.12331

Abstract

A comparative study of two force perception skill learning approaches for robot‐assisted spinal surgery, the impedance model method and the imitation learning (IL) method, is presented. The impedance model method develops separate models for the ...

research-article

Contrastive Open-Set Active Learning-Based Sample Selection for Image Classification

IEEE Transactions on Image Processing (TIP), Volume 33Pages 5525–5537https://doi.org/10.1109/TIP.2024.3451928

In this paper, we address a complex but practical scenario in Active Learning (AL) known as open-set AL, where the unlabeled data consists of both in-distribution (ID) and out-of-distribution (OOD) samples. Standard AL methods will fail in this scenario ...

research-article

From NeRFLiX to NeRFLiX++: A General NeRF-Agnostic Restorer Paradigm

IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 5Pages 3422–3437https://doi.org/10.1109/TPAMI.2023.3343395

Neural radiance fields (NeRF) have shown great success in novel view synthesis. However, recovering high-quality details from real-world scenes is still challenging for the existing NeRF-based approaches, due to the potential imperfect calibration ...

research-article

A comprehensive benchmark for neural human radiance fields

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 1525, Pages 35107–35120

The past two years have witnessed a significant increase in interest concerning NeRF-based human body rendering. While this surge has propelled considerable advancements, it has also led to an influx of methods and datasets. This explosion complicates ...

research-article

CODA: generalizing to open and unseen domains with compaction and disambiguation

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 560, Pages 12746–12759

The generalization capability of machine learning systems degenerates notably when the test distribution drifts from the training distribution. Recently, Domain Generalization (DG) has been gaining momentum in enabling machine learning models to ...

research-article

SeamlessNeRF: Stitching Part NeRFs with Gradient Propagation

SA '23: SIGGRAPH Asia 2023 Conference PapersArticle No.: 33, Pages 1–10https://doi.org/10.1145/3610548.3618238

Neural Radiance Fields (NeRFs) have emerged as promising digital mediums of 3D objects and scenes, sparking a surge in research to extend the editing capabilities in this domain. The task of seamless editing and merging of multiple NeRFs, resembling the ...

research-article

EMS: 3D Eyebrow Modeling from Single-View Images

ACM Transactions on Graphics (TOG), Volume 42, Issue 6Article No.: 269, Pages 1–19https://doi.org/10.1145/3618323

Eyebrows play a critical role in facial expression and appearance. Although the 3D digitization of faces is well explored, less attention has been drawn to 3D eyebrow modeling. In this work, we propose EMS, the first learning-based framework for single-...

research-article

PointMatch: A consistency training framework for weakly supervised semantic segmentation of 3D point clouds

Computers and Graphics (CGRS), Volume 116, Issue CPages 427–436https://doi.org/10.1016/j.cag.2023.09.006

Abstract

Semantic segmentation of point cloud usually relies on dense annotation that is exhausting and costly, so it attracts wide attention to investigate solutions for the weakly supervised scheme with only sparse points annotated. Existing works start ...

Graphical abstract

Display Omitted

Highlights

A consistency training framework for efficient learning from sparse semantic labels.
Propose to probe and well exploit super-points to promote the pseudo-label quality.
Extensive experiments validate the effectiveness and ...

research-article

RecolorNeRF: Layer Decomposed Radiance Fields for Efficient Color Editing of 3D Scenes

MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 8004–8015https://doi.org/10.1145/3581783.3611957

Radiance fields have gradually become a main representation of media. Although its appearance editing has been studied, how to achieve view-consistent recoloring in an efficient manner is still under explored. We present RecolorNeRF, a novel user-...

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Reproducibility Badges

Publication Date

Save to Binder

Upcoming Conferences