Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024
Controllable Shape Modeling with Neural Generalized Cylinder
SA '24: SIGGRAPH Asia 2024 Conference PapersArticle No.: 80, Pages 1–11https://doi.org/10.1145/3680528.3687617Neural shape representation, such as neural signed distance field (NSDF), becomes more and more popular in shape modeling as its ability to deal with complex topology and arbitrary resolution. Due to the implicit manner to use features for shape ...
- research-articleDecember 2024
Towards Unified 3D Hair Reconstruction from Single-View Portraits
SA '24: SIGGRAPH Asia 2024 Conference PapersArticle No.: 114, Pages 1–11https://doi.org/10.1145/3680528.3687597Single-view 3D hair reconstruction is challenging, due to the wide range of shape variations among diverse hairstyles. Current state-of-the-art methods are specialized in recovering un-braided 3D hairs and often take braided styles as their failure cases, ...
- research-articleNovember 2024
MVImgNet2.0: A Larger-scale Dataset of Multi-view Images
- Yushuang Wu,
- Luyue Shi,
- Haolin Liu,
- Hongjie Liao,
- Lingteng Qiu,
- Weihao Yuan,
- Xiaodong Gu,
- Zilong Dong,
- Shuguang Cui,
- Xiaoguang Han
ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 173, Pages 1–16https://doi.org/10.1145/3687973MVImgNet is a large-scale dataset that contains multi-view images of ~220k real-world objects in 238 classes. As a counterpart of ImageNet, it introduces 3D visual signals via multi-view shooting, making a soft bridge between 2D and 3D vision. This paper ...
- research-articleNovember 2024
StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal
- Chongjie Ye,
- Lingteng Qiu,
- Xiaodong Gu,
- Qi Zuo,
- Yushuang Wu,
- Zilong Dong,
- Liefeng Bo,
- Yuliang Xiu,
- Xiaoguang Han
ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 250, Pages 1–18https://doi.org/10.1145/3687971This work addresses the challenge of high-quality surface normal estimation from monocular colored inputs (i.e., images and videos), a field which has recently been revolutionized by repurposing diffusion priors. However, previous attempts still struggle ...
- research-articleNovember 2024
GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details
- Zhongjin Luo,
- Haolin Liu,
- Chenghong Li,
- Wanghao Du,
- Zirong Jin,
- Wanhu Sun,
- Yinyu Nie,
- Weikai Chen,
- Xiaoguang Han
ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 204, Pages 1–12https://doi.org/10.1145/3687921Neural implicit functions have brought impressive advances to the state-of-the-art of clothed human digitization from multiple or even single images. However, despite the progress, current arts still have difficulty generalizing to unseen images with ...
-
- ArticleNovember 2024
DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors
- Zizheng Yan,
- Jiapeng Zhou,
- Fanpeng Meng,
- Yushuang Wu,
- Lingteng Qiu,
- Zisheng Ye,
- Shuguang Cui,
- Guanying Chen,
- Xiaoguang Han
AbstractText-to-3D generation has recently seen significant progress. To enhance its practicality in real-world applications, it is crucial to generate multiple independent objects with interactions, similar to layer-compositing in 2D image editing. ...
- ArticleNovember 2024
- ArticleNovember 2024
Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images
- David Junhao Zhang,
- Mutian Xu,
- Jay Zhangjie Wu,
- Chuhui Xue,
- Wenqing Zhang,
- Xiaoguang Han,
- Song Bai,
- Mike Zheng Shou
AbstractThis paper studies visual representation learning with diffusion-generated synthetic images. We start by uncovering that diffusion models’ cross-attention layers inherently provide annotation-free attention masks aligned with corresponding text ...
- ArticleNovember 2024
SphereHead: Stable 3D Full-Head Synthesis with Spherical Tri-Plane Representation
AbstractWhile recent advances in 3D-aware Generative Adversarial Networks (GANs) have aided the development of near-frontal view human face synthesis, the challenge of comprehensively synthesizing a full 3D head viewable from all angles still persists. ...
- ArticleSeptember 2024
Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement
AbstractPrevious low-light image enhancement (LLIE) approaches, while employing frequency decomposition techniques to address the intertwined challenges of low frequency (e.g., illumination recovery) and high frequency (e.g., noise reduction), primarily ...
- ArticleApril 2024
PIFu for the Real World: A Self-supervised Framework to Reconstruct Dressed Human from Single-View Images
AbstractIt is very challenging to accurately reconstruct sophisticated human geometry caused by various poses and garments from a single image. Recently, works based on pixel-aligned implicit function (PIFu) have made a big step and achieved state-of-the-...
- research-articleMarch 2024
Transfer force perception skills to robot‐assisted laminectomy via imitation learning from human demonstrations
CAAI Transactions on Intelligence Technology (CIT2), Volume 9, Issue 4Pages 903–916https://doi.org/10.1049/cit2.12331AbstractA comparative study of two force perception skill learning approaches for robot‐assisted spinal surgery, the impedance model method and the imitation learning (IL) method, is presented. The impedance model method develops separate models for the ...
- research-articleSeptember 2024
Contrastive Open-Set Active Learning-Based Sample Selection for Image Classification
IEEE Transactions on Image Processing (TIP), Volume 33Pages 5525–5537https://doi.org/10.1109/TIP.2024.3451928In this paper, we address a complex but practical scenario in Active Learning (AL) known as open-set AL, where the unlabeled data consists of both in-distribution (ID) and out-of-distribution (OOD) samples. Standard AL methods will fail in this scenario ...
- research-articleDecember 2023
From NeRFLiX to NeRFLiX++: A General NeRF-Agnostic Restorer Paradigm
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 5Pages 3422–3437https://doi.org/10.1109/TPAMI.2023.3343395Neural radiance fields (NeRF) have shown great success in novel view synthesis. However, recovering high-quality details from real-world scenes is still challenging for the existing NeRF-based approaches, due to the potential imperfect calibration ...
- research-articleMay 2024
A comprehensive benchmark for neural human radiance fields
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 1525, Pages 35107–35120The past two years have witnessed a significant increase in interest concerning NeRF-based human body rendering. While this surge has propelled considerable advancements, it has also led to an influx of methods and datasets. This explosion complicates ...
- research-articleMay 2024
CODA: generalizing to open and unseen domains with compaction and disambiguation
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 560, Pages 12746–12759The generalization capability of machine learning systems degenerates notably when the test distribution drifts from the training distribution. Recently, Domain Generalization (DG) has been gaining momentum in enabling machine learning models to ...
- research-articleDecember 2023
SeamlessNeRF: Stitching Part NeRFs with Gradient Propagation
SA '23: SIGGRAPH Asia 2023 Conference PapersArticle No.: 33, Pages 1–10https://doi.org/10.1145/3610548.3618238Neural Radiance Fields (NeRFs) have emerged as promising digital mediums of 3D objects and scenes, sparking a surge in research to extend the editing capabilities in this domain. The task of seamless editing and merging of multiple NeRFs, resembling the ...
- research-articleDecember 2023
EMS: 3D Eyebrow Modeling from Single-View Images
ACM Transactions on Graphics (TOG), Volume 42, Issue 6Article No.: 269, Pages 1–19https://doi.org/10.1145/3618323Eyebrows play a critical role in facial expression and appearance. Although the 3D digitization of faces is well explored, less attention has been drawn to 3D eyebrow modeling. In this work, we propose EMS, the first learning-based framework for single-...
- research-articleMarch 2024
PointMatch: A consistency training framework for weakly supervised semantic segmentation of 3D point clouds
Computers and Graphics (CGRS), Volume 116, Issue CPages 427–436https://doi.org/10.1016/j.cag.2023.09.006AbstractSemantic segmentation of point cloud usually relies on dense annotation that is exhausting and costly, so it attracts wide attention to investigate solutions for the weakly supervised scheme with only sparse points annotated. Existing works start ...
Graphical abstractDisplay Omitted
Highlights- A consistency training framework for efficient learning from sparse semantic labels.
- Propose to probe and well exploit super-points to promote the pseudo-label quality.
- Extensive experiments validate the effectiveness and ...
- research-articleOctober 2023
RecolorNeRF: Layer Decomposed Radiance Fields for Efficient Color Editing of 3D Scenes
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 8004–8015https://doi.org/10.1145/3581783.3611957Radiance fields have gradually become a main representation of media. Although its appearance editing has been studied, how to achieve view-consistent recoloring in an efficient manner is still under explored. We present RecolorNeRF, a novel user-...