-
University of California, San Diego
- La Jolla, CA
- https://xzhang.dev
Highlights
- Pro
Stars
Constrained decoding utilities for text generation using Huggingface seq2seq models
C++ libraries and programs demonstrating mesh processing research published in ACM SIGGRAPH (1992-2003)
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View, 3DV2025
Refine high-quality datasets and visual AI models
A lightweight web dashboard for monitoring GPU usage
repository of "Towards High-Fidelity Single-view Holistic Reconstruction of Indoor Scenes" ECCV2022
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
The code of "Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting"
Spring Framework Kotlin APIs, the functional way
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Large scale K-means and K-nn implementation on NVIDIA GPU / CUDA
NGINX and NGINX Plus Ingress Controllers for Kubernetes
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
Node.js library for creating Guacamole-compatible servers. Guacamole is a RDP/VNC/SSH/Telnet client for HTML5 browsers.
Fully featured implementation of Routing Transformer
Long-Term Feature Banks for Detailed Video Understanding
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Implementation of the paper Video Action Transformer Network
Learning to Discriminate Information for Online Action Detection, CVPR 2020
DSNet: A Flexible Detect-to-Summarize Network for Video Summarization
Evaluation code for the MPII human pose dataset
Adapter for using HPE FlexibleLOM cards in full height PCIe slots
End-to-End Object Detection with Transformers
UPSNet: A Unified Panoptic Segmentation Network
Animation course with Manim