- shanghai
Stars
Witness the aha moment of VLM with less than $3.
Fully open reproduction of DeepSeek-R1
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
A toolkit for developing and comparing reinforcement learning algorithms.
Machine learning compiler based on MLIR for Sophgo TPU.
Unofficial implementation of LSQ-Net, a neural network quantization framework
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
A Unified Library for Parameter-Efficient and Modular Transfer Learning
bert-base-chinese example
Implementation of Nougat Neural Optical Understanding for Academic Documents
ICCV2023 | Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation
Official implementation of Deep Factorized Metric Learning.
CVPR2023 - Rethinking Federated Learning with Domain Shift: A Prototype View
[CVPR 2023] Adaptive Sparse Pairwise Loss for Object Re-Identification
SnP: Large-Scale Training Data Search for Object Re-Identification (CVPR 2023)
[CVPR2023] Twins Contrastive Search of Multi-Scale Interaction for Object Re-Identification
Hierarchical Fine-Grained Image Forgery Detection and Localization (CVPR2023 and IJCV2024)
[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥