Stars
Tabular Deep Learning Library for PyTorch
A comprehensive toolkit and benchmark for tabular data learning, featuring 30+ deep methods, more than 10 classical methods, and 300 diverse tabular datasets.
LLM Inference with Deep Learning Accelerator.
[ICLR 2025] TabDiff: a Mixed-type Diffusion Model for Tabular Data Generation
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)
This github repository will record interesting articles on the arXiv website every week (based on my own opinions only).
A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application
[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
High-Resolution Image Synthesis with Latent Diffusion Models
StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
[CSUR] A Survey on Video Diffusion Models
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
A curated list of recent diffusion models for video generation, editing, and various other applications.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
A Library for Advanced Deep Time Series Models.
Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper
An implementation of the BERT model and its related downstream tasks based on the PyTorch framework. @月来客栈
Kaggle Humpback whale identification: 2xGPU Data augmentation + FP16 mixed precision training
Tracking and collecting papers/projects/others related to Segment Anything.
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Reading list for research topics in Masked Image Modeling
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…