Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
SEED-Voken: A Series of Powerful Visual Tokenizers
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simpl…
Official inference repo for FLUX.1 models
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
High-Resolution Image Synthesis with Latent Diffusion Models
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
A natural language interface for computers
A feature-rich command-line audio/video downloader
Chat with MLX is a high-performance macOS application that connects your local documents to a personalized large language model (LLM).
Building a semantic search engine for Gmail using OpenAI embedding's model + Pinecone vector storage
Master programming by recreating your favorite technologies from scratch.
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Improved AnimateAnyone implementation that allows you to use the opse image sequence and reference image to generate stylized video
Better CPUs for Super Smash Bros Melee built in C
A web application created with Flask, Python, HTML, JavaScript, and CSS that uses machine learning to analyze NBA players' contracts and determine whether players are underpaid or overpaid.
An iOS app developed for runners to log run details and share with others!