Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
The simplest, fastest repository for training/finetuning small-sized VLMs.
Free MLOps course from DataTalks.Club
Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation - NeurIPS 2024
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
[CVPR 2024 Highlight] Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
A collection of papers and codes for human pose transfer
[ICCV 2023] The official implementation of paper "HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation"
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
An LLM playground you can run on your laptop
[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
Learn AI/ML for beginners with a roadmap and free resources.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Minimal reproduction of DeepSeek R1-Zero
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Fully open reproduction of DeepSeek-R1
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Document to Markdown OCR library with Llama 3.2 vision
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
Implements VAR+CLIP for text-to-image (T2I) generation
A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.