-
nanoVLM-LLM Public
Forked from huggingface/nanoVLMThe simplest, fastest repository for training/finetuning small-sized VLMs.
Python UpdatedJun 24, 2025 -
-
-
-
-
VLM-OD-smolvlm Public
Forked from shreydan/VLM-ODexperimental: finetune smolVLM on COCO (without any special <locXYZ> tokens)
Jupyter Notebook UpdatedMay 19, 2025 -
-
SmolVLM2_Test-VLM-LLM Public
Forked from Tokymin/SmolVLM2_TestPython Apache License 2.0 UpdatedApr 27, 2025 -
UNO-Diffusion-VTON Public
Forked from bytedance/UNO🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
Python Apache License 2.0 UpdatedApr 16, 2025 -
smol-captioner-smolvlm-llm Public
Forked from mustaphouni04/smol-captionerSmolVLM adapted for captioning in Food Images
Python UpdatedApr 12, 2025 -
SmolVLM-Finetune-LLM Public
Forked from 2U1/SmolVLM-FinetuneAn open-source implementaion for fine-tuning SmolVLM.
Python Apache License 2.0 UpdatedMar 31, 2025 -
VAR-Diffusion Public
Forked from FoundationVision/VAR[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
Jupyter Notebook MIT License UpdatedMar 22, 2025 -
transfusion-pytorch-VLM Public
Forked from lucidrains/transfusion-pytorchPytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Python MIT License UpdatedMar 18, 2025 -
VLM2Vec Public
Forked from TIGER-AI-Lab/VLM2VecThis repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25]
Python Apache License 2.0 UpdatedMar 17, 2025 -
MusePose-human-video Public
Forked from TMElyralab/MusePoseMusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
Python Other UpdatedMar 5, 2025 -
VTON-project-catvton-train Public
Forked from ColdTbrew/VTON-projectJupyter Notebook UpdatedMar 2, 2025 -
-
clip_finetune2 Public
Forked from Aorunfa/clip_finetuneclip微调
Jupyter Notebook UpdatedFeb 19, 2025 -
-
catvton-flux-diffusion-try-on Public
Forked from nftblackmagic/catvton-fluxPython MIT License UpdatedFeb 7, 2025 -
-
try-off-anyone-diffusion-vton Public
Forked from ixarchakos/try-off-anyoneOfficial repository of "TryOffAnyone: Tiled Cloth Generation from a Dressed Person"
Python UpdatedJan 17, 2025 -
TPD-try-on-VTON-VITON-diffusion Public
Forked from Gal4way/TPDThis is the official repository for the paper "Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On". CVPR 2024
Python UpdatedJan 17, 2025 -
PromptDresser-VTON-diffusion Public
Forked from rlawjdghek/PromptDresserPromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask
Python UpdatedJan 11, 2025 -
OminiControl-diffusion-VTON Public
Forked from Yuanshi9815/OminiControlA minimal and universal controller for FLUX.1.
Python Apache License 2.0 UpdatedJan 7, 2025 -
-
WGF-VITON Public
Forked from soonchanpark/WGF-VITONOfficial Implementation of WGF-VITON (Full-body Virtual Try-on using Top and Bottom Garments with Wearing Style Control., Computer Vision and Image Understanding 2024)
Python UpdatedDec 27, 2024 -
MV-VTON-VTON-VITON-Diffusion Public
Forked from hywang2002/MV-VTON[AAAI 2025] MV-VTON: Multi-View Virtual Try-On with Diffusion Models
Python Other UpdatedDec 10, 2024 -
tryoffdiff-diffusion-fashion-vton Public
Forked from rizavelioglu/tryoffdiffOfficial repository of "TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models".
-