suchot

🎈

life

suchot

🎈

life

18 followers · 63 following

Achievements

Highlights

Lists (3)

Sort

Starred repositories

bilibili / Index-anisora

Python 1,245 52 Updated May 27, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,126 41 Updated May 21, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 3,049 247 Updated May 30, 2025

modelscope / ImagePulse

Open Image Curation Tools

Python 31 1 Updated Apr 22, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,315 2,240 Updated Feb 1, 2025

lodestone-rock / flow

Python 97 13 Updated May 5, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 34,287 4,915 Updated May 30, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 8,729 788 Updated May 19, 2025

Yuanshi9815 / OminiControl

A minimal and universal controller for FLUX.1.

Python 1,598 112 Updated May 13, 2025

hao-ai-lab / FastVideo

FastVideo is a unified framework for accelerated video generation.

Python 1,468 96 Updated May 31, 2025

christopher-beckham / k-diffusing

Python 4 Updated Feb 3, 2025

chenditc / investment_data

Scripts and doc for https://www.dolthub.com/repositories/chenditc/investment_data

Python 459 70 Updated May 30, 2025

deepseek-ai / DeepSeek-R1

89,744 11,592 Updated Apr 9, 2025

bytedance / Valley

Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.

Python 234 14 Updated Feb 27, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 3,865 379 Updated May 30, 2025

microsoft / TRELLIS

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 9,615 804 Updated May 30, 2025

JunyaoHu / common_metrics_on_video_quality

You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.

Python 379 14 Updated Jan 6, 2025

Francis-Rings / StableAnimator

[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…

Python 1,307 82 Updated Apr 24, 2025