8000 suchot / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View suchot's full-sized avatar
🎈
life
🎈
life

Highlights

  • Pro

Block or report suchot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 1,245 52 Updated May 27, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,126 41 Updated May 21, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 3,049 247 Updated May 30, 2025

Open Image Curation Tools

Python 31 1 Updated Apr 22, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,315 2,240 Updated Feb 1, 2025
Python 97 13 Updated May 5, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 34,287 4,915 Updated May 30, 2025

Enjoy the magic of Diffusion models!

Python 8,729 788 Updated May 19, 2025

A minimal and universal controller for FLUX.1.

Python 1,598 112 Updated May 13, 2025

FastVideo is a unified framework for accelerated video generation.

Python 1,468 96 Updated May 31, 2025
Python 4 Updated Feb 3, 2025

Scripts and doc for https://www.dolthub.com/repositories/chenditc/investment_data

Python 459 70 Updated May 30, 2025

Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.

Python 234 14 Updated Feb 27, 2025

A PyTorch native platform for training generative AI models

Python 3,865 379 Updated May 30, 2025

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 9,615 804 Updated May 30, 2025

You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.

Python 379 14 Updated Jan 6, 2025

[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…

Python 1,307 82 Updated Apr 24, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,860 1,750 Updated Feb 26, 2025

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation

Python 2,285 166 Updated May 30, 2025

🎬 人人影视 机器人和网站,包含人人影视全部资源以及众多网友的网盘分享

Python 15,290 1,848 Updated May 23, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 10,182 898 Updated May 23, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 13,850 1,688 Updated May 30, 2025

InstantIR: Blind Image Restoration with Instant Generative Reference 🔥

Python 515 53 Updated Nov 14, 2024

Example models using DeepSpeed

Python 6,511 1,092 Updated May 23, 2025

experimental implementation of Consistory

Python 20 3 Updated Jul 15, 2024

Unofficial PyTorch Implementation for paper FlashFace

Python 15 3 Updated Apr 9, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 11,478 1,106 Updated May 14, 2025

A general fine-tuning kit geared toward diffusion models.

Python 2,347 221 Updated May 19, 2025
Jupyter Notebook 184 17 Updated Jul 12, 2024
Next
0