8000 xinyu1205 (Xinyu Huang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View xinyu1205's full-sized avatar

Block or report xinyu1205

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 573 14 Updated May 19, 2025

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript 14,000 1,150 Updated May 19, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 940 26 Updated May 19, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,343 1,406 Updated May 16, 2025

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,049 40 Updated Apr 21, 2025

Official inference repo for FLUX.1 models

Python 21,709 1,541 Updated Feb 6, 2025

This repo contains the python code as well as the webpage html files for the Spice-E project from VAILab at TAU.

Jupyter Notebook 19 1 Updated Dec 9, 2024

Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enou…

Python 1,391 80 Updated May 16, 2025

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…

JavaScript 5,952 536 Updated May 18, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,847 1,617 Updated Feb 29, 2024

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 504 14 Updated May 13, 2025

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 603 23 Updated May 19, 2025

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,551 87 Updated Sep 27, 2024

Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.

Python 221 23 Updated Mar 15, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,189 5,987 Updated May 19, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 934 43 Updated Apr 15, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,244 51 Updated May 11, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 45,780 7,971 Updated May 14, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,184 969 Updated May 19, 2025

Explore the Multimodal “Aha Moment” on 2B Model

Python 585 20 Updated Mar 18, 2025

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,661 77 Updated Apr 18, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,773 295 Updated Mar 10, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,958 306 Updated May 11, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,380 170 Updated May 15, 2025

Code, Data and Red Teaming for ZeroBench

46 3 Updated May 3, 2025

Code for the Molmo Vision-Language Model

Python 421 36 Updated Dec 12, 2024

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,049 86 Updated Apr 3, 2025

Simple RL training for reasoning

Python 3,564 265 Updated Apr 10, 2025

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,700 201 Updated May 17, 2025
Next
0