xinyu1205

Xinyu Huang xinyu1205

Ph.D. Student at Fudan University, homepage: xinyu1205.github.io

128 followers · 50 following

Fudan University
Shanghai, China
https://xinyu1205.github.io

Achievements

Stars

JiuhaiChen / BLIP3o

Python 573 14 Updated May 19, 2025

bytedance / UI-TARS-desktop

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript 14,000 1,150 Updated May 19, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 940 26 Updated May 19, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,343 1,406 Updated May 16, 2025

IDEA-Research / DINO-X-API

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,049 40 Updated Apr 21, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 21,709 1,541 Updated Feb 6, 2025

TAU-VAILab / Spice-E

This repo contains the python code as well as the webpage html files for the Spice-E project from VAILab at TAU.

Jupyter Notebook 19 1 Updated Dec 9, 2024

River-Zhang / ICEdit

Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enou…

Python 1,391 80 Updated May 16, 2025

jamez-bondos / awesome-gpt4o-images

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…

JavaScript 5,952 536 Updated May 18, 2025

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,847 1,617 Updated Feb 29, 2024

baaivision / NOVA

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 504 14 Updated May 13, 2025

ModalMinds / MM-EUREKA

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 603 23 Updated May 19, 2025

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,551 87 Updated Sep 27, 2024

niki-amini-naieni / CountGD

Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.

Python 221 23 Updated Mar 15, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,189 5,987 Updated May 19, 2025

sail-sg / understand-r1-zero

Understanding R1-Zero-Like Training: A Critical Perspective

Python 934 43 Updated Apr 15, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,244 51 Updated May 11, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 45,780 7,971 Updated May 14, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,184 969 Updated May 19, 2025

turningpoint-ai / VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Python 585 20 Updated Mar 18, 2025

Liuziyu77 / Visual-RFT

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,661 77 Updated Apr 18, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,773 295 Updated Mar 10, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 4,958 306 Updated May 11, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,380 170 Updated May 15, 2025

jonathan-roberts1 / zerobench

Code, Data and Red Teaming for ZeroBench

46 3 Updated May 3, 2025

allenai / molmo

Code for the Molmo Vision-Language Model

Python 421 36 Updated Dec 12, 2024

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,049 86 Updated Apr 3, 2025

deepseek-ai / DeepSeek-R1

89,335 11,547 Updated Apr 9, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,564 265 Updated Apr 10, 2025

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,700 201 Updated May 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xinyu Huang xinyu1205

Achievements

Achievements

Block or report xinyu1205

Stars

JiuhaiChen / BLIP3o

bytedance / UI-TARS-desktop

ByteDance-Seed / Seed1.5-VL

QwenLM / Qwen3

IDEA-Research / DINO-X-API

black-forest-labs / flux

TAU-VAILab / Spice-E

River-Zhang / ICEdit

jamez-bondos / awesome-gpt4o-images

CompVis / latent-diffusion

baaivision / NOVA

ModalMinds / MM-EUREKA

LTH14 / mar

niki-amini-naieni / CountGD

hiyouga / LLaMA-Factory

sail-sg / understand-r1-zero

BytedTsinghua-SIA / DAPO

FoundationAgents / OpenManus

volcengine / verl

turningpoint-ai / VisualThinker-R1-Zero

Liuziyu77 / Visual-RFT

deepseek-ai / DualPipe

om-ai-lab / VLM-R1

hiyouga / EasyR1

jonathan-roberts1 / zerobench

allenai / molmo

lsdefine / simple_GRPO

deepseek-ai / DeepSeek-R1

hkust-nlp / simpleRL-reason

argilla-io / distilabel