8000 XavierCHEN34 (Xi CHEN) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View XavierCHEN34's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report XavierCHEN34

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 2,081 197 Updated Apr 28, 2025

Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning

Python 167 5 Updated Apr 19, 2025

[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

Python 181 5 Updated Sep 26, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,184 5,987 Updated May 19, 2025

Gemma open-weight LLM library, from Google DeepMind

Jupyter Notebook 3,278 445 Updated May 19, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,377 363 Updated May 17, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 384 18 Updated May 17, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,283 161 Updated May 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,180 967 Updated May 19, 2025

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 758 44 Updated Aug 5, 2024

MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants

Python 33 4 Updated May 13, 2025

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,999 113 Updated Jul 29, 2024

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,457 1,300 Updated May 17, 2025

Official Repo for Open-Reasoner-Zero

Python 1,920 98 Updated Apr 8, 2025

R1-onevision, a visual language model capable of deep CoT reasoning.

Python 518 14 Updated Apr 13, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,957 306 Updated May 11, 2025

Fully open reproduction of DeepSeek-R1

Python 24,462 2,252 Updated May 19, 2025

Code release for "LLMs can see and hear without any training"

Python 436 36 Updated May 8, 2025

[CVPR 2025 Highlight] Official implementation of "MangaNinja: Line Art Colorization with Precise Reference Following"

Python 602 48 Updated Mar 2, 2025

The ultimate training toolkit for finetuning diffusion models

Python 4,723 537 Updated May 17, 2025
Python 74 7 Updated Sep 29, 2024

ReNeg: Learning Negative Embedding with Reward Guidance

Python 32 Updated Jan 2, 2025

Official implementation of "DepthLab: From Partial to Complete"

Python 482 28 Updated Feb 14, 2025
Python 2,049 156 Updated Nov 8, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 10,045 877 Updated Apr 27, 2025

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation

143 4 Updated Mar 8, 2025

The best OSS video generation models

Python 3,159 350 Updated Jan 8, 2025

Inference script for Oasis 500M

Python 1,826 157 Updated Nov 8, 2024
Python 9 Updated Apr 1, 2025
Next
0