8000 HYPJUDY (Yupan Huang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View HYPJUDY's full-sized avatar
🌌
(๑>◡<๑)
🌌
(๑>◡<๑)

Highlights

  • Pro

Organizations

@researchmm

Block or report HYPJUDY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official inference repo for FLUX.1 models

Python 22,524 1,600 Updated Jun 5, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,764 206 Updated Jun 23, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,827 1,616 Updated Jun 23, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,417 189 Updated Jun 4, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,926 1,489 Updated Apr 24, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 26,745 2,603 Updated Apr 30, 2025

A fork to add multimodal model training to open-r1

Python 1,309 63 Updated Feb 8, 2025

Fully open reproduction of DeepSeek-R1

Python 24,859 2,305 Updated Jun 23, 2025

[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation

Python 735 21 Updated May 23, 2025

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,541 274 Updated Jun 19, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 50,505 8,261 Updated Jun 23, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 83,882 61,093 Updated Jun 23, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 25,329 2,280 Updated Jun 22, 2025

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,606 161 Updated May 21, 2025

Playing Pokemon Red with Reinforcement Learning

Jupyter Notebook 7,432 714 Updated May 30, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,637 78 Updated Feb 11, 2025

Get your documents ready for gen AI

Python 32,570 2,105 Updated Jun 23, 2025

This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.

381 15 Updated Jun 4, 2025

Official inference framework for 1-bit LLMs

Python 20,262 1,518 Updated Jun 3, 2025

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,017 113 Updated Jul 29, 2024

VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning

Python 116 13 Updated Sep 17, 2024

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 712 534 Updated Jul 4, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,475 714 Updated Jun 20, 2025

Deezer source separation library including pretrained models.

Python 27,011 2,961 Updated Apr 2, 2025

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Jupyter Notebook 436 53 Updated Jun 11, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,673 671 Updated Feb 10, 2025

Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pai 4C75 rs data (about 600k including English/Chinese)

Python 84 4 Updated Sep 21, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 4,651 428 Updated Apr 22, 2025

Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.

Python 118 19 Updated Mar 18, 2023

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,750 354 Updated Dec 7, 2024
Next
0