8000 YuzaChongyi / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View YuzaChongyi's full-sized avatar

Block or report YuzaChongyi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.

Python 723 70 Updated May 21, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,158 57 Updated May 27, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,600 1,069 Updated May 28, 2025

A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.

56 Updated Jan 13, 2025

A PyTorch native platform for training generative AI models

Python 3,849 379 Updated May 28, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,505 103 Updated Mar 7, 2025

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 890 104 Updated May 16, 2025

Tools for merging pretrained large language models.

Python 5,750 553 Updated May 20, 2025

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Python 84 8 Updated Oct 23, 2024

[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 353 6 Updated May 5, 2025

Official repository for CoMM Dataset

Python 35 1 Updated Dec 31, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

816 19 Updated Jul 31, 2024

Ongoing research training transformer models at scale

Python 12,442 2,797 Updated May 27, 2025

Stable Diffusion web UI

Python 152,932 28,444 Updated May 3, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,431 375 Updated May 28, 2025

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,365 465 Updated Nov 6, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,492 1,406 Updated May 27, 2025

ChatGPT资料汇总学习,持续更新......

4,148 388 Updated Apr 27, 2025

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,981 379 Updated Jun 28, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,562 429 Updated May 29, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,376 1,508 Updated Apr 29, 2025

A series of large language models developed by Baichuan Intelligent Technology

Python 4,124 295 Updated Nov 8, 2024

DataComp: In search of the next generation of multimodal datasets

Python 710 60 Updated Apr 28, 2025

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 56,375 12,006 Updated May 19, 2025

Stable diffusion webui based on diffusers.

Python 978 68 Updated Sep 29, 2023

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,970 238 Updated Sep 6, 2023

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,056 93 Updated Jun 13, 2024

✨✨Latest Advances on Multimodal Large Language Models

15,348 991 Updated May 15, 2025

百亿参数的中英文双语基座大模型

Python 2,434 191 Updated Jul 28, 2023

Research Trends in LLM-guided Multimodal Learning.

357 16 Updated Oct 17, 2023
Next
0