8000 Weishaoya / Starred · GitHub

More Web Proxy on the site http://driver.im/

Weishaoya

Follow

Weishaoya

Follow

1 follower · 16 following

Starred repositories

LiveBench / LiveBench

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Python 807 63 Updated Jun 26, 2025

THUDM / AlignBench

大模型多维度中文对齐评测基准 (ACL 2024)

Python 397 27 Updated Aug 16, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,408 724 Updated Jul 2, 2025

modelscope / evalscope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,238 134 Updated Jul 1, 2025

langgenius / dify

Production-ready platform for agentic workflow 10000 development.

TypeScript 105,409 15,910 Updated Jul 2, 2025

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8,140 701 Updated Jul 2, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,913 1,540 Updated Jul 2, 2025

NVIDIA / DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 14,361 3,352 Updated Aug 12, 2024

maaaxinfinity / ktrun

KTransformers 一键部署脚本

Shell 47 8 Updated Apr 18, 2025

sunkx109 / GPUs-Specs

Summary of the Specs of Commonly Used GPUs for Training and Inference of LLM

48 7 Updated Mar 15, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 47,485 8,281 Updated Jun 30, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 3,282 359 Updated Jul 1, 2025

cline / cline

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 46,816 5,988 Updated Jul 2, 2025

fufankeji / LLMs-Technology-Community-Beyondata

《赋范大模型技术社区》是针对各阶大模型学习者量身打造的基于各类大模型，包括环境设置、本地部署、高效微调、开发实战等技能在内的全流程指导！

460 74 Updated Feb 21, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,480 1,030 Updated Jul 1, 2025

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,205 1,161 Updated Jun 30, 2025

TinyLLaVA / TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Python 842 89 Updated Apr 26, 2025

ZhangXJ199 / TinyLLaVA-Video

A Simple Framework of Small-scale LMMs for Video Understanding

Python 69 5 Updated Jun 11, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 41,399 3,298 Updated Jul 2, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 53,321 6,538 Updated Jul 1, 2025

scutan90 / DeepLearning-500-questions

深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，50余万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系scutjy2015@163.com 版权所有，违权必究 Tan 2018.06

JavaScript 56,255 15,966 Updated Jun 26, 2024

netease-youdao / QAnything

Question and Answer based on Anything.

Python 13,328 1,289 Updated Mar 24, 2025

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 145,282 12,265 Updated Jul 1, 2025

bhargobdeka / advanced-RAG-app

This repository will consist of advanced RAG applications.

Jupyter Notebook 34 10 Updated Jul 31, 2024

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 58,726 5,813 Updated Jul 2, 2025

KevinLiangX / mutil-models-test

Python 2 1 Updated Oct 30, 2024

tickstep / aliyunpan

阿里云盘命令行客户端，支持JavaScript插件，支持同步备份功能。

Go 4,664 376 Updated Apr 15, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 51,213 8,444 Updated Jul 2, 2025

dair-ai / Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 58,748 5,859 Updated Jun 19, 2025

catid / self-discover

Implementation of Google's SELF-DISCOVER

Python 294 32 Updated Aug 9, 2024

Starred topics

Algorithm

0