8000 ocss884 (Junrong Lin) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ocss884's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@camel-ai

Block or report ocss884

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Expert Parallelism Load Balancer

Python 1,200 191 Updated Mar 24, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,709 777 Updated May 23, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,698 1,439 Updated May 22, 2025

Allow torch tensor memory to be released and resumed later

C++ 33 5 Updated May 19, 2025

GitHub's official MCP Server

Go 14,469 930 Updated May 27, 2025

Distributed RL System for LLM Reasoning

Python 1,280 61 Updated May 26, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,112 381 Updated May 28, 2025

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,574 309 Updated Oct 19, 2024

Scalable RL solution for advanced reasoning of language models

Python 1,575 91 Updated Mar 18, 2025

My learning notes/codes for ML SYS.

Python 2,283 143 Updated May 27, 2025

Python tool for converting files and office documents to Markdown.

Python 58,014 2,986 Updated May 23, 2025

NVR with realtime local object detection for IP cameras

TypeScript 22,777 2,130 Updated May 27, 2025

Composable building blocks to build Llama Apps

Python 7,816 1,042 Updated May 27, 2025

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 99,553 27,690 Updated May 27, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 14,674 1,858 Updated May 27, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.

Python 4,050 278 Updated May 27, 2025

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 64,718 6,590 Updated May 27, 2025

The easiest way to run WireGuard VPN + Web-based Admin UI.

TypeScript 19,349 1,836 Updated May 27, 2025

A guidance language for controlling large language models.

Jupyter Notebook 20,242 1,108 Updated May 27, 2025

VPS 融合怪服务器测评项目 更推荐使用无环境依赖的Go版本 VPS Fusion Monster Server Test Script – More recommended to use the Go version with no environment dependencies: https://github.com/oneclickvirt/ecs

Shell 5,331 450 Updated Apr 12, 2025

⛅️ 精选的 Cloudflare 工具、开源项目、指南、博客和其他资源列表。/ ⛅️ A curated list of Cloudflare tools, open source projects, guides, blogs and other resources.

11,006 868 Updated May 16, 2025

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,309 75 Updated Dec 3, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,576 1,449 Updated May 27, 2025
Python 319 16 Updated Jul 16, 2024
Python 3,549 347 Updated May 13, 2025

FastAPI Best Practices and Conventions we used at our startup

11,922 869 Updated Apr 11, 2025

Ongoing research training transformer models at scale

Python 12,437 2,797 Updated May 27, 2025

💯 Curated coding interview preparation materials for busy software engineers

TypeScript 126,535 15,437 Updated Apr 28, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 48,263 7,632 Updated May 27, 2025

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org 32D3

Python 12,670 1,347 Updated May 27, 2025
Next
0