8000 Belyenochi (JasonZhu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Belyenochi's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Belyenochi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,893 333 Updated Jan 8, 2025

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,214 131 Updated Jun 26, 2025

A curated list of awesome papers on dataset distillation and related applications.

HTML 1,699 152 Updated Jun 27, 2025

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 46,508 7,054 Updated Jun 26, 2025

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,026 98 Updated Jun 12, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 53,027 6,493 Updated Jun 26, 2025

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

Go 1,005 92 Updated Jun 20, 2025
Python 567 39 Updated Apr 12, 2025

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

Python 222 26 Updated Mar 13, 2025

An Open Source Toolkit For LLM Distillation

Python 657 80 Updated Jun 1, 2025

LangChain for Go, the easiest way to write LLM-based programs in Go

Go 6,983 870 Updated Jun 21, 2025

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,301 311 Updated Jun 27, 2025
JavaScript 116 23 Updated May 15, 2025

AutoMQ is a stateless/diskless Kafka on S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency. Multi-AZ Availability.

Java 6,740 463 Updated Jun 26, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,467 290 Updated Jun 25, 2025

A generative AI extension for JupyterLab

Python 3,668 416 Updated Jun 26, 2025

Ollama Python library

Python 7,903 725 Updated Jun 18, 2025

NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes

Go 2,175 355 Updated Jun 26, 2025

An open protocol enabling communication and interoperability between opaque agentic applications.

TypeScript 17,724 1,757 Updated Jun 26, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,408 312 Updated May 13, 2025

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 112,443 32,668 Updated Jun 26, 2025

Development repository for the Triton language and compiler

MLIR 15,962 2,069 Updated Jun 27, 2025

CUDA Learning guide

Cuda 398 46 Updated Jun 20, 2024

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,349 447 Updated Jun 27, 2025

AI 基础知识 - GPU 架构、CUDA 编程以及大模型基础知识

Jupyter Notebook 146 14 Updated Jun 24, 2025

A container runtime written in Rust

Rust 6,768 370 Updated Jun 24, 2025

Hyperlight is a lightweight Virtual Machine Manager (VMM) designed to be embedded within applications. It enables safe execution of untrusted code within micro virtual machines with very low latenc…

Rust 3,685 133 Updated Jun 26, 2025

A Go implementation of the Model Context Protocol (MCP), enabling seamless integration between LLM applications and external data sources and tools.

Go 6,068 595 Updated Jun 21, 2025

Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)

Go 1,828 320 Updated Jun 26, 2025

WasmEdge is a lightweight, high-performance, and extensible WebAssembly runtime for cloud native, edge, and decentralized applications. It powers serverless apps, embedded functions, microservices,…

C++ 9,549 861 Updated Jun 25, 2025
Next
0