Stars
A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search…
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Muon optimizer: +>30% sample efficiency with <3% wallclock overhead
No fortress, purely open ground. OpenManus is Coming.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use.
AI computer use powered by open source LLMs and E2B Desktop Sandbox
Making large AI models cheaper, faster and more accessible
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
The Triton TensorRT-LLM Backend
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Fully open reproduction of DeepSeek-R1
The official implementation of Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
[EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!
Short examples illustrating AVX2 intrinsics for simple tasks.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model
zengyijie / memsniff
Forked from box/memsniffA tool for recording and displaying statistics on memcached traffic written in golang.