Stars
A MCP (Model Context Protocol) server for interacting with dbt.
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
Notebooks and examples on how to onboard and use various features of Amazon Personalize
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
antimatter15 / alpaca.cpp
Forked from ggml-org/llama.cppLocally run an Instruction-Tuned Chat-Style LLM
Repo for my video course on prompt engineering
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.
[ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
[ICLR Workshop 2025] An official source code for paper "GuardReasoner: Towards Reasoning-based LLM Safeguards".
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
🤗 smolagents: a barebones library for agents that think in code.
LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.
Deploy your agentic worfklows to production
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…
Agentic components of the Llama Stack APIs
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
An end-to-end demo of using BigQuery continuous queries to address abandoned ecommerce shopping carts.
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
DSPy: The framework for programming—not prompting—language models
Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)