Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,441 2,506 Updated Jun 6, 2025

agiresearch / A-mem

A-MEM: Agentic Memory for LLM Agents

Python 389 48 Updated May 18, 2025

chiphuyen / aie-book

[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)

Jupyter Notebook 4,483 555 Updated Feb 12, 2025

bytedance / MTVQA

MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingual text perception and comprehension capabilities across nine…

Python 59 3 Updated May 15, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,489 383 Updated Jun 8, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,130 1,182 Updated Jun 8, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,873 1,750 Updated Feb 26, 2025

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 374 28 Updated Jun 3, 2025

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 5,475 593 Updated Jun 6, 2025

sail-sg / sailcompass

🧭 SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages

Python 13 1 Updated Feb 12, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 50,800 7,393 Updated Apr 20, 2025

karpathy / nn-zero-to-hero

Neural Networks: Zero to Hero

Jupyter Notebook 13,963 1,940 Updated Aug 18, 2024

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,850 779 Updated May 15, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,475 1,002 Updated Jun 6, 2025

sail-sg / sailcraft

🚢 Data Toolkit for Sailor Language Models

Python 91 10 Updated Feb 24, 2025

algorithmzuo / algorithm-journey

左程云的算法和数据结构通关课

Java 2,141 555 Updated Jun 8, 2025

Zhen-Tan-dmml / LLM4Annotation

568 23 Updated Jun 3, 2025

unslothai / unsloth

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 40,209 3,187 Updated Jun 6, 2025

prometheus-eval / prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 950 54 Updated Apr 25, 2025

facebookresearch / llm-transparency-tool

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…

Python 821 63 Updated Dec 3, 2024

huggingface / evaluation-guidebook

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 1,410 85 Updated Jan 7, 2025