Highlights
- Pro
Stars
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
verl: Volcano Engine Reinforcement Learning for LLMs
Witness the aha moment of VLM with less than $3.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
The Open Cookbook for Top-Tier Code Large Language Model
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
A framework for few-shot evaluation of language models.
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A high-throughput and memory-efficient inference and serving engine for LLMs
Large scale K-means and K-nn implementation on NVIDIA GPU / CUDA
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
A multi-lingual program repair benchmark set based on the Quixey Challenge
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
The interactive graphing library for Python ✨
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
DeepSeek Coder: Let the Code Write Itself
Integrate cutting-edge LLM technology quickly and easily into your apps
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset