Stars
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…
wolfecameron / nanoMoE
Forked from karpathy/nanoGPTAn extension of the nanoGPT repository for training small MOE models.
A curated list of CTF frameworks, libraries, resources and softwares
Algorithm and data structure articles for https://cp-algorithms.com (based on http://e-maxx.ru)
Democratizing Reinforcement Learning for LLMs
OI / ACM-ICPC essays and learning materials
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Welcome to How to CAD Almost Anything! Fusion 360 edition. In this repository, you'll find the workshop's slides, recordings and Fusion 360 files.
What would you do with 1000 H100s...
This repository contains the Hugging Face Agents Course.
Structured state space sequence models
Some nice-designed problems that I solved in CodeForces contests.
Ongoing research training transformer models at scale
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
DeepSeek LLM: Let there be answers
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Papers & presentation materials from Hugging Face's internal science day
An open-source NLP research library, built on PyTorch.
Assignment solutions CS224n: Natural Language Processing with Deep Learning (winter-2019)
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)