Stars
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation. It allows users to compare different chunking methods and i…
Typescript tool to translate any video with AI. Voice cloning, subtitles and LipSync. 35 languages available. Used by https://www.voicecheap.ai/.
Awesome Reasoning LLM Tutorial/Survey/Guide
Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
Large Language Model (LLM) Systems Paper List
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
aider is AI pair programming in your terminal
leimingyu / fast.cu
Forked from pranjalssh/fast.cuFastest kernels written from scratch
🙌 OpenHands: Code Less, Make More
A plugin for Jupyter Notebook to run CUDA C/C++ code
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
Code for the paper "Evaluating Large Language Models Trained on Code"
PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)
CUDA integration for Python, plus shiny features
NVIDIA curated collection of educational resources related to general purpose GPU programming.
Efficient implementations of Merge Sort and Bitonic Sort algorithms using CUDA for GPU parallel processing, resulting in accelerated sorting of large arrays. Includes both CPU and GPU versions, alo…
A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Processors”). Features six capstone projects to solidify GPU par…
minimal GRPO implementation from scratch
Universal and Transferable Attacks on Aligned Language Models