Lists (1)
Sort Name ascending (A-Z)
Stars
Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"
A guidance language for controlling large language models.
Deep learning models for contextual multi-armed bandit setting
Train transformer language models with reinforcement learning.
This is a repository where I track and share the knowledge I acquire on my journey to reach my dream data position
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Vector (and Scalar) Quantization, in Pytorch
A workflow for reproducible and open scientific articles
Single-file pytorch implementation of hybrid-SAC
CREATE Environment for long-horizon physics-puzzle tasks with diverse tools
Toy meta-RL environments for testing algorithms implementations
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Massively parallel rigidbody physics simulation on accelerator hardware.
A playbook for systematically maximizing the performance of deep learning models.
A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
A Python toolbox for performing gradient-free optimization
corl-team / katakomba
Forked from tinkoff-ai/katakombaData-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
corl-team / CORL
Forked from tinkoff-ai/CORLHigh-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC