-
CS PhD Student
- Nairobi, Kenya
-
05:06
(UTC -12:00) - #
Starred repositories
This is the official implementation for the Retinoscopy project.
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
opendilab / LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
💯 Curated coding interview preparation materials for busy software engineers
Video+code lecture on building nanoGPT from scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Collection of leetcode company tag problems. Periodically updating.
Source code and dataset for 2022 Publication "Applying Deep Reinforcement Learning to the HP Model for Protein Structure Prediction"
Multi-Objective Molecule Generation using Interpretable Substructures (ICML 2020)
Python Implementation of Reinforcement Learning: An Introduction
Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)
Code for the paper "A Deep Generative Model for Fragment-Based Molecule Generation" (AISTATS 2020)
Regression Transformer (2023; Nature Machine Intelligence)
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)
[NeurIPS 2023] Online Fine-Tuning Game Solver
Understanding Deep Learning - Simon J.D. Prince
Platform for designing and evaluating Graph Neural Networks (GNN)
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
In Generative AI with Large Language Models (LLMs), you’ll learn the fundamentals of how generative AI works, and how to deploy it in real-world applications.
Hackers' Guide to Language Models
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
Collection of reinforcement learning algorithms
Protein structure datasets for machine learning.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.