-
A*STAR
- Singapore
- https://flint-xf-fan.github.io/
- in/flintxffan
Highlights
- Pro
-
Rethinking-Privacy-in-RL Public
[IJCNN 2025 Position Paper]: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs
privacy reinforcement-learning differential-privacy privacy-protection privacy-preserving-machine-learning privacy-in-reinforcement-learningUpdatedApr 27, 2025 -
Federated-RLHF Public
[AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple instances of GPT-2 for personalized sentiment aligned text ge…
-
Byzantine-Federated-RL Public
[NeurIPS2021] Federated Reinforcement Learning with Theoretical Guarantees. The repo contains code and experiments for our Federated Policy Gradient with Byzantine Resilience framework for improvin…
-
Cloud-Free-Tier-Comparison Public
Forked from cloudcommunity/Cloud-Free-Tier-ComparisonComparing the free tier offers of the major cloud providers like AWS, Azure, GCP, Oracle etc.
MIT License UpdatedOct 31, 2024 -
flower Public
Forked from adap/flowerFlower: A Friendly Federated Learning Framework
-
FedHQL Public
Project page for paper FedHQL: Federated Heterogeneous Q-Learning, AAMAS 2023 (extended abstract)
UpdatedJun 1, 2023 -
sacred Public
Forked from oxwhirl/sacredSacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
Python MIT License UpdatedMay 31, 2022 -
pymarl Public
Forked from oxwhirl/pymarlPython Multi-Agent Reinforcement Learning framework
Python Apache License 2.0 UpdatedMar 11, 2022 -
pysekiro_with_RL Public
Forked from analoganddigital/pysekiro_with_RLPython GNU General Public License v3.0 UpdatedMar 21, 2021 -
mimic-iv Public
Forked from MIT-LCP/mimic-ivCode and discussion around the MIMIC-IV database
Python MIT License UpdatedDec 17, 2020 -
-
tensor2tensor Public
Forked from tensorflow/tensor2tensorLibrary of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Python Apache License 2.0 UpdatedAug 4, 2020 -
MIMIC_RL_COACH Public
Forked from asjad99/MIMIC_RL_COACHApplications of Batch Reinforcement Learning to MIMIC dataset
HTML UpdatedJul 26, 2020 -
-
svrg_for_policy_evaluation_with_fewer_gradients Public
Forked from zilunpeng/svrg_for_policy_evaluation_with_fewer_gradientsPython MIT License UpdatedFeb 17, 2020 -
world-models Public
Forked from ctallec/world-modelsReimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch
Python MIT License UpdatedJan 23, 2020 -
Deep-Reinforcement-Learning-Algorithms-with-PyTorch Public
Forked from p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorchPyTorch implementations of deep reinforcement learning algorithms and environments
Python UpdatedOct 17, 2019 -
rl-baselines-zoo Public
Forked from araffin/rl-baselines-zooA collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Python MIT License UpdatedOct 16, 2019 -
stable-baselines Public
Forked from hill-a/stable-baselinesA fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Python MIT License UpdatedOct 15, 2019 -
Rainbow Public
Forked from Kaixhin/RainbowRainbow: Combining Improvements in Deep Reinforcement Learning
Python MIT License UpdatedSep 16, 2019 -
WeChatExtension-ForMac Public
Forked from MustangYM/WeChatExtension-ForMacMac版微信的功能拓展
Objective-C MIT License UpdatedAug 12, 2019 -
pg_travel Public
Forked from reinforcement-learning-kr/pg_travelPolicy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
Python MIT License UpdatedAug 1, 2019 -
reinforcement-learning-an-introduction Public
Forked from ShangtongZhang/reinforcement-learning-an-introductionPython Implementation of Reinforcement Learning: An Introduction
Python MIT License UpdatedJul 12, 2019 -
probabilistic-federated-neural-matching Public
Forked from IBM/probabilistic-federated-neural-matchingBayesian Nonparametric Federated Learning of Neural Networks
Python Apache License 2.0 UpdatedMay 29, 2019 -
mimic-preprocess Public
Forked from zzzace2000/mimic-preprocessMIMIC preprocessing for the ICML paper "Dynamic Measurement Scheduling for Event Forecasting Using Deep RL"
Jupyter Notebook MIT License UpdatedMay 16, 2019 -
blackbox-fusion Public
Forked from hqminh/blackbox-fusionExperimental code for ICML 2019 paper Collective Model Fusion for Multiple Black-Box Experts
Python UpdatedMay 13, 2019 -
cups-rl Public
Forked from TheMTank/cups-rlCustomisable Unified Physical Simulations (CUPS) for Reinforcement Learning. Experiments run on the ai2thor environment (http://ai2thor.allenai.org/) e.g. using A3C, RainbowDQN and A3C_GA (Gated At…
Python MIT License UpdatedMay 4, 2019 -
-
MLDA-Workshop Public
ML/DL training workshops for EEE undergrads
-