flint-xf-fan

Flint Xiaofeng Fan flint-xf-fan

PhD in AI from NUS; A*STAR CIS Scholar, class 2019; A*STAR International Fellow, class 2024; Postdoc at ETH Zurich.

37 followers · 11 following

Achievements

Highlights

Rethinking-Privacy-in-RL Public

[IJCNN 2025 Position Paper]: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs

privacy reinforcement-learning differential-privacy privacy-protection privacy-preserving-machine-learning privacy-in-reinforcement-learning

Updated Apr 27, 2025
Federated-RLHF Public

[AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple instances of GPT-2 for personalized sentiment aligned text ge…

rft federated-reinforcement-learning llms rlhf reinforcement-learning-from-human-feedback fedrl personalized-rlhf

Python 6 Updated Apr 16, 2025
Byzantine-Federated-RL Public

[NeurIPS2021] Federated Reinforcement Learning with Theoretical Guarantees. The repo contains code and experiments for our Federated Policy Gradient with Byzantine Resilience framework for improvin…

reinforcement-learning policy-gradient federated-learning federated-reinforcement-learning sample-efficient-rl byzantine-reinforcement-learning fedrl

Python 95 12 Updated Apr 16, 2025
Cloud-Free-Tier-Comparison Public
Forked from cloudcommunity/Cloud-Free-Tier-Comparison

Comparing the free tier offers of the major cloud providers like AWS, Azure, GCP, Oracle etc.

MIT License Updated Oct 31, 2024
flower Public
Forked from adap/flower

Flower: A Friendly Federated Learning Framework

Python 1 Apache License 2.0 Updated Sep 16, 2023
FedHQL Public

Project page for paper FedHQL: Federated Heterogeneous Q-Learning, AAMAS 2023 (extended abstract)

Updated Jun 1, 2023
sacred Public
Forked from oxwhirl/sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python MIT License Updated May 31, 2022
pymarl Public
Forked from oxwhirl/pymarl

Python Multi-Agent Reinforcement Learning framework

Python Apache License 2.0 Updated Mar 11, 2022
pysekiro_with_RL Public
Forked from analoganddigital/pysekiro_with_RL

Python GNU General Public License v3.0 Updated Mar 21, 2021
mimic-iv Public
Forked from MIT-LCP/mimic-iv

Code and discussion around the MIMIC-IV database

Python MIT License Updated Dec 17, 2020
meta_irl Public

Clone repo of https://github.com/ermongroup/MetaIRL

Python 1 Updated Nov 19, 2020
tensor2tensor Public
Forked from tensorflow/tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python Apache License 2.0 Updated Aug 4, 2020
MIMIC_RL_COACH Public
Forked from asjad99/MIMIC_RL_COACH

Applications of Batch Reinforcement Learning to MIMIC dataset

HTML Updated Jul 26, 2020
RLMA Public
Forked from Miaowshroom/RLMA

Python Updated Apr 18, 2020
svrg_for_policy_evaluation_with_fewer_gradients Public
Forked from zilunpeng/svrg_for_policy_evaluation_with_fewer_gradients

Python MIT License Updated Feb 17, 2020
world-models Public
Forked from ctallec/world-models

Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch

Python MIT License Updated Jan 23, 2020
Deep-Reinforcement-Learning-Algorithms-with-PyTorch Public
Forked from p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Python Updated Oct 17, 2019
rl-baselines-zoo Public
Forked from araffin/rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

Python MIT License Updated Oct 16, 2019
stable-baselines Public
Forked from hill-a/stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python MIT License Updated Oct 15, 2019
Rainbow Public
Forked from Kaixhin/Rainbow

Rainbow: Combining Improvements in Deep Reinforcement Learning

Python MIT License Updated Sep 16, 2019
WeChatExtension-ForMac Public
Forked from MustangYM/WeChatExtension-ForMac

Mac版微信的功能拓展

Objective-C MIT License Updated Aug 12, 2019
pg_travel Public
Forked from reinforcement-learning-kr/pg_travel

Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)

Python MIT License Updated Aug 1, 2019
reinforcement-learning-an-introduction Public
Forked from ShangtongZhang/reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python MIT License Updated Jul 12, 2019
probabilistic-federated-neural-matching Public
Forked from IBM/probabilistic-federated-neural-matching

Bayesian Nonparametric Federated Learning of Neural Networks

Python Apache License 2.0 Updated May 29, 2019
mimic-preprocess Public
Forked from zzzace2000/mimic-preprocess

MIMIC preprocessing for the ICML paper "Dynamic Measurement Scheduling for Event Forecasting Using Deep RL"

Jupyter Notebook MIT License Updated May 16, 2019
blackbox-fusion Public
Forked from hqminh/blackbox-fusion

Experimental code for ICML 2019 paper Collective Model Fusion for Multiple Black-Box Experts

Python Updated May 13, 2019
cups-rl Public
Forked from TheMTank/cups-rl

Customisable Unified Physical Simulations (CUPS) for Reinforcement Learning. Experiments run on the ai2thor environment (http://ai2thor.allenai.org/) e.g. using A3C, RainbowDQN and A3C_GA (Gated At…

Python MIT License Updated May 4, 2019
Vessel-Segmentation Public

Jupyter Notebook Updated Jan 23, 2019
MLDA-Workshop Public

ML/DL training workshops for EEE undergrads

Jupyter Notebook 13 9 Updated Jan 16, 2019
SDCND-Behavioral-Cloning Public

Python Updated Jan 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flint Xiaofeng Fan flint-xf-fan

Achievements

Achievements

Highlights

Block or report flint-xf-fan

Rethinking-Privacy-in-RL Public

Federated-RLHF Public

Byzantine-Federated-RL Public

Cloud-Free-Tier-Comparison Public

flower Public

FedHQL Public

sacred Public

pymarl Public

pysekiro_with_RL Public

mimic-iv Public

meta_irl Public

tensor2tensor Public

MIMIC_RL_COACH Public

RLMA Public

svrg_for_policy_evaluation_with_fewer_gradients Public

world-models Public

Deep-Reinforcement-Learning-Algorithms-with-PyTorch Public

rl-baselines-zoo Public

stable-baselines Public

Rainbow Public

WeChatExtension-ForMac Public

pg_travel Public

reinforcement-learning-an-introduction Public

probabilistic-federated-neural-matching Public

mimic-preprocess Public

blackbox-fusion Public

cups-rl Public

Vessel-Segmentation Public

MLDA-Workshop Public

SDCND-Behavioral-Cloning Public