-
DeepCTR Public
Forked from shenweichen/DeepCTREasy-to-use,Modular and Extendible package of deep-learning based CTR models .
Python Apache License 2.0 UpdatedApr 24, 2023 -
-
-
-
-
-
-
-
Entropy-Regularized-RL Public
soft q learning and soft actor critic
-
-
AlphaZero_Gomoku Public
Forked from junxiaosong/AlphaZero_GomokuAn implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Python MIT License UpdatedJul 9, 2018 -
-
scalable_agent Public
Forked from google-deepmind/scalable_agentA TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.
Python Apache License 2.0 UpdatedJun 28, 2018 -
baselines-rudder Public
Forked from ml-jku/baselines-rudderRUDDER for ATARI games with delayed rewards in OpenAI Baselines package
Python MIT License UpdatedJun 24, 2018 -
-
Faster-RCNN_TF Public
Forked from smallcorgi/Faster-RCNN_TFFaster-RCNN in Tensorflow
Python MIT License UpdatedMay 26, 2018 -
tensorflow-rl Public
Forked from steveKapturowski/tensorflow-rlImplementations of deep RL papers and random experimentation
Python Apache License 2.0 UpdatedApr 7, 2018 -
models Public
Forked from tensorflow/modelsModels and examples built with TensorFlow
Python Apache License 2.0 UpdatedNov 12, 2017