8000 GitHub - scascin0/alphazero: A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing too much.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing too much.

Notifications You must be signed in to change notification settings

scascin0/alphazero

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

AlphaZero

This repo aims to provide a working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing too much.

The connect2.py example gives a rough idea of what's needed to train an agent. Being this basic, the game does not require everything a complete AlphaZero training loop does but the main things should be there.

The main reference for this implementation can be found at this link, which is the preprint of the Nature paper published by DeepMind in 2017.

Requirements:

  • torch
  • matplotlib (for connect2.py example)
  • tqdm (for progress bar)

About

A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing too much.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0