8000 Policy iteration by StoneT2000 · Pull Request #8 · StoneT2000/rl-ts · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Policy iteration #8

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 5 commits into from
May 23, 2021
Merged

Policy iteration #8

merged 5 commits into from
May 23, 2021

Conversation

StoneT2000
Copy link
Owner

closes half of #6

@StoneT2000 StoneT2000 merged commit c6948c6 into 0.0.5 May 23, 2021
@StoneT2000 StoneT2000 deleted the policy-iteration branch May 23, 2021 07:54
StoneT2000 added a commit that referenced this pull request May 28, 2021
* Policy iteration (#8)

* init

* Update policyIteration.ts

* Finish policy iteration, add new types

* cleanup

* codecov

* Circleci (#9)

* test

* Update config.yml

* Update config.yml

* Update config.yml

* Update config.yml

* new

* json

* codecov?

* ??

* Update config.yml

* Update config.yml

* lint

* @typescript-eslint/parser

* New functionality for environments to make it easier for tabular RL algorithms and in general DP approaches

* reactivate old tests, change generic type orderings to SAR (State Action Reward), use new features of envs in policy iteration

* Tfjs (#12)

* act

* prettier and dqn init

* tensors and ndarrays

* np tests

* np push

* discount cumulative sum

* Update np.ts

* rename to primitive as we use ndarray now

* box space and some helpers

* Update box.ts

* using numjs, some utilities for random numbers, dqn init

* fix box....

* Discrete

* init cartpole

* dqn test

* working dqn

* Update index.ts

* lint and prettier and test

* Update index.ts

* sanity test for dqn

* increases timeout val

* fixed cumsum function

* improve exports

* Testing using library

* fix

* prettier

* tests

* Update box.ts

* Canvas render (#16)

* Update np.spec.ts

* init

* tests, readme

* cartpole gif

* Update cartpole.gif

* stf

* Update README.md
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant
0