BiSUNA

Train from scratch Binary Neural Networks using neuroevolution as its base technique (gradient decent free), to then apply such results to reinforcement learning environments tested in the OpenAI Gym

This project extends the original SUNA paper with binary operations and connections. It can be compiled on MacOS using Xcode or in Linux using commands from script/RecreateEnvironment.sh, considering that the last case needs the library Zweifel to be compiled first.

Fist SUNA implementation

This project is an extension on the original SUNA implementation Follow the link to recreate the original environment.

Install

If you are testing on a Linux environment, follow the commands from the file "script/RecreateEnvironment.sh", that will ease the process, which can be summarized as follows:

Get all dependencies
Install gRPC (Python and C++)
Compile gym-uds-api project (OpenAI Gym server interface)
Compile Zwefiel library
Compile BiSUNA project

Changing Environments (same as SUNA)

Environment can be changed in main.cpp. For example commenting out where the Reinforcement_Environment is defined and uncommenting the line with:

Reinforcement_Environment* env= new Double_Cart_Pole(random);

If the environment should be terminated when the maximum steps is reached uncomment the following in parameters.h:

#define TERMINATE_IF_MAX_STEPS_REACHED

Do not forget to comment it out when surpassing the maximum number of steps is not a termination condition! For example, montain car does not need it while double cart pole does.

Changing Parameters (same as SUNA)

Many parameters of the environment as well as of the agent can be changed by modifying some definitions in parameters.h

Running Experiments (same as SUNA)

To run a trial until its maximum number of trials defined in main.cpp, run:

./rl

To test the best individual, run: (same as SUNA)

./rl_live dna_best_individual

A series of trials can be run by using the script mean_curve.sh

Adding Agents or Problems (same as SUNA)

An agent needs to implement the Reinforcement_Agent.h while a problem needs to implement the Reinforcement_Environment.h. There are simple examples of agents and problems inside respectively the agents/ and environments/ directories. Most of the examples were built with the general reinforcement learning in mind, however they can be applied to supervised learning as well as unsupervised learning (e.g., consider the reward from the system as an error).

License

Apache License Version 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
RawData		RawData
SUNA.xcodeproj		SUNA.xcodeproj
Script		Script
headers		headers
lib		lib
src		src
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BiSUNA

Contents

Fist SUNA implementation

Install

Changing Environments (same as SUNA)

Changing Parameters (same as SUNA)

Running Experiments (same as SUNA)

Adding Agents or Problems (same as SUNA)

License

About

Releases

Packages

Languages

rval735/BiSUNA

Folders and files

Latest commit

History

Repository files navigation

BiSUNA

Contents

Fist SUNA implementation

Install

Changing Environments (same as SUNA)

Changing Parameters (same as SUNA)

Running Experiments (same as SUNA)

Adding Agents or Problems (same as SUNA)

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages