-
-
PufferLib Public
Forked from PufferAI/PufferLibSimplifying reinforcement learning for complex game environments
C MIT License UpdatedMay 30, 2025 -
-
-
-
sentiment_analysis Public
A 109 million parameter generative language model trained on Yelp data for next token prediction and review rating prediction.
-
tictactoe-rl Public
Teaching reinforcement learning agents to master tictactoe with self-play.
-
-
-
-
-
-