Apply REINFORCE algorithm
-
Updated
Feb 28, 2023 - Jupyter Notebook
Apply REINFORCE algorithm
Tensorflow implementation of Proximal Policy Optimization (Reinforcement Learning) and its common optimizations. Features Tensorboard integration and lots of sample runs on custom, classical and robotics oriented environments.
Apply REINFORCE algorithm
Add a description, image, and links to the policy-gradient-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the policy-gradient-algorithm topic, visit your repo's landing page and select "manage topics."