Reinforcement learning gridworld

Suppose that an agent is situated in the 4×3 environment shown in the figure below. Beginning in the start state, it must choose an action at each time step. The interaction with the environment terminates when the agent reaches one of the goal states, marked +1 or –1. Aavailable actions are Up, Down, Left, and Right. We assume, this gridworld is deterministic, meaning the agent will go where it intends to go. For example, when the agent decides to take action up at (0, 1), it will land in (0, 2) rather than elsewhere.

We apply reinforcement learning to find best traveling path for the agent.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Gridworld_Reinfocrement_Learning.ipynb		Gridworld_Reinfocrement_Learning.ipynb
README.md		README.md
gridworld.png		gridworld.png
gridworld_coded.png		gridworld_coded.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement learning gridworld

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Reinforcement learning gridworld

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages