REINFORCEMENT LEARNING FOR SNAKE

We've all played it. But did you make a computer play it?

Feel like snaking?

python3 snake.py

Feel like learning the snaking?

python3 qlearn.py Q.npy

Feel like watching the learned snaking?

python3 autoplay.py Q.npy

Snake it don't break it.

Model

There are approximately as many states in a five-by-five game of snake as there are ants on Earth -- about 100 000 000 000 000 000. [1] Problematic. How to deal? Well, we reduce the state space to the eight cells surrounding the ant. I mean snake. Then we use plain Q learning. Probably will try some other function estimator in the future. That will be on-policy though (i.e. uses the same policy to estimate value as it does to choose actions) as opposed to the Q learning which simply takes the next state's best action as the value.

[1] https://www.quora.com/How-many-ants-are-there-in-the-world

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
README.md		README.md
autoenc.py		autoenc.py
autoplay.py		autoplay.py
cdqn.py		cdqn.py
cnake.py		cnake.py
dqn.py		dqn.py
pg.py		pg.py
qlearn.py		qlearn.py
snake.py		snake.py
termin.py		termin.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

REINFORCEMENT LEARNING FOR SNAKE

Model

About

Releases

Packages

Languages

lericson/snake

Folders and files

Latest commit

History

Repository files navigation

REINFORCEMENT LEARNING FOR SNAKE

Model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages