GitHub - lumi-a/tictactoe-qlearning: A super-basic implementation of Q-learning for TicTacToe in Rust

A super-basic implementation of Q-learning for TicTacToe in Rust, so that I improve my understanding of the lecture-material of a Reinforcement-Learning-class.

Run with cargo run -r. This first trains the quality-function-policy, and then plays a tournament against a random player. Results will be around:

Q-Learning: 98.95%
Random:     00.78%
Draws:      00.27%

I used Rust for speed and type-safety, though the code does look clunky for something as simple as TicTacToe. If rewritten, it probably should have some abstract game-traits (having states, transition-functions, rewards, etc.).

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

lumi-a/tictactoe-qlearning

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages