vs
Mode
Human v Computer
Human v Human
Algorithm
Minimax
Q-Learning
The algorithm used is
Q-learning
.
The agent has trained for 500000 games to populate the Q-Table.
The Q-table dictates the agent function of the agent.
Depth
1
2
3
4
Unlimited
Starting Player
Human
Computer
New Game
View on Github
Press "New Game" after changing options
Press to get a hint
Hint