RL Cambridge Kaggle Competition Project

This is a repository for members of the Cambridge (UK) Reinforcement Learning Meetup to create agents and compete in games of connect-X, an environment introduced by Kaggle. See for more details: https://www.kaggle.com/c/connectx

Setup

Environment

A conda environment is available which includes the Kaggle Connect-X RL environment and all modules that were available in the original competition:

Python standard libraries (python 3.6)
pytorch
numpy
scipy
gym
kaggle-environments==0.1.6 (more recent versions threw an error when importing kaggle_environments

To get the conda environment that allows you to use the kaggle environment, follow these steps. Check out README.md in conda_envs for more information on downloading anaconda before doing the below, if you don't have it already.

cd conda_env
conda create -f conda_env.yml
conda activate common
cd ..

Getting started

After setting up the environment, try:

source setup_python_path.sh
python check_run.py

The check_run.py script checks that your environment is functioning with the environment correctly, and gives key examples on how to interact with the python code and get started. It is largely a copy-paste of the codeblocks available in the notebook: https://www.kaggle.com/ajeffries/connectx-getting-started

Competing

As per the competition rules below, it's recommended you don't share your code if you're planning to compete in the Kaggle Competition "unless it's made available at no cost via the official competition forums" (see link below). I'm not totally sure that's the meaning of the rule, but that seems lik the safest interpretation.

https://www.kaggle.com/c/connectx/rules

Now the rules bit is over... We can use this environment to compete between our own agents! If you have an agent that is in submittable format, we can compete. Submittable format means to have an entirely encapsulated function called my_agent(observation, configuration). This is verified with scripts/agent_runner my_agent_file.py, or by jumping straight in and trying the below:

scripts/evaluate_agents.py agent1_file.py [agent2_file.py negamax random]

example_submission/ shows the example agent and the example code used to create the submittable file (reproduced in scripts/submit_agent.py), as-provided by Kaggle.

q_learning/ has an example written by the repo author that is runnable with the evaluate_agents.py script, but I have not yet verified that it will be runnable in a proper kaggle submission (the main uncertainty is whether the model state dict .pt file is submittable, as I have seen some examples where the state dict is written out as a variable within the function. I hope this won't be necessary, as it seems like a big pain!

Leaderboard

Negamax (Kaggle) vs q_learning (j-bernardi): 0.0 : 1.0

^ Should be easy to beat me!

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
conda_env		conda_env
example_submission		example_submission
q_learning		q_learning
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENCE.txt		LICENCE.txt
README.md		README.md
check_run.py		check_run.py
set_python_path.sh		set_python_path.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL Cambridge Kaggle Competition Project

Setup

Environment

Getting started

Competing

Leaderboard

About

Releases

Packages

Languages

License

j-bernardi/rl-camb-kaggle-connect-x

Folders and files

Latest commit

History

Repository files navigation

RL Cambridge Kaggle Competition Project

Setup

Environment

Getting started

Competing

Leaderboard

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages