MARL for patrolling agents

We provide here an environment for a predator/prey game. We explore two methods: a simple DQN architecture as well as a true Multi-Agent algorithm architecture using a Policy Gradient approach: Multi-Agent Deep Deterministic Policy Gradient (Lowe, R., Wu, Y., Tamar, A., Harb, J., Abbeel, O. P., & Mordatch, I. (2017). Multi-agent actor-critic for mixed cooperative-competitive environments. In Advances in Neural Information Processing Systems (pp. 6379-6390)).

Some results

After 1400 episodes of training.

DDQN 2vs2	MADDPG 2vs2	DDQN 2v1 Magic Switch

Environment

Blue dots represent preys and orange dots are predators.

Action space

The action space is discrete. Every agent can do one of none, left, right, top, bottom.

State space

The state is perfectly known by all the agents.

The state is the 3D coordinates (x, y, z) for every agent.

Name		Name	Last commit message	Last commit date
Latest commit History 217 Commits
builds		builds
config		config
gifs		gifs
model		model
sim		sim
test		test
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
MARL for patrolling agents - Report.pdf		MARL for patrolling agents - Report.pdf
main_dqn.py		main_dqn.py
main_maddpg.py		main_maddpg.py
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MARL for patrolling agents

Some results

Environment

Action space

State space

About

Releases

Packages

Languages

License

bdvllrs/marl-patrolling-agents

Folders and files

Latest commit

History

Repository files navigation

MARL for patrolling agents

Some results

Environment

Action space

State space

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages