Evolutionary Reinforcement Learning for Sample-Efficient Multiagent
  Coordination

Shauharda Khadka and Somdeb Majumdar and Santiago Miret and Stephen
  McAleer and Kagan Tumer

by Shauharda Khadka and Somdeb Majumdar and Santiago Miret and Stephen McAleer and Kagan Tumer

Released as a article .

2019

Abstract

A key challenge for Multiagent RL (Reinforcement Learning) is the design of agent-specific, local rewards that are aligned with sparse global objectives. In this paper, we introduce MERL (Multiagent Evolutionary RL), a hybrid algorithm that does not require an explicit alignment between local and global objectives. MERL uses fast, policy-gradient based learning for each agent by utilizing their dense local rewards. Concurrently, an evolutionary algorithm is used to recruit agents into a team by directly optimizing the sparser global objective. We explore problems that require coupling (a minimum number of agents required to coordinate for success), where the degree of coupling is not known to the agents. We demonstrate that MERL's integrated approach is more sample-efficient and retains performance better with increasing coupling orders compared to MADDPG, the state-of-the-art policy-gradient algorithm for multiagent coordination.
In text/plain format

Archived Files and Locations

application/pdf 1.8 MB
file_jvk7d7kigzhtpkz5i2hq2jagkq arxiv.org (repository)
web.archive.org (webarchive)

Read Archived PDF

Preserved and Accessible

Type article
Stage

submitted

Date 2019-06-18
Version v1
Language en ^?

arXiv 1906.07315v1

Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)

Cite This

BibTeX
CSL-JSON
MLA
Harvard

Lookup Links

Worldcat
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar

Catalog Record
Revision: 16cdb047-c7d0-495e-be9c-e71218434eef
API URL: JSON

Edit Metadata View History

Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination release_rgdv2qdqx5dxld5ffqnpltbg2i

Abstract

Archived Files and Locations

Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination `release_rgdv2qdqx5dxld5ffqnpltbg2i`