Department of Computer Science Faculty Scholarship and Creative Works

Adaptive Policy Gradient in Multiagent Learning

Bikramjit Banerjee, Tulane UniversityFollow
Jing Peng, Montclair State UniversityFollow

Document Type

Paper

Publication Date

12-1-2003

Abstract

Inspired by the recent results in policy gradient learning in a general-sum game scenario, in the form of two algorithms, IGA and WoLF-IGA, we explore an alternative version of WoLF. We show that our new WoLF criterion (PDWoLF) is also accurate in 2 × 2 games, while being accurately computable even in more than 2-action games, unlike WoLF that relies on estimation. In particular, we show that this difference in accuracy in more than 2-action games translates to faster convergence (to Nash equilibrium policies in self-play) for PDWoLF in conjunction with the general Policy Hill Climbing algorithm. Interestingly, this expedience gets more pronounced with increasing learning rate ratio, for which we also delve into an explanation, We also show experimentally that learning faster with PDWoLF could also entail learning better policies earlier in self play. Finally we present the scalable version of PDWoLF and show that even in such domains requiring generalizations and approximations, PDWoLF could dominate WoLF in performance.

Montclair State University Digital Commons Citation

Banerjee, Bikramjit and Peng, Jing, "Adaptive Policy Gradient in Multiagent Learning" (2003). Department of Computer Science Faculty Scholarship and Creative Works. 83.
https://digitalcommons.montclair.edu/compusci-facpubs/83

This document is currently not available here.

COinS

Department of Computer Science Faculty Scholarship and Creative Works

Adaptive Policy Gradient in Multiagent Learning

Document Type

Publication Date

Abstract

Montclair State University Digital Commons Citation

Search

Browse

Author Corner

Links

Department of Computer Science Faculty Scholarship and Creative Works

Adaptive Policy Gradient in Multiagent Learning

Authors

Document Type

Publication Date

Abstract

Montclair State University Digital Commons Citation

Share

Search

Browse

Author Corner

Links

//<![CDATA[ document.write("<a href='mailto:" + "digitalcommons" + "@" + "mail.montclair.edu" + "'>" + "Contact Us" + "<\/a>") //]]>