Department of Computer Science Faculty Scholarship and Creative Works

Fast Concurrent Reinforcement Learners

Bikramjit Banerjee, University of TulsaFollow
Sandip Sen, University of TulsaFollow
Jing Peng, Montclair State UniversityFollow

Document Type

Conference Proceeding

Publication Date

12-1-2001

Abstract

When several agents learn concurrently, the payoff received by an agent is dependent on the behavior of the other agents. As the other agents learn, the reward of one agent becomes non-stationary. This makes learning in multiagent systems more difficult than single-agent learning. A few methods, however, are known to guarantee convergence to equilibrium in the limit in such systems. In this paper we experimentally study one such technique, the minimax-Q, in a competitive domain and prove its equivalence with another well-known method for competitive domains. We study the rate of convergence of minimax-Q and investigate possible ways for increasing the same. We also present a variant of the algorithm, minimax-SARSA, and prove its convergence to minimax-Q values under appropriate conditions. Finally we show that this new algorithm performs better than simple minimax-Q in a general-sum domain as well.

Montclair State University Digital Commons Citation

Banerjee, Bikramjit; Sen, Sandip; and Peng, Jing, "Fast Concurrent Reinforcement Learners" (2001). Department of Computer Science Faculty Scholarship and Creative Works. 283.
https://digitalcommons.montclair.edu/compusci-facpubs/283

This document is currently not available here.

COinS

Department of Computer Science Faculty Scholarship and Creative Works

Fast Concurrent Reinforcement Learners

Document Type

Publication Date

Abstract

Montclair State University Digital Commons Citation

Search

Browse

Author Corner

Links

Department of Computer Science Faculty Scholarship and Creative Works

Fast Concurrent Reinforcement Learners

Authors

Document Type

Publication Date

Abstract

Montclair State University Digital Commons Citation

Share

Search

Browse

Author Corner

Links

//<![CDATA[ document.write("<a href='mailto:" + "digitalcommons" + "@" + "mail.montclair.edu" + "'>" + "Contact Us" + "<\/a>") //]]>