Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Finding equilibrium points in continuous minmax games has become a key
problem within machine learning, in part due to its connection to the training
of generative adversarial networks and reinforcement learning. Because of
existence and robustness issues, recent developments have shifted from pure
equilibria to focusing on mixed equilibrium points. In this work we consider a
method for finding mixed equilibria in two-layer zero-sum games based on
entropic regularisation, where the two competing strategies are represented by
two sets of interacting particles. We show that the sequence of empirical
measures of the particle system satisfies a large deviation principle as the
number of particles grows to infinity, and how this implies convergence of the
empirical measure and the associated Nikaid\^o-Isoda error, complementing
existing law of large numbers results.
Authors (2)
Viktor Nilsson
Pierre Nyquist