Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ml 85% Match Research Paper Statisticians,Machine Learning Researchers,Data Scientists 3 weeks ago

Learning to sample fibers for goodness-of-fit testing

reinforcement-learning β€Ί robotics-rl
πŸ“„ Abstract

Abstract: We consider the problem of constructing exact goodness-of-fit tests for discrete exponential family models. This classical problem remains practically unsolved for many types of structured or sparse data, as it rests on a computationally difficult core task: to produce a reliable sample from lattice points in a high-dimensional polytope. We translate the problem into a Markov decision process and demonstrate a reinforcement learning approach for learning `good moves' for sampling. We illustrate the approach on data sets and models for which traditional MCMC samplers converge too slowly due to problem size, sparsity structure, and the requirement to use prohibitive non-linear algebra computations in the process. The differentiating factor is the use of scalable tools from \emph{linear} algebra in the context of theoretical guarantees provided by \emph{non-linear} algebra. Our algorithm is based on an actor-critic sampling scheme, with provable convergence. The discovered moves can be used to efficiently obtain an exchangeable sample, significantly cutting computational times with regards to statistical testing.
Authors (2)
Ivan Gvozdanović
Sonja Petrović
Submitted
May 22, 2024
arXiv Category
stat.ML
arXiv PDF

Key Contributions

Develops a reinforcement learning approach, specifically an actor-critic sampling scheme, to learn efficient sampling strategies for goodness-of-fit testing in discrete exponential families. This method addresses the computational difficulty of sampling from high-dimensional lattice points in polytopes, outperforming traditional MCMC samplers in challenging cases.

Business Value

Enables more reliable and efficient statistical inference for complex models, leading to better model validation and decision-making in fields relying on statistical testing.