Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 85% Match theoretical research paper theoretical ML researchers,deep learning theorists,researchers interested in interpretability 2 weeks ago

From Universal Approximation Theorem to Tropical Geometry of Multi-Layer Perceptrons

large-language-models › model-architecture
📄 Abstract

Abstract: We revisit the Universal Approximation Theorem(UAT) through the lens of the tropical geometry of neural networks and introduce a constructive, geometry-aware initialization for sigmoidal multi-layer perceptrons (MLPs). Tropical geometry shows that Rectified Linear Unit (ReLU) networks admit decision functions with a combinatorial structure often described as a tropical rational, namely a difference of tropical polynomials. Focusing on planar binary classification, we design purely sigmoidal MLPs that adhere to the finite-sum format of UAT: a finite linear combination of shifted and scaled sigmoids of affine functions. The resulting models yield decision boundaries that already align with prescribed shapes at initialization and can be refined by standard training if desired. This provides a practical bridge between the tropical perspective and smooth MLPs, enabling interpretable, shape-driven initialization without resorting to ReLU architectures. We focus on the construction and empirical demonstrations in two dimensions; theoretical analysis and higher-dimensional extensions are left for future work.
Authors (2)
Yi-Shan Chu
Yueh-Cheng Kuo
Submitted
October 16, 2025
arXiv Category
stat.ML
arXiv PDF

Key Contributions

Revisits the Universal Approximation Theorem using tropical geometry to introduce a geometry-aware initialization for sigmoidal MLPs. This approach allows for the construction of MLPs whose decision boundaries align with prescribed shapes at initialization, offering an interpretable alternative to ReLU networks without sacrificing approximation power.

Business Value

Provides a deeper theoretical understanding of neural networks, potentially leading to more stable and interpretable models in applications where decision boundary shapes are critical.