Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ml 80% Match Theoretical Research Paper Causal Inference Researchers,Machine Learning Theorists,Network Scientists,Statisticians 20 hours ago

Theoretical Guarantees for Causal Discovery on Large Random Graphs

graph-neural-networks › knowledge-graphs
📄 Abstract

Abstract: We investigate theoretical guarantees for the false-negative rate (FNR) -- the fraction of true causal edges whose orientation is not recovered, under single-variable random interventions and an $\epsilon$-interventional faithfulness assumption that accommodates latent confounding. For sparse Erd\H{o}s--R\'enyi directed acyclic graphs, where the edge probability scales as $p_e = \Theta(1/d)$, we show that the FNR concentrates around its mean at rate $O(\frac{\log d}{\sqrt d})$, implying that large deviations above the expected error become exponentially unlikely as dimensionality increases. This concentration ensures that derived upper bounds hold with high probability in large-scale settings. Extending the analysis to generalized Barab\'asi--Albert graphs reveals an even stronger phenomenon: when the degree exponent satisfies $\gamma > 3$, the deviation width scales as $O(d^{\beta - \frac{1}{2}})$ with $\beta = 1/(\gamma - 1) < \frac{1}{2}$, and hence vanishes in the limit. This demonstrates that realistic scale-free topologies intrinsically regularize causal discovery, reducing variability in orientation error. These finite-dimension results provide the first dimension-adaptive, faithfulness-robust guarantees for causal structure recovery, and challenge the intuition that high dimensionality and network heterogeneity necessarily hinder accurate discovery. Our simulation results corroborate these theoretical predictions, showing that the FNR indeed concentrates and often vanishes in practice as dimensionality grows.

Key Contributions

Provides theoretical guarantees for the false-negative rate in causal discovery on large random graphs (Erdos-Renyi and Barabasi-Albert) under an epsilon-interventional faithfulness assumption. It shows that the FNR concentrates around its mean with high probability in large-scale settings, ensuring derived upper bounds hold.

Business Value

Establishes foundational theoretical understanding for building more reliable causal inference systems, which can lead to better decision-making in complex systems like biological networks or social interactions.