Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Despite the remarkable success, recent reconstruction-based anomaly detection
(AD) methods via diffusion modeling still involve fine-grained noise-strength
tuning and computationally expensive multi-step denoising, leading to a
fundamental tension between fidelity and efficiency. In this paper, we propose
a novel inversion-based AD approach - detection via noising in latent space -
which circumvents explicit reconstruction. Importantly, we contend that the
limitations in prior reconstruction-based methods originate from the prevailing
detection via denoising in RGB space paradigm. To address this, we model AD
under a reconstruction-free formulation, which directly infers the final latent
variable corresponding to the input image via DDIM inversion, and then measures
the deviation based on the known prior distribution for anomaly scoring.
Specifically, in approximating the original probability flow ODE using the
Euler method, we only enforce very few inversion steps to noise the clean image
to pursue inference efficiency. As the added noise is adaptively derived with
the learned diffusion model, the original features for the clean testing image
can still be leveraged to yield high detection accuracy. We perform extensive
experiments and detailed analysis across three widely used image AD datasets
under the unsupervised unified setting to demonstrate the effectiveness of our
model, regarding state-of-the-art AD performance, and about 2 times inference
time speedup without diffusion distillation.