Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ahsan Mahmood

Anomaly Detection via Gumbel Noise Score Matching

Apr 06, 2023
Ahsan Mahmood, Junier Oliva, Martin Styner

Figure 1 for Anomaly Detection via Gumbel Noise Score Matching

Figure 2 for Anomaly Detection via Gumbel Noise Score Matching

Figure 3 for Anomaly Detection via Gumbel Noise Score Matching

Figure 4 for Anomaly Detection via Gumbel Noise Score Matching

We propose Gumbel Noise Score Matching (GNSM), a novel unsupervised method to detect anomalies in categorical data. GNSM accomplishes this by estimating the scores, i.e. the gradients of log likelihoods w.r.t.~inputs, of continuously relaxed categorical distributions. We test our method on a suite of anomaly detection tabular datasets. GNSM achieves a consistently high performance across all experiments. We further demonstrate the flexibility of GNSM by applying it to image data where the model is tasked to detect poor segmentation predictions. Images ranked anomalous by GNSM show clear segmentation failures, with the outputs of GNSM strongly correlating with segmentation metrics computed on ground-truth. We outline the score matching training objective utilized by GNSM and provide an open-source implementation of our work.

Via

Access Paper or Ask Questions

Multiscale Score Matching for Out-of-Distribution Detection

Oct 27, 2020
Ahsan Mahmood, Junier Oliva, Martin Styner

Figure 1 for Multiscale Score Matching for Out-of-Distribution Detection

Figure 2 for Multiscale Score Matching for Out-of-Distribution Detection

Figure 3 for Multiscale Score Matching for Out-of-Distribution Detection

Figure 4 for Multiscale Score Matching for Out-of-Distribution Detection

We present a new methodology for detecting out-of-distribution (OOD) images by utilizing norms of the score estimates at multiple noise scales. A score is defined to be the gradient of the log density with respect to the input data. Our methodology is completely unsupervised and follows a straight forward training scheme. First, we train a deep network to estimate scores for levels of noise. Once trained, we calculate the noisy score estimates for N in-distribution samples and take the L2-norms across the input dimensions (resulting in an NxL matrix). Then we train an auxiliary model (such as a Gaussian Mixture Model) to learn the in-distribution spatial regions in this L-dimensional space. This auxiliary model can now be used to identify points that reside outside the learned space. Despite its simplicity, our experiments show that this methodology significantly outperforms the state-of-the-art in detecting out-of-distribution images. For example, our method can effectively separate CIFAR-10 (inlier) and SVHN (OOD) images, a setting which has been previously shown to be difficult for deep likelihood models.

Via

Access Paper or Ask Questions