Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

M. Maruf

Beyond Discriminative Regions: Saliency Maps as Alternatives to CAMs for Weakly Supervised Semantic Segmentation

Aug 21, 2023
M. Maruf, Arka Daw, Amartya Dutta, Jie Bu, Anuj Karpatne

Figure 1 for Beyond Discriminative Regions: Saliency Maps as Alternatives to CAMs for Weakly Supervised Semantic Segmentation

Figure 2 for Beyond Discriminative Regions: Saliency Maps as Alternatives to CAMs for Weakly Supervised Semantic Segmentation

Figure 3 for Beyond Discriminative Regions: Saliency Maps as Alternatives to CAMs for Weakly Supervised Semantic Segmentation

Figure 4 for Beyond Discriminative Regions: Saliency Maps as Alternatives to CAMs for Weakly Supervised Semantic Segmentation

In recent years, several Weakly Supervised Semantic Segmentation (WS3) methods have been proposed that use class activation maps (CAMs) generated by a classifier to produce pseudo-ground truths for training segmentation models. While CAMs are good at highlighting discriminative regions (DR) of an image, they are known to disregard regions of the object that do not contribute to the classifier's prediction, termed non-discriminative regions (NDR). In contrast, attribution methods such as saliency maps provide an alternative approach for assigning a score to every pixel based on its contribution to the classification prediction. This paper provides a comprehensive comparison between saliencies and CAMs for WS3. Our study includes multiple perspectives on understanding their similarities and dissimilarities. Moreover, we provide new evaluation metrics that perform a comprehensive assessment of WS3 performance of alternative methods w.r.t. CAMs. We demonstrate the effectiveness of saliencies in addressing the limitation of CAMs through our empirical studies on benchmark datasets. Furthermore, we propose random cropping as a stochastic aggregation technique that improves the performance of saliency, making it a strong alternative to CAM for WS3.

* 24 pages, 13 figures, 4 tables

Via

Access Paper or Ask Questions

Learning Compact Representations of Neural Networks using DiscriminAtive Masking (DAM)

Oct 01, 2021
Jie Bu, Arka Daw, M. Maruf, Anuj Karpatne

Figure 1 for Learning Compact Representations of Neural Networks using DiscriminAtive Masking (DAM)

Figure 2 for Learning Compact Representations of Neural Networks using DiscriminAtive Masking (DAM)

Figure 3 for Learning Compact Representations of Neural Networks using DiscriminAtive Masking (DAM)

Figure 4 for Learning Compact Representations of Neural Networks using DiscriminAtive Masking (DAM)

A central goal in deep learning is to learn compact representations of features at every layer of a neural network, which is useful for both unsupervised representation learning and structured network pruning. While there is a growing body of work in structured pruning, current state-of-the-art methods suffer from two key limitations: (i) instability during training, and (ii) need for an additional step of fine-tuning, which is resource-intensive. At the core of these limitations is the lack of a systematic approach that jointly prunes and refines weights during training in a single stage, and does not require any fine-tuning upon convergence to achieve state-of-the-art performance. We present a novel single-stage structured pruning method termed DiscriminAtive Masking (DAM). The key intuition behind DAM is to discriminatively prefer some of the neurons to be refined during the training process, while gradually masking out other neurons. We show that our proposed DAM approach has remarkably good performance over various applications, including dimensionality reduction, recommendation system, graph representation learning, and structured pruning for image classification. We also theoretically show that the learning objective of DAM is directly related to minimizing the L0 norm of the masking layer.

* 25 pages, 11 figures, 7 tables, Accepted to NeurIPS 2021

Via

Access Paper or Ask Questions

PID-GAN: A GAN Framework based on a Physics-informed Discriminator for Uncertainty Quantification with Physics

Jun 06, 2021
Arka Daw, M. Maruf, Anuj Karpatne

Figure 1 for PID-GAN: A GAN Framework based on a Physics-informed Discriminator for Uncertainty Quantification with Physics

Figure 2 for PID-GAN: A GAN Framework based on a Physics-informed Discriminator for Uncertainty Quantification with Physics

Figure 3 for PID-GAN: A GAN Framework based on a Physics-informed Discriminator for Uncertainty Quantification with Physics

Figure 4 for PID-GAN: A GAN Framework based on a Physics-informed Discriminator for Uncertainty Quantification with Physics

As applications of deep learning (DL) continue to seep into critical scientific use-cases, the importance of performing uncertainty quantification (UQ) with DL has become more pressing than ever before. In scientific applications, it is also important to inform the learning of DL models with knowledge of physics of the problem to produce physically consistent and generalized solutions. This is referred to as the emerging field of physics-informed deep learning (PIDL). We consider the problem of developing PIDL formulations that can also perform UQ. To this end, we propose a novel physics-informed GAN architecture, termed PID-GAN, where the knowledge of physics is used to inform the learning of both the generator and discriminator models, making ample use of unlabeled data instances. We show that our proposed PID-GAN framework does not suffer from imbalance of generator gradients from multiple loss terms as compared to state-of-the-art. We also empirically demonstrate the efficacy of our proposed framework on a variety of case studies involving benchmark physics-based PDEs as well as imperfect physics. All the code and datasets used in this study have been made available on this link : https://github.com/arkadaw9/PID-GAN.

* 11 pages, 11 figures, 2 tables, Published at KDD 2021

Via

Access Paper or Ask Questions

Beyond Observed Connections : Link Injection

Sep 02, 2020
Jie Bu, M. Maruf, Arka Daw

Figure 1 for Beyond Observed Connections : Link Injection

Figure 2 for Beyond Observed Connections : Link Injection

Figure 3 for Beyond Observed Connections : Link Injection

Figure 4 for Beyond Observed Connections : Link Injection

In this paper, we proposed the \textit{link injection}, a novel method that helps any differentiable graph machine learning models to go beyond observed connections from the input data in an end-to-end learning fashion. It finds out (weak) connections in favor of the current task that is not present in the input data via a parametric link injection layer. We evaluate our method on both node classification and link prediction tasks using a series of state-of-the-art graph convolution networks. Results show that the link injection helps a variety of models to achieve better performances on both applications. Further empirical analysis shows a great potential of this method in efficiently exploiting unseen connections from the injected links.

Via

Access Paper or Ask Questions

Maximizing Cohesion and Separation in Graph Representation Learning: A Distance-aware Negative Sampling Approach

Jul 02, 2020
M. Maruf, Anuj Karpatne

Figure 1 for Maximizing Cohesion and Separation in Graph Representation Learning: A Distance-aware Negative Sampling Approach

Figure 2 for Maximizing Cohesion and Separation in Graph Representation Learning: A Distance-aware Negative Sampling Approach

Figure 3 for Maximizing Cohesion and Separation in Graph Representation Learning: A Distance-aware Negative Sampling Approach

Figure 4 for Maximizing Cohesion and Separation in Graph Representation Learning: A Distance-aware Negative Sampling Approach

The objective of unsupervised graph representation learning (GRL) is to learn a low-dimensional space of node embeddings that reflect the structure of a given unlabeled graph. Existing algorithms for this task rely on negative sampling objectives that maximize the similarity in node embeddings at nearby nodes (referred to as "cohesion") by maintaining positive and negative corpus of node pairs. While positive samples are drawn from node pairs that co-occur in short random walks, conventional approaches construct negative corpus by uniformly sampling random pairs, thus ignoring valuable information about structural dissimilarity among distant node pairs (referred to as "separation"). In this paper, we present a novel Distance-aware Negative Sampling (DNS) which maximizes the separation of distant node-pairs while maximizing cohesion at nearby node-pairs by setting the negative sampling probability proportional to the pair-wise shortest distances. Our approach can be used in conjunction with any GRL algorithm and we demonstrate the efficacy of our approach over baseline negative sampling methods over downstream node classification tasks on a number of benchmark datasets and GRL algorithms. All our codes and datasets are available at \url{https://github.com/Distance-awareNS/DNS/}.

* 18 pages, 6 figures, 2 tables, preprint paper, under review on NeurIPS2020 conference

Via

Access Paper or Ask Questions