Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mingzhang Yin

Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation

Apr 05, 2024
Mingyuan Zhou, Huangjie Zheng, Zhendong Wang, Mingzhang Yin, Hai Huang

We introduce Score identity Distillation (SiD), an innovative data-free method that distills the generative capabilities of pretrained diffusion models into a single-step generator. SiD not only facilitates an exponentially fast reduction in Fr\'echet inception distance (FID) during distillation but also approaches or even exceeds the FID performance of the original teacher diffusion models. By reformulating forward diffusion processes as semi-implicit distributions, we leverage three score-related identities to create an innovative loss mechanism. This mechanism achieves rapid FID reduction by training the generator using its own synthesized images, eliminating the need for real data or reverse-diffusion-based generation, all accomplished within significantly shortened generation time. Upon evaluation across four benchmark datasets, the SiD algorithm demonstrates high iteration efficiency during distillation and surpasses competing distillation approaches, whether they are one-step or few-step, data-free, or dependent on training data, in terms of generation quality. This achievement not only redefines the benchmarks for efficiency and effectiveness in diffusion distillation but also in the broader field of diffusion-based generation. Our PyTorch implementation will be publicly accessible on GitHub.

Via

Access Paper or Ask Questions

Nonparametric Discrete Choice Experiments with Machine Learning Guided Adaptive Design

Oct 18, 2023
Mingzhang Yin, Ruijiang Gao, Weiran Lin, Steven M. Shugan

Figure 1 for Nonparametric Discrete Choice Experiments with Machine Learning Guided Adaptive Design

Figure 2 for Nonparametric Discrete Choice Experiments with Machine Learning Guided Adaptive Design

Figure 3 for Nonparametric Discrete Choice Experiments with Machine Learning Guided Adaptive Design

Figure 4 for Nonparametric Discrete Choice Experiments with Machine Learning Guided Adaptive Design

Designing products to meet consumers' preferences is essential for a business's success. We propose the Gradient-based Survey (GBS), a discrete choice experiment for multiattribute product design. The experiment elicits consumer preferences through a sequence of paired comparisons for partial profiles. GBS adaptively constructs paired comparison questions based on the respondents' previous choices. Unlike the traditional random utility maximization paradigm, GBS is robust to model misspecification by not requiring a parametric utility model. Cross-pollinating the machine learning and experiment design, GBS is scalable to products with hundreds of attributes and can design personalized products for heterogeneous consumers. We demonstrate the advantage of GBS in accuracy and sample efficiency compared to the existing parametric and nonparametric methods in simulations.

Via

Access Paper or Ask Questions

Confounding-Robust Policy Improvement with Human-AI Teams

Oct 13, 2023
Ruijiang Gao, Mingzhang Yin

Figure 1 for Confounding-Robust Policy Improvement with Human-AI Teams

Figure 2 for Confounding-Robust Policy Improvement with Human-AI Teams

Figure 3 for Confounding-Robust Policy Improvement with Human-AI Teams

Figure 4 for Confounding-Robust Policy Improvement with Human-AI Teams

Human-AI collaboration has the potential to transform various domains by leveraging the complementary strengths of human experts and Artificial Intelligence (AI) systems. However, unobserved confounding can undermine the effectiveness of this collaboration, leading to biased and unreliable outcomes. In this paper, we propose a novel solution to address unobserved confounding in human-AI collaboration by employing the marginal sensitivity model (MSM). Our approach combines domain expertise with AI-driven statistical modeling to account for potential confounders that may otherwise remain hidden. We present a deferral collaboration framework for incorporating the MSM into policy learning from observational data, enabling the system to control for the influence of unobserved confounding factors. In addition, we propose a personalized deferral collaboration system to leverage the diverse expertise of different human decision-makers. By adjusting for potential biases, our proposed solution enhances the robustness and reliability of collaborative outcomes. The empirical and theoretical analyses demonstrate the efficacy of our approach in mitigating unobserved confounding and improving the overall performance of human-AI collaborations.

* 24 pages

Via

Access Paper or Ask Questions

Gradient Estimation for Binary Latent Variables via Gradient Variance Clipping

Aug 12, 2022
Russell Z. Kunes, Mingzhang Yin, Max Land, Doron Haviv, Dana Pe'er, Simon Tavaré

Figure 1 for Gradient Estimation for Binary Latent Variables via Gradient Variance Clipping

Figure 2 for Gradient Estimation for Binary Latent Variables via Gradient Variance Clipping

Figure 3 for Gradient Estimation for Binary Latent Variables via Gradient Variance Clipping

Figure 4 for Gradient Estimation for Binary Latent Variables via Gradient Variance Clipping

Gradient estimation is often necessary for fitting generative models with discrete latent variables, in contexts such as reinforcement learning and variational autoencoder (VAE) training. The DisARM estimator (Yin et al. 2020; Dong, Mnih, and Tucker 2020) achieves state of the art gradient variance for Bernoulli latent variable models in many contexts. However, DisARM and other estimators have potentially exploding variance near the boundary of the parameter space, where solutions tend to lie. To ameliorate this issue, we propose a new gradient estimator \textit{bitflip}-1 that has lower variance at the boundaries of the parameter space. As bitflip-1 has complementary properties to existing estimators, we introduce an aggregated estimator, \textit{unbiased gradient variance clipping} (UGC) that uses either a bitflip-1 or a DisARM gradient update for each coordinate. We theoretically prove that UGC has uniformly lower variance than DisARM. Empirically, we observe that UGC achieves the optimal value of the optimization objectives in toy experiments, discrete VAE training, and in a best subset selection problem.

Via

Access Paper or Ask Questions

Probabilistic Conformal Prediction Using Conditional Random Samples

Jun 20, 2022
Zhendong Wang, Ruijiang Gao, Mingzhang Yin, Mingyuan Zhou, David M. Blei

Figure 1 for Probabilistic Conformal Prediction Using Conditional Random Samples

Figure 2 for Probabilistic Conformal Prediction Using Conditional Random Samples

Figure 3 for Probabilistic Conformal Prediction Using Conditional Random Samples

Figure 4 for Probabilistic Conformal Prediction Using Conditional Random Samples

This paper proposes probabilistic conformal prediction (PCP), a predictive inference algorithm that estimates a target variable by a discontinuous predictive set. Given inputs, PCP construct the predictive set based on random samples from an estimated generative model. It is efficient and compatible with either explicit or implicit conditional generative models. Theoretically, we show that PCP guarantees correct marginal coverage with finite samples. Empirically, we study PCP on a variety of simulated and real datasets. Compared to existing methods for conformal inference, PCP provides sharper predictive sets.

Via

Access Paper or Ask Questions

Partial Identification with Noisy Covariates: A Robust Optimization Approach

Feb 22, 2022
Wenshuo Guo, Mingzhang Yin, Yixin Wang, Michael I. Jordan

Figure 1 for Partial Identification with Noisy Covariates: A Robust Optimization Approach

Figure 2 for Partial Identification with Noisy Covariates: A Robust Optimization Approach

Figure 3 for Partial Identification with Noisy Covariates: A Robust Optimization Approach

Figure 4 for Partial Identification with Noisy Covariates: A Robust Optimization Approach

Causal inference from observational datasets often relies on measuring and adjusting for covariates. In practice, measurements of the covariates can often be noisy and/or biased, or only measurements of their proxies may be available. Directly adjusting for these imperfect measurements of the covariates can lead to biased causal estimates. Moreover, without additional assumptions, the causal effects are not point-identifiable due to the noise in these measurements. To this end, we study the partial identification of causal effects given noisy covariates, under a user-specified assumption on the noise level. The key observation is that we can formulate the identification of the average treatment effects (ATE) as a robust optimization problem. This formulation leads to an efficient robust optimization algorithm that bounds the ATE with noisy covariates. We show that this robust optimization approach can extend a wide range of causal adjustment methods to perform partial identification, including backdoor adjustment, inverse propensity score weighting, double machine learning, and front door adjustment. Across synthetic and real datasets, we find that this approach provides ATE bounds with a higher coverage probability than existing methods.

* Proceedings of Conference on Causal Learning and Reasoning (CLeaR) 2022

Via

Access Paper or Ask Questions

Optimization-based Causal Estimation from Heterogenous Environments

Sep 24, 2021
Mingzhang Yin, Yixin Wang, David M. Blei

Figure 1 for Optimization-based Causal Estimation from Heterogenous Environments

Figure 2 for Optimization-based Causal Estimation from Heterogenous Environments

Figure 3 for Optimization-based Causal Estimation from Heterogenous Environments

Figure 4 for Optimization-based Causal Estimation from Heterogenous Environments

This paper presents a new optimization approach to causal estimation. Given data that contains covariates and an outcome, which covariates are causes of the outcome, and what is the strength of the causality? In classical machine learning (ML), the goal of optimization is to maximize predictive accuracy. However, some covariates might exhibit a non-causal association to the outcome. Such spurious associations provide predictive power for classical ML, but they prevent us from causally interpreting the result. This paper proposes CoCo, an optimization algorithm that bridges the gap between pure prediction and causal inference. CoCo leverages the recently-proposed idea of environments, datasets of covariates/response where the causal relationships remain invariant but where the distribution of the covariates changes from environment to environment. Given datasets from multiple environments -- and ones that exhibit sufficient heterogeneity -- CoCo maximizes an objective for which the only solution is the causal solution. We describe the theoretical foundations of this approach and demonstrate its effectiveness on simulated and real datasets. Compared to classical ML and existing methods, CoCo provides more accurate estimates of the causal model.

Via

Access Paper or Ask Questions

Pairwise Supervised Hashing with Bernoulli Variational Auto-Encoder and Self-Control Gradient Estimator

May 21, 2020
Siamak Zamani Dadaneh, Shahin Boluki, Mingzhang Yin, Mingyuan Zhou, Xiaoning Qian

Figure 1 for Pairwise Supervised Hashing with Bernoulli Variational Auto-Encoder and Self-Control Gradient Estimator

Figure 2 for Pairwise Supervised Hashing with Bernoulli Variational Auto-Encoder and Self-Control Gradient Estimator

Figure 3 for Pairwise Supervised Hashing with Bernoulli Variational Auto-Encoder and Self-Control Gradient Estimator

Figure 4 for Pairwise Supervised Hashing with Bernoulli Variational Auto-Encoder and Self-Control Gradient Estimator

Semantic hashing has become a crucial component of fast similarity search in many large-scale information retrieval systems, in particular, for text data. Variational auto-encoders (VAEs) with binary latent variables as hashing codes provide state-of-the-art performance in terms of precision for document retrieval. We propose a pairwise loss function with discrete latent VAE to reward within-class similarity and between-class dissimilarity for supervised hashing. Instead of solving the optimization relying on existing biased gradient estimators, an unbiased low-variance gradient estimator is adopted to optimize the hashing function by evaluating the non-differentiable loss function over two correlated sets of binary hashing codes to control the variance of gradient estimates. This new semantic hashing framework achieves superior performance compared to the state-of-the-arts, as demonstrated by our comprehensive experiments.

* Uncertainty in Artificial Intelligence Conference (UAI) 2020
* To appear in UAI 2020

Via

Access Paper or Ask Questions

Discrete Action On-Policy Learning with Action-Value Critic

Feb 21, 2020
Yuguang Yue, Yunhao Tang, Mingzhang Yin, Mingyuan Zhou

Figure 1 for Discrete Action On-Policy Learning with Action-Value Critic

Figure 2 for Discrete Action On-Policy Learning with Action-Value Critic

Figure 3 for Discrete Action On-Policy Learning with Action-Value Critic

Figure 4 for Discrete Action On-Policy Learning with Action-Value Critic

Reinforcement learning (RL) in discrete action space is ubiquitous in real-world applications, but its complexity grows exponentially with the action-space dimension, making it challenging to apply existing on-policy gradient based deep RL algorithms efficiently. To effectively operate in multidimensional discrete action spaces, we construct a critic to estimate action-value functions, apply it on correlated actions, and combine these critic estimated action values to control the variance of gradient estimation. We follow rigorous statistical analysis to design how to generate and combine these correlated actions, and how to sparsify the gradients by shutting down the contributions from certain dimensions. These efforts result in a new discrete action on-policy RL algorithm that empirically outperforms related on-policy algorithms relying on variance control techniques. We demonstrate these properties on OpenAI Gym benchmark tasks, and illustrate how discretizing the action space could benefit the exploration phase and hence facilitate convergence to a better local optimal solution thanks to the flexibility of discrete policy.

Via

Access Paper or Ask Questions