Minimax Optimal Convergence Rates for Estimating Ground Truth from Crowdsourced Labels

May 30, 2016

Chao Gao, Dengyong Zhou

May 30, 2016

Chao Gao, Dengyong Zhou

**Click to Read Paper and Get Code**

Double or Nothing: Multiplicative Incentive Mechanisms for Crowdsourcing

Dec 16, 2015

Nihar B. Shah, Dengyong Zhou

Dec 16, 2015

Nihar B. Shah, Dengyong Zhou

**Click to Read Paper and Get Code**

On the Impossibility of Convex Inference in Human Computation

Nov 21, 2014

Nihar B. Shah, Dengyong Zhou

Nov 21, 2014

Nihar B. Shah, Dengyong Zhou

* AAAI 2015

**Click to Read Paper and Get Code**

Provably Optimal Algorithms for Generalized Linear Contextual Bandits

Jun 18, 2017

Lihong Li, Yu Lu, Dengyong Zhou

Contextual bandits are widely used in Internet services from news recommendation to advertising, and to Web search. Generalized linear models (logistical regression in particular) have demonstrated stronger performance than linear models in many applications where rewards are binary. However, most theoretical analyses on contextual bandits so far are on linear bandits. In this work, we propose an upper confidence bound based algorithm for generalized linear contextual bandits, which achieves an $\tilde{O}(\sqrt{dT})$ regret over $T$ rounds with $d$ dimensional feature vectors. This regret matches the minimax lower bound, up to logarithmic terms, and improves on the best previous result by a $\sqrt{d}$ factor, assuming the number of arms is fixed. A key component in our analysis is to establish a new, sharp finite-sample confidence bound for maximum-likelihood estimates in generalized linear models, which may be of independent interest. We also analyze a simpler upper confidence bound algorithm, which is useful in practice, and prove it to have optimal regret for certain cases.
Jun 18, 2017

Lihong Li, Yu Lu, Dengyong Zhou

* Published at ICML 2017

**Click to Read Paper and Get Code**

In many machine learning applications, crowdsourcing has become the primary means for label collection. In this paper, we study the optimal error rate for aggregating labels provided by a set of non-expert workers. Under the classic Dawid-Skene model, we establish matching upper and lower bounds with an exact exponent $mI(\pi)$ in which $m$ is the number of workers and $I(\pi)$ the average Chernoff information that characterizes the workers' collective ability. Such an exact characterization of the error exponent allows us to state a precise sample size requirement $m>\frac{1}{I(\pi)}\log\frac{1}{\epsilon}$ in order to achieve an $\epsilon$ misclassification error. In addition, our results imply the optimality of various EM algorithms for crowdsourcing initialized by consistent estimators.

* To appear in the Proceedings of the 33rd International Conference on Machine Learning, New York, NY, USA, 2016

* To appear in the Proceedings of the 33rd International Conference on Machine Learning, New York, NY, USA, 2016

**Click to Read Paper and Get Code**
Statistical Decision Making for Optimal Budget Allocation in Crowd Labeling

Apr 24, 2014

Xi Chen, Qihang Lin, Dengyong Zhou

Apr 24, 2014

Xi Chen, Qihang Lin, Dengyong Zhou

* 39 pages

**Click to Read Paper and Get Code**

* 13 pages, 3 figures, downloadable supplementary files

**Click to Read Paper and Get Code**

Approval Voting and Incentives in Crowdsourcing

Sep 07, 2015

Nihar B. Shah, Dengyong Zhou, Yuval Peres

Sep 07, 2015

Nihar B. Shah, Dengyong Zhou, Yuval Peres

**Click to Read Paper and Get Code**

Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation

Oct 29, 2018

Qiang Liu, Lihong Li, Ziyang Tang, Dengyong Zhou

Oct 29, 2018

Qiang Liu, Lihong Li, Ziyang Tang, Dengyong Zhou

* 21 pages, 5 figures, NIPS 2018 (spotlight)

**Click to Read Paper and Get Code**

Spectral Methods meet EM: A Provably Optimal Algorithm for Crowdsourcing

Nov 01, 2014

Yuchen Zhang, Xi Chen, Dengyong Zhou, Michael I. Jordan

Nov 01, 2014

Yuchen Zhang, Xi Chen, Dengyong Zhou, Michael I. Jordan

**Click to Read Paper and Get Code**

On the Discrimination-Generalization Tradeoff in GANs

Feb 23, 2018

Pengchuan Zhang, Qiang Liu, Dengyong Zhou, Tao Xu, Xiaodong He

Feb 23, 2018

Pengchuan Zhang, Qiang Liu, Dengyong Zhou, Tao Xu, Xiaodong He

* ICLR 2018

**Click to Read Paper and Get Code**

Towards Neural Phrase-based Machine Translation

Sep 24, 2018

Po-Sen Huang, Chong Wang, Sitao Huang, Dengyong Zhou, Li Deng

Sep 24, 2018

Po-Sen Huang, Chong Wang, Sitao Huang, Dengyong Zhou, Li Deng

* in International Conference on Learning Representations (ICLR) 2018

**Click to Read Paper and Get Code**

Action-depedent Control Variates for Policy Optimization via Stein's Identity

Feb 23, 2018

Hao Liu, Yihao Feng, Yi Mao, Dengyong Zhou, Jian Peng, Qiang Liu

Feb 23, 2018

Hao Liu, Yihao Feng, Yi Mao, Dengyong Zhou, Jian Peng, Qiang Liu

* The first two authors contributed equally. Author ordering determined by coin flip over a Google Hangout. Accepted by ICLR 2018

**Click to Read Paper and Get Code**

Stochastic Variance Reduction Methods for Policy Evaluation

Jun 09, 2017

Simon S. Du, Jianshu Chen, Lihong Li, Lin Xiao, Dengyong Zhou

Jun 09, 2017

Simon S. Du, Jianshu Chen, Lihong Li, Lin Xiao, Dengyong Zhou

* Accepted by ICML 2017

**Click to Read Paper and Get Code**

Sequence Modeling via Segmentations

Jul 18, 2018

Chong Wang, Yining Wang, Po-Sen Huang, Abdelrahman Mohamed, Dengyong Zhou, Li Deng

Jul 18, 2018

Chong Wang, Yining Wang, Po-Sen Huang, Abdelrahman Mohamed, Dengyong Zhou, Li Deng

* recurrent neural networks, dynamic programming, structured prediction

**Click to Read Paper and Get Code**

Regularized Minimax Conditional Entropy for Crowdsourcing

Mar 25, 2015

Dengyong Zhou, Qiang Liu, John C. Platt, Christopher Meek, Nihar B. Shah

Mar 25, 2015

Dengyong Zhou, Qiang Liu, John C. Platt, Christopher Meek, Nihar B. Shah

* 31 pages

**Click to Read Paper and Get Code**

Neuro-Symbolic Program Synthesis

Nov 06, 2016

Emilio Parisotto, Abdel-rahman Mohamed, Rishabh Singh, Lihong Li, Dengyong Zhou, Pushmeet Kohli

Nov 06, 2016

Emilio Parisotto, Abdel-rahman Mohamed, Rishabh Singh, Lihong Li, Dengyong Zhou, Pushmeet Kohli

**Click to Read Paper and Get Code**

Neural Phrase-to-Phrase Machine Translation

Nov 06, 2018

Jiangtao Feng, Lingpeng Kong, Po-Sen Huang, Chong Wang, Da Huang, Jiayuan Mao, Kan Qiao, Dengyong Zhou

Nov 06, 2018

Jiangtao Feng, Lingpeng Kong, Po-Sen Huang, Chong Wang, Da Huang, Jiayuan Mao, Kan Qiao, Dengyong Zhou

**Click to Read Paper and Get Code**