Alert button
Picture for Prashanth L. A

Prashanth L. A

Alert button

Generalized Simultaneous Perturbation Stochastic Approximation with Reduced Estimator Bias

Add code
Bookmark button
Alert button
Dec 20, 2022
Shalabh Bhatnagar, Prashanth L. A

Viaarxiv icon

Approximate gradient ascent methods for distortion risk measures

Add code
Bookmark button
Alert button
Feb 22, 2022
Nithia Vijayan, Prashanth L. A

Figure 1 for Approximate gradient ascent methods for distortion risk measures
Figure 2 for Approximate gradient ascent methods for distortion risk measures
Viaarxiv icon

Likelihood ratio-based policy gradient methods for distorted risk measures: A non-asymptotic analysis

Add code
Bookmark button
Alert button
Jul 14, 2021
Nithia Vijayan, Prashanth L. A

Figure 1 for Likelihood ratio-based policy gradient methods for distorted risk measures: A non-asymptotic analysis
Figure 2 for Likelihood ratio-based policy gradient methods for distorted risk measures: A non-asymptotic analysis
Viaarxiv icon

Smoothed functional-based gradient algorithms for off-policy reinforcement learning

Add code
Bookmark button
Alert button
Jan 06, 2021
Nithia Vijayan, Prashanth L. A

Figure 1 for Smoothed functional-based gradient algorithms for off-policy reinforcement learning
Figure 2 for Smoothed functional-based gradient algorithms for off-policy reinforcement learning
Viaarxiv icon

Improved Concentration Bounds for Conditional Value-at-Risk and Cumulative Prospect Theory using Wasserstein distance

Add code
Bookmark button
Alert button
Feb 27, 2019
Sanjay P. Bhat, Prashanth L. A

Figure 1 for Improved Concentration Bounds for Conditional Value-at-Risk and Cumulative Prospect Theory using Wasserstein distance
Viaarxiv icon

Correlated bandits or: How to minimize mean-squared error online

Add code
Bookmark button
Alert button
Feb 08, 2019
Vinay Praneeth Boda, Prashanth L. A

Figure 1 for Correlated bandits or: How to minimize mean-squared error online
Viaarxiv icon