Alert button
Picture for Reda Ouhamma

Reda Ouhamma

Alert button

CRIStAL

Learning Nash Equilibria in Zero-Sum Markov Games: A Single Time-scale Algorithm Under Weak Reachability

Add code
Bookmark button
Alert button
Dec 13, 2023
Reda Ouhamma, Maryam Kamgarpour

Viaarxiv icon

Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning

Add code
Bookmark button
Alert button
Oct 05, 2022
Reda Ouhamma, Debabrota Basu, Odalric-Ambrym Maillard

Figure 1 for Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning
Figure 2 for Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning
Viaarxiv icon

Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge

Add code
Bookmark button
Alert button
Nov 02, 2021
Reda Ouhamma, Odalric Maillard, Vianney Perchet

Figure 1 for Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge
Figure 2 for Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge
Figure 3 for Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge
Figure 4 for Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge
Viaarxiv icon

Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits

Add code
Bookmark button
Alert button
Oct 18, 2021
Reda Ouhamma, Rémy Degenne, Pierre Gaillard, Vianney Perchet

Figure 1 for Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits
Figure 2 for Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits
Figure 3 for Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits
Figure 4 for Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits
Viaarxiv icon

Is Standard Deviation the New Standard? Revisiting the Critic in Deep Policy Gradients

Add code
Bookmark button
Alert button
Oct 09, 2020
Yannis Flet-Berliac, Reda Ouhamma, Odalric-Ambrym Maillard, Philippe Preux

Figure 1 for Is Standard Deviation the New Standard? Revisiting the Critic in Deep Policy Gradients
Figure 2 for Is Standard Deviation the New Standard? Revisiting the Critic in Deep Policy Gradients
Figure 3 for Is Standard Deviation the New Standard? Revisiting the Critic in Deep Policy Gradients
Figure 4 for Is Standard Deviation the New Standard? Revisiting the Critic in Deep Policy Gradients
Viaarxiv icon