Alert button
Picture for Kyle Wray

Kyle Wray

Alert button

Entropy-regularized Point-based Value Iteration

Add code
Bookmark button
Alert button
Feb 14, 2024
Harrison Delecki, Marcell Vazquez-Chanlatte, Esen Yel, Kyle Wray, Tomer Arnon, Stefan Witwicki, Mykel J. Kochenderfer

Figure 1 for Entropy-regularized Point-based Value Iteration
Figure 2 for Entropy-regularized Point-based Value Iteration
Figure 3 for Entropy-regularized Point-based Value Iteration
Figure 4 for Entropy-regularized Point-based Value Iteration
Viaarxiv icon

Decision Making in Non-Stationary Environments with Policy-Augmented Search

Add code
Bookmark button
Alert button
Jan 06, 2024
Ava Pettet, Yunuo Zhang, Baiting Luo, Kyle Wray, Hendrik Baier, Aron Laszka, Abhishek Dubey, Ayan Mukhopadhyay

Viaarxiv icon

Active teacher selection for reinforcement learning from human feedback

Add code
Bookmark button
Alert button
Oct 23, 2023
Rachel Freedman, Justin Svegliato, Kyle Wray, Stuart Russell

Figure 1 for Active teacher selection for reinforcement learning from human feedback
Figure 2 for Active teacher selection for reinforcement learning from human feedback
Figure 3 for Active teacher selection for reinforcement learning from human feedback
Figure 4 for Active teacher selection for reinforcement learning from human feedback
Viaarxiv icon