Alert button
Picture for Brett Daley

Brett Daley

Alert button

Compound Returns Reduce Variance in Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 06, 2024
Brett Daley, Martha White, Marlos C. Machado

Viaarxiv icon

Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 26, 2023
Brett Daley, Martha White, Christopher Amato, Marlos C. Machado

Figure 1 for Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
Figure 2 for Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
Figure 3 for Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
Figure 4 for Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
Viaarxiv icon

Adaptive Tree Backup Algorithms for Temporal-Difference Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 04, 2022
Brett Daley, Isaac Chan

Figure 1 for Adaptive Tree Backup Algorithms for Temporal-Difference Reinforcement Learning
Viaarxiv icon

Improving the Efficiency of Off-Policy Reinforcement Learning by Accounting for Past Decisions

Add code
Bookmark button
Alert button
Dec 23, 2021
Brett Daley, Christopher Amato

Viaarxiv icon

Virtual Replay Cache

Add code
Bookmark button
Alert button
Dec 06, 2021
Brett Daley, Christopher Amato

Figure 1 for Virtual Replay Cache
Figure 2 for Virtual Replay Cache
Figure 3 for Virtual Replay Cache
Figure 4 for Virtual Replay Cache
Viaarxiv icon

Human-Level Control without Server-Grade Hardware

Add code
Bookmark button
Alert button
Nov 01, 2021
Brett Daley, Christopher Amato

Figure 1 for Human-Level Control without Server-Grade Hardware
Figure 2 for Human-Level Control without Server-Grade Hardware
Figure 3 for Human-Level Control without Server-Grade Hardware
Figure 4 for Human-Level Control without Server-Grade Hardware
Viaarxiv icon

Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods

Add code
Bookmark button
Alert button
Jun 10, 2021
Brett Daley, Christopher Amato

Figure 1 for Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods
Figure 2 for Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods
Figure 3 for Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods
Figure 4 for Investigating Alternatives to the Root Mean Square for Adaptive Gradient Methods
Viaarxiv icon

Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 22, 2021
Brett Daley, Cameron Hickert, Christopher Amato

Figure 1 for Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning
Figure 2 for Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning
Figure 3 for Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning
Viaarxiv icon

Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 08, 2021
Xueguang Lyu, Yuchen Xiao, Brett Daley, Christopher Amato

Figure 1 for Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning
Figure 2 for Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning
Figure 3 for Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning
Figure 4 for Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning
Viaarxiv icon

Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability

Add code
Bookmark button
Alert button
Nov 05, 2020
Hai Nguyen, Brett Daley, Xinchao Song, Christopher Amato, Robert Platt

Figure 1 for Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability
Figure 2 for Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability
Figure 3 for Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability
Figure 4 for Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability
Viaarxiv icon