Picture for Mehdi Mirza

Mehdi Mirza

Retrieval-Augmented Reinforcement Learning

Add code
Mar 09, 2022
Figure 1 for Retrieval-Augmented Reinforcement Learning
Figure 2 for Retrieval-Augmented Reinforcement Learning
Figure 3 for Retrieval-Augmented Reinforcement Learning
Figure 4 for Retrieval-Augmented Reinforcement Learning
Viaarxiv icon

Evaluating model-based planning and planner amortization for continuous control

Oct 07, 2021
Figure 1 for Evaluating model-based planning and planner amortization for continuous control
Figure 2 for Evaluating model-based planning and planner amortization for continuous control
Figure 3 for Evaluating model-based planning and planner amortization for continuous control
Figure 4 for Evaluating model-based planning and planner amortization for continuous control
Viaarxiv icon

Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban

Oct 03, 2020
Figure 1 for Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban
Figure 2 for Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban
Figure 3 for Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban
Figure 4 for Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban
Viaarxiv icon

Physically Embedded Planning Problems: New Challenges for Reinforcement Learning

Add code
Sep 11, 2020
Figure 1 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Figure 2 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Figure 3 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Figure 4 for Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Viaarxiv icon

An investigation of model-free planning

Add code
Jan 11, 2019
Figure 1 for An investigation of model-free planning
Figure 2 for An investigation of model-free planning
Figure 3 for An investigation of model-free planning
Figure 4 for An investigation of model-free planning
Viaarxiv icon

Optimizing Agent Behavior over Long Time Scales by Transporting Value

Add code
Oct 15, 2018
Figure 1 for Optimizing Agent Behavior over Long Time Scales by Transporting Value
Figure 2 for Optimizing Agent Behavior over Long Time Scales by Transporting Value
Figure 3 for Optimizing Agent Behavior over Long Time Scales by Transporting Value
Figure 4 for Optimizing Agent Behavior over Long Time Scales by Transporting Value
Viaarxiv icon

Probing Physics Knowledge Using Tools from Developmental Psychology

Apr 03, 2018
Figure 1 for Probing Physics Knowledge Using Tools from Developmental Psychology
Figure 2 for Probing Physics Knowledge Using Tools from Developmental Psychology
Figure 3 for Probing Physics Knowledge Using Tools from Developmental Psychology
Figure 4 for Probing Physics Knowledge Using Tools from Developmental Psychology
Viaarxiv icon

Unsupervised Predictive Memory in a Goal-Directed Agent

Add code
Mar 28, 2018
Figure 1 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 2 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 3 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 4 for Unsupervised Predictive Memory in a Goal-Directed Agent
Viaarxiv icon

Generalizable Features From Unsupervised Learning

Add code
Dec 12, 2016
Figure 1 for Generalizable Features From Unsupervised Learning
Figure 2 for Generalizable Features From Unsupervised Learning
Figure 3 for Generalizable Features From Unsupervised Learning
Figure 4 for Generalizable Features From Unsupervised Learning
Viaarxiv icon

Asynchronous Methods for Deep Reinforcement Learning

Add code
Jun 16, 2016
Figure 1 for Asynchronous Methods for Deep Reinforcement Learning
Figure 2 for Asynchronous Methods for Deep Reinforcement Learning
Figure 3 for Asynchronous Methods for Deep Reinforcement Learning
Figure 4 for Asynchronous Methods for Deep Reinforcement Learning
Viaarxiv icon