Alert button
Picture for Miguel Suau

Miguel Suau

Alert button

Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL

Add code
Bookmark button
Alert button
Jun 04, 2023
Miguel Suau, Matthijs T. J. Spaan, Frans A. Oliehoek

Figure 1 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 2 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 3 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 4 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Viaarxiv icon

Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems

Add code
Bookmark button
Alert button
Jul 01, 2022
Miguel Suau, Jinke He, Mustafa Mert Çelikok, Matthijs T. J. Spaan, Frans A. Oliehoek

Figure 1 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 2 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 3 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 4 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Viaarxiv icon

Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems

Add code
Bookmark button
Alert button
Feb 03, 2022
Miguel Suau, Jinke He, Matthijs T. J. Spaan, Frans A. Oliehoek

Figure 1 for Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Figure 2 for Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Figure 3 for Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Figure 4 for Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Viaarxiv icon

Online Planning in POMDPs with Self-Improving Simulators

Add code
Bookmark button
Alert button
Jan 27, 2022
Jinke He, Miguel Suau, Hendrik Baier, Michael Kaisers, Frans A. Oliehoek

Figure 1 for Online Planning in POMDPs with Self-Improving Simulators
Figure 2 for Online Planning in POMDPs with Self-Improving Simulators
Figure 3 for Online Planning in POMDPs with Self-Improving Simulators
Figure 4 for Online Planning in POMDPs with Self-Improving Simulators
Viaarxiv icon

Offline Contextual Bandits for Wireless Network Optimization

Add code
Bookmark button
Alert button
Nov 11, 2021
Miguel Suau, Alexandros Agapitos, David Lynch, Derek Farrell, Mingqi Zhou, Aleksandar Milenovic

Figure 1 for Offline Contextual Bandits for Wireless Network Optimization
Figure 2 for Offline Contextual Bandits for Wireless Network Optimization
Figure 3 for Offline Contextual Bandits for Wireless Network Optimization
Viaarxiv icon

Influence-Augmented Online Planning for Complex Environments

Add code
Bookmark button
Alert button
Oct 21, 2020
Jinke He, Miguel Suau, Frans A. Oliehoek

Figure 1 for Influence-Augmented Online Planning for Complex Environments
Figure 2 for Influence-Augmented Online Planning for Complex Environments
Figure 3 for Influence-Augmented Online Planning for Complex Environments
Figure 4 for Influence-Augmented Online Planning for Complex Environments
Viaarxiv icon

Influence-aware Memory for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 21, 2019
Miguel Suau, Elena Congeduti, Rolf Starre, Aleksander Czechowski, Frans Olihoek

Figure 1 for Influence-aware Memory for Deep Reinforcement Learning
Figure 2 for Influence-aware Memory for Deep Reinforcement Learning
Figure 3 for Influence-aware Memory for Deep Reinforcement Learning
Figure 4 for Influence-aware Memory for Deep Reinforcement Learning
Viaarxiv icon