Picture for Aaron Courville

Aaron Courville

The Curse of Diversity in Ensemble-Based Exploration

Add code
May 07, 2024
Viaarxiv icon

LOQA: Learning with Opponent Q-Learning Awareness

Add code
May 02, 2024
Viaarxiv icon

Modeling Caption Diversity in Contrastive Vision-Language Pretraining

Add code
Apr 30, 2024
Viaarxiv icon

SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision

Add code
Apr 24, 2024
Viaarxiv icon

Best Response Shaping

Add code
Apr 05, 2024
Viaarxiv icon

Scattered Mixture-of-Experts Implementation

Add code
Mar 13, 2024
Figure 1 for Scattered Mixture-of-Experts Implementation
Figure 2 for Scattered Mixture-of-Experts Implementation
Figure 3 for Scattered Mixture-of-Experts Implementation
Figure 4 for Scattered Mixture-of-Experts Implementation
Viaarxiv icon

In deep reinforcement learning, a pruned network is a good network

Add code
Feb 19, 2024
Viaarxiv icon

V-STaR: Training Verifiers for Self-Taught Reasoners

Add code
Feb 09, 2024
Viaarxiv icon

Language Model Alignment with Elastic Reset

Add code
Dec 06, 2023
Viaarxiv icon

Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy

Add code
Nov 21, 2023
Viaarxiv icon