Alert button
Picture for Arian Hosseini

Arian Hosseini

Alert button

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Add code
Bookmark button
Alert button
Mar 24, 2024
Shengyi Huang, Michael Noukhovitch, Arian Hosseini, Kashif Rasul, Weixun Wang, Lewis Tunstall

Viaarxiv icon

V-STaR: Training Verifiers for Self-Taught Reasoners

Add code
Bookmark button
Alert button
Feb 09, 2024
Arian Hosseini, Xingdi Yuan, Nikolay Malkin, Aaron Courville, Alessandro Sordoni, Rishabh Agarwal

Viaarxiv icon

Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference

Add code
Bookmark button
Alert button
Jun 21, 2023
Alessandro Sordoni, Xingdi Yuan, Marc-Alexandre Côté, Matheus Pereira, Adam Trischler, Ziang Xiao, Arian Hosseini, Friederike Niedtner, Nicolas Le Roux

Viaarxiv icon

On the Compositional Generalization Gap of In-Context Learning

Add code
Bookmark button
Alert button
Nov 15, 2022
Arian Hosseini, Ankit Vani, Dzmitry Bahdanau, Alessandro Sordoni, Aaron Courville

Figure 1 for On the Compositional Generalization Gap of In-Context Learning
Figure 2 for On the Compositional Generalization Gap of In-Context Learning
Figure 3 for On the Compositional Generalization Gap of In-Context Learning
Figure 4 for On the Compositional Generalization Gap of In-Context Learning
Viaarxiv icon

Understanding by Understanding Not: Modeling Negation in Language Models

Add code
Bookmark button
Alert button
May 07, 2021
Arian Hosseini, Siva Reddy, Dzmitry Bahdanau, R Devon Hjelm, Alessandro Sordoni, Aaron Courville

Figure 1 for Understanding by Understanding Not: Modeling Negation in Language Models
Figure 2 for Understanding by Understanding Not: Modeling Negation in Language Models
Figure 3 for Understanding by Understanding Not: Modeling Negation in Language Models
Figure 4 for Understanding by Understanding Not: Modeling Negation in Language Models
Viaarxiv icon

Ordered Memory

Add code
Bookmark button
Alert button
Nov 03, 2019
Yikang Shen, Shawn Tan, Arian Hosseini, Zhouhan Lin, Alessandro Sordoni, Aaron Courville

Figure 1 for Ordered Memory
Figure 2 for Ordered Memory
Figure 3 for Ordered Memory
Figure 4 for Ordered Memory
Viaarxiv icon