Alert button
Picture for Kianté Brantley

Kianté Brantley

Alert button

REBEL: Reinforcement Learning via Regressing Relative Rewards

Add code
Bookmark button
Alert button
Apr 25, 2024
Zhaolin Gao, Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Gokul Swamy, Kianté Brantley, Thorsten Joachims, J. Andrew Bagnell, Jason D. Lee, Wen Sun

Viaarxiv icon

Dataset Reset Policy Optimization for RLHF

Add code
Bookmark button
Alert button
Apr 15, 2024
Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun

Viaarxiv icon

Adversarial Imitation Learning via Boosting

Add code
Bookmark button
Alert button
Apr 12, 2024
Jonathan D. Chang, Dhruv Sreenivas, Yingbing Huang, Kianté Brantley, Wen Sun

Viaarxiv icon

RL for Consistency Models: Faster Reward Guided Text-to-Image Generation

Add code
Bookmark button
Alert button
Mar 25, 2024
Owen Oertell, Jonathan D. Chang, Yiyi Zhang, Kianté Brantley, Wen Sun

Viaarxiv icon

A Surprising Failure? Multimodal LLMs and the NLVR Challenge

Add code
Bookmark button
Alert button
Feb 26, 2024
Anne Wu, Kianté Brantley, Yoav Artzi

Viaarxiv icon

Reviewer2: Optimizing Review Generation Through Prompt Generation

Add code
Bookmark button
Alert button
Feb 16, 2024
Zhaolin Gao, Kianté Brantley, Thorsten Joachims

Viaarxiv icon

Policy-Gradient Training of Language Models for Ranking

Add code
Bookmark button
Alert button
Oct 06, 2023
Ge Gao, Jonathan D. Chang, Claire Cardie, Kianté Brantley, Thorsten Joachim

Figure 1 for Policy-Gradient Training of Language Models for Ranking
Figure 2 for Policy-Gradient Training of Language Models for Ranking
Figure 3 for Policy-Gradient Training of Language Models for Ranking
Figure 4 for Policy-Gradient Training of Language Models for Ranking
Viaarxiv icon

Ranking with Long-Term Constraints

Add code
Bookmark button
Alert button
Jul 10, 2023
Kianté Brantley, Zhichong Fang, Sarah Dean, Thorsten Joachims

Viaarxiv icon

Interactive Text Generation

Add code
Bookmark button
Alert button
Mar 17, 2023
Felix Faltings, Michel Galley, Baolin Peng, Kianté Brantley, Weixin Cai, Yizhe Zhang, Jianfeng Gao, Bill Dolan

Figure 1 for Interactive Text Generation
Figure 2 for Interactive Text Generation
Figure 3 for Interactive Text Generation
Figure 4 for Interactive Text Generation
Viaarxiv icon

lilGym: Natural Language Visual Reasoning with Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 03, 2022
Anne Wu, Kianté Brantley, Noriyuki Kojima, Yoav Artzi

Figure 1 for lilGym: Natural Language Visual Reasoning with Reinforcement Learning
Figure 2 for lilGym: Natural Language Visual Reasoning with Reinforcement Learning
Figure 3 for lilGym: Natural Language Visual Reasoning with Reinforcement Learning
Figure 4 for lilGym: Natural Language Visual Reasoning with Reinforcement Learning
Viaarxiv icon