Alert button
Picture for Surya Kanoria

Surya Kanoria

Alert button

Soft Preference Optimization: Aligning Language Models to Expert Distributions

Add code
Bookmark button
Alert button
Apr 30, 2024
Arsalan Sharifnassab, Sina Ghiassian, Saber Salehkaleybar, Surya Kanoria, Dale Schuurmans

Viaarxiv icon

Automatic Music Playlist Generation via Simulation-based Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 13, 2023
Federico Tomasi, Joseph Cauteruccio, Surya Kanoria, Kamil Ciosek, Matteo Rinaldi, Zhenwen Dai

Figure 1 for Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Figure 2 for Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Figure 3 for Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Figure 4 for Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Viaarxiv icon

What to Learn, and How: Toward Effective Learning from Rationales

Add code
Bookmark button
Alert button
Nov 30, 2021
Samuel Carton, Surya Kanoria, Chenhao Tan

Figure 1 for What to Learn, and How: Toward Effective Learning from Rationales
Figure 2 for What to Learn, and How: Toward Effective Learning from Rationales
Figure 3 for What to Learn, and How: Toward Effective Learning from Rationales
Figure 4 for What to Learn, and How: Toward Effective Learning from Rationales
Viaarxiv icon