Alert button
Picture for Yuu Jinnai

Yuu Jinnai

Alert button

Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment

Add code
Bookmark button
Alert button
Apr 05, 2024
Yuu Jinnai, Tetsuro Morimura, Kaito Ariu, Kenshi Abe

Viaarxiv icon

On the True Distribution Approximation of Minimum Bayes-Risk Decoding

Add code
Bookmark button
Alert button
Mar 31, 2024
Atsumoto Ohashi, Ukyo Honda, Tetsuro Morimura, Yuu Jinnai

Viaarxiv icon

Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding

Add code
Bookmark button
Alert button
Jan 10, 2024
Yuu Jinnai, Ukyo Honda, Tetsuro Morimura, Peinan Zhang

Viaarxiv icon

Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding

Add code
Bookmark button
Alert button
Jan 05, 2024
Yuu Jinnai, Kaito Ariu

Viaarxiv icon

Model-Based Minimum Bayes Risk Decoding

Add code
Bookmark button
Alert button
Nov 09, 2023
Yuu Jinnai, Tetsuro Morimura, Ukyo Honda, Kaito Ariu, Kenshi Abe

Viaarxiv icon

On the Depth between Beam Search and Exhaustive Search for Text Generation

Add code
Bookmark button
Alert button
Aug 25, 2023
Yuu Jinnai, Tetsuro Morimura, Ukyo Honda

Figure 1 for On the Depth between Beam Search and Exhaustive Search for Text Generation
Figure 2 for On the Depth between Beam Search and Exhaustive Search for Text Generation
Figure 3 for On the Depth between Beam Search and Exhaustive Search for Text Generation
Figure 4 for On the Depth between Beam Search and Exhaustive Search for Text Generation
Viaarxiv icon

Blind Signal Separation for Fast Ultrasound Computed Tomography

Add code
Bookmark button
Alert button
Apr 27, 2023
Takumi Noda, Yuu Jinnai, Naoki Tomii, Takashi Azuma

Figure 1 for Blind Signal Separation for Fast Ultrasound Computed Tomography
Figure 2 for Blind Signal Separation for Fast Ultrasound Computed Tomography
Figure 3 for Blind Signal Separation for Fast Ultrasound Computed Tomography
Figure 4 for Blind Signal Separation for Fast Ultrasound Computed Tomography
Viaarxiv icon

Lipschitz Lifelong Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 17, 2020
Erwan Lecarpentier, David Abel, Kavosh Asadi, Yuu Jinnai, Emmanuel Rachelson, Michael L. Littman

Figure 1 for Lipschitz Lifelong Reinforcement Learning
Figure 2 for Lipschitz Lifelong Reinforcement Learning
Figure 3 for Lipschitz Lifelong Reinforcement Learning
Figure 4 for Lipschitz Lifelong Reinforcement Learning
Viaarxiv icon

AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search

Add code
Bookmark button
Alert button
Mar 26, 2019
Linnan Wang, Yiyang Zhao, Yuu Jinnai, Yuandong Tian, Rodrigo Fonseca

Figure 1 for AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search
Figure 2 for AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search
Figure 3 for AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search
Figure 4 for AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search
Viaarxiv icon

Discovering Options for Exploration by Minimizing Cover Time

Add code
Bookmark button
Alert button
Mar 16, 2019
Yuu Jinnai, Jee Won Park, David Abel, George Konidaris

Figure 1 for Discovering Options for Exploration by Minimizing Cover Time
Figure 2 for Discovering Options for Exploration by Minimizing Cover Time
Figure 3 for Discovering Options for Exploration by Minimizing Cover Time
Figure 4 for Discovering Options for Exploration by Minimizing Cover Time
Viaarxiv icon