Alert button
Picture for Rio Yokota

Rio Yokota

Alert button

Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities

Add code
Bookmark button
Alert button
Apr 27, 2024
Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Hirai Shota, Sakae Mizuki, Rio Yokota, Naoaki Okazaki

Viaarxiv icon

Building a Large Japanese Web Corpus for Large Language Models

Add code
Bookmark button
Alert button
Apr 27, 2024
Naoaki Okazaki, Kakeru Hattori, Hirai Shota, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, Sakae Mizuki

Viaarxiv icon

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Add code
Bookmark button
Alert button
Mar 30, 2024
Taishi Nakamura, Mayank Mishra, Simone Tedeschi, Yekun Chai, Jason T Stillerman, Felix Friedrich, Prateek Yadav, Tanmay Laud, Vu Minh Chien, Terry Yue Zhuo, Diganta Misra, Ben Bogin, Xuan-Son Vu, Marzena Karpinska, Arnav Varma Dantuluri, Wojciech Kusa, Tommaso Furlanello, Rio Yokota, Niklas Muennighoff, Suhas Pai, Tosin Adewumi, Veronika Laippala, Xiaozhe Yao, Adalberto Junior, Alpay Ariyak, Aleksandr Drozd, Jordan Clive, Kshitij Gupta, Liangyu Chen, Qi Sun, Ken Tsui, Noah Persaud, Nour Fahmy, Tianlong Chen, Mohit Bansal, Nicolo Monti, Tai Dang, Ziyang Luo, Tien-Tung Bui, Roberto Navigli, Virendra Mehta, Matthew Blumberg, Victor May, Huu Nguyen, Sampo Pyysalo

Viaarxiv icon

Variational Learning is Effective for Large Deep Networks

Add code
Bookmark button
Alert button
Feb 27, 2024
Yuesong Shen, Nico Daheim, Bai Cong, Peter Nickl, Gian Maria Marconi, Clement Bazan, Rio Yokota, Iryna Gurevych, Daniel Cremers, Mohammad Emtiyaz Khan, Thomas Möllenhoff

Viaarxiv icon

SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning

Add code
Bookmark button
Alert button
Sep 29, 2023
Risa Shinoda, Ryo Hayamizu, Kodai Nakashima, Nakamasa Inoue, Rio Yokota, Hirokatsu Kataoka

Viaarxiv icon

Pre-training Vision Transformers with Very Limited Synthesized Images

Add code
Bookmark button
Alert button
Jul 31, 2023
Ryo Nakamura, Hirokatsu Kataoka, Sora Takashima, Edgar Josafat Martinez Noriega, Rio Yokota, Nakamasa Inoue

Figure 1 for Pre-training Vision Transformers with Very Limited Synthesized Images
Figure 2 for Pre-training Vision Transformers with Very Limited Synthesized Images
Figure 3 for Pre-training Vision Transformers with Very Limited Synthesized Images
Figure 4 for Pre-training Vision Transformers with Very Limited Synthesized Images
Viaarxiv icon

ASDL: A Unified Interface for Gradient Preconditioning in PyTorch

Add code
Bookmark button
Alert button
May 08, 2023
Kazuki Osawa, Satoki Ishikawa, Rio Yokota, Shigang Li, Torsten Hoefler

Figure 1 for ASDL: A Unified Interface for Gradient Preconditioning in PyTorch
Figure 2 for ASDL: A Unified Interface for Gradient Preconditioning in PyTorch
Figure 3 for ASDL: A Unified Interface for Gradient Preconditioning in PyTorch
Figure 4 for ASDL: A Unified Interface for Gradient Preconditioning in PyTorch
Viaarxiv icon

Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves

Add code
Bookmark button
Alert button
Mar 02, 2023
Sora Takashima, Ryo Hayamizu, Nakamasa Inoue, Hirokatsu Kataoka, Rio Yokota

Figure 1 for Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Figure 2 for Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Figure 3 for Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Figure 4 for Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Viaarxiv icon

Informative Sample-Aware Proxy for Deep Metric Learning

Add code
Bookmark button
Alert button
Nov 18, 2022
Aoyu Li, Ikuro Sato, Kohta Ishikawa, Rei Kawakami, Rio Yokota

Figure 1 for Informative Sample-Aware Proxy for Deep Metric Learning
Figure 2 for Informative Sample-Aware Proxy for Deep Metric Learning
Figure 3 for Informative Sample-Aware Proxy for Deep Metric Learning
Figure 4 for Informative Sample-Aware Proxy for Deep Metric Learning
Viaarxiv icon

Empirical Study on Optimizer Selection for Out-of-Distribution Generalization

Add code
Bookmark button
Alert button
Nov 18, 2022
Hiroki Naganuma, Kartik Ahuja, Shiro Takagi, Tetsuya Motokawa, Rio Yokota, Kohta Ishikawa, Ikuro Sato, Ioannis Mitliagkas

Figure 1 for Empirical Study on Optimizer Selection for Out-of-Distribution Generalization
Figure 2 for Empirical Study on Optimizer Selection for Out-of-Distribution Generalization
Figure 3 for Empirical Study on Optimizer Selection for Out-of-Distribution Generalization
Figure 4 for Empirical Study on Optimizer Selection for Out-of-Distribution Generalization
Viaarxiv icon