Alert button
Picture for Naoaki Okazaki

Naoaki Okazaki

Alert button

Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities

Add code
Bookmark button
Alert button
Apr 27, 2024
Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Hirai Shota, Sakae Mizuki, Rio Yokota, Naoaki Okazaki

Viaarxiv icon

Building a Large Japanese Web Corpus for Large Language Models

Add code
Bookmark button
Alert button
Apr 27, 2024
Naoaki Okazaki, Kakeru Hattori, Hirai Shota, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, Sakae Mizuki

Viaarxiv icon

Building a Japanese Document-Level Relation Extraction Dataset Assisted by Cross-Lingual Transfer

Add code
Bookmark button
Alert button
Apr 25, 2024
Youmi Ma, An Wang, Naoaki Okazaki

Viaarxiv icon

Sampling-based Pseudo-Likelihood for Membership Inference Attacks

Add code
Bookmark button
Alert button
Apr 17, 2024
Masahiro Kaneko, Youmi Ma, Yuki Wata, Naoaki Okazaki

Viaarxiv icon

An Analysis of BPE Vocabulary Trimming in Neural Machine Translation

Add code
Bookmark button
Alert button
Mar 30, 2024
Marco Cognetta, Tatsuya Hiraoka, Naoaki Okazaki, Rico Sennrich, Yuval Pinter

Viaarxiv icon

Likelihood-based Mitigation of Evaluation Bias in Large Language Models

Add code
Bookmark button
Alert button
Mar 01, 2024
Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, Naoaki Okazaki

Viaarxiv icon

Two Counterexamples to Tokenization and the Noiseless Channel

Add code
Bookmark button
Alert button
Feb 29, 2024
Marco Cognetta, Vilém Zouhar, Sangwhan Moon, Naoaki Okazaki

Viaarxiv icon

Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction

Add code
Bookmark button
Alert button
Feb 28, 2024
Koki Maeda, Shuhei Kurita, Taiki Miyanishi, Naoaki Okazaki

Viaarxiv icon

Knowledge of Pretrained Language Models on Surface Information of Tokens

Add code
Bookmark button
Alert button
Feb 22, 2024
Tatsuya Hiraoka, Naoaki Okazaki

Viaarxiv icon

Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting

Add code
Bookmark button
Alert button
Jan 28, 2024
Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki, Timothy Baldwin

Viaarxiv icon