Alert button
Picture for Yuval Pinter

Yuval Pinter

Alert button

Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge

Add code
Bookmark button
Alert button
Apr 20, 2024
Khuyagbaatar Batsuren, Ekaterina Vylomova, Verna Dankers, Tsetsuukhei Delgerbaatar, Omri Uzan, Yuval Pinter, Gábor Bella

Viaarxiv icon

An Analysis of BPE Vocabulary Trimming in Neural Machine Translation

Add code
Bookmark button
Alert button
Mar 30, 2024
Marco Cognetta, Tatsuya Hiraoka, Naoaki Okazaki, Rico Sennrich, Yuval Pinter

Viaarxiv icon

BiVert: Bidirectional Vocabulary Evaluation using Relations for Machine Translation

Add code
Bookmark button
Alert button
Mar 06, 2024
Carinne Cherf, Yuval Pinter

Figure 1 for BiVert: Bidirectional Vocabulary Evaluation using Relations for Machine Translation
Figure 2 for BiVert: Bidirectional Vocabulary Evaluation using Relations for Machine Translation
Figure 3 for BiVert: Bidirectional Vocabulary Evaluation using Relations for Machine Translation
Figure 4 for BiVert: Bidirectional Vocabulary Evaluation using Relations for Machine Translation
Viaarxiv icon

Greed is All You Need: An Evaluation of Tokenizer Inference Methods

Add code
Bookmark button
Alert button
Mar 02, 2024
Omri Uzan, Craig W. Schmidt, Chris Tanner, Yuval Pinter

Figure 1 for Greed is All You Need: An Evaluation of Tokenizer Inference Methods
Figure 2 for Greed is All You Need: An Evaluation of Tokenizer Inference Methods
Figure 3 for Greed is All You Need: An Evaluation of Tokenizer Inference Methods
Figure 4 for Greed is All You Need: An Evaluation of Tokenizer Inference Methods
Viaarxiv icon

Tokenization Is More Than Compression

Add code
Bookmark button
Alert button
Feb 28, 2024
Craig W. Schmidt, Varshini Reddy, Haoran Zhang, Alec Alameddine, Omri Uzan, Yuval Pinter, Chris Tanner

Viaarxiv icon

MPIrigen: MPI Code Generation through Domain-Specific Language Models

Add code
Bookmark button
Alert button
Feb 14, 2024
Nadav Schneider, Niranjan Hasabnis, Vy A. Vo, Tal Kadosh, Neva Krien, Mihai Capotă, Abdul Wasay, Guy Tamir, Ted Willke, Nesreen Ahmed, Yuval Pinter, Timothy Mattson, Gal Oren

Viaarxiv icon

Domain-Specific Code Language Models: Unraveling the Potential for HPC Codes and Tasks

Add code
Bookmark button
Alert button
Dec 20, 2023
Tal Kadosh, Niranjan Hasabnis, Vy A. Vo, Nadav Schneider, Neva Krien, Mihai Capota, Abdul Wasay, Nesreen Ahmed, Ted Willke, Guy Tamir, Yuval Pinter, Timothy Mattson, Gal Oren

Viaarxiv icon

Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark

Add code
Bookmark button
Alert button
Nov 15, 2023
Stephen Mayhew, Terra Blevins, Shuheng Liu, Marek Šuppa, Hila Gonen, Joseph Marvin Imperial, Börje F. Karlsson, Peiqin Lin, Nikola Ljubešić, LJ Miranda, Barbara Plank, Arij Riabi, Yuval Pinter

Viaarxiv icon

Analyzing Cognitive Plausibility of Subword Tokenization

Add code
Bookmark button
Alert button
Oct 20, 2023
Lisa Beinborn, Yuval Pinter

Viaarxiv icon

Emptying the Ocean with a Spoon: Should We Edit Models?

Add code
Bookmark button
Alert button
Oct 18, 2023
Yuval Pinter, Michael Elhadad

Viaarxiv icon