Alert button
Picture for Yulia Tsvetkov

Yulia Tsvetkov

Alert button

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

Add code
Bookmark button
Alert button
Apr 10, 2024
Yu Ying Chiu, Liwei Jiang, Maria Antoniak, Chan Young Park, Shuyue Stella Li, Mehar Bhatia, Sahithya Ravi, Yulia Tsvetkov, Vered Shwartz, Yejin Choi

Viaarxiv icon

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

Add code
Bookmark button
Alert button
Mar 16, 2024
Fahim Faisal, Orevaoghene Ahia, Aarohi Srivastava, Kabir Ahuja, David Chiang, Yulia Tsvetkov, Antonios Anastasopoulos

Figure 1 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Figure 2 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Figure 3 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Figure 4 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Viaarxiv icon

Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs

Add code
Bookmark button
Alert button
Mar 05, 2024
Aly M. Kassem, Omar Mahmoud, Niloofar Mireshghallah, Hyunwoo Kim, Yulia Tsvetkov, Yejin Choi, Sherif Saad, Santu Rana

Figure 1 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 2 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 3 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 4 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Viaarxiv icon

Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers

Add code
Bookmark button
Alert button
Feb 27, 2024
Roy Xie, Orevaoghene Ahia, Yulia Tsvetkov, Antonios Anastasopoulos

Viaarxiv icon

Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks

Add code
Bookmark button
Alert button
Feb 18, 2024
Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, Yulia Tsvetkov, Tianxing He

Viaarxiv icon

DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection

Add code
Bookmark button
Alert button
Feb 16, 2024
Herun Wan, Shangbin Feng, Zhaoxuan Tan, Heng Wang, Yulia Tsvetkov, Minnan Luo

Viaarxiv icon

Do Membership Inference Attacks Work on Large Language Models?

Add code
Bookmark button
Alert button
Feb 12, 2024
Michael Duan, Anshuman Suri, Niloofar Mireshghallah, Sewon Min, Weijia Shi, Luke Zettlemoyer, Yulia Tsvetkov, Yejin Choi, David Evans, Hannaneh Hajishirzi

Viaarxiv icon

What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

Add code
Bookmark button
Alert button
Feb 01, 2024
Shangbin Feng, Herun Wan, Ningnan Wang, Zhaoxuan Tan, Minnan Luo, Yulia Tsvetkov

Viaarxiv icon

Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration

Add code
Bookmark button
Alert button
Feb 01, 2024
Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Vidhisha Balachandran, Yulia Tsvetkov

Viaarxiv icon

Fine-grained Hallucination Detection and Editing for Language Models

Add code
Bookmark button
Alert button
Jan 17, 2024
Abhika Mishra, Akari Asai, Vidhisha Balachandran, Yizhong Wang, Graham Neubig, Yulia Tsvetkov, Hannaneh Hajishirzi

Viaarxiv icon