Picture for Savvas Zannettou

Savvas Zannettou

UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images

Add code
May 06, 2024
Figure 1 for UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
Figure 2 for UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
Figure 3 for UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
Figure 4 for UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
Viaarxiv icon

A Comprehensive View of the Biases of Toxicity and Sentiment Analysis Methods Towards Utterances with African American English Expressions

Add code
Jan 23, 2024
Viaarxiv icon

You Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic Content

Add code
Aug 10, 2023
Figure 1 for You Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic Content
Figure 2 for You Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic Content
Figure 3 for You Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic Content
Figure 4 for You Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic Content
Viaarxiv icon

Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models

Add code
May 23, 2023
Figure 1 for Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Figure 2 for Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Figure 3 for Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Figure 4 for Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Viaarxiv icon

On the Evolution of (Hateful) Memes by Means of Multimodal Contrastive Learning

Add code
Dec 13, 2022
Figure 1 for On the Evolution of (Hateful) Memes by Means of Multimodal Contrastive Learning
Figure 2 for On the Evolution of (Hateful) Memes by Means of Multimodal Contrastive Learning
Figure 3 for On the Evolution of (Hateful) Memes by Means of Multimodal Contrastive Learning
Figure 4 for On the Evolution of (Hateful) Memes by Means of Multimodal Contrastive Learning
Viaarxiv icon

Why So Toxic? Measuring and Triggering Toxic Behavior in Open-Domain Chatbots

Add code
Sep 09, 2022
Figure 1 for Why So Toxic? Measuring and Triggering Toxic Behavior in Open-Domain Chatbots
Figure 2 for Why So Toxic? Measuring and Triggering Toxic Behavior in Open-Domain Chatbots
Figure 3 for Why So Toxic? Measuring and Triggering Toxic Behavior in Open-Domain Chatbots
Figure 4 for Why So Toxic? Measuring and Triggering Toxic Behavior in Open-Domain Chatbots
Viaarxiv icon

Feels Bad Man: Dissecting Automated Hateful Meme Detection Through the Lens of Facebook's Challenge

Feb 17, 2022
Figure 1 for Feels Bad Man: Dissecting Automated Hateful Meme Detection Through the Lens of Facebook's Challenge
Figure 2 for Feels Bad Man: Dissecting Automated Hateful Meme Detection Through the Lens of Facebook's Challenge
Figure 3 for Feels Bad Man: Dissecting Automated Hateful Meme Detection Through the Lens of Facebook's Challenge
Figure 4 for Feels Bad Man: Dissecting Automated Hateful Meme Detection Through the Lens of Facebook's Challenge
Viaarxiv icon