Picture for Yonatan Belinkov

Yonatan Belinkov

DEPTH: Discourse Education through Pre-Training Hierarchically

Add code
May 13, 2024
Viaarxiv icon

Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs

Add code
Apr 15, 2024
Viaarxiv icon

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

Add code
Mar 31, 2024
Figure 1 for Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Figure 2 for Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Figure 3 for Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Figure 4 for Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Viaarxiv icon

Jamba: A Hybrid Transformer-Mamba Language Model

Add code
Mar 28, 2024
Figure 1 for Jamba: A Hybrid Transformer-Mamba Language Model
Figure 2 for Jamba: A Hybrid Transformer-Mamba Language Model
Figure 3 for Jamba: A Hybrid Transformer-Mamba Language Model
Figure 4 for Jamba: A Hybrid Transformer-Mamba Language Model
Viaarxiv icon

Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms

Add code
Mar 26, 2024
Figure 1 for Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms
Figure 2 for Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms
Figure 3 for Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms
Figure 4 for Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms
Viaarxiv icon

Concept-Best-Matching: Evaluating Compositionality in Emergent Communication

Add code
Mar 17, 2024
Viaarxiv icon

Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information

Add code
Mar 14, 2024
Figure 1 for Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information
Figure 2 for Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information
Figure 3 for Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information
Figure 4 for Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information
Viaarxiv icon

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines

Add code
Mar 09, 2024
Figure 1 for Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Figure 2 for Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Figure 3 for Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Figure 4 for Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Viaarxiv icon

A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry

Add code
Feb 27, 2024
Viaarxiv icon

Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking

Add code
Feb 22, 2024
Viaarxiv icon