Picture for Clement Neo

Clement Neo

Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions

Feb 23, 2024
Viaarxiv icon

Increasing Trust in Language Models through the Reuse of Verified Circuits

Add code
Feb 06, 2024
Viaarxiv icon