Alert button
Picture for Nandi Schoots

Nandi Schoots

Alert button

Extending Activation Steering to Broad Skills and Multiple Behaviours

Add code
Bookmark button
Alert button
Mar 09, 2024
Teun van der Weij, Massimo Poesio, Nandi Schoots

Figure 1 for Extending Activation Steering to Broad Skills and Multiple Behaviours
Figure 2 for Extending Activation Steering to Broad Skills and Multiple Behaviours
Figure 3 for Extending Activation Steering to Broad Skills and Multiple Behaviours
Figure 4 for Extending Activation Steering to Broad Skills and Multiple Behaviours
Viaarxiv icon

Dissecting Language Models: Machine Unlearning via Selective Pruning

Add code
Bookmark button
Alert button
Mar 02, 2024
Nicholas Pochinkov, Nandi Schoots

Figure 1 for Dissecting Language Models: Machine Unlearning via Selective Pruning
Figure 2 for Dissecting Language Models: Machine Unlearning via Selective Pruning
Figure 3 for Dissecting Language Models: Machine Unlearning via Selective Pruning
Figure 4 for Dissecting Language Models: Machine Unlearning via Selective Pruning
Viaarxiv icon

Improving Activation Steering in Language Models with Mean-Centring

Add code
Bookmark button
Alert button
Dec 06, 2023
Ole Jorgensen, Dylan Cope, Nandi Schoots, Murray Shanahan

Viaarxiv icon

Comparing Optimization Targets for Contrast-Consistent Search

Add code
Bookmark button
Alert button
Nov 01, 2023
Hugo Fry, Seamus Fallows, Ian Fan, Jamie Wright, Nandi Schoots

Viaarxiv icon

Any Deep ReLU Network is Shallow

Add code
Bookmark button
Alert button
Jun 20, 2023
Mattia Jacopo Villani, Nandi Schoots

Figure 1 for Any Deep ReLU Network is Shallow
Figure 2 for Any Deep ReLU Network is Shallow
Figure 3 for Any Deep ReLU Network is Shallow
Figure 4 for Any Deep ReLU Network is Shallow
Viaarxiv icon

Low-Entropy Latent Variables Hurt Out-of-Distribution Performance

Add code
Bookmark button
Alert button
May 20, 2023
Nandi Schoots, Dylan Cope

Figure 1 for Low-Entropy Latent Variables Hurt Out-of-Distribution Performance
Figure 2 for Low-Entropy Latent Variables Hurt Out-of-Distribution Performance
Figure 3 for Low-Entropy Latent Variables Hurt Out-of-Distribution Performance
Figure 4 for Low-Entropy Latent Variables Hurt Out-of-Distribution Performance
Viaarxiv icon

Learning to Communicate with Strangers via Channel Randomisation Methods

Add code
Bookmark button
Alert button
Apr 19, 2021
Dylan Cope, Nandi Schoots

Figure 1 for Learning to Communicate with Strangers via Channel Randomisation Methods
Figure 2 for Learning to Communicate with Strangers via Channel Randomisation Methods
Figure 3 for Learning to Communicate with Strangers via Channel Randomisation Methods
Figure 4 for Learning to Communicate with Strangers via Channel Randomisation Methods
Viaarxiv icon