Alert button
Picture for Benjamin L. Edelman

Benjamin L. Edelman

Alert button

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Add code
Bookmark button
Alert button
Apr 15, 2024
Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, Jose Hernandez-Orallo, Lewis Hammond, Eric Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi, Alan Chan, Markus Anderljung, Lilian Edwards, Yoshua Bengio, Danqi Chen, Samuel Albanie, Tegan Maharaj, Jakob Foerster, Florian Tramer, He He, Atoosa Kasirzadeh, Yejin Choi, David Krueger

Viaarxiv icon

The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains

Add code
Bookmark button
Alert button
Feb 16, 2024
Benjamin L. Edelman, Ezra Edelman, Surbhi Goel, Eran Malach, Nikolaos Tsilivis

Viaarxiv icon

Distinguishing the Knowable from the Unknowable with Language Models

Add code
Bookmark button
Alert button
Feb 05, 2024
Gustaf Ahdritz, Tian Qin, Nikhil Vyas, Boaz Barak, Benjamin L. Edelman

Viaarxiv icon

Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models

Add code
Bookmark button
Alert button
Nov 15, 2023
Hanlin Zhang, Benjamin L. Edelman, Danilo Francati, Daniele Venturi, Giuseppe Ateniese, Boaz Barak

Viaarxiv icon

Feature emergence via margin maximization: case studies in algebraic tasks

Add code
Bookmark button
Alert button
Nov 13, 2023
Depen Morwani, Benjamin L. Edelman, Costin-Andrei Oncescu, Rosie Zhao, Sham Kakade

Figure 1 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 2 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 3 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 4 for Feature emergence via margin maximization: case studies in algebraic tasks
Viaarxiv icon

Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

Add code
Bookmark button
Alert button
Sep 07, 2023
Benjamin L. Edelman, Surbhi Goel, Sham Kakade, Eran Malach, Cyril Zhang

Figure 1 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Figure 2 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Figure 3 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Figure 4 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Viaarxiv icon

Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit

Add code
Bookmark button
Alert button
Jul 18, 2022
Boaz Barak, Benjamin L. Edelman, Surbhi Goel, Sham Kakade, Eran Malach, Cyril Zhang

Figure 1 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Figure 2 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Figure 3 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Figure 4 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Viaarxiv icon

Inductive Biases and Variable Creation in Self-Attention Mechanisms

Add code
Bookmark button
Alert button
Oct 19, 2021
Benjamin L. Edelman, Surbhi Goel, Sham Kakade, Cyril Zhang

Figure 1 for Inductive Biases and Variable Creation in Self-Attention Mechanisms
Figure 2 for Inductive Biases and Variable Creation in Self-Attention Mechanisms
Figure 3 for Inductive Biases and Variable Creation in Self-Attention Mechanisms
Figure 4 for Inductive Biases and Variable Creation in Self-Attention Mechanisms
Viaarxiv icon

SGD on Neural Networks Learns Functions of Increasing Complexity

Add code
Bookmark button
Alert button
May 28, 2019
Preetum Nakkiran, Gal Kaplun, Dimitris Kalimeris, Tristan Yang, Benjamin L. Edelman, Fred Zhang, Boaz Barak

Figure 1 for SGD on Neural Networks Learns Functions of Increasing Complexity
Figure 2 for SGD on Neural Networks Learns Functions of Increasing Complexity
Figure 3 for SGD on Neural Networks Learns Functions of Increasing Complexity
Figure 4 for SGD on Neural Networks Learns Functions of Increasing Complexity
Viaarxiv icon