Alert button
Picture for Cyril Zhang

Cyril Zhang

Alert button

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Add code
Bookmark button
Alert button
Apr 23, 2024
Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Parul Chopra, Allie Del Giorno, Gustavo de Rosa, Matthew Dixon, Ronen Eldan, Dan Iter, Amit Garg, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Jamie Huynh, Mojan Javaheripi, Xin Jin, Piero Kauffmann, Nikos Karampatziakis, Dongwoo Kim, Mahoud Khademi, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Chen Liang, Weishung Liu, Eric Lin, Zeqi Lin, Piyush Madan, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Heyang Qin, Marko Radmilac, Corby Rosset, Sambudha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Xia Song, Masahiro Tanaka, Xin Wang, Rachel Ward, Guanhua Wang, Philipp Witte, Michael Wyatt, Can Xu, Jiahang Xu, Sonali Yadav, Fan Yang, Ziyi Yang, Donghan Yu, Chengruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lyna Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou

Viaarxiv icon

Can large language models explore in-context?

Add code
Bookmark button
Alert button
Mar 22, 2024
Akshay Krishnamurthy, Keegan Harris, Dylan J. Foster, Cyril Zhang, Aleksandrs Slivkins

Viaarxiv icon

Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression

Add code
Bookmark button
Alert button
Oct 17, 2023
Adam Block, Dylan J. Foster, Akshay Krishnamurthy, Max Simchowitz, Cyril Zhang

Viaarxiv icon

Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

Add code
Bookmark button
Alert button
Sep 07, 2023
Benjamin L. Edelman, Surbhi Goel, Sham Kakade, Eran Malach, Cyril Zhang

Figure 1 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Figure 2 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Figure 3 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Figure 4 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Viaarxiv icon

Exposing Attention Glitches with Flip-Flop Language Modeling

Add code
Bookmark button
Alert button
Jun 01, 2023
Bingbin Liu, Jordan T. Ash, Surbhi Goel, Akshay Krishnamurthy, Cyril Zhang

Figure 1 for Exposing Attention Glitches with Flip-Flop Language Modeling
Figure 2 for Exposing Attention Glitches with Flip-Flop Language Modeling
Figure 3 for Exposing Attention Glitches with Flip-Flop Language Modeling
Figure 4 for Exposing Attention Glitches with Flip-Flop Language Modeling
Viaarxiv icon

Learning Hidden Markov Models Using Conditional Samples

Add code
Bookmark button
Alert button
Feb 28, 2023
Sham M. Kakade, Akshay Krishnamurthy, Gaurav Mahajan, Cyril Zhang

Figure 1 for Learning Hidden Markov Models Using Conditional Samples
Figure 2 for Learning Hidden Markov Models Using Conditional Samples
Viaarxiv icon

Neural Active Learning on Heteroskedastic Distributions

Add code
Bookmark button
Alert button
Nov 02, 2022
Savya Khosla, Chew Kin Whye, Jordan T. Ash, Cyril Zhang, Kenji Kawaguchi, Alex Lamb

Figure 1 for Neural Active Learning on Heteroskedastic Distributions
Figure 2 for Neural Active Learning on Heteroskedastic Distributions
Figure 3 for Neural Active Learning on Heteroskedastic Distributions
Figure 4 for Neural Active Learning on Heteroskedastic Distributions
Viaarxiv icon

Transformers Learn Shortcuts to Automata

Add code
Bookmark button
Alert button
Oct 19, 2022
Bingbin Liu, Jordan T. Ash, Surbhi Goel, Akshay Krishnamurthy, Cyril Zhang

Viaarxiv icon

Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms

Add code
Bookmark button
Alert button
Sep 01, 2022
Surbhi Goel, Sham Kakade, Adam Tauman Kalai, Cyril Zhang

Figure 1 for Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Figure 2 for Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Figure 3 for Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Viaarxiv icon

Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit

Add code
Bookmark button
Alert button
Jul 18, 2022
Boaz Barak, Benjamin L. Edelman, Surbhi Goel, Sham Kakade, Eran Malach, Cyril Zhang

Figure 1 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Figure 2 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Figure 3 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Figure 4 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Viaarxiv icon