Alert button
Picture for Shital Shah

Shital Shah

Alert button

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Add code
Bookmark button
Alert button
Apr 23, 2024
Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Parul Chopra, Allie Del Giorno, Gustavo de Rosa, Matthew Dixon, Ronen Eldan, Dan Iter, Amit Garg, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Jamie Huynh, Mojan Javaheripi, Xin Jin, Piero Kauffmann, Nikos Karampatziakis, Dongwoo Kim, Mahoud Khademi, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Chen Liang, Weishung Liu, Eric Lin, Zeqi Lin, Piyush Madan, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Heyang Qin, Marko Radmilac, Corby Rosset, Sambudha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Xia Song, Masahiro Tanaka, Xin Wang, Rachel Ward, Guanhua Wang, Philipp Witte, Michael Wyatt, Can Xu, Jiahang Xu, Sonali Yadav, Fan Yang, Ziyi Yang, Donghan Yu, Chengruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lyna Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou

Viaarxiv icon

Textbooks Are All You Need

Add code
Bookmark button
Alert button
Jun 20, 2023
Suriya Gunasekar, Yi Zhang, Jyoti Aneja, Caio César Teodoro Mendes, Allie Del Giorno, Sivakanth Gopi, Mojan Javaheripi, Piero Kauffmann, Gustavo de Rosa, Olli Saarikivi, Adil Salim, Shital Shah, Harkirat Singh Behl, Xin Wang, Sébastien Bubeck, Ronen Eldan, Adam Tauman Kalai, Yin Tat Lee, Yuanzhi Li

Figure 1 for Textbooks Are All You Need
Figure 2 for Textbooks Are All You Need
Figure 3 for Textbooks Are All You Need
Figure 4 for Textbooks Are All You Need
Viaarxiv icon

Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints

Add code
Bookmark button
Alert button
Oct 06, 2022
Ganesh Jawahar, Subhabrata Mukherjee, Debadeepta Dey, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Caio Cesar Teodoro Mendes, Gustavo Henrique de Rosa, Shital Shah

Figure 1 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Figure 2 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Figure 3 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Figure 4 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Viaarxiv icon

One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning

Add code
Bookmark button
Alert button
Mar 15, 2022
Sharath Girish, Debadeepta Dey, Neel Joshi, Vibhav Vineet, Shital Shah, Caio Cesar Teodoro Mendes, Abhinav Shrivastava, Yale Song

Figure 1 for One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning
Figure 2 for One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning
Figure 3 for One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning
Figure 4 for One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning
Viaarxiv icon

LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models

Add code
Bookmark button
Alert button
Mar 04, 2022
Mojan Javaheripi, Shital Shah, Subhabrata Mukherjee, Tomasz L. Religa, Caio C. T. Mendes, Gustavo H. de Rosa, Sebastien Bubeck, Farinaz Koushanfar, Debadeepta Dey

Figure 1 for LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models
Figure 2 for LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models
Figure 3 for LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models
Figure 4 for LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models
Viaarxiv icon

FEAR: A Simple Lightweight Method to Rank Architectures

Add code
Bookmark button
Alert button
Jun 07, 2021
Debadeepta Dey, Shital Shah, Sebastien Bubeck

Figure 1 for FEAR: A Simple Lightweight Method to Rank Architectures
Figure 2 for FEAR: A Simple Lightweight Method to Rank Architectures
Figure 3 for FEAR: A Simple Lightweight Method to Rank Architectures
Figure 4 for FEAR: A Simple Lightweight Method to Rank Architectures
Viaarxiv icon

Understanding Failures of Deep Networks via Robust Feature Extraction

Add code
Bookmark button
Alert button
Dec 03, 2020
Sahil Singla, Besmira Nushi, Shital Shah, Ece Kamar, Eric Horvitz

Figure 1 for Understanding Failures of Deep Networks via Robust Feature Extraction
Figure 2 for Understanding Failures of Deep Networks via Robust Feature Extraction
Figure 3 for Understanding Failures of Deep Networks via Robust Feature Extraction
Figure 4 for Understanding Failures of Deep Networks via Robust Feature Extraction
Viaarxiv icon

An Empirical Analysis of Backward Compatibility in Machine Learning Systems

Add code
Bookmark button
Alert button
Aug 11, 2020
Megha Srivastava, Besmira Nushi, Ece Kamar, Shital Shah, Eric Horvitz

Figure 1 for An Empirical Analysis of Backward Compatibility in Machine Learning Systems
Figure 2 for An Empirical Analysis of Backward Compatibility in Machine Learning Systems
Figure 3 for An Empirical Analysis of Backward Compatibility in Machine Learning Systems
Figure 4 for An Empirical Analysis of Backward Compatibility in Machine Learning Systems
Viaarxiv icon

Safe Reinforcement Learning via Curriculum Induction

Add code
Bookmark button
Alert button
Jun 22, 2020
Matteo Turchetta, Andrey Kolobov, Shital Shah, Andreas Krause, Alekh Agarwal

Figure 1 for Safe Reinforcement Learning via Curriculum Induction
Figure 2 for Safe Reinforcement Learning via Curriculum Induction
Figure 3 for Safe Reinforcement Learning via Curriculum Induction
Figure 4 for Safe Reinforcement Learning via Curriculum Induction
Viaarxiv icon

A System for Real-Time Interactive Analysis of Deep Learning Training

Add code
Bookmark button
Alert button
Jan 07, 2020
Shital Shah, Roland Fernandez, Steven Drucker

Figure 1 for A System for Real-Time Interactive Analysis of Deep Learning Training
Figure 2 for A System for Real-Time Interactive Analysis of Deep Learning Training
Viaarxiv icon