Alert button
Picture for Gu-Yeon Wei

Gu-Yeon Wei

Alert button

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

Add code
Bookmark button
Alert button
Dec 22, 2023
Alicia Golden, Samuel Hsia, Fei Sun, Bilge Acun, Basil Hosmer, Yejin Lee, Zachary DeVito, Jeff Johnson, Gu-Yeon Wei, David Brooks, Carole-Jean Wu

Viaarxiv icon

Hardware Resilience Properties of Text-Guided Image Classifiers

Add code
Bookmark button
Alert button
Dec 05, 2023
Syed Talal Wasim, Kabila Haile Soboka, Abdulrahman Mahmoud, Salman Khan, David Brooks, Gu-Yeon Wei

Figure 1 for Hardware Resilience Properties of Text-Guided Image Classifiers
Figure 2 for Hardware Resilience Properties of Text-Guided Image Classifiers
Figure 3 for Hardware Resilience Properties of Text-Guided Image Classifiers
Figure 4 for Hardware Resilience Properties of Text-Guided Image Classifiers
Viaarxiv icon

MAD Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems

Add code
Bookmark button
Alert button
Oct 18, 2023
Samuel Hsia, Alicia Golden, Bilge Acun, Newsha Ardalani, Zachary DeVito, Gu-Yeon Wei, David Brooks, Carole-Jean Wu

Viaarxiv icon

Guess & Sketch: Language Model Guided Transpilation

Add code
Bookmark button
Alert button
Sep 25, 2023
Celine Lee, Abdulrahman Mahmoud, Michal Kurek, Simone Campanoni, David Brooks, Stephen Chong, Gu-Yeon Wei, Alexander M. Rush

Figure 1 for Guess & Sketch: Language Model Guided Transpilation
Figure 2 for Guess & Sketch: Language Model Guided Transpilation
Figure 3 for Guess & Sketch: Language Model Guided Transpilation
Figure 4 for Guess & Sketch: Language Model Guided Transpilation
Viaarxiv icon

INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation

Add code
Bookmark button
Alert button
Jun 13, 2023
Yuji Chai, John Gkountouras, Glenn G. Ko, David Brooks, Gu-Yeon Wei

Figure 1 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Figure 2 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Figure 3 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Figure 4 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Viaarxiv icon

S$^{3}$: Increasing GPU Utilization during Generative Inference for Higher Throughput

Add code
Bookmark button
Alert button
Jun 09, 2023
Yunho Jin, Chun-Feng Wu, David Brooks, Gu-Yeon Wei

Figure 1 for S$^{3}$: Increasing GPU Utilization during Generative Inference for Higher Throughput
Figure 2 for S$^{3}$: Increasing GPU Utilization during Generative Inference for Higher Throughput
Figure 3 for S$^{3}$: Increasing GPU Utilization during Generative Inference for Higher Throughput
Figure 4 for S$^{3}$: Increasing GPU Utilization during Generative Inference for Higher Throughput
Viaarxiv icon

CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning

Add code
Bookmark button
Alert button
May 04, 2023
Sai Qian Zhang, Thierry Tambe, Nestor Cuevas, Gu-Yeon Wei, David Brooks

Figure 1 for CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning
Figure 2 for CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning
Figure 3 for CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning
Figure 4 for CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning
Viaarxiv icon

MP-Rec: Hardware-Software Co-Design to Enable Multi-Path Recommendation

Add code
Bookmark button
Alert button
Feb 21, 2023
Samuel Hsia, Udit Gupta, Bilge Acun, Newsha Ardalani, Pan Zhong, Gu-Yeon Wei, David Brooks, Carole-Jean Wu

Figure 1 for MP-Rec: Hardware-Software Co-Design to Enable Multi-Path Recommendation
Figure 2 for MP-Rec: Hardware-Software Co-Design to Enable Multi-Path Recommendation
Figure 3 for MP-Rec: Hardware-Software Co-Design to Enable Multi-Path Recommendation
Figure 4 for MP-Rec: Hardware-Software Co-Design to Enable Multi-Path Recommendation
Viaarxiv icon

GPU-based Private Information Retrieval for On-Device Machine Learning Inference

Add code
Bookmark button
Alert button
Jan 27, 2023
Maximilian Lam, Jeff Johnson, Wenjie Xiong, Kiwan Maeng, Udit Gupta, Minsoo Rhu, Hsien-Hsin S. Lee, Vijay Janapa Reddi, Gu-Yeon Wei, David Brooks, Edward Suh

Figure 1 for GPU-based Private Information Retrieval for On-Device Machine Learning Inference
Figure 2 for GPU-based Private Information Retrieval for On-Device Machine Learning Inference
Figure 3 for GPU-based Private Information Retrieval for On-Device Machine Learning Inference
Figure 4 for GPU-based Private Information Retrieval for On-Device Machine Learning Inference
Viaarxiv icon

PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices

Add code
Bookmark button
Alert button
Jan 26, 2023
Yuji Chai, Devashree Tripathy, Chuteng Zhou, Dibakar Gope, Igor Fedorov, Ramon Matas, David Brooks, Gu-Yeon Wei, Paul Whatmough

Figure 1 for PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices
Figure 2 for PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices
Figure 3 for PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices
Figure 4 for PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices
Viaarxiv icon