Picture for Steven Hoi

Steven Hoi

CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition

Add code
Feb 04, 2024
Figure 1 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Figure 2 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Figure 3 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Figure 4 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Viaarxiv icon

HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts

Add code
Dec 12, 2023
Figure 1 for HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
Figure 2 for HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
Figure 3 for HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
Figure 4 for HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
Viaarxiv icon

Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation

Add code
Oct 28, 2023
Viaarxiv icon

InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning

Add code
May 11, 2023
Figure 1 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 2 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 3 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 4 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Viaarxiv icon

LogAI: A Library for Log Analytics and Intelligence

Add code
Jan 31, 2023
Figure 1 for LogAI: A Library for Log Analytics and Intelligence
Figure 2 for LogAI: A Library for Log Analytics and Intelligence
Figure 3 for LogAI: A Library for Log Analytics and Intelligence
Figure 4 for LogAI: A Library for Log Analytics and Intelligence
Viaarxiv icon

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

Add code
Jan 30, 2023
Figure 1 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Figure 2 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Figure 3 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Figure 4 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Viaarxiv icon

Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5

Add code
Dec 22, 2022
Figure 1 for Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5
Figure 2 for Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5
Figure 3 for Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5
Figure 4 for Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5
Viaarxiv icon

BotSIM: An End-to-End Bot Simulation Toolkit for Commercial Task-Oriented Dialog Systems

Add code
Nov 30, 2022
Figure 1 for BotSIM: An End-to-End Bot Simulation Toolkit for Commercial Task-Oriented Dialog Systems
Figure 2 for BotSIM: An End-to-End Bot Simulation Toolkit for Commercial Task-Oriented Dialog Systems
Figure 3 for BotSIM: An End-to-End Bot Simulation Toolkit for Commercial Task-Oriented Dialog Systems
Figure 4 for BotSIM: An End-to-End Bot Simulation Toolkit for Commercial Task-Oriented Dialog Systems
Viaarxiv icon

BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems

Add code
Nov 30, 2022
Figure 1 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Figure 2 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Figure 3 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Figure 4 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Viaarxiv icon

DeepTIMe: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting

Add code
Jul 14, 2022
Figure 1 for DeepTIMe: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting
Figure 2 for DeepTIMe: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting
Figure 3 for DeepTIMe: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting
Figure 4 for DeepTIMe: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting
Viaarxiv icon