Picture for Giang Do

Giang Do

CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition

Add code
Feb 04, 2024
Figure 1 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Figure 2 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Figure 3 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Figure 4 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Viaarxiv icon

HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts

Add code
Dec 12, 2023
Figure 1 for HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
Figure 2 for HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
Figure 3 for HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
Figure 4 for HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
Viaarxiv icon