Picture for Yiqun Yao

Yiqun Yao

Tele-FLM Technical Report

Add code
Apr 25, 2024
Figure 1 for Tele-FLM Technical Report
Figure 2 for Tele-FLM Technical Report
Figure 3 for Tele-FLM Technical Report
Figure 4 for Tele-FLM Technical Report
Viaarxiv icon

CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text

Add code
Mar 04, 2024
Figure 1 for CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
Figure 2 for CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
Figure 3 for CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
Figure 4 for CatCode: A Comprehensive Evaluation Framework for LLMs On the Mixture of Code and Text
Viaarxiv icon

FLM-101B: An Open LLM and How to Train It with $100K Budget

Add code
Sep 17, 2023
Figure 1 for FLM-101B: An Open LLM and How to Train It with $100K Budget
Figure 2 for FLM-101B: An Open LLM and How to Train It with $100K Budget
Figure 3 for FLM-101B: An Open LLM and How to Train It with $100K Budget
Figure 4 for FLM-101B: An Open LLM and How to Train It with $100K Budget
Viaarxiv icon

2x Faster Language Model Pre-training via Masked Structural Growth

May 04, 2023
Figure 1 for 2x Faster Language Model Pre-training via Masked Structural Growth
Figure 2 for 2x Faster Language Model Pre-training via Masked Structural Growth
Figure 3 for 2x Faster Language Model Pre-training via Masked Structural Growth
Figure 4 for 2x Faster Language Model Pre-training via Masked Structural Growth
Viaarxiv icon

Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales

Add code
Apr 29, 2023
Figure 1 for Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales
Figure 2 for Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales
Figure 3 for Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales
Figure 4 for Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales
Viaarxiv icon

MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task

May 17, 2021
Figure 1 for MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task
Figure 2 for MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task
Figure 3 for MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task
Figure 4 for MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task
Viaarxiv icon

Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks

Nov 15, 2018
Figure 1 for Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks
Figure 2 for Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks
Figure 3 for Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks
Figure 4 for Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks
Viaarxiv icon

Cascaded Mutual Modulation for Visual Reasoning

Add code
Sep 06, 2018
Figure 1 for Cascaded Mutual Modulation for Visual Reasoning
Figure 2 for Cascaded Mutual Modulation for Visual Reasoning
Figure 3 for Cascaded Mutual Modulation for Visual Reasoning
Figure 4 for Cascaded Mutual Modulation for Visual Reasoning
Viaarxiv icon

Hierarchical Memory Networks for Answer Selection on Unknown Words

Add code
Sep 28, 2016
Figure 1 for Hierarchical Memory Networks for Answer Selection on Unknown Words
Figure 2 for Hierarchical Memory Networks for Answer Selection on Unknown Words
Figure 3 for Hierarchical Memory Networks for Answer Selection on Unknown Words
Figure 4 for Hierarchical Memory Networks for Answer Selection on Unknown Words
Viaarxiv icon