Alert button
Picture for Zhihua Wu

Zhihua Wu

Alert button

ChuXin: 1.6B Technical Report

Add code
Bookmark button
Alert button
May 08, 2024
Xiaomin Zhuang, Yufan Jiang, Qiaozhi He, Zhihua Wu

Viaarxiv icon

Efficient LLM Inference with Kcache

Add code
Bookmark button
Alert button
Apr 28, 2024
Qiaozhi He, Zhihua Wu

Viaarxiv icon

Code Comparison Tuning for Code Large Language Models

Add code
Bookmark button
Alert button
Mar 28, 2024
Yufan Jiang, Qiaozhi He, Xiaomin Zhuang, Zhihua Wu

Figure 1 for Code Comparison Tuning for Code Large Language Models
Figure 2 for Code Comparison Tuning for Code Large Language Models
Figure 3 for Code Comparison Tuning for Code Large Language Models
Figure 4 for Code Comparison Tuning for Code Large Language Models
Viaarxiv icon

RecycleGPT: An Autoregressive Language Model with Recyclable Module

Add code
Bookmark button
Alert button
Aug 08, 2023
Yufan Jiang, Qiaozhi He, Xiaomin Zhuang, Zhihua Wu, Kunpeng Wang, Wenlai Zhao, Guangwen Yang

Figure 1 for RecycleGPT: An Autoregressive Language Model with Recyclable Module
Figure 2 for RecycleGPT: An Autoregressive Language Model with Recyclable Module
Figure 3 for RecycleGPT: An Autoregressive Language Model with Recyclable Module
Figure 4 for RecycleGPT: An Autoregressive Language Model with Recyclable Module
Viaarxiv icon

TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training

Add code
Bookmark button
Alert button
Feb 20, 2023
Chang Chen, Min Li, Zhihua Wu, Dianhai Yu, Chao Yang

Figure 1 for TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training
Figure 2 for TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training
Figure 3 for TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training
Figure 4 for TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training
Viaarxiv icon

HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle

Add code
Bookmark button
Alert button
Jul 13, 2022
Guoxia Wang, Xiaomin Fang, Zhihua Wu, Yiqun Liu, Yang Xue, Yingfei Xiang, Dianhai Yu, Fan Wang, Yanjun Ma

Figure 1 for HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle
Figure 2 for HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle
Figure 3 for HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle
Figure 4 for HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle
Viaarxiv icon

SE-MoE: A Scalable and Efficient Mixture-of-Experts Distributed Training and Inference System

Add code
Bookmark button
Alert button
May 20, 2022
Liang Shen, Zhihua Wu, WeiBao Gong, Hongxiang Hao, Yangfan Bai, HuaChao Wu, Xinxuan Wu, Haoyi Xiong, Dianhai Yu, Yanjun Ma

Viaarxiv icon

Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters

Add code
Bookmark button
Alert button
May 19, 2022
Yang Xiang, Zhihua Wu, Weibao Gong, Siyu Ding, Xianjie Mo, Yuang Liu, Shuohuan Wang, Peng Liu, Yongshuai Hou, Long Li, Bin Wang, Shaohuai Shi, Yaqian Han, Yue Yu, Ge Li, Yu Sun, Yanjun Ma, Dianhai Yu

Figure 1 for Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters
Figure 2 for Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters
Figure 3 for Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters
Figure 4 for Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters
Viaarxiv icon

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

Add code
Bookmark button
Alert button
Dec 31, 2021
Han Zhang, Weichong Yin, Yewei Fang, Lanxin Li, Boqiang Duan, Zhihua Wu, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Figure 1 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 2 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 3 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 4 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Viaarxiv icon

ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

Add code
Bookmark button
Alert button
Dec 23, 2021
Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang

Figure 1 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Figure 2 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Figure 3 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Figure 4 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Viaarxiv icon