Picture for Yijia Zhang

Yijia Zhang

BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation

Add code
Feb 16, 2024
Figure 1 for BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation
Figure 2 for BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation
Figure 3 for BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation
Figure 4 for BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation
Viaarxiv icon

CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

Add code
Nov 28, 2023
Figure 1 for CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
Figure 2 for CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
Figure 3 for CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
Figure 4 for CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
Viaarxiv icon

AFPQ: Asymmetric Floating Point Quantization for LLMs

Add code
Nov 03, 2023
Figure 1 for AFPQ: Asymmetric Floating Point Quantization for LLMs
Figure 2 for AFPQ: Asymmetric Floating Point Quantization for LLMs
Figure 3 for AFPQ: Asymmetric Floating Point Quantization for LLMs
Figure 4 for AFPQ: Asymmetric Floating Point Quantization for LLMs
Viaarxiv icon

A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations

Add code
Oct 31, 2023
Viaarxiv icon

TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching

Add code
Aug 29, 2023
Figure 1 for TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
Figure 2 for TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
Figure 3 for TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
Figure 4 for TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
Viaarxiv icon

Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training

May 31, 2023
Figure 1 for Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training
Figure 2 for Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training
Figure 3 for Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training
Figure 4 for Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training
Viaarxiv icon

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models

May 21, 2023
Figure 1 for Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Figure 2 for Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Figure 3 for Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Figure 4 for Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Viaarxiv icon

TC-GAT: Graph Attention Network for Temporal Causality Discovery

Apr 21, 2023
Figure 1 for TC-GAT: Graph Attention Network for Temporal Causality Discovery
Figure 2 for TC-GAT: Graph Attention Network for Temporal Causality Discovery
Figure 3 for TC-GAT: Graph Attention Network for Temporal Causality Discovery
Figure 4 for TC-GAT: Graph Attention Network for Temporal Causality Discovery
Viaarxiv icon

Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction

Jan 18, 2019
Figure 1 for Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction
Figure 2 for Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction
Figure 3 for Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction
Figure 4 for Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction
Viaarxiv icon