Picture for Yongbin Li

Yongbin Li

A Survey on Self-Evolution of Large Language Models

Add code
Apr 22, 2024
Figure 1 for A Survey on Self-Evolution of Large Language Models
Figure 2 for A Survey on Self-Evolution of Large Language Models
Figure 3 for A Survey on Self-Evolution of Large Language Models
Figure 4 for A Survey on Self-Evolution of Large Language Models
Viaarxiv icon

Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment

Add code
Mar 30, 2024
Figure 1 for Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
Figure 2 for Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
Figure 3 for Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
Figure 4 for Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
Viaarxiv icon

Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning

Add code
Mar 29, 2024
Figure 1 for Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning
Figure 2 for Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning
Figure 3 for Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning
Figure 4 for Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning
Viaarxiv icon

Semantically-Shifted Incremental Adapter-Tuning is A Continual ViTransformer

Add code
Mar 29, 2024
Figure 1 for Semantically-Shifted Incremental Adapter-Tuning is A Continual ViTransformer
Figure 2 for Semantically-Shifted Incremental Adapter-Tuning is A Continual ViTransformer
Figure 3 for Semantically-Shifted Incremental Adapter-Tuning is A Continual ViTransformer
Figure 4 for Semantically-Shifted Incremental Adapter-Tuning is A Continual ViTransformer
Viaarxiv icon

Fine-Tuning Language Models with Reward Learning on Policy

Add code
Mar 28, 2024
Figure 1 for Fine-Tuning Language Models with Reward Learning on Policy
Figure 2 for Fine-Tuning Language Models with Reward Learning on Policy
Figure 3 for Fine-Tuning Language Models with Reward Learning on Policy
Figure 4 for Fine-Tuning Language Models with Reward Learning on Policy
Viaarxiv icon

Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models

Add code
Mar 04, 2024
Figure 1 for Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Figure 2 for Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Figure 3 for Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Figure 4 for Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Viaarxiv icon

SoFA: Shielded On-the-fly Alignment via Priority Rule Following

Add code
Feb 27, 2024
Figure 1 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Figure 2 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Figure 3 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Figure 4 for SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Viaarxiv icon

Self-Retrieval: Building an Information Retrieval System with One Large Language Model

Add code
Feb 23, 2024
Figure 1 for Self-Retrieval: Building an Information Retrieval System with One Large Language Model
Figure 2 for Self-Retrieval: Building an Information Retrieval System with One Large Language Model
Figure 3 for Self-Retrieval: Building an Information Retrieval System with One Large Language Model
Figure 4 for Self-Retrieval: Building an Information Retrieval System with One Large Language Model
Viaarxiv icon

One Shot Learning as Instruction Data Prospector for Large Language Models

Add code
Jan 04, 2024
Viaarxiv icon

DialCLIP: Empowering CLIP as Multi-Modal Dialog Retriever

Add code
Jan 03, 2024
Viaarxiv icon