Picture for Kang Xu

Kang Xu

Constrained Ensemble Exploration for Unsupervised Skill Discovery

Add code
May 25, 2024
Viaarxiv icon

Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning

Add code
May 10, 2024
Figure 1 for Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
Figure 2 for Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
Figure 3 for Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
Figure 4 for Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
Viaarxiv icon

Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey

Add code
Dec 22, 2023
Viaarxiv icon

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

Add code
May 29, 2023
Figure 1 for Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Figure 2 for Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Figure 3 for Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Figure 4 for Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Viaarxiv icon

Cross-Domain Policy Adaptation via Value-Guided Data Filtering

Add code
May 28, 2023
Figure 1 for Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Figure 2 for Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Figure 3 for Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Figure 4 for Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Viaarxiv icon

On the Value of Myopic Behavior in Policy Reuse

Add code
May 28, 2023
Figure 1 for On the Value of Myopic Behavior in Policy Reuse
Figure 2 for On the Value of Myopic Behavior in Policy Reuse
Figure 3 for On the Value of Myopic Behavior in Policy Reuse
Figure 4 for On the Value of Myopic Behavior in Policy Reuse
Viaarxiv icon

Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning

Add code
Sep 28, 2022
Figure 1 for Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning
Figure 2 for Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning
Figure 3 for Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning
Figure 4 for Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning
Viaarxiv icon

Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation

Add code
Sep 24, 2022
Figure 1 for Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Figure 2 for Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Figure 3 for Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Figure 4 for Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Viaarxiv icon

Neural Topic Modeling with Deep Mutual Information Estimation

Add code
Mar 12, 2022
Figure 1 for Neural Topic Modeling with Deep Mutual Information Estimation
Figure 2 for Neural Topic Modeling with Deep Mutual Information Estimation
Figure 3 for Neural Topic Modeling with Deep Mutual Information Estimation
Figure 4 for Neural Topic Modeling with Deep Mutual Information Estimation
Viaarxiv icon

Evolutionary Action Selection for Gradient-based Policy Learning

Add code
Jan 20, 2022
Figure 1 for Evolutionary Action Selection for Gradient-based Policy Learning
Figure 2 for Evolutionary Action Selection for Gradient-based Policy Learning
Figure 3 for Evolutionary Action Selection for Gradient-based Policy Learning
Figure 4 for Evolutionary Action Selection for Gradient-based Policy Learning
Viaarxiv icon