Alert button
Picture for Yaodong Yang

Yaodong Yang

Alert button

INSIGHT: End-to-End Neuro-Symbolic Visual Reinforcement Learning with Language Explanations

Add code
Bookmark button
Alert button
Mar 19, 2024
Lirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang, Qing Li

Figure 1 for INSIGHT: End-to-End Neuro-Symbolic Visual Reinforcement Learning with Language Explanations
Figure 2 for INSIGHT: End-to-End Neuro-Symbolic Visual Reinforcement Learning with Language Explanations
Figure 3 for INSIGHT: End-to-End Neuro-Symbolic Visual Reinforcement Learning with Language Explanations
Figure 4 for INSIGHT: End-to-End Neuro-Symbolic Visual Reinforcement Learning with Language Explanations
Viaarxiv icon

AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents

Add code
Bookmark button
Alert button
Mar 19, 2024
Jieming Cui, Tengyu Liu, Nian Liu, Yaodong Yang, Yixin Zhu, Siyuan Huang

Figure 1 for AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents
Figure 2 for AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents
Figure 3 for AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents
Figure 4 for AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents
Viaarxiv icon

UniDexFPM: Universal Dexterous Functional Pre-grasp Manipulation Via Diffusion Policy

Add code
Bookmark button
Alert button
Mar 19, 2024
Tianhao Wu, Yunchong Gan, Mingdong Wu, Jingbo Cheng, Yaodong Yang, Yixin Zhu, Hao Dong

Figure 1 for UniDexFPM: Universal Dexterous Functional Pre-grasp Manipulation Via Diffusion Policy
Figure 2 for UniDexFPM: Universal Dexterous Functional Pre-grasp Manipulation Via Diffusion Policy
Figure 3 for UniDexFPM: Universal Dexterous Functional Pre-grasp Manipulation Via Diffusion Policy
Figure 4 for UniDexFPM: Universal Dexterous Functional Pre-grasp Manipulation Via Diffusion Policy
Viaarxiv icon

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects

Add code
Bookmark button
Alert button
Mar 01, 2024
Zhaowei Zhang, Fengshuo Bai, Mingzhi Wang, Haoyang Ye, Chengdong Ma, Yaodong Yang

Viaarxiv icon

Rethinking Information Structures in RLHF: Reward Generalization from a Graph Theory Perspective

Add code
Bookmark button
Alert button
Feb 20, 2024
Tianyi Qiu, Fanzhi Zeng, Jiaming Ji, Dong Yan, Kaile Wang, Jiayi Zhou, Han Yang, Josef Dai, Xuehai Pan, Yaodong Yang

Viaarxiv icon

Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction

Add code
Bookmark button
Alert button
Feb 06, 2024
Jiaming Ji, Boyuan Chen, Hantao Lou, Donghai Hong, Borong Zhang, Xuehai Pan, Juntao Dai, Yaodong Yang

Viaarxiv icon

Panacea: Pareto Alignment via Preference Adaptation for LLMs

Add code
Bookmark button
Alert button
Feb 03, 2024
Yifan Zhong, Chengdong Ma, Xiaoyuan Zhang, Ziran Yang, Qingfu Zhang, Siyuan Qi, Yaodong Yang

Viaarxiv icon

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

Add code
Bookmark button
Alert button
Jan 19, 2024
Siyuan Qi, Shuo Chen, Yexin Li, Xiangyu Kong, Junqi Wang, Bangcheng Yang, Pring Wong, Yifan Zhong, Xiaoyuan Zhang, Zhaowei Zhang, Nian Liu, Wei Wang, Yaodong Yang, Song-Chun Zhu

Viaarxiv icon

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 12, 2023
Yinmin Zhang, Jie Liu, Chuming Li, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang

Viaarxiv icon

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Add code
Bookmark button
Alert button
Nov 30, 2023
Zihao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang

Viaarxiv icon