Alert button
Picture for Zhibo Gong

Zhibo Gong

Alert button

Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing

Add code
Bookmark button
Alert button
Mar 05, 2020
Hangyu Mao, Zhibo Gong, Zhen Xiao

Figure 1 for Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing
Figure 2 for Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing
Figure 3 for Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing
Figure 4 for Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing
Viaarxiv icon

Learning Agent Communication under Limited Bandwidth by Message Pruning

Add code
Bookmark button
Alert button
Dec 03, 2019
Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Zhibo Gong, Yan Ni

Figure 1 for Learning Agent Communication under Limited Bandwidth by Message Pruning
Figure 2 for Learning Agent Communication under Limited Bandwidth by Message Pruning
Figure 3 for Learning Agent Communication under Limited Bandwidth by Message Pruning
Figure 4 for Learning Agent Communication under Limited Bandwidth by Message Pruning
Viaarxiv icon

Learning Multi-agent Communication under Limited-bandwidth Restriction for Internet Packet Routing

Add code
Bookmark button
Alert button
Feb 26, 2019
Hangyu Mao, Zhibo Gong, Zhengchao Zhang, Zhen Xiao, Yan Ni

Figure 1 for Learning Multi-agent Communication under Limited-bandwidth Restriction for Internet Packet Routing
Figure 2 for Learning Multi-agent Communication under Limited-bandwidth Restriction for Internet Packet Routing
Figure 3 for Learning Multi-agent Communication under Limited-bandwidth Restriction for Internet Packet Routing
Figure 4 for Learning Multi-agent Communication under Limited-bandwidth Restriction for Internet Packet Routing
Viaarxiv icon

Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG

Add code
Bookmark button
Alert button
Nov 13, 2018
Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Zhibo Gong

Figure 1 for Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG
Figure 2 for Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG
Figure 3 for Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG
Figure 4 for Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG
Viaarxiv icon

ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 29, 2017
Hangyu Mao, Zhibo Gong, Yan Ni, Zhen Xiao

Figure 1 for ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning
Figure 2 for ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning
Figure 3 for ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning
Figure 4 for ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning
Viaarxiv icon