Picture for Boyi Li

Boyi Li

DiffuBox: Refining 3D Object Detection with Point Diffusion

Add code
May 25, 2024
Viaarxiv icon

Language-Image Models with 3D Understanding

Add code
May 06, 2024
Viaarxiv icon

Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

Add code
Mar 21, 2024
Figure 1 for Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Figure 2 for Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Figure 3 for Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Figure 4 for Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Viaarxiv icon

Driving Everywhere with Large Language Model Policy Adaptation

Add code
Feb 08, 2024
Viaarxiv icon

Synthesizing Moving People with 3D Control

Add code
Jan 19, 2024
Viaarxiv icon

Self-correcting LLM-controlled Diffusion Models

Add code
Nov 27, 2023
Viaarxiv icon

From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation

Add code
Nov 21, 2023
Viaarxiv icon

EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

Add code
Nov 03, 2023
Viaarxiv icon

Interactive Task Planning with Language Models

Add code
Oct 16, 2023
Figure 1 for Interactive Task Planning with Language Models
Figure 2 for Interactive Task Planning with Language Models
Figure 3 for Interactive Task Planning with Language Models
Figure 4 for Interactive Task Planning with Language Models
Viaarxiv icon

LLM-grounded Video Diffusion Models

Add code
Oct 02, 2023
Figure 1 for LLM-grounded Video Diffusion Models
Figure 2 for LLM-grounded Video Diffusion Models
Figure 3 for LLM-grounded Video Diffusion Models
Figure 4 for LLM-grounded Video Diffusion Models
Viaarxiv icon