Alert button
Picture for Wenwen Tong

Wenwen Tong

Alert button

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

Add code
Bookmark button
Alert button
Apr 29, 2024
Zhe Chen, Weiyun Wang, Hao Tian, Shenglong Ye, Zhangwei Gao, Erfei Cui, Wenwen Tong, Kongzhi Hu, Jiapeng Luo, Zheng Ma, Ji Ma, Jiaqi Wang, Xiaoyi Dong, Hang Yan, Hewei Guo, Conghui He, Botian Shi, Zhenjiang Jin, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Min Dou, Lewei Lu, Xizhou Zhu, Tong Lu, Dahua Lin, Yu Qiao, Jifeng Dai, Wenhai Wang

Viaarxiv icon

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

Add code
Bookmark button
Alert button
Dec 25, 2023
Wenhai Wang, Jiangwei Xie, ChuanYang Hu, Haoming Zou, Jianan Fan, Wenwen Tong, Yang Wen, Silei Wu, Hanming Deng, Zhiqi Li, Hao Tian, Lewei Lu, Xizhou Zhu, Xiaogang Wang, Yu Qiao, Jifeng Dai

Viaarxiv icon

Scene as Occupancy

Add code
Bookmark button
Alert button
Jun 06, 2023
Wenwen Tong, Chonghao Sima, Tai Wang, Silei Wu, Hanming Deng, Li Chen, Yi Gu, Lewei Lu, Ping Luo, Dahua Lin, Hongyang Li

Figure 1 for Scene as Occupancy
Figure 2 for Scene as Occupancy
Figure 3 for Scene as Occupancy
Figure 4 for Scene as Occupancy
Viaarxiv icon

3D Data Augmentation for Driving Scenes on Camera

Add code
Bookmark button
Alert button
Mar 18, 2023
Wenwen Tong, Jiangwei Xie, Tianyu Li, Hanming Deng, Xiangwei Geng, Ruoyi Zhou, Dingchen Yang, Bo Dai, Lewei Lu, Hongyang Li

Figure 1 for 3D Data Augmentation for Driving Scenes on Camera
Figure 2 for 3D Data Augmentation for Driving Scenes on Camera
Figure 3 for 3D Data Augmentation for Driving Scenes on Camera
Figure 4 for 3D Data Augmentation for Driving Scenes on Camera
Viaarxiv icon