Picture for Lin Ma

Lin Ma

AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning

Add code
Jun 01, 2024
Viaarxiv icon

TIE: Revolutionizing Text-based Image Editing for Complex-Prompt Following and High-Fidelity Editing

May 27, 2024
Viaarxiv icon

Integer Scale: A Free Lunch for Faster Fine-grained Quantization of LLMs

Add code
May 23, 2024
Viaarxiv icon

Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts

Add code
May 18, 2024
Viaarxiv icon

Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning

Add code
May 13, 2024
Viaarxiv icon

Aux-NAS: Exploiting Auxiliary Labels with Negligibly Extra Inference Cost

Add code
May 09, 2024
Viaarxiv icon

Matten: Video Generation with Mamba-Attention

May 05, 2024
Viaarxiv icon

LaSagnA: Language-based Segmentation Assistant for Complex Queries

Add code
Apr 12, 2024
Viaarxiv icon

UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection

Add code
Apr 07, 2024
Viaarxiv icon

Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models

Add code
Mar 12, 2024
Figure 1 for Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Figure 2 for Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Figure 3 for Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Figure 4 for Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Viaarxiv icon