Picture for Zipeng Xu

Zipeng Xu

SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective

Add code
Mar 16, 2023
Figure 1 for SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective
Figure 2 for SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective
Figure 3 for SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective
Figure 4 for SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective
Viaarxiv icon

StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model

Add code
Mar 16, 2023
Figure 1 for StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model
Figure 2 for StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model
Figure 3 for StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model
Figure 4 for StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model
Viaarxiv icon

Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene

Add code
Mar 16, 2022
Figure 1 for Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Figure 2 for Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Figure 3 for Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Figure 4 for Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Viaarxiv icon

Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model

Add code
Nov 26, 2021
Figure 1 for Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
Figure 2 for Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
Figure 3 for Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
Figure 4 for Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
Viaarxiv icon

Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser

Add code
Sep 06, 2021
Figure 1 for Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser
Figure 2 for Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser
Figure 3 for Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser
Figure 4 for Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser
Viaarxiv icon

Modeling Explicit Concerning States for Reinforcement Learning in Visual Dialogue

Add code
Jul 12, 2021
Figure 1 for Modeling Explicit Concerning States for Reinforcement Learning in Visual Dialogue
Figure 2 for Modeling Explicit Concerning States for Reinforcement Learning in Visual Dialogue
Figure 3 for Modeling Explicit Concerning States for Reinforcement Learning in Visual Dialogue
Figure 4 for Modeling Explicit Concerning States for Reinforcement Learning in Visual Dialogue
Viaarxiv icon

Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue

Add code
Oct 01, 2020
Figure 1 for Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue
Figure 2 for Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue
Figure 3 for Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue
Figure 4 for Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue
Viaarxiv icon