Picture for Junbo Zhang

Junbo Zhang

Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion

Add code
Mar 12, 2024
Figure 1 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 2 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 3 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 4 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Viaarxiv icon

Deep Learning for Cross-Domain Data Fusion in Urban Computing: Taxonomy, Advances, and Outlook

Add code
Feb 29, 2024
Viaarxiv icon

Federated Continual Learning via Knowledge Fusion: A Survey

Dec 27, 2023
Viaarxiv icon

MLPST: MLP is All You Need for Spatio-Temporal Prediction

Sep 23, 2023
Figure 1 for MLPST: MLP is All You Need for Spatio-Temporal Prediction
Figure 2 for MLPST: MLP is All You Need for Spatio-Temporal Prediction
Figure 3 for MLPST: MLP is All You Need for Spatio-Temporal Prediction
Figure 4 for MLPST: MLP is All You Need for Spatio-Temporal Prediction
Viaarxiv icon

Spatio-Temporal Contrastive Self-Supervised Learning for POI-level Crowd Flow Inference

Sep 12, 2023
Figure 1 for Spatio-Temporal Contrastive Self-Supervised Learning for POI-level Crowd Flow Inference
Figure 2 for Spatio-Temporal Contrastive Self-Supervised Learning for POI-level Crowd Flow Inference
Figure 3 for Spatio-Temporal Contrastive Self-Supervised Learning for POI-level Crowd Flow Inference
Figure 4 for Spatio-Temporal Contrastive Self-Supervised Learning for POI-level Crowd Flow Inference
Viaarxiv icon

CED: Consistent ensemble distillation for audio tagging

Add code
Sep 08, 2023
Figure 1 for CED: Consistent ensemble distillation for audio tagging
Figure 2 for CED: Consistent ensemble distillation for audio tagging
Figure 3 for CED: Consistent ensemble distillation for audio tagging
Figure 4 for CED: Consistent ensemble distillation for audio tagging
Viaarxiv icon

Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information

Add code
Jun 28, 2023
Figure 1 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Figure 2 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Figure 3 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Figure 4 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Viaarxiv icon

AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction

Add code
Jun 25, 2023
Figure 1 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Figure 2 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Figure 3 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Figure 4 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Viaarxiv icon

Understanding temporally weakly supervised training: A case study for keyword spotting

May 30, 2023
Figure 1 for Understanding temporally weakly supervised training: A case study for keyword spotting
Figure 2 for Understanding temporally weakly supervised training: A case study for keyword spotting
Figure 3 for Understanding temporally weakly supervised training: A case study for keyword spotting
Figure 4 for Understanding temporally weakly supervised training: A case study for keyword spotting
Viaarxiv icon

Streaming Audio Transformers for Online Audio Tagging

Add code
May 29, 2023
Figure 1 for Streaming Audio Transformers for Online Audio Tagging
Figure 2 for Streaming Audio Transformers for Online Audio Tagging
Figure 3 for Streaming Audio Transformers for Online Audio Tagging
Figure 4 for Streaming Audio Transformers for Online Audio Tagging
Viaarxiv icon