Alert button
Picture for Digbalay Bose

Digbalay Bose

Alert button

Can Text-to-image Model Assist Multi-modal Learning for Visual Recognition with Visual Modality Missing?

Add code
Bookmark button
Alert button
Feb 14, 2024
Tiantian Feng, Daniel Yang, Digbalay Bose, Shrikanth Narayanan

Viaarxiv icon

Does Video Summarization Require Videos? Quantifying the Effectiveness of Language in Video Summarization

Add code
Bookmark button
Alert button
Sep 18, 2023
Yoonsoo Nam, Adam Lehavi, Daniel Yang, Digbalay Bose, Swabha Swayamdipta, Shrikanth Narayanan

Figure 1 for Does Video Summarization Require Videos? Quantifying the Effectiveness of Language in Video Summarization
Figure 2 for Does Video Summarization Require Videos? Quantifying the Effectiveness of Language in Video Summarization
Figure 3 for Does Video Summarization Require Videos? Quantifying the Effectiveness of Language in Video Summarization
Viaarxiv icon

MM-AU:Towards Multimodal Understanding of Advertisement Videos

Add code
Bookmark button
Alert button
Aug 27, 2023
Digbalay Bose, Rajat Hebbar, Tiantian Feng, Krishna Somandepalli, Anfeng Xu, Shrikanth Narayanan

Figure 1 for MM-AU:Towards Multimodal Understanding of Advertisement Videos
Figure 2 for MM-AU:Towards Multimodal Understanding of Advertisement Videos
Figure 3 for MM-AU:Towards Multimodal Understanding of Advertisement Videos
Figure 4 for MM-AU:Towards Multimodal Understanding of Advertisement Videos
Viaarxiv icon

FedMultimodal: A Benchmark For Multimodal Federated Learning

Add code
Bookmark button
Alert button
Jun 20, 2023
Tiantian Feng, Digbalay Bose, Tuo Zhang, Rajat Hebbar, Anil Ramakrishna, Rahul Gupta, Mi Zhang, Salman Avestimehr, Shrikanth Narayanan

Figure 1 for FedMultimodal: A Benchmark For Multimodal Federated Learning
Figure 2 for FedMultimodal: A Benchmark For Multimodal Federated Learning
Figure 3 for FedMultimodal: A Benchmark For Multimodal Federated Learning
Figure 4 for FedMultimodal: A Benchmark For Multimodal Federated Learning
Viaarxiv icon

Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content

Add code
Bookmark button
Alert button
Jun 13, 2023
Tiantian Feng, Digbalay Bose, Xuan Shi, Shrikanth Narayanan

Figure 1 for Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content
Figure 2 for Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content
Figure 3 for Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content
Figure 4 for Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content
Viaarxiv icon

Signal Processing Grand Challenge 2023 -- e-Prevention: Sleep Behavior as an Indicator of Relapses in Psychotic Patients

Add code
Bookmark button
Alert button
Apr 17, 2023
Kleanthis Avramidis, Kranti Adsul, Digbalay Bose, Shrikanth Narayanan

Figure 1 for Signal Processing Grand Challenge 2023 -- e-Prevention: Sleep Behavior as an Indicator of Relapses in Psychotic Patients
Viaarxiv icon

Contextually-rich human affect perception using multimodal scene information

Add code
Bookmark button
Alert button
Mar 13, 2023
Digbalay Bose, Rajat Hebbar, Krishna Somandepalli, Shrikanth Narayanan

Figure 1 for Contextually-rich human affect perception using multimodal scene information
Figure 2 for Contextually-rich human affect perception using multimodal scene information
Figure 3 for Contextually-rich human affect perception using multimodal scene information
Figure 4 for Contextually-rich human affect perception using multimodal scene information
Viaarxiv icon

A dataset for Audio-Visual Sound Event Detection in Movies

Add code
Bookmark button
Alert button
Feb 14, 2023
Rajat Hebbar, Digbalay Bose, Krishna Somandepalli, Veena Vijai, Shrikanth Narayanan

Figure 1 for A dataset for Audio-Visual Sound Event Detection in Movies
Figure 2 for A dataset for Audio-Visual Sound Event Detection in Movies
Figure 3 for A dataset for Audio-Visual Sound Event Detection in Movies
Figure 4 for A dataset for Audio-Visual Sound Event Detection in Movies
Viaarxiv icon

Multimodal Estimation of Change Points of Physiological Arousal in Drivers

Add code
Bookmark button
Alert button
Oct 28, 2022
Kleanthis Avramidis, Tiantian Feng, Digbalay Bose, Shrikanth Narayanan

Figure 1 for Multimodal Estimation of Change Points of Physiological Arousal in Drivers
Figure 2 for Multimodal Estimation of Change Points of Physiological Arousal in Drivers
Figure 3 for Multimodal Estimation of Change Points of Physiological Arousal in Drivers
Figure 4 for Multimodal Estimation of Change Points of Physiological Arousal in Drivers
Viaarxiv icon

Understanding of Emotion Perception from Art

Add code
Bookmark button
Alert button
Oct 13, 2021
Digbalay Bose, Krishna Somandepalli, Souvik Kundu, Rimita Lahiri, Jonathan Gratch, Shrikanth Narayanan

Figure 1 for Understanding of Emotion Perception from Art
Figure 2 for Understanding of Emotion Perception from Art
Figure 3 for Understanding of Emotion Perception from Art
Figure 4 for Understanding of Emotion Perception from Art
Viaarxiv icon