Alert button
Picture for Sudheendra Vijayanarasimhan

Sudheendra Vijayanarasimhan

Alert button

$IC^3$: Image Captioning by Committee Consensus

Add code
Bookmark button
Alert button
Feb 16, 2023
David M. Chan, Austin Myers, Sudheendra Vijayanarasimhan, David A. Ross, John Canny

Figure 1 for $IC^3$: Image Captioning by Committee Consensus
Figure 2 for $IC^3$: Image Captioning by Committee Consensus
Figure 3 for $IC^3$: Image Captioning by Committee Consensus
Figure 4 for $IC^3$: Image Captioning by Committee Consensus
Viaarxiv icon

Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features

Add code
Bookmark button
Alert button
Dec 20, 2022
Vivek Rathod, Bryan Seybold, Sudheendra Vijayanarasimhan, Austin Myers, Xiuye Gu, Vighnesh Birodkar, David A. Ross

Figure 1 for Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features
Figure 2 for Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features
Figure 3 for Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features
Figure 4 for Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features
Viaarxiv icon

Distribution Aware Metrics for Conditional Natural Language Generation

Add code
Bookmark button
Alert button
Sep 29, 2022
David M Chan, Yiming Ni, David A Ross, Sudheendra Vijayanarasimhan, Austin Myers, John Canny

Figure 1 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 2 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 3 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 4 for Distribution Aware Metrics for Conditional Natural Language Generation
Viaarxiv icon

What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics

Add code
Bookmark button
Alert button
May 12, 2022
David M. Chan, Austin Myers, Sudheendra Vijayanarasimhan, David A. Ross, Bryan Seybold, John F. Canny

Figure 1 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Figure 2 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Figure 3 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Figure 4 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Viaarxiv icon

Active Learning for Video Description With Cluster-Regularized Ensemble Ranking

Add code
Bookmark button
Alert button
Jul 29, 2020
David M. Chan, Sudheendra Vijayanarasimhan, David A. Ross, John Canny

Figure 1 for Active Learning for Video Description With Cluster-Regularized Ensemble Ranking
Figure 2 for Active Learning for Video Description With Cluster-Regularized Ensemble Ranking
Figure 3 for Active Learning for Video Description With Cluster-Regularized Ensemble Ranking
Figure 4 for Active Learning for Video Description With Cluster-Regularized Ensemble Ranking
Viaarxiv icon

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

Add code
Bookmark button
Alert button
Apr 30, 2018
Chunhui Gu, Chen Sun, David A. Ross, Carl Vondrick, Caroline Pantofaru, Yeqing Li, Sudheendra Vijayanarasimhan, George Toderici, Susanna Ricco, Rahul Sukthankar, Cordelia Schmid, Jitendra Malik

Figure 1 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Figure 2 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Figure 3 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Figure 4 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Viaarxiv icon

Rethinking the Faster R-CNN Architecture for Temporal Action Localization

Add code
Bookmark button
Alert button
Apr 20, 2018
Yu-Wei Chao, Sudheendra Vijayanarasimhan, Bryan Seybold, David A. Ross, Jia Deng, Rahul Sukthankar

Figure 1 for Rethinking the Faster R-CNN Architecture for Temporal Action Localization
Figure 2 for Rethinking the Faster R-CNN Architecture for Temporal Action Localization
Figure 3 for Rethinking the Faster R-CNN Architecture for Temporal Action Localization
Figure 4 for Rethinking the Faster R-CNN Architecture for Temporal Action Localization
Viaarxiv icon

End-to-End Learning of Semantic Grasping

Add code
Bookmark button
Alert button
Nov 09, 2017
Eric Jang, Sudheendra Vijayanarasimhan, Peter Pastor, Julian Ibarz, Sergey Levine

Figure 1 for End-to-End Learning of Semantic Grasping
Figure 2 for End-to-End Learning of Semantic Grasping
Figure 3 for End-to-End Learning of Semantic Grasping
Figure 4 for End-to-End Learning of Semantic Grasping
Viaarxiv icon

The Kinetics Human Action Video Dataset

Add code
Bookmark button
Alert button
May 19, 2017
Will Kay, Joao Carreira, Karen Simonyan, Brian Zhang, Chloe Hillier, Sudheendra Vijayanarasimhan, Fabio Viola, Tim Green, Trevor Back, Paul Natsev, Mustafa Suleyman, Andrew Zisserman

Figure 1 for The Kinetics Human Action Video Dataset
Figure 2 for The Kinetics Human Action Video Dataset
Figure 3 for The Kinetics Human Action Video Dataset
Figure 4 for The Kinetics Human Action Video Dataset
Viaarxiv icon

Motion Prediction Under Multimodality with Conditional Stochastic Networks

Add code
Bookmark button
Alert button
May 05, 2017
Katerina Fragkiadaki, Jonathan Huang, Alex Alemi, Sudheendra Vijayanarasimhan, Susanna Ricco, Rahul Sukthankar

Figure 1 for Motion Prediction Under Multimodality with Conditional Stochastic Networks
Figure 2 for Motion Prediction Under Multimodality with Conditional Stochastic Networks
Figure 3 for Motion Prediction Under Multimodality with Conditional Stochastic Networks
Figure 4 for Motion Prediction Under Multimodality with Conditional Stochastic Networks
Viaarxiv icon