Structured Fusion Networks for Dialog S Mehri, T Srinivasan, M Eskenazi arXiv preprint arXiv:1907.10016, 2019 | 105 | 2019 |
CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks T Srinivasan, TY Chang, LLP Alva, G Chochlakis, M Rostami, J Thomason Advances in Neural Information Processing Systems, 2022, 2022 | 65 | 2022 |
Worst of Both Worlds: Biases Compound in Pre-trained Vision-and-Language Models T Srinivasan, Y Bisk 4th Workshop on Gender Bias in Natural Language Processing, NAACL 2022, 2021 | 64 | 2021 |
Looking Enhances Listening: Recovering Missing Speech Using Images T Srinivasan, R Sanabria, F Metze 45th International Conference on Acoustics, Speech, and Signal Processing, 2020 | 20 | 2020 |
Multimodal Speech Recognition with Unstructured Audio Masking T Srinivasan, R Sanabria, F Metze, D Elliott Workshop on Natural Language Processing Beyond Text 2020, 2020 | 16 | 2020 |
Fine-Grained Grounding for Multimodal Speech Recognition T Srinivasan, R Sanabria, F Metze, D Elliott Findings of EMNLP 2020, 2020 | 14 | 2020 |
Multitask Learning For Different Subword Segmentations In Neural Machine Translation T Srinivasan, R Sanabria, F Metze 16th International Workshop on Spoken Language Translation 2019, 2019 | 14* | 2019 |
VAuLT: Augmenting the Vision-and-Language Transformer with the Propagation of Deep Language Representations G Chochlakis, T Srinivasan, J Thomason, S Narayanan arXiv preprint arXiv:2208.09021, 2022 | 11 | 2022 |
Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions T Srinivasan, R Sanabria, F Metze The How2 Challenge: New Tasks for Vision & Language, ICML 2019, 2019 | 11 | 2019 |
Curriculum Learning for Data-Efficient Vision-Language Alignment T Srinivasan, X Ren, J Thomason Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 9 | 2023 |
I2I: Initializing Adapters with Improvised Knowledge T Srinivasan, F Jia, M Rostami, J Thomason Conference on Lifelong Learning Agents, 923-935, 2023 | 5 | 2023 |
Reasoning Over History: Context Aware Visual Dialog MA Shah, S Mehri, T Srinivasan arXiv preprint arXiv:2011.00669, 2020 | 5 | 2020 |
Multimodal Speech Recognition for Language-Guided Embodied Agents A Chang, X Zhu, A Monga, S Ahn, T Srinivasan, J Thomason Interspeech 2023, 2023 | 4 | 2023 |
Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning T Srinivasan, J Hessel, T Gupta, BY Lin, Y Choi, J Thomason, KR Chandu arXiv preprint arXiv:2402.15610, 2024 | 2 | 2024 |
Exploring Strategies for Modeling Sign Language Phonology L Kezar, R Carlin, T Srinivasan, Z Sehyr, N Caselli, J Thomason arXiv preprint arXiv:2310.00195, 2023 | 2 | 2023 |
Compare without Despair: Reliable Preference Evaluation with Generation Separability S Ghosh, T Srinivasan, S Swayamdipta arXiv preprint arXiv:2407.01878, 2024 | 1 | 2024 |
Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems M İnan, A Sicilia, S Dey, V Dongre, T Srinivasan, J Thomason, G Tür, ... arXiv preprint arXiv:2501.17348, 2025 | | 2025 |
WinoViz: Probing Visual Properties of Objects Under Different States W Jin, T Srinivasan, J Thomason, X Ren arXiv preprint arXiv:2402.13584, 2024 | | 2024 |
CMU’s Machine Translation System for IWSLT 2019 T Srinivasan, R Sanabria, F Metze Proceedings of the 16th International Conference on Spoken Language Translation, 2019 | | 2019 |