Kriti Aggarwal

Cited by

	All	Since 2019
Citations	1391	1391
h-index	8	8
i10-index	8	8

920

460

230

690

202220232024115 913 358

Co-authors

Li DongMicrosoft ResearchVerified email at microsoft.com
Subhojit SomSenior Applied Scientist, Microsoft CorporationVerified email at gatech.edu
Hangbo BaoMicrosoft ResearchVerified email at microsoft.com
Johan BjorckCornell UniversityVerified email at cornell.edu
Saksham SinghalMicrosoftVerified email at microsoft.com
Furu WeiPartner Research Manager, Microsoft ResearchVerified email at microsoft.com
Vishrav ChaudharyMicrosoft TuringVerified email at microsoft.com
Xia SongMicrosoftVerified email at microsoft.com

Kriti Aggarwal

Microsoft, University of California, San Diego

Verified email at microsoft.com

Deep learning Computer vision NLP Multimodality


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Subhojit Som, et al. Image as a foreign language: Beit pretraining for all vision and vision-language tasks W Wang, H Bao, L Dong, J Bjorck, Z Peng, Q Liu, K Aggarwal, ... arXiv preprint arXiv:2208.10442 2 (3), 15, 2022	368	2022
Vlmo: Unified vision-language pre-training with mixture-of-modality-experts H Bao, W Wang, L Dong, Q Liu, OK Mohammed, K Aggarwal, S Som, ... Advances in Neural Information Processing Systems 35, 32897-32912, 2022	320	2022
Image as a foreign language: Beit pretraining for all vision and vision-language tasks W Wang, H Bao, L Dong, J Bjorck, Z Peng, Q Liu, K Aggarwal, ... arXiv preprint arXiv:2208.10442, 2022	291*	2022
Language is not all you need: Aligning perception with language models S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ... Advances in Neural Information Processing Systems 36, 2024	259	2024
Subhojit Som, Xia Song, and Furu Wei S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ... Language is not all you need: Aligning perception with language models …, 2023	57	2023
Orca 2: Teaching small language models how to reason A Mitra, L Del Corro, S Mahajan, A Codas, C Simoes, S Agarwal, X Chen, ... arXiv preprint arXiv:2311.11045, 2023	33	2023
Subhojit Som, and Furu Wei. 2022b W Wang, H Bao, L Dong, J Bjorck, Z Peng, Q Liu, K Aggarwal, ... Image as a foreign language: Beit pretraining for all vision and …, 2022	33	2022
Subhojit Som, and Furu Wei. Vlmo: Unified vision-language pretraining with mixture-of-modality-experts H Bao, W Wang, L Dong, Q Liu, OK Mohammed, K Aggarwal arXiv preprint arXiv:2111.02358 3, 2021	27	2021
Bootstrapping a high quality multilingual multimodal dataset for Bletchley OK Mohammed, K Aggarwal, Q Liu, S Singhal, J Bjorck, S Som Asian Conference on Machine Learning, 738-753, 2023	2	2023
DUBLIN--Document Understanding By Language-Image Network K Aggarwal, A Khandelwal, K Tanmay, OM Khan, Q Liu, M Choudhury, ... arXiv preprint arXiv:2305.14218, 2023	1	2023
Polaris: A Safety-focused LLM Constellation Architecture for Healthcare S Mukherjee, P Gamble, MS Ausin, N Kant, K Aggarwal, N Manjunath, ... arXiv preprint arXiv:2403.13313, 2024		2024
ODIN: A Single Model for 2D and 3D Perception A Jain, P Katara, N Gkanatsios, AW Harley, G Sarch, K Aggarwal, ... arXiv preprint arXiv:2401.02416, 2024		2024
DUBLIN: Visual Document Understanding By Language-Image Network K Aggarwal, A Khandelwal, K Tanmay, OK Mohammed, Q Liu, ... Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–13

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors