Follow
Kriti Aggarwal
Kriti Aggarwal
Microsoft, University of California, San Diego
Verified email at microsoft.com
Title
Cited by
Cited by
Year
Subhojit Som, et al. Image as a foreign language: Beit pretraining for all vision and vision-language tasks
W Wang, H Bao, L Dong, J Bjorck, Z Peng, Q Liu, K Aggarwal, ...
arXiv preprint arXiv:2208.10442 2 (3), 15, 2022
3682022
Vlmo: Unified vision-language pre-training with mixture-of-modality-experts
H Bao, W Wang, L Dong, Q Liu, OK Mohammed, K Aggarwal, S Som, ...
Advances in Neural Information Processing Systems 35, 32897-32912, 2022
3202022
Image as a foreign language: Beit pretraining for all vision and vision-language tasks
W Wang, H Bao, L Dong, J Bjorck, Z Peng, Q Liu, K Aggarwal, ...
arXiv preprint arXiv:2208.10442, 2022
291*2022
Language is not all you need: Aligning perception with language models
S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ...
Advances in Neural Information Processing Systems 36, 2024
2592024
Subhojit Som, Xia Song, and Furu Wei
S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ...
Language is not all you need: Aligning perception with language models …, 2023
572023
Orca 2: Teaching small language models how to reason
A Mitra, L Del Corro, S Mahajan, A Codas, C Simoes, S Agarwal, X Chen, ...
arXiv preprint arXiv:2311.11045, 2023
332023
Subhojit Som, and Furu Wei. 2022b
W Wang, H Bao, L Dong, J Bjorck, Z Peng, Q Liu, K Aggarwal, ...
Image as a foreign language: Beit pretraining for all vision and …, 2022
332022
Subhojit Som, and Furu Wei. Vlmo: Unified vision-language pretraining with mixture-of-modality-experts
H Bao, W Wang, L Dong, Q Liu, OK Mohammed, K Aggarwal
arXiv preprint arXiv:2111.02358 3, 2021
272021
Bootstrapping a high quality multilingual multimodal dataset for Bletchley
OK Mohammed, K Aggarwal, Q Liu, S Singhal, J Bjorck, S Som
Asian Conference on Machine Learning, 738-753, 2023
22023
DUBLIN--Document Understanding By Language-Image Network
K Aggarwal, A Khandelwal, K Tanmay, OM Khan, Q Liu, M Choudhury, ...
arXiv preprint arXiv:2305.14218, 2023
12023
Polaris: A Safety-focused LLM Constellation Architecture for Healthcare
S Mukherjee, P Gamble, MS Ausin, N Kant, K Aggarwal, N Manjunath, ...
arXiv preprint arXiv:2403.13313, 2024
2024
ODIN: A Single Model for 2D and 3D Perception
A Jain, P Katara, N Gkanatsios, AW Harley, G Sarch, K Aggarwal, ...
arXiv preprint arXiv:2401.02416, 2024
2024
DUBLIN: Visual Document Understanding By Language-Image Network
K Aggarwal, A Khandelwal, K Tanmay, OK Mohammed, Q Liu, ...
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–13