Shashank Rajput

Cited by

	All	Since 2019
Citations	1199	1196
h-index	13	13
i10-index	13	13

440

220

110

330

20192020202120222023202410 50 139 240 433 323

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Dimitris PapailiopoulosAssociate Professor, University of Wisconsin-MadisonVerified email at papail.io
Jy-yong SohnYonsei UniversityVerified email at yonsei.ac.kr
Kangwook LeeUniversity of Wisconsin-MadisonVerified email at wisc.edu
Kartik SreenivasanGraduate Student, University of Wisconsin-MadisonVerified email at wisc.edu
Hongyi WangSenior Project Scientist, CMU; Incoming Assistant Professor, Rutgers UniversityVerified email at andrew.cmu.edu
Harit VishwakarmaGraduate Student, University of Wisconsin MadisonVerified email at cs.wisc.edu
Zachary CharlesResearch Scientist, GoogleVerified email at google.com
Angeliki GiannouUniversity of Wisconsin–MadisonVerified email at wisc.edu
Tuan DinhUniversity of California, San FranciscoVerified email at ucsf.edu
Ankit PensiaIBM ResearchVerified email at ibm.com
Alliot NaglePhD Student, UT AustinVerified email at utexas.edu
Michael GiraUniversity of Wisconsin–MadisonVerified email at wisc.edu
Yuchen ZengUniversity of Wisconsin-MadisonVerified email at wisc.edu
Ruisu ZhangUniversity of Wisconsin-MadisonVerified email at wisc.edu
Ziqian LinPh.D. student of UW-MadisonVerified email at wisc.edu
Maheswaran (Mahesh) SathiamoorthyGoogle DeepMindVerified email at google.com
Lichan HongGoogle DeepMindVerified email at google.com
Yi TayResearch Scientist, Google BrainVerified email at google.com
Ed H. ChiGoogle DeepMind / Google BrainVerified email at acm.org
Vinh Q. TranResearch Scientist, Google DeepMindVerified email at google.com

Shashank Rajput

Research Scientist, MosaicML (Databricks)

Verified email at databricks.com - Homepage

Large Language Models Machine Learning Optimization Distributed Systems


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Attack of the tails: Yes, you really can backdoor federated learning H Wang, K Sreenivasan, S Rajput, H Vishwakarma, S Agarwal, J Sohn, ... Advances in Neural Information Processing Systems (NeurIPS), 2020	554	2020
DETOX: A Redundancy-based Framework for Faster and More Robust Gradient Aggregation S Rajput, H Wang, Z Charles, D Papailiopoulos Advances in Neural Information Processing Systems (NeurIPS), 2019	125	2019
Optimal Lottery Tickets via Subset Sum: Logarithmic Over-Parameterization is Sufficient A Pensia, S Rajput, A Nagle, H Vishwakarma, D Papailiopoulos Advances in Neural Information Processing Systems (NeurIPS), 2020	100	2020
Lift: Language-interfaced fine-tuning for non-language machine learning tasks T Dinh, Y Zeng, R Zhang, Z Lin, M Gira, S Rajput, J Sohn, ... Advances in Neural Information Processing Systems (NeurIPS), 2022	82	2022
Recommender Systems with Generative Retrieval S Rajput, N Mehta, A Singh, R Keshavan, T Vu, L Heldt, L Hong, Y Tay, ... Advances in Neural Information Processing Systems (NeurIPS), 2023	62	2023
Closing the convergence gap of SGD without replacement S Rajput, A Gupta, D Papailiopoulos International Conference on Machine Learning (ICML), 2020	60	2020
Looped Transformers as Programmable Computers A Giannou, S Rajput, J Sohn, K Lee, JD Lee, D Papailiopoulos International Conference on Machine Learning (ICML), 2023	56	2023
Does data augmentation lead to positive margin? S Rajput, Z Feng, Z Charles, PL Loh, D Papailiopoulos International Conference on Machine Learning (ICML), 2019	42	2019
Minibatch vs Local SGD with Shuffling: Tight Convergence Bounds and Beyond C Yun, S Rajput, S Sra International Conference on Learning Representations (ICLR), 2022	36	2022
Convergence and Margin of Adversarial Training on Separable Data Z Charles, S Rajput, S Wright, D Papailiopoulos arXiv preprint arXiv:1905.09209, 2019	20	2019
An exponential improvement on the memorization capacity of deep threshold networks S Rajput, K Sreenivasan, D Papailiopoulos, A Karbasi Advances in Neural Information Processing Systems (NeurIPS), 2021	18	2021
Permutation-Based SGD: Is Random Optimal? S Rajput, K Lee, D Papailiopoulos International Conference on Learning Representations (ICLR), 2022	17	2022
Finding everything within random binary networks K Sreenivasan, S Rajput, J Sohn, D Papailiopoulos International Conference on Artificial Intelligence and Statistics (AISTATS), 2022	15*	2022
The Expressive Power of Tuning Only the Norm Layers A Giannou, S Rajput, D Papailiopoulos Conference on Learning Theory (COLT), 2023	5	2023
The Expressive Power of Tuning Only the Normalization Layers A Giannou, S Rajput, D Papailiopoulos The Thirty Sixth Annual Conference on Learning Theory, 4130-4131, 2023	4	2023
Maestro: Uncovering Low-Rank Structures via Trainable Decomposition S Horváth, S Laskaridis, S Rajput, H Wang arXiv preprint arXiv:2308.14929, 2023	2	2023
Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment T Dinh, J Sohn, S Rajput, T Ossowski, Y Ming, J Hu, D Papailiopoulos, ... EMNLP (Findings), 2022	1	2022
Large-Scale SGD Algorithms and the Expressive Power of Modern Neural Networks S Rajput The University of Wisconsin-Madison, 2023		2023
SUPER SEEDS: extreme model compression by trading off storage with compute N Lee, S Rajput, J Sohnw, H Wangc, A Naglew, EP Xingmp, K Leew, ...

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors