Parallel restarted SGD with faster convergence and less communication: Demystifying why model averaging works for deep learning H Yu, S Yang, S Zhu Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 5693-5700, 2019 | 669* | 2019 |
On the linear speedup analysis of communication efficient momentum SGD for distributed non-convex optimization H Yu, R Jin, S Yang International Conference on Machine Learning, 7184-7193, 2019 | 390 | 2019 |
Sparse temporally dynamic resting-state functional connectivity networks for early MCI identification CY Wee, S Yang, PT Yap, D Shen Brain imaging and behavior 10 (2), 342-356, 2016 | 185 | 2016 |
Feature grouping and selection over an undirected graph S Yang, L Yuan, YC Lai, X Shen, P Wonka, J Ye Proceedings of the 18th ACM SIGKDD international conference on Knowledge …, 2012 | 146 | 2012 |
Fused multiple graphical lasso S Yang, Z Lu, X Shen, P Wonka, J Ye SIAM Journal on Optimization 25 (2), 916–943, 2015 | 108 | 2015 |
Unified Visual Transformer Compression S Yu, T Chen, J Shen, H Yuan, J Tan, S Yang, J Liu, Z Wang ICLR 2022, 2022 | 88 | 2022 |
An efficient ADMM algorithm for multidimensional anisotropic total variation regularization problems S Yang, J Wang, W Fan, X Zhang, P Wonka, J Ye Proceedings of the 19th ACM SIGKDD international conference on Knowledge …, 2013 | 60 | 2013 |
Multifeature, sparse-based approach for defects detection and classification in semiconductor units BM Haddad, S Yang, LJ Karam, J Ye, NS Patel, MW Braun IEEE Transactions on Automation Science and Engineering 15 (1), 145-159, 2016 | 57 | 2016 |
GDP: Stabilized Neural Network Pruning via Gates with Differentiable Polarization Y Guo, H Yuan, J Tan, Z Wang, S Yang, J Liu ICCV 2021, 2021 | 55 | 2021 |
Persia: An open, hybrid system scaling deep learning-based recommenders up to 100 trillion parameters X Lian, B Yuan, X Zhu, Y Wang, Y He, H Wu, L Sun, H Lyu, C Liu, X Dong, ... Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022 | 33* | 2022 |
Shifted Chunk Transformer for Spatio-Temporal Representational Learning X Zha, W Zhu, T Lv, S Yang, J Liu NeurIPS 2021, 2021 | 32 | 2021 |
TNASP: A Transformer-based NAS Predictor with a Self-evolution Framework JL S Lu, J Li, J Tan, S Yang NeurIPS 2021, 2021 | 31* | 2021 |
Learning with non-convex truncated losses by SGD Y Xu, S Zhu, S Yang, C Zhang, R Jin, T Yang Uncertainty in Artificial Intelligence, 701-711, 2020 | 28 | 2020 |
Structural Graphical Lasso for Learning Mouse Brain Connectivity S Yang, Q Sun, S Ji, P Wonka, I Davidson, J Ye Proceedings of the 21th ACM SIGKDD International Conference on Knowledge …, 2015 | 28 | 2015 |
BAGUA: Scaling up Distributed Learning with System Relaxations S Gan, X Lian, R Wang, J Chang, C Liu, H Shi, S Zhang, X Li, T Sun, ... VLDB 2022, 2022 | 27 | 2022 |
ProxyBO: Accelerating Neural Architecture Search via Bayesian Optimization with Zero-cost Proxies Y Shen, Y Li, J Zheng, W Zhang, P Yao, J Li, S Yang, J Liu, C Bin AAAI, 2023 | 23 | 2023 |
A highly scalable parallel algorithm for isotropic total variation models J Wang, Q Li, S Yang, W Fan, P Wonka, J Ye International Conference on Machine Learning, 235-243, 2014 | 16 | 2014 |
Shrinking the upper confidence bound: A dynamic product selection problem for urban warehouses R Jin, D Simchi-Levi, L Wang, X Wang, S Yang Management Science 67 (8), 4756-4771, 2021 | 12 | 2021 |
POSO: Personalized Cold Start Modules for Large-scale Recommender Systems S Dai, H Lin, Z Zhao, J Lin, H Wu, Z Wang, S Yang, J Liu arXiv preprint arXiv:2108.04690, 2021 | 11 | 2021 |
Multi-task vector field learning B Lin, S Yang, C Zhang, J Ye, X He Advances in neural information processing systems 25, 2012 | 10 | 2012 |