Követés
Rajib Nath
Cím
Hivatkozott rá
Hivatkozott rá
Év
Dense linear algebra solvers for multicore with GPU accelerators
S Tomov, R Nath, H Ltaief, J Dongarra
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
3032010
An improved MAGMA GEMM for Fermi graphics processing units
R Nath, S Tomov, J Dongarra
The International Journal of High Performance Computing Applications 24 (4 …, 2010
2722010
Accelerating GPU kernels for dense linear algebra
R Nath, S Tomov, J Dongarra
High Performance Computing for Computational Science–VECPAR 2010, 83-92, 2011
952011
Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing
S Tomov, R Nath, J Dongarra
Parallel Computing 36 (12), 645-654, 2010
812010
A scalable high performant Cholesky factorization for multicore with GPU accelerators
H Ltaief, S Tomov, R Nath, P Du, J Dongarra
International Conference on High Performance Computing for Computational …, 2010
732010
Optimizing Symmetric Dense Matrix-Vector Multiplication on GPUs
R Nath, S Tomov, J Dongarra
Super Computing (SC), 2011
702011
JETC: Joint energy thermal and cooling management for memory and CPU subsystems in servers
R Ayoub, R Nath, T Rosing
IEEE International Symposium on High-Performance Comp Architecture, 1-12, 2012
492012
MAGMA Users’ Guide
S Tomov, R Nath, P Du, J Dongarra
ICL, UTK (November 2009), 2011
452011
MAGMA version 0.2 User Guide
S Tomov, R Nath, P Du, J Dongarra
412009
The CRISP performance model for dynamic voltage and frequency scaling in a GPGPU
R Nath, D Tullsen
Proceedings of the 48th international symposium on microarchitecture, 281-293, 2015
332015
Hybrid multicore cholesky factorization with multiple gpu accelerators
H Ltaief, S Tomov, R Nath, J Dongarra
IEEE Transaction on Parallel and Distributed Systems 48, 2010
252010
An implementation of the tile QR factorization for a GPU and multiple CPUs
J Kurzak, R Nath, P Du, J Dongarra
PARA, 2010
242010
A fully empirical autotuned dense QR factorization for multicore architectures
E Agullo, J Dongarra, R Nath, S Tomov
Euro-Par 2011 Parallel Processing, 194-205, 2011
232011
BLAS for GPUs
R Nath, S Tomov, J Dongarra
102010
Temperature aware thread block scheduling in GPGPUs
R Nath, R Ayoub, TS Rosing
Proceedings of the 50th Annual Design Automation Conference, 1-6, 2013
92013
Accelerating ML recommendation with over a thousand RISC-V/tensor processors on Esperanto’s ET-SoC-1 chip
D Ditzel, R Espasa, N Aymerich, A Baum, T Berg, J Burr, E Hao, J Iyer, ...
2021 IEEE Hot Chips 33 Symposium (HCS), 1-23, 2021
82021
Cometc: Coordinated management of energy/thermal/cooling in servers
R Ayoub, R Nath, TS Rosing
ACM Transactions on Design Automation of Electronic Systems (TODAES) 19 (1 …, 2013
72013
Magma, matrix algebra on gpu and multicore architectures
A Tomov, R Nath, P Du, J Dongarra
72012
Power Modeling and Thermal Management Techniques for Manycores
R Nath, D Carmean, T Rosing
62013
Fully empirical autotuned qr factorization for multicore architectures
E Agullo, J Dongarra, R Nath, S Tomov
arXiv preprint arXiv:1102.5328, 2011
62011
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–20