Google's neural machine translation system: Bridging the gap between human and machine translation Y Wu, M Schuster, Z Chen, QV Le, M Norouzi, W Macherey, M Krikun, ...
arXiv preprint arXiv:1609.08144, 2016
9123 2016 In-datacenter performance analysis of a tensor processing unit NP Jouppi, C Young, N Patil, D Patterson, G Agrawal, R Bajwa, S Bates, ...
Proceedings of the 44th annual international symposium on computer …, 2017
5777 2017 Anton, a special-purpose machine for molecular dynamics simulation DE Shaw, MM Deneroff, RO Dror, JS Kuskin, RH Larson, JK Salmon, ...
Communications of the ACM 51 (7), 91-97, 2008
995 2008 Anton 2: raising the bar for performance and programmability in a special-purpose molecular dynamics supercomputer DE Shaw, JP Grossman, JA Bank, B Batson, JA Butts, JC Chao, ...
SC'14: Proceedings of the International Conference for High Performance …, 2014
743 2014 Millisecond-scale molecular dynamics simulations on Anton DE Shaw, RO Dror, JK Salmon, JP Grossman, KM Mackenzie, JA Bank, ...
Proceedings of the conference on high performance computing networking …, 2009
698 2009 Embedded computing: a VLIW approach to architecture, compilers and tools JA Fisher, P Faraboschi, C Young
Elsevier, 2005
528 2005 Mesh-tensorflow: Deep learning for supercomputers N Shazeer, Y Cheng, N Parmar, D Tran, A Vaswani, P Koanantakool, ...
Advances in neural information processing systems 31, 2018
425 2018 Ten lessons from three generations shaped google’s tpuv4i: Industrial product NP Jouppi, DH Yoon, M Ashcraft, M Gottscho, TB Jablin, G Kurian, ...
2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021
382 2021 Anton, a special-purpose machine for molecular dynamics simulation DE Shaw, MM Deneroff, RO Dror, JS Kuskin, RH Larson, JK Salmon, ...
ACM SIGARCH Computer Architecture News 35 (2), 1-12, 2007
368 2007 Mlperf training benchmark P Mattson, C Cheng, G Diamos, C Coleman, P Micikevicius, D Patterson, ...
Proceedings of Machine Learning and Systems 2, 336-349, 2020
339 2020 A domain-specific supercomputer for training deep neural networks NP Jouppi, DH Yoon, G Kurian, S Li, N Patil, J Laudon, C Young, ...
Communications of the ACM 63 (7), 67-78, 2020
317 2020 Motivation for and evaluation of the first tensor processing unit N Jouppi, C Young, N Patil, D Patterson
ieee Micro 38 (3), 10-19, 2018
311 2018 Measurements of differential cross-sections of highly boosted top quarks decaying to all-hadronic final states in collisions at using the ATLAS … M Aaboud, G Aad, B Abbott, O Abdinov, B Abeloos, SH Abidi, ...
Physical Review D 98 (1), 012003, 2018
274 2018 Tpu v4: An optically reconfigurable supercomputer for machine learning with hardware support for embeddings N Jouppi, G Kurian, S Li, P Ma, R Nagarajan, L Nai, N Patil, ...
Proceedings of the 50th Annual International Symposium on Computer …, 2023
267 2023 Sparse gpu kernels for deep learning T Gale, M Zaharia, C Young, E Elsen
SC20: International Conference for High Performance Computing, Networking …, 2020
258 2020 A new golden age in computer architecture: Empowering the machine-learning revolution J Dean, D Patterson, C Young
IEEE Micro 38 (2), 21-29, 2018
239 2018 A comparative analysis of schemes for correlated branch prediction C Young, N Gloy, MD Smith
ACM SIGARCH Computer Architecture News 23 (2), 276-286, 1995
221 1995 A domain-specific architecture for deep neural networks NP Jouppi, C Young, N Patil, D Patterson
Communications of the ACM 61 (9), 50-59, 2018
211 2018 Search for a heavy charged boson in events with a charged lepton and missing transverse momentum from collisions at with the ATLAS detector G Aad, B Abbott, DC Abbott, O Abdinov, A Abed Abud, K Abeling, ...
Physical review D 100 (5), 052013, 2019
195 2019 Improving the accuracy of static branch prediction using branch correlation C Young, MD Smith
ACM SIGOPS Operating Systems Review 28 (5), 232-241, 1994
172 1994