A view of cloud computing M Armbrust, A Fox, R Griffith, AD Joseph, R Katz, A Konwinski, G Lee, ... Communications of the ACM 53 (4), 50-58, 2010 | 11972 | 2010 |
Above the clouds: A berkeley view of cloud computing A Fox, R Griffith, A Joseph, R Katz, A Konwinski, G Lee, D Patterson, ... Dept. Electrical Eng. and Comput. Sciences, University of California …, 2009 | 8650 | 2009 |
Spark: Cluster computing with working sets. M Zaharia, M Chowdhury, MJ Franklin, S Shenker, I Stoica HotCloud 10 (10-10), 95, 2010 | 5615 | 2010 |
Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing M Zaharia, M Chowdhury, T Das, A Dave, J Ma, M McCauly, MJ Franklin, ... Presented as part of the 9th {USENIX} Symposium on Networked Systems Design …, 2012 | 4790 | 2012 |
Improving MapReduce performance in heterogeneous environments. M Zaharia, A Konwinski, AD Joseph, RH Katz, I Stoica Osdi 8 (4), 7, 2008 | 2094 | 2008 |
Mesos: A platform for fine-grained resource sharing in the data center. B Hindman, A Konwinski, M Zaharia, A Ghodsi, AD Joseph, RH Katz, ... NSDI 11 (2011), 22-22, 2011 | 1934 | 2011 |
Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling M Zaharia, D Borthakur, J Sen Sarma, K Elmeleegy, S Shenker, I Stoica Proceedings of the 5th European conference on Computer systems, 265-278, 2010 | 1669 | 2010 |
Mllib: Machine learning in apache spark X Meng, J Bradley, B Yavuz, E Sparks, S Venkataraman, D Liu, ... The Journal of Machine Learning Research 17 (1), 1235-1241, 2016 | 1567 | 2016 |
Apache spark: a unified engine for big data processing M Zaharia, RS Xin, P Wendell, T Das, M Armbrust, A Dave, X Meng, ... Communications of the ACM 59 (11), 56-65, 2016 | 1512 | 2016 |
Spark sql: Relational data processing in spark M Armbrust, RS Xin, C Lian, Y Huai, D Liu, JK Bradley, X Meng, T Kaftan, ... Proceedings of the 2015 ACM SIGMOD international conference on management of …, 2015 | 1184 | 2015 |
Dominant Resource Fairness: Fair Allocation of Multiple Resource Types. A Ghodsi, M Zaharia, B Hindman, A Konwinski, S Shenker, I Stoica Nsdi 11 (2011), 24-24, 2011 | 1146 | 2011 |
Discretized streams: Fault-tolerant streaming computation at scale M Zaharia, T Das, H Li, T Hunter, S Shenker, I Stoica Proceedings of the twenty-fourth ACM symposium on operating systems …, 2013 | 1079 | 2013 |
Discretized streams: an efficient and fault-tolerant model for stream processing on large clusters M Zaharia, T Das, H Li, S Shenker, I Stoica Proceedings of the 4th USENIX conference on Hot Topics in Cloud Computing, 10-10, 2012 | 635 | 2012 |
Managing data transfers in computer clusters with orchestra M Chowdhury, M Zaharia, J Ma, MI Jordan, I Stoica SIGCOMM 41 (4), 2011 | 618 | 2011 |
Sparrow: distributed, low latency scheduling K Ousterhout, P Wendell, M Zaharia, I Stoica Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems …, 2013 | 550 | 2013 |
Shark: SQL and rich analytics at scale RS Xin, J Rosen, M Zaharia, MJ Franklin, S Shenker, I Stoica Proceedings of the 2013 ACM SIGMOD International Conference on Management of …, 2013 | 533 | 2013 |
Learning spark: lightning-fast big data analysis H Karau, A Konwinski, P Wendell, M Zaharia " O'Reilly Media, Inc.", 2015 | 525 | 2015 |
Job scheduling for multi-user mapreduce clusters M Zaharia, D Borthakur, JS Sarma, K Elmeleegy, S Shenker, I Stoica Technical Report UCB/EECS-2009-55, EECS Department, University of California …, 2009 | 441 | 2009 |
Tachyon: Reliable, memory speed storage for cluster computing frameworks H Li, A Ghodsi, M Zaharia, S Shenker, I Stoica Proceedings of the ACM Symposium on Cloud Computing, 1-15, 2014 | 375 | 2014 |
A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples SN Naccache, S Federman, N Veeraraghavan, M Zaharia, D Lee, ... Genome research 24 (7), 1180-1192, 2014 | 349 | 2014 |