From: Programming big data analysis: principles and solutions
System | Main companies using it | User community size questions (weekly) | GitHub stars | API support | GitHub commits |
---|---|---|---|---|---|
Hadoop | Yahoo!, IBM, Amazon | Large-43.3k (36) | 11.8k | Java, C, C++, Ruby, Groovy, Perl, Python | 25.1k |
Spark | eBay, Amazon, Alibaba | Very Large-69.5k (193) | 30.4k | Scala, Python, Java, R | 30.8k |
Storm | Twitter, Groupon, Spotify | Small-2.5k (2) | 6.3k | Clojure, Java, Python, Ruby, JavaScript | 10.4k |
Hama | Samsung Electronics, Korea Telecom, Sogou | Very Small-22 (< 1) | 129 | Java, Python, C, C++ | 1.6k |
MPI | Amazon WS, AMD, Cisco, Facebook | Medium-6.3k (13) | 1.3k | Java, Fortran, C, C++, Perl, Python | 31.8k |
Hive | Facebook, Netflix, Yahoo!, AirBnB | Large-20.2k (44) | 3.8k | HiveQL | 15.6k |
Pig | LinkedIn, PayPal, Mendeley | Small-5.2k (< 2) | 631 | PigLatin | 3.7k |