Skip to main content

Table 2 Diffusion and popularity of the systems

From: Programming big data analysis: principles and solutions

System

Main companies using it

User community size questions (weekly)

GitHub stars

API support

GitHub commits

Hadoop

Yahoo!, IBM, Amazon

Large-43.3k (36)

11.8k

Java, C, C++, Ruby, Groovy, Perl, Python

25.1k

Spark

eBay, Amazon, Alibaba

Very Large-69.5k (193)

30.4k

Scala, Python, Java, R

30.8k

Storm

Twitter, Groupon, Spotify

Small-2.5k (2)

6.3k

Clojure, Java, Python, Ruby, JavaScript

10.4k

Hama

Samsung Electronics, Korea Telecom, Sogou

Very Small-22 (< 1)

129

Java, Python, C, C++

1.6k

MPI

Amazon WS, AMD, Cisco, Facebook

Medium-6.3k (13)

1.3k

Java, Fortran, C, C++, Perl, Python

31.8k

Hive

Facebook, Netflix, Yahoo!, AirBnB

Large-20.2k (44)

3.8k

HiveQL

15.6k

Pig

LinkedIn, PayPal, Mendeley

Small-5.2k (< 2)

631

PigLatin

3.7k