Skip to main content

Table 4 The tools used in the respective data analysis pipelines of each paper

From: Manufacturing process data analysis pipelines: a requirements analysis and survey

Paper

Ingestion

Communication

Storage

Analysis

Visualization

[44]

Custom

HDFS, HBase, MongoDB Infinispan,

Hadoop, Hive, Pig, Elasticsearch

Custom

[45]

Custom

HDFS, MySQL

Hadoop

− (\(\sim \))

[46]

WSO2 BAM

WSO2 ESB

HDFS, RDB (\(\sim \)), Cassandra (\(\sim \))

Hadoop, WSO2 CEP

Custom (WSO2 UES)

[47]

Kafka

HDFS

Hadoop, Storm

[48]

HDFS, HBase, MongoDB Cassandra,

Hadoop, Hive

[49]

HDFS

Hadoop, Mahout, Jena Elephas

[50]

MySQL

Matlab, QuickCog

[51]

Custom

Microsoft SQL 2012

Custom

Custom

[52]

Custom

Kafka

HDFS, HBase

Hadoop, Storm, Hive, Radoop, Rapidminer

− (\(\sim \))

[53]

Sqoop

HDFS, HBase

Hadoop, Hive, Impala

[54]

Sqoop

Flume

HDFS, HBase, MySQL

Hadoop, Hive

Custom

[55]

Custom

Custom

MongoDB

Custom

Custom

[56]

Custom

MongoDB, PostgreSQL

RStudio, Watson Analytics, Qliksense

Custom

[57]

Flume (\(\sim \)), Sqoop (\(\sim \))

Custom

HDFS, HBase

Hadoop, Hive, Impala, Spark, Pig

Custom

[58]

Custom

Custom

Cassandra

Spark

Zeppelin (\(\sim \))

[59]

Kafka

Kafka

Cassandra, OntoQUAD

Spark

Custom, Jupyter, Ontos Eiger

[60]

Custom

HDFS

Hadoop, Hive, Spark

Custom

[61]

Storm

Kafka

MongoDB

Storm

Custom

[62]

Pig, Hive

Custom

HDFS

Hadoop, Hive, Pig

Flamingo, Custom

[63]

ODI, Talend, Sqoop

Kafka

HDFS, HBase

Hadoop, Spark, IPython

Tableau, Microsoft BI

[64]

Sqoop, Custom

Custom

HDFS, RDB \(\sim \)

Hadoop, Hive, Impala, Spark, Matlab

[65]

Custom

Custom (\(\sim \))

Custom

[66]

Custom

Kafka, RabbitMQ

HDFS, HBase, Cassandra, PostgreSQL

Hadoop, Spark, Storm

Custom

[67]

Custom

Custom

Microsoft SQL 2008R2

Custom

Custom

[68]

Custom

Cassandra

Spark

[69]

Sqoop

HDFS

Spark

Custom

[70]

Custom, Storm

Kafka

CouchDB

[71]

Sqoop

HDFS, HBase

Hadoop, Hive, Pig

Custom

[72]

Custom

MongoDB

Custom

[73]

WSO2 ESB

WSO2 ESB

Alfresco CMS, Neo4j

Apache UIMA, WEKA

Custom

[74]

− (\(\sim \))

HDFS, HBase

Hadoop, Hive

[75]

Spark

HDFS, HBase

R, Drools

Custom

[76]

Custom

MySQL

Custom

Custom

[77]

Custom

Microsoft SQL 2008R2

Custom

Custom

[78]

Flume, Sqoop

HDFS

Hadoop, Hive, Solr, RServe, Mahout

Custom

[79]

Flume

Kafka

HDFS, HBase, MySQL

Hadoop, Hive, Storm

Custom

[80]

Kafka

Cassandra

Spark

Custom

[81]

Custom

RabbitMQ

HDFS

Hadoop

  1. \(\sim \), implies that it is uncertain if the tool was used for this stage of the pipeline. −, implies that no tool was used for this stage of the pipeline
  2. ODI: Oracle Data Integrator, BAM: Business Activity Monitor, ESB: Enterprise Service Bus, CEP: Complex Event Processor, UES: User Engagement Server