1 |
Apache Accumulo |
NoSQL wide column store |
Data access |
Apache Accumulo – Hadoop Open Source Code |
2 |
Apache Ambari |
Distributed Computing |
Operations |
Apache Ambari – Hadoop Open Source Code |
3 |
Apache Atlas |
Metadata Management |
Data Management, Data Governance and Integration |
Apache atlas – Hadoop Open Source Code |
4 |
Apache Falcon |
The framework of Managing data |
Data Governance and Integration |
Apache Falcon – Hadoop Open Source Code |
5 |
Apache Flume |
Data transfer into HDFS |
Data Governance and Integration |
Apache Flume – Hadoop Open Source Code |
6 |
Apache Hadoop |
Batch Processing |
Data Management, Data access, Data Governance and Integration, Security, Operations |
Apache Hadoop – Hadoop Open Source Code |
7 |
Apache Hadoop HDFS |
Big Data Storage |
Data Management, Data Governance and Integration, Security |
Apache Hadoop HDFS – Hadoop Open Source Code |
8 |
Apache Hadoop MapReduce |
Batch Processing |
Data access |
Apache Hadoop MapReduce – Hadoop Open Source Code |
9 |
Apache Hadoop YARN |
Resource Management and Job Scheduling layer |
Data Management |
Apache Hadoop YARN – Hadoop Open Source Code |
10 |
Apache HBase |
NoSQL Database |
Data access |
Apache HBase – Hadoop Open Source Code |
11 |
Apache Hive |
Relational Database |
Data access |
Apache Hive – Hadoop Open Source Code |
12 |
Apache Kafka |
Stream Processing |
Data Governance and Integration |
Apache Kafka – Hadoop Open Source Code |
13 |
Apache Knox Gateway |
Security Entry point |
Security |
Apache Knox Gateway – Hadoop Open Source Code |
14 |
Apache OOZIE |
Work flow Scheduler |
Operations |
Apache OOZIE – Hadoop Open Source Code |
15 |
Apache Phoenix |
SQL Database |
Data access |
Apache Phoenix – Hadoop Open Source Code |
16 |
Apache Pig (High level Scripting Language used with Hadoop) |
High level Scripting Language |
Data access |
Apache Pig – Hadoop Open Source Code |
17 |
Apache Ranger |
Data Security Framework |
Security |
Apache Ranger – Hadoop Open Source Code |
18 |
Apache Slider |
Framework of YARN- based |
Data access |
Apache Slider – Hadoop Open Source Code |
19 |
Apache Solr |
Search Platform |
Data access |
Apache Solr – Hadoop Open Source Code |
20 |
Apache Spark |
Hybrid Framework(Batch and Stream) |
Data access |
Apache Spark – Hadoop Open Source Codee |
21 |
Apache Sqoop |
Data Transfer tool |
Data Governance and Integration |
Apache Sqoop – Hadoop Open Source Code |
22 |
Apache Storm |
Distributed Stream Processing |
Data access |
Apache Storm – Hadoop Open Source Code |
23 |
Apache Tez |
Framework of YARN- based |
Data access |
Apache Tez – Hadoop Open Source Code |
24 |
Apache Zeppelin |
Web Based Notebook |
Data Analytics |
Apache Zeppelin – Hadoop Open Source Code |
25 |
Apache Zookeeper |
Distributed Computing |
Operations |
Apache ZooKeeper – Hadoop Open Source Code |
26 |
Druid |
Data Store |
BI queries |
Druid – Hadoop Open Source Code |
27 |
Apache Samza |
Stream Processing |
Data Analytics |
Apache Samza – Hadoop Open Source Code |
28 |
Apache Flink |
Stream Processing |
Data Analytics, ML Algorithms |
Apache Flink – Hadoop Open Source Code |