[vc_row][vc_column][vc_column_text]

Apache Hadoop Open Source Tools for Big Data Projects

[/vc_column_text][vc_empty_space height=”20px”][vc_column_text]

S. No. Tools Type Used For Page Link.
1 Apache Accumulo NoSQL wide column store Data access Apache Accumulo – Hadoop Open Source Code
2 Apache Ambari Distributed Computing Operations Apache Ambari – Hadoop Open Source Code
3 Apache Atlas Metadata Management Data Management, Data Governance and Integration Apache atlas – Hadoop Open Source Code
4 Apache Falcon The framework of Managing data Data Governance and Integration Apache Falcon – Hadoop Open Source Code
5 Apache Flume Data transfer into HDFS Data Governance and Integration Apache Flume – Hadoop Open Source Code
6 Apache Hadoop Batch Processing Data Management, Data access, Data Governance and Integration, Security, Operations Apache Hadoop – Hadoop Open Source Code
7 Apache Hadoop HDFS Big Data Storage Data Management, Data Governance and Integration, Security Apache Hadoop HDFS – Hadoop Open Source Code
8 Apache Hadoop MapReduce Batch Processing Data access Apache Hadoop MapReduce – Hadoop Open Source Code
9 Apache Hadoop YARN Resource Management and Job Scheduling layer Data Management Apache Hadoop YARN – Hadoop Open Source Code
10 Apache HBase NoSQL Database Data access Apache HBase – Hadoop Open Source Code
11 Apache Hive Relational Database Data access Apache Hive – Hadoop Open Source Code
12 Apache Kafka Stream Processing Data Governance and Integration Apache Kafka – Hadoop Open Source Code
13 Apache Knox Gateway Security Entry point Security Apache Knox Gateway – Hadoop Open Source Code
14 Apache OOZIE Work flow Scheduler Operations Apache OOZIE – Hadoop Open Source Code
15 Apache Phoenix SQL Database Data access Apache Phoenix – Hadoop Open Source Code
16 Apache Pig (High level Scripting Language used with Hadoop) High level Scripting Language Data access Apache Pig – Hadoop Open Source Code
17 Apache Ranger Data Security Framework Security Apache Ranger – Hadoop Open Source Code
18 Apache Slider Framework of YARN- based Data access Apache Slider – Hadoop Open Source Code
19 Apache Solr Search Platform Data access Apache Solr – Hadoop Open Source Code
20 Apache Spark Hybrid Framework(Batch and Stream) Data access Apache Spark – Hadoop Open Source Codee
21 Apache Sqoop Data Transfer tool Data Governance and Integration Apache Sqoop – Hadoop Open Source Code
22 Apache Storm Distributed Stream Processing Data access Apache Storm – Hadoop Open Source Code
23 Apache Tez Framework of YARN- based Data access Apache Tez – Hadoop Open Source Code
24 Apache Zeppelin Web Based Notebook Data Analytics Apache Zeppelin – Hadoop Open Source Code
25 Apache Zookeeper Distributed Computing Operations Apache ZooKeeper – Hadoop Open Source Code
26 Druid Data Store BI queries Druid – Hadoop Open Source Code
27 Apache Samza Stream Processing Data Analytics Apache Samza – Hadoop Open Source Code
28 Apache Flink Stream Processing Data Analytics, ML Algorithms Apache Flink – Hadoop Open Source Code

[/vc_column_text][/vc_column][/vc_row]

Leave Comment

Your email address will not be published. Required fields are marked *

clear formSubmit