Saturday, 23 April 2016

Apache Hadoop Abbreviations/Terms

Hadoop Terms
HDFS - Hadoop Distributed File System
GFS - Google File System
NN - NameNode
DN - Data Node
SNN - Secondary NameNode
JT - Job Tracker
TT - Task Tracker
HA NN - Highly Available NameNode (or NN HA - NameNode Highly Available)
REST - Representational State Transfer
HiveQL - Hive SQL
HAR - Hadoop Archive
ORC - Optimized Row Columnar
JSON - Java Script Object Notation
CDH - Cloudera’s Distribution Including Apache Hadoop
ZKFC - ZooKeeper Failover Controller
FUSE - Filesystem In Userspace
YARN - Yet Another Resource Negotiator
Amazon EC2 - Amazon Elastic Compute Cloud
Amazon S3 - Amazon Simple Storage Service
WASB - Windows Azure Storage Blobs (WASB)
EMR - Elastic MapReduce
JAR - Java ARchive
RPC - Remote Procedure Call
UDFs - user-defined functions
ETL - Extract/Transform/Load 
Hadoop -1.0.4.tar.gz Directory Structure click here

No comments:

Post a Comment