Big Data and Hadoop Basics

This post provides an introduction to following concepts : Hadoop Basics What is HDFS ? What is YARN ?   Lets start with the simplest question first.     What is Big Data ?   Big data is a term coined for huge volume of data(in terrabytes or petabytes) that is difficult to manage using […]

Big Data and Hadoop Interview Questions

Q : What is a Namenode? A : Namenode is the master node on which Resource manager runs. It contains metadata about data present in datanodes. It maintains and manages the blocks which are present on the datanodes. It is a high-availability machine and single point of failure in HDFS.   Q : Is Secondary […]