In this post, we will write the Word count program in Java. We explained the logic of this program in MapReduce Hello World (Part 1). Before writing the program , here is the data type differences between Java and MapReduce: Equivalent of int in MapReduce is IntWritable Equivalent of String is Text Equivalent …
Continue reading MapReduce Hello World (Part 2)Big Data
In this post, we will : 1) Understand MapReduce basics 2) Write a word count program in Map Reduce This is also considered as the Hello World program in MapReduce programming. What is MapReduce ? MapReduce is the ‘heart‘ of Hadoop that consists of two parts – ‘map’ and ‘reduce’. Maps …
Continue reading MapReduce Hello World (Part 1)This post provides an introduction to following concepts : Hadoop Basics What is HDFS ? What is YARN ? Lets start with the simplest question first. What is Big Data ? Big data is a term coined for huge volume of data(in terrabytes or petabytes) that is difficult to manage using …
Continue reading Big Data and Hadoop BasicsIn this article, we will discuss what is Big data and why do enterprises care about Big data.we will learn: What is wrong with our traditional DWH solutions? When RDBMS could not help much Technical issues we face with RDBMS How Hadoop is different from RDBMS Core features of Hadoop What is wrong with our …
Continue reading What is Big Data and Why do Enterprises care about Big Data?Riak is an open source, distributed and NOSQL key-value database. What is Riak? Open Source database Highly Scalable Distributed Highly available Eventually/Strongly consistent Fault Tolerant Simple to operate Schema free Low Latency Where to use Riak? Huge volume of data Low Latency High velocity Read and writes If you need a highly available and consistent …
Continue reading Riak Tutorials