Introduction to Hadoop – understanding Big Data
This is my second post on my journey of understanding Big Data. My first post looked at the design of large-scale retrieval systems at Google. This post looks at Hadoop – a framework for processing massive data sets across multiple nodes, insipired by Google’s MapReduce and GFS architectures.
There are two nice video introductions to Hadoop – both sponsored by O’Reilly Media and both featuring Tom White, author of Hadoop: The Definitive Guide. The first webcast is from July 2009 titled An Introduction to Hadoop:
The second webcast is from September 2010 titled The State of Hadoop:
For a wonderful overview of the state of Big Data, please see Making Sense of Big Data, a PWC Technology forecast from 2010.
More to come on Hadoop and Big Data in future posts.