Introduction to Apache Hadoop, an open source software framework for storage and large scale processing of data-sets on clusters of commodity hardware.