HADOOP ARCHITECTURE : A distributed file system

Paper Topic :

Computer Science and Its applications

Author Name :

Ruchira A. Kulkarni

Abstract :

ABSTRACT: Hadoop is an open-source software framework which stores and process big data in distributed manner for cloud computing.Hadoop is originated from Apache Nutch, which is an open source search engine, itself is a part of the Lucent project. Hadoop was invented by Doug Cutting and Mike Cafarella in 2005 in java. It was mainly developed to support distribution for the Nutch search engine project. Yahoo has developed and contributed 80% of the core of Hadoop. All modules in Hadoop are designed with basic assumption that hardware failures of machines are common hence should be automatically handled in software by the framework.

Download Article