Hadoop 1 Vs Hadoop 2
Limitation of hadoop 1 ü No horizontal scalability of NameNode · As a cluster size grows there is a bottleneck for NameNode metadata ü Does not supports NameNode High-availability · If NameNode is single point of contact, if it fails then whole HDFS process is down ü Overburdened Job Tracker · As Schedule job, Monitor Job, if Task Tracker fails then reschedule, Resource manage are done by this process alone ü Does not support multi-tenancy · Means different(Map-reduce, Streaming job, Interactive job) jobs on same resource at a same time Hadoop 2 1. HDFS Federation · We can have more than one NameNode’s each NameNode will have namespace associated with it. · ...