kum5758 Posted November 10, 2015 Report Posted November 10, 2015 If any want to learn Hadoop from scratch..below topics would be helpful. 1. Basic Unix(preferred Linux. Currently Linux is only supported production platform. Others flavors like AIX doesnt work with Hadoop out of the box) 2. Basic shell Scripting 3. Understanding of traditional filesystem 4. Understanding of Cluster/Distributed Systems 5. Proficiency in any programming language.(preferred Java/Scala/Python. Understanding of JVM is required) 6. Understanding of RDBMS concepts. 7. Big Data Hadoop Stack. 1. Hadoop Architecture & Distributions(Name/Data/edge nodes, zookeeper etc) 2. HDFS 3. MapReduce v1,MR2,YARN(most of the companies do not use MapReduce unless it is necessary) 4. Spark(most happening alternative fr MapReduce) 5. Pig, Hive, NoSQL/columnar Database(Haase, Cassandra), Avro/Parquet 6. Scoop,Oozie,Flume 7.Advanced Concepts i. Streaming(Spark/kafka/Storm) ii. Mahout/Mlib iii.Solr 8.Impala n Hue if using Cloudera distribution. 9.Hadoop Security n Authentication basics oka NON-IT background nundi ochina vaalaki em em pre-requiesite bhayya hadoop start chese mundu?
Recommended Posts