Jump to content

Recommended Posts

Posted

java vastee hadoop nerchukovachuu, mari .net vastheee ?


Neeku mari interest unte hdinsight msft offering of haddop with partnership of hortonworks..
  • Replies 76
  • Created
  • Last Reply

Top Posters In This Topic

  • ICANWIN

    20

  • vizagpower

    6

  • siritptpras

    5

  • karna11

    4

Posted

Hadoop gurinchi konni facts:
U should be well versed in Java for hadoop
U should know using linux to run java programs
Hadoop is just a java based framework and hdfs is the distributed file system. Hadoop framework vaadatam nerchukunte pani aipodu.. hdfs file system meeda oka noSQL database (hbase or casandra) kuda minimum ga ochhi undali..
Not many companies have hadoop ecosystems.. so openings are less.. and TCS wipro and other indian companies hire hadoop developers for 50-55$ max.. there are very few requirements leaving out the indian it companies.. because rest of them are from companies like Netflix, amazon, google and apple.. even in these companies.. each one has their own custom tools used on hadoop ecosystem.. not many people use map reduce directly.. so nerchukunte fine.. job osthe keka.. manchi billing rate unte inka keka.. contract aipothe next contract mundu kante manchi rate ki eppudu ki ostadooooo.. evaru cheppaleru.


Add machine learning techniques like apache mahout which is also important...
Posted

Neeku mari interest unte hdinsight msft offering of haddop with partnership of hortonworks..

koncham elabroate cheyyii,  .net ki idhi ela related ooooo

Posted

hadoop is an ecosystem.....andhulo 

mapreduse for programming

Hdfs- distributed file system

Hbase- Distributed database

hive- datawarehousing infrastructure for analysis on top of Hadoop/HDFS

PIG- high-level platform for analyzing data which writes mapreduce programs using scripting language 

cassandra - DBMS just like Hbase.

etc etc inka chaala unnai 

 

 

Hive and pig are alternatives to mapreduce coz most of the mapreduce programs are written to analyze data which can be done by HIVE and PIG by using no sql.....so just some queries vaste chaalu hive and pig use cheyacchu........but if the company has a requirement that has a problem which has to be solved from scratch and develop own solution then no other option but to use mapreduce.

basically asalu konni companies only hive and pig use chestai....so java and mapreduce avasaram ledu for hadoop developer role.(only for some companies)

Posted

koncham elabroate cheyyii, .net ki idhi ela related ooooo


Pedda idea ledu kaani mostly Linux replaced by windows server..and .net Lang instead of Java etc etc..
Every thing is replaced by ms tools and tech..bottom line
  • 3 weeks later...
Posted

java vastee hadoop nerchukovachuu, mari .net vastheee ?

 

Hadoop has a flavor in .net . HDinsight is there for Microsoft technologies

Posted

hadoop is an ecosystem.....andhulo 

mapreduse for programming

Hdfs- distributed file system

Hbase- Distributed database

hive- datawarehousing infrastructure for analysis on top of Hadoop/HDFS

PIG- high-level platform for analyzing data which writes mapreduce programs using scripting language 

cassandra - DBMS just like Hbase.

etc etc inka chaala unnai 

 

 

Hive and pig are alternatives to mapreduce coz most of the mapreduce programs are written to analyze data which can be done by HIVE and PIG by using no sql.....so just some queries vaste chaalu hive and pig use cheyacchu........but if the company has a requirement that has a problem which has to be solved from scratch and develop own solution then no other option but to use mapreduce.

basically asalu konni companies only hive and pig use chestai....so java and mapreduce avasaram ledu for hadoop developer role.(only for some companies)

GP

  • 1 year later...
Posted

still a good idea to learn??

indian companies ochi billing padesay antunaru bayata ;)
Posted

but bhayya oka 4 months complete ga dedicate ichi --real time training iche vaalu unte hadoop will be easy--java kuda ravali anntunnaru.

 

Posted

If any want to learn Hadoop from scratch..below topics would be helpful.
1. Basic Unix(preferred Linux. Currently Linux is only supported production platform. Others flavors like AIX doesnt work with Hadoop out of the box)
2. Basic shell Scripting
3. Understanding of traditional filesystem
4. Understanding of Cluster/Distributed Systems
5. Proficiency in any programming language.(preferred Java/Scala/Python. Understanding of JVM is required)
6. Understanding of RDBMS concepts.
7. Big Data

Hadoop Stack.
1. Hadoop Architecture & Distributions(Name/Data/edge nodes, zookeeper etc)
2. HDFS
3. MapReduce v1,MR2,YARN(most of the companies do not use MapReduce unless it is necessary)
4. Spark(most happening alternative fr MapReduce)
5. Pig, Hive, NoSQL/columnar Database(Haase, Cassandra), Avro/Parquet
6. Scoop,Oozie,Flume
7.Advanced Concepts
i. Streaming(Spark/kafka/Storm)
ii. Mahout/Mlib
iii.Solr
8.Impala n Hue if using Cloudera distribution.
9.Hadoop Security n Authentication basics

×
×
  • Create New...