Jump to content

What Is Big Data And What Is Hadoop


Recommended Posts

Posted

Hadoop kakunda vere edina tool tho kooda big data pi work cheyocha.



hana on sap
  • Replies 52
  • Created
  • Last Reply

Top Posters In This Topic

  • namesake

    9

  • vasu123

    5

  • Darling999

    4

  • suryausa

    4

Top Posters In This Topic

Posted

Hadoop lo big data anedi oka part ah? Or big data complete vere na

It is a good question and lot of people will confuse about it..

 

Big data is a issue or problem where companies not able to handle the data coming from lot of sources(Like Web logs,Sensor dat etc..i mean even the small company nd big company)..Within the next coming 3 to 5 years the data increases like 50 to 70% more compared to previous Last 5 years.So companies will miss important info( if they are not able to analyze this data).So Hadoop is used to handle this type of data.

 

Hadoop is not a tool and it is an Open source Eco system which is used to handle this Big data and will process distributedly.

 

Please look into RDBMS V/S Hadoop in google..If you want to know why oracle,teradata etc.. not able to handle this type of data??

 

Batch Processing : Mapreduce,Pig,Apache Spark

Real time Processing : Storm or Spark Streaming

Sql interface : Hive,Impala,.SparkSql,Apache Drill

Sqoop : Used to move the data from RDBMS to HDFS or Vice-Versa

Oozie: Scheduling the jobs

No-sql : Hbase,MongoDB,Cassandra etc.

Graph database : Neo4j

Dashboards to visualize this data: Tableau

 

Technologies which is important to learn CoreJava,Python,Scala,R

Posted

It is a good question and lot of people will confuse about it..

 

Big data is a issue or problem where companies not able to handle the data coming from lot of sources(Like Web logs,Sensor dat etc..i mean even the small company nd big company)..Within the next coming 3 to 5 years the data increases like 50 to 70% more compared to previous Last 5 years.So companies will miss important info( if they are not able to analyze this data.So Hadoop is used to handle this type of data.

 

Hadoop is not a tool and it is an Open source Eco system which is used to handle this Big data and will process distributedly.

 

Please look into RDBMS V/S Hadoop in google..If you want to know why oracle,teradata etc.. not able to handle this type of data??

 

Batch Processing : Mapreduce,Pig,Apache Spark

Real time Processing : Storm or Spark Streaming

Sql interface : Hive,Impala,.SparkSql,Apache Drill

Sqoop : Used to move the data from RDBMS to HDFS or Vice-Versa

Oozie: Scheduling the jobs

No-sql : Hbase,MongoDB,Cassandra etc.

Graph database : Neo4j

Dashboards to visualize this data: Tableau

 

Technologies which is important to learn CoreJava,Python,Scala,R

GP

Posted

It is a good question and lot of people will confuse about it..

 

Big data is a issue or problem where companies not able to handle the data coming from lot of sources(Like Web logs,Sensor dat etc..i mean even the small company nd big company)..Within the next coming 3 to 5 years the data increases like 50 to 70% more compared to previous Last 5 years.So companies will miss important info( if they are not able to analyze this data).So Hadoop is used to handle this type of data.

 

Hadoop is not a tool and it is an Open source Eco system which is used to handle this Big data and will process distributedly.

 

Please look into RDBMS V/S Hadoop in google..If you want to know why oracle,teradata etc.. not able to handle this type of data??

 

Batch Processing : Mapreduce,Pig,Apache Spark

Real time Processing : Storm or Spark Streaming

Sql interface : Hive,Impala,.SparkSql,Apache Drill

Sqoop : Used to move the data from RDBMS to HDFS or Vice-Versa

Oozie: Scheduling the jobs

No-sql : Hbase,MongoDB,Cassandra etc.

Graph database : Neo4j

Dashboards to visualize this data: Tableau

 

Technologies which is important to learn CoreJava,Python,Scala,R

 

 

gp

Posted

So core java is primary step ah? Any excellent core java video tutorials for beginners who does not know java.

Posted

It is a good question and lot of people will confuse about it..

 

Big data is a issue or problem where companies not able to handle the data coming from lot of sources(Like Web logs,Sensor dat etc..i mean even the small company nd big company)..Within the next coming 3 to 5 years the data increases like 50 to 70% more compared to previous Last 5 years.So companies will miss important info( if they are not able to analyze this data).So Hadoop is used to handle this type of data.

 

Hadoop is not a tool and it is an Open source Eco system which is used to handle this Big data and will process distributedly.

 

Please look into RDBMS V/S Hadoop in google..If you want to know why oracle,teradata etc.. not able to handle this type of data??

 

Batch Processing : Mapreduce,Pig,Apache Spark

Real time Processing : Storm or Spark Streaming

Sql interface : Hive,Impala,.SparkSql,Apache Drill

Sqoop : Used to move the data from RDBMS to HDFS or Vice-Versa

Oozie: Scheduling the jobs

No-sql : Hbase,MongoDB,Cassandra etc.

Graph database : Neo4j

Dashboards to visualize this data: Tableau

 

Technologies which is important to learn CoreJava,Python,Scala,R

GP.. Terrific Info..

Posted

GP.. Terrific Info..

java nerchukovalinag-smiling-o_zpsd23b83a3.gif?1367267799

Posted

It is a good question and lot of people will confuse about it..

 

Big data is a issue or problem where companies not able to handle the data coming from lot of sources(Like Web logs,Sensor dat etc..i mean even the small company nd big company)..Within the next coming 3 to 5 years the data increases like 50 to 70% more compared to previous Last 5 years.So companies will miss important info( if they are not able to analyze this data).So Hadoop is used to handle this type of data.

 

Hadoop is not a tool and it is an Open source Eco system which is used to handle this Big data and will process distributedly.

 

Please look into RDBMS V/S Hadoop in google..If you want to know why oracle,teradata etc.. not able to handle this type of data??

 

Batch Processing : Mapreduce,Pig,Apache Spark

Real time Processing : Storm or Spark Streaming

Sql interface : Hive,Impala,.SparkSql,Apache Drill

Sqoop : Used to move the data from RDBMS to HDFS or Vice-Versa

Oozie: Scheduling the jobs

No-sql : Hbase,MongoDB,Cassandra etc.

Graph database : Neo4j

Dashboards to visualize this data: Tableau

 

Technologies which is important to learn CoreJava,Python,Scala,R

baaga cheppav bhayya....inni rojulu ee tool deniki upayogam paduthundho thelisedhi kaadhu...ippudu ardam ayyindhi

Posted

java nerchukovalinag-smiling-o_zpsd23b83a3.gif?1367267799

gadhe vaste manchiga BE, BPM ki potunde kadha.. nag-smiling-o_zpsd23b83a3.gif?1367267799

Posted

na mgr gaadu 5 yrs project antunnadu nag-smiling-o_zpsd23b83a3.gif?1367267799 inka nerchukodaniki interest ravadam ledhu

gadhe vaste manchiga BE, BPM ki potunde kadha.. nag-smiling-o_zpsd23b83a3.gif?1367267799

 

Posted

na mgr gaadu 5 yrs project antunnadu nag-smiling-o_zpsd23b83a3.gif?1367267799 inka nerchukodaniki interest ravadam ledhu

Hmm manchidhe kadha.. Settle ayipo...asalike Market atta attane undhi.. Inko position unte PM seyyi.. try sesta.nag-smiling-o_zpsd23b83a3.gif?1367267799

×
×
  • Create New...