Jump to content

Remote Data Transfer between multiple Data lakes


Recommended Posts

Posted

Context : As part of a big data project, we have multiples data lakes (one by region): America, Asia and Europe. And we need to transfer data between them. Our data lakes are built around Hortonworks.

Constraint: Limitations in the WAN network because it is also used to transfer
Higher priority data (our data are a little less).

Volumetry: tens of gigs of data per day.

Need: What are the bests practices to garnish this transfer of data between the plates?
Do you have returns exp. For this kind of
need.

All documentation, contact, link, information will be welcome.

Thank you for your help

 

 

@Spartan @k2s @yomama @tennisluvr @loveindia Help!

Posted

there are several crawl tools that you can use for data transfer... we are using ucm 

Posted
Just now, Katravelli said:

there are several crawl tools that you can use for data transfer... we are using ucm 

Can you please elaborate 

Posted
2 hours ago, Ruskey said:

Context : As part of a big data project, we have multiples data lakes (one by region): America, Asia and Europe. And we need to transfer data between them. Our data lakes are built around Hortonworks.

Constraint: Limitations in the WAN network because it is also used to transfer
Higher priority data (our data are a little less).

Volumetry: tens of gigs of data per day.

Need: What are the bests practices to garnish this transfer of data between the plates?
Do you have returns exp. For this kind of
need.

All documentation, contact, link, information will be welcome.

Thank you for your help

 

 

@Spartan @k2s @yomama @tennisluvr @loveindia Help!

Azcopy is there no.. 

within cloud to cloud transfer is pretty fast..

Posted
Just now, k2s said:

Azcopy is there no.. 

within cloud to cloud transfer is pretty fast..

Thx let me research on that 

Posted
25 minutes ago, Ruskey said:

Thx let me research on that 

Azure data lake factory tools are exclusively for data copying . 

Posted
On 1/26/2017 at 7:51 AM, Ruskey said:

Context : As part of a big data project, we have multiples data lakes (one by region): America, Asia and Europe. And we need to transfer data between them. Our data lakes are built around Hortonworks.

Constraint: Limitations in the WAN network because it is also used to transfer
Higher priority data (our data are a little less).

Volumetry: tens of gigs of data per day.

Need: What are the bests practices to garnish this transfer of data between the plates?
Do you have returns exp. For this kind of
need.

All documentation, contact, link, information will be welcome.

Thank you for your help

 

 

@Spartan @k2s @yomama @tennisluvr @loveindia Help!

shame on you man.. question kuda original kadu kada.. LinkedIN lo Hadoop Users lo Ahmed Cheriat 8 days back esina question idi.. as it is copy paste chesi ikkada esinav... question anna ardam ayinda vay neeku aada ? edo help help ani sfamming posts esinav *7*^ 

Posted
On 1/26/2017 at 10:41 AM, Ruskey said:

Thx let me research on that 

RT.gif panimalina pake pakodi 

Posted
1 minute ago, k2s said:

like this a HK.gif

aadu andariki cheppey answer adey andukey adey cheppa nenu kuda daddy man... diaper trash cheyakuna eeda em chestunnav... RT.gif

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...