Sqoop vs Talend capabilities

Six Stars

Sqoop vs Talend capabilities

Hi,

As per my analysis, for importing data from RDBMS to HDFS, we can do it either using tSqoop or tMSSql.

I.e. tSqoop can directly import the data to hdfs whereas using sql components we have to fetch data and then store in file and put the file on hdfs.

Now, I want to know. are there any other pro's and con's of these both approaches.

Ex. In sql approach I can manipulate the data before putting on HDFS, can we do this from sqoop as well?

 

So can someone please briefly elaborate this?

 

Thanks in advance!

Moderator

Re: Sqoop vs Talend capabilities

Hello,

In order to take advantage of MapReduce, you can use sqoop component to load data from RDBMS to HDFS directly.

Please have a look at  Apache's documentation about Sqoop.

Best regards

Sabrina

--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.

2019 GARNER MAGIC QUADRANT FOR DATA INTEGRATION TOOL

Talend named a Leader.

Get your copy

OPEN STUDIO FOR DATA INTEGRATION

Kickstart your first data integration and ETL projects.

Download now

What’s New for Talend Summer ’19

Watch the recorded webinar!

Watch Now

Put Massive Amounts of Data to Work

Learn how to make your data more available, reduce costs and cut your build time

Watch Now

How OTTO Utilizes Big Data to Deliver Personalized Experiences

Read about OTTO's experiences with Big Data and Personalized Experiences

Blog

Best Practices for Using Context Variables with Talend – Part 4

Pick up some tips and tricks with Context Variables

Blog