How to process the AWS S3 data from EMR as an Hive External Table with Talend BigData Platform

Six Stars

How to process the AWS S3 data from EMR as an Hive External Table with Talend BigData Platform

Hi all, 

 

We are going to process the big data on S3 with Amazon EMR.

 

Basically, I assume that we can directly access and process the data on S3  as a Hive External Table.

S3 external image from EMR.PNG

 

Like below, by creating the external table on Hive, we can access the S3 data through EMR directly.

 

S3 external image from EMR2.PNG

 

So, I'd like to use this procedure with Talend BigData Platform.

 

How can I define the components??

 

 

 

Moderator

Re: How to process the AWS S3 data from EMR as an Hive External Table with Talend BigData Platform

Hello,

Sorry for our silence. We have re-directed your issue to our PM and experts and then come back to you as soon as we can.

Best regards

Sabrina

 

--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.

Cloud Free Trial

Try Talend Cloud free for 30 days.

Tutorial

Introduction to Talend Open Studio for Data Integration.

Definitive Guide to Data Integration

Practical steps to developing your data integration strategy.

Definitive Guide to Data Quality

Create systems and workflow to manage clean data ingestion and data transformation.