How to process the AWS S3 data from EMR as an Hive External Table with Talend BigData Platform

Six Stars

How to process the AWS S3 data from EMR as an Hive External Table with Talend BigData Platform

Hi all, 

 

We are going to process the big data on S3 with Amazon EMR.

 

Basically, I assume that we can directly access and process the data on S3  as a Hive External Table.

S3 external image from EMR.PNG

 

Like below, by creating the external table on Hive, we can access the S3 data through EMR directly.

 

S3 external image from EMR2.PNG

 

So, I'd like to use this procedure with Talend BigData Platform.

 

How can I define the components??

 

 

 

Moderator

Re: How to process the AWS S3 data from EMR as an Hive External Table with Talend BigData Platform

Hello,

Sorry for our silence. We have re-directed your issue to our PM and experts and then come back to you as soon as we can.

Best regards

Sabrina

 

--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.

2019 GARNER MAGIC QUADRANT FOR DATA INTEGRATION TOOL

Talend named a Leader.

Get your copy

OPEN STUDIO FOR DATA INTEGRATION

Kickstart your first data integration and ETL projects.

Download now

What’s New for Talend Summer ’19

Watch the recorded webinar!

Watch Now

Best Practices for Using Context Variables with Talend – Part 4

Pick up some tips and tricks with Context Variables

Blog

How Media Organizations Achieved Success with Data Integration

Learn how media organizations have achieved success with Data Integration

Read

Agile Data lakes & Analytics

Accelerate your data lake projects with an agile approach

Watch