We are going to process the big data on S3 with Amazon EMR.
Basically, I assume that we can directly access and process the data on S3 as a Hive External Table.
Like below, by creating the external table on Hive, we can access the S3 data through EMR directly.
So, I'd like to use this procedure with Talend BigData Platform.
How can I define the components??
Sorry for our silence. We have re-directed your issue to our PM and experts and then come back to you as soon as we can.
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Watch the recorded webinar!
Learn how to do cool things with Context Variables
Find out how to migrate from one database to another using the Dynamic schema
Pick up some tips and tricks with Context Variables