We are going to process the big data on S3 with Amazon EMR.
Basically, I assume that we can directly access and process the data on S3 as a Hive External Table.
Like below, by creating the external table on Hive, we can access the S3 data through EMR directly.
So, I'd like to use this procedure with Talend BigData Platform.
How can I define the components??
Sorry for our silence. We have re-directed your issue to our PM and experts and then come back to you as soon as we can.
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Watch the recorded webinar!
Pick up some tips and tricks with Context Variables
Learn how media organizations have achieved success with Data Integration
Accelerate your data lake projects with an agile approach