We are going to process the big data on S3 with Amazon EMR.
Basically, I assume that we can directly access and process the data on S3 as a Hive External Table.
Like below, by creating the external table on Hive, we can access the S3 data through EMR directly.
So, I'd like to use this procedure with Talend BigData Platform.
How can I define the components??
Sorry for our silence. We have re-directed your issue to our PM and experts and then come back to you as soon as we can.
Try Talend Cloud free for 30 days.
Introduction to Talend Open Studio for Data Integration.
Practical steps to developing your data integration strategy.
Create systems and workflow to manage clean data ingestion and data transformation.