Guys, I've started my tests with Talend Big Data.
Now, specifically, I'm trying to read S3 csv file to a dataframe... I Want to try to merge this data with an existent parquet file on S3.
The talend is returning the following error:
I saw once some article or person saying that on Talend Big Data is necessary download the file from S3 to HDFS firstly and after with the file inside hdfs is possible then use a Big Data Batch job to process the data. Is it correct? Would be possible do the way I'm trying or Should I try the second approach.
I found really difficult to find answers to this through the internet...
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Learn how to make your data more available, reduce costs and cut your build time
Read about OTTO's experiences with Big Data and Personalized Experiences
Take a look at this video about Talend Integration with Databricks