I am trying to create a spark job to iterate through a file placed in hdfs and store it into an ArrayList. That ArrayList will be used in a java code for lookUp. But, the iterate operation is not supported as well as it is not populating the ArrayList too.
Many components are not available in the spark job which are there in a standard job, making it difficult to solve the problem statement.
Any suggestions are welcome
Watch the recorded webinar!
Introduction to Talend Open Studio for Data Integration.
Test drive Talend's enterprise products.
Practical steps to developing your data integration strategy.