I have a directory of files which I iterate through and output to 1 master file. I would like to sort all the rows in a particular order before outputting to a file but because I'm iterating through multiple files my output is automatically separated into batches based on the flow of rows through my job.
Is there a way to sort all the rows without having to read in the master file?
Solved! Go to Solution.
In this case would tFileList trigger SubJob Ok when all files have been read or after each file was read?
The subjob starts to work only when all files are read.
"on Subjob OK" means "when the subjob is finish with success", so in this case when all the files have been read.
Hope it will be helpful.
Watch the recorded webinar!
Accelerate your data lake projects with an agile approach
Create systems and workflow to manage clean data ingestion and data transformation.
Introduction to Talend Open Studio for Data Integration.