I have a directory of files which I iterate through and output to 1 master file. I would like to sort all the rows in a particular order before outputting to a file but because I'm iterating through multiple files my output is automatically separated into batches based on the flow of rows through my job.
Is there a way to sort all the rows without having to read in the master file?
Solved! Go to Solution.
In this case would tFileList trigger SubJob Ok when all files have been read or after each file was read?
The subjob starts to work only when all files are read.
"on Subjob OK" means "when the subjob is finish with success", so in this case when all the files have been read.
Hope it will be helpful.
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Part 2 of a series on Context Variables
Learn how to do cool things with Context Variables
Find out how to migrate from one database to another using the Dynamic schema