I have a directory of files which I iterate through and output to 1 master file. I would like to sort all the rows in a particular order before outputting to a file but because I'm iterating through multiple files my output is automatically separated into batches based on the flow of rows through my job.
Is there a way to sort all the rows without having to read in the master file?
Solved! Go to Solution.
In this case would tFileList trigger SubJob Ok when all files have been read or after each file was read?
The subjob starts to work only when all files are read.
"on Subjob OK" means "when the subjob is finish with success", so in this case when all the files have been read.
Hope it will be helpful.
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Watch the recorded webinar!
Pick up some tips and tricks with Context Variables
Learn how media organizations have achieved success with Data Integration
Create systems and workflow to manage clean data ingestion and data transformation.