I enabled the parallelization options on XML extract jobs that converts XML fields to pipe delimited text file. But it is causing data jumble. When I do partition and departition option jobs get stuck.
What is the correct way to set partition and departition? Could you please suggest. I am sharing the job screenshot below:
From your screenshot, you are using tExtractXMLField component in your work flow. Do you want to run synchronously your job for each XML file? Could you please elaborate your case with an example with input and expected output value?
For the performance concern I am trying to use this parallelization option. I want extract to be performed parallel so that extract would be fast. The other combination that I tried gets jobs running fine but data jumble issue is there. Suppose timestamp column captures data only for timestamp normally or without this option the way I have in screenshot. May be there is something wrong with the option I selected as screenshot, I don't know but I am getting data jumbled up. Data that doesn't belong to timestamp column is populating to timestamp column which is wrong. Could you please let me know, what is the issue?