Batch JsonInput from Folder

Five Stars

Batch JsonInput from Folder

Hi,

 

I am trying to extract some data in CSV format from around 40,000 Json files.

Assuming that all files contain the same fields, what would be the best/most effecient way to process these?

 

Thanks

K

 


Accepted Solutions
Highlighted
Sixteen Stars TRF
Sixteen Stars

Re: Batch JsonInput from Folder

Hi,

The genral design should look like this:

tFileList-->tFileInputJSON-->(tSomethingElseIfNeeded)-->tFileOutpoutDelimited

Click on the "iterate" link after tFileList then tick the option"Enable parallel execution" then enter the number of threads you want in parallel (maybe something like 4 or 5).


TRF

View solution in original post


All Replies
Highlighted
Sixteen Stars TRF
Sixteen Stars

Re: Batch JsonInput from Folder

Hi,

The genral design should look like this:

tFileList-->tFileInputJSON-->(tSomethingElseIfNeeded)-->tFileOutpoutDelimited

Click on the "iterate" link after tFileList then tick the option"Enable parallel execution" then enter the number of threads you want in parallel (maybe something like 4 or 5).


TRF

View solution in original post

Five Stars

Re: Batch JsonInput from Folder

You are brilliant, thank you.
it is worth noting that in the filename path for the jsoninput you must specify the variable from tFileList_1:

((String)globalMap.get("tFileList_1_CURRENT_FILEPATH"))

2019 GARNER MAGIC QUADRANT FOR DATA INTEGRATION TOOL

Talend named a Leader.

Get your copy

OPEN STUDIO FOR DATA INTEGRATION

Kickstart your first data integration and ETL projects.

Download now

What’s New for Talend Summer ’19

Watch the recorded webinar!

Watch Now

Best Practices for Using Context Variables with Talend – Part 2

Part 2 of a series on Context Variables

Blog

Best Practices for Using Context Variables with Talend – Part 1

Learn how to do cool things with Context Variables

Blog

Migrate Data from one Database to another with one Job using the Dynamic Schema

Find out how to migrate from one database to another using the Dynamic schema

Blog