Running Talend Spark subjobs in parallel Talend Big data version 6.4

Highlighted
Seven Stars

Running Talend Spark subjobs in parallel Talend Big data version 6.4

Is there a way to run talend spark subjobs in parallel?.


Accepted Solutions
Highlighted
Employee

Re: Running Talend Spark subjobs in parallel Talend Big data version 6.4

Hi,

 

     When you trigger a Spark BigData sub job, you are providing an independent process to run the sub job.

 

     Talend jobs will not allow to run a Spark sub job without selecting the option "Use an independent process to run subjob".

 

     You can very well trigger them using tparallelize component from a DI job (just to maintain the orchestration). But tparallelize component cannot be used in Talend Big Data jobs.

 

Warm Regards,

 

Nikhil Thampi

View solution in original post


All Replies
Highlighted
Employee

Re: Running Talend Spark subjobs in parallel Talend Big data version 6.4

Hi,

 

     When you trigger a Spark BigData sub job, you are providing an independent process to run the sub job.

 

     Talend jobs will not allow to run a Spark sub job without selecting the option "Use an independent process to run subjob".

 

     You can very well trigger them using tparallelize component from a DI job (just to maintain the orchestration). But tparallelize component cannot be used in Talend Big Data jobs.

 

Warm Regards,

 

Nikhil Thampi

View solution in original post

Highlighted
Seven Stars

Re: Running Talend Spark subjobs in parallel Talend Big data version 6.4

So the solution is to put two subjobs into different talend spark jobs.

Call these two spark jobs in one talend standard job using the trun component.

set the trun component property to 'run as an independent process'

2019 GARTNER MAGIC QUADRANT FOR DATA INTEGRATION TOOL

Talend named a Leader.

Get your copy

OPEN STUDIO FOR DATA INTEGRATION

Kickstart your first data integration and ETL projects.

Download now

Best Practices for Using Context Variables with Talend – Part 2

Part 2 of a series on Context Variables

Blog

Best Practices for Using Context Variables with Talend – Part 1

Learn how to do cool things with Context Variables

Blog

Migrate Data from one Database to another with one Job using the Dynamic Schema

Find out how to migrate from one database to another using the Dynamic schema

Blog