Creation of lookup data in BD spark jobs

Highlighted
Five Stars

Creation of lookup data in BD spark jobs

HI,

I have created several Standard jobs as POC. Now we are dealing with large data volumes (TB's) .

I need to convert the standard jobs to BD spark jobs.

Most of the components from standard job operate in BD jobs, but the critical component i require is thashoutput and thashinput.

Can you please suggest what alternatives do I have to create such lookup files in a BigData jobs ?

 

 

Thanks

Badri Nair 


Accepted Solutions
Moderator

Re: Creation of lookup data in BD spark jobs

Hello,

Is there any specific need for you to use tHashInPut and tHashOutPut components in your standard jobs?

tCacheIn and tCacheOut can be available in the Spark Batch and Spark Streaming Job framework.

Best regards

Sabrina

--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.

All Replies
Moderator

Re: Creation of lookup data in BD spark jobs

Hello,

Is there any specific need for you to use tHashInPut and tHashOutPut components in your standard jobs?

tCacheIn and tCacheOut can be available in the Spark Batch and Spark Streaming Job framework.

Best regards

Sabrina

--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.

What’s New for Talend Spring ’19

Join us live for a sneak peek!

Sign up now

Agile Data lakes & Analytics

Accelerate your data lake projects with an agile approach

Watch

Definitive Guide to Data Quality

Create systems and workflow to manage clean data ingestion and data transformation.

Download

Tutorial

Introduction to Talend Open Studio for Data Integration.

Watch