You can do the incremental load as follows :-
a) Extract the last run date from the DB and store in context variable. Also store the current time in another context variable.
b) Execute the SQL query from source database to extract the incremental data. Make sure your query's where clause is having both last run date and current system date stored in context variables.
c) Once the data extraction is complete, load the new system date from context variable to configuration table storing last run dates.
Another approach to extract the delta data is to implement the CDC (Change Data Capture) in source tables. Below link can give details about Talend CDC capabilities.
Note:- If the suggestion provided has helped to resolve your query, could you please mark the topic as solution provided? It will enrich the Talend community.
Unfortunately I do not have the backup of this job which I had created. But it is a very straightforward implementation.
Please feel free to get in touch if you are ever stuck during the job creation. We are always there to help
Apologies for the delay as I was on vacation.
The last run date will capture the time stamp which was used as the cut off time to fetch the delta records. Whenever you are running the delta job, the data fetch should happen between last run date+timestamp and current date+timestamp.
There are no PDF documents for this process but you can easily create a job flow during this concept. If you are stuck somewhere, please feel free to create a new post topic along with job flow and component error screenshots and we will be there to help you :-)
Introduction to Talend Open Studio for Data Integration.
Practical steps to developing your data integration strategy.
Create systems and workflow to manage clean data ingestion and data transformation.