We have source log file as below:
(Please note its not delimited file)
12/31/2017 23:38:59.335-06:00 DB_firstValue 11,196
Table Count Thread Oldest entry
---------- ---------- ---------- ------------ --------
Table1 6,437 1 12/31/17 03:02:04
Table2 1,548 1 12/31/17 03:24:10
Table 1,123 1 12/31/17 03:24:10
tbl 871 1 12/31/17 03:02:07
abc_tbl 99 1 12/31/17 03:12:37
expected output file is:
|Report Run Date||Report Run Time||Database Name||Table Name||Oldest Trans Date||Oldest Trans Timestamp||count|
Please suggest how we can achieve this by talend job.
Source file attached for reference.
You can use talend Stat&Log feature in your job work flow.
tStatCatcher, Statistics table records details like begin Time, End Time which is generated by any job.
tLogCatcher, Logs table records the messages from Java Exception, tDie and/or tWarn (prerequisite: there must be an error in your job).
tFlowMeterCatcher, Flow Meter table records the number of rows processed in the defined flow(prerequisite: you have to use tFlowMeter in your job).
Here are related online documents about:TalendHelpCenter:tStatCatcher, TalendHelpCenter:tFlowMeterCatcher and TalendHelpCenter:tFlowMeter
Let us know if it is what you are looking for.
These solution are for log and errors, i can use these component once file read is done but i am facing issue to read the file as well, as file is not delimited one.
Attached file for reference(source- file1.txt, target-out.xlsx), please let me know for further.
Please have a look at this component tFileInputRaw with the option 'Read the file as a string' which can read all data in a raw file and sends it to a single output column for subsequent processing by another component.
Watch the recorded webinar!
Create systems and workflow to manage clean data ingestion and data transformation.
Introduction to Talend Open Studio for Data Integration.
Test drive Talend's enterprise products.