I have a csv file which consists of several columns. There is also a column called count having values like 1,2,3,5, etc. specific to each record. Now at the output, there would be multiple csv files and the number of columns at each of the csv file would depend upon the values of 'count' column. Consider the below scenario for further details.
input csv -
output csv -
(Taking 1st row into consideration)
As the value of 'count' here is 2 therefore, output csv will have
ie., only the first 2 columns.
Please help me achieve this scenario.
Thanks in advance!
@dipanjan93,if i understand correctly, you just want to take the first two columns of the input file is that correct?
you can just filter the columns which you do not want to populate using tFilterColumns or tJavarow or tMap.
@manodwhb Well not really. Number of columns at the output csv file would be dependent upon the 'count'. As in the example you can see for the 1st record count is 2 therefore 2 columns are there at the output. Similarly, for the second row the value of count column is 1 so the output shall have 1 columns. So on and so forth.
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Read about OTTO's experiences with Big Data and Personalized Experiences
Pick up some tips and tricks with Context Variables
Take a look at this video about Talend Integration with Databricks