I'm trying to collect the tHbaseInput component output , which is returned as JavaPairRDD<Nullable, row1Struct>.
My question is row1Struct is the class which is generated by Talend.
Basically i want to collect the JavaPairRDD and convert as DataFrame.
When i call the collect method on JavaPairRDD, it is returning the Tuple of <Nullable, row1Struct>
My problem is what is this row1Struct, how to get the list of row1Struct as List of String or something, so i can convert that to DataFrame.
The worst part of Talend tJava component is it doesnt say any data type.
We have to Print each and everything in Sysout then decide what is the datatype it could be.
Talend Streaming edition is subscription edition, why in the world some one gives this worst tool as a priced one.
Introduction to Talend Open Studio for Data Integration.
Practical steps to developing your data integration strategy.
Create systems and workflow to manage clean data ingestion and data transformation.