I'm trying to collect the tHbaseInput component output , which is returned as JavaPairRDD<Nullable, row1Struct>.
My question is row1Struct is the class which is generated by Talend.
Basically i want to collect the JavaPairRDD and convert as DataFrame.
When i call the collect method on JavaPairRDD, it is returning the Tuple of <Nullable, row1Struct>
My problem is what is this row1Struct, how to get the list of row1Struct as List of String or something, so i can convert that to DataFrame.
The worst part of Talend tJava component is it doesnt say any data type.
We have to Print each and everything in Sysout then decide what is the datatype it could be.
Talend Streaming edition is subscription edition, why in the world some one gives this worst tool as a priced one.
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Watch the recorded webinar!
Learn how to make your data more available, reduce costs and cut your build time
Read about OTTO's experiences with Big Data and Personalized Experiences
Take a look at this video about Talend Integration with Databricks