Hi there.. What palette should i use to 1. Accept a pdf file 2. Convert this file to text (would using tmap be a good option???) 3. Store the text file on HDFS I have generate a routine which will convert the pdf file to text file. But i kind of stuck with the palette!!! Plz help!!
Hi, There is no official PDF component in Talend. Here is a custom component tPDFToText wriiten by Talend community user and shared on Talend Exchange which is used to convert a PDF to text file. Best regards Sabrina
-- Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.
Thank you for the reply.. The component tPDFtoText is quite static. It takes only one file at a time and also have to specify the output file name in the component setting. Any way to make it dynamic!!!