[resolved] Parsing data from HTML

Four Stars

[resolved] Parsing data from HTML

I have tried approaches, and to be honest, It's a pain in the ass.
I have tried the tHTMLParse custom component from the exchange but it does not help too much. 
Is there maybe a way to map the data from a html document with a XPath, like in the tfileInputXML component. maybe extracting the value of a attribute and some value of a tag. I'm supprised there is not anything like that or I'm just missing something.
also, I saw there are some other components used for this purposes, but nothing for version 5.5.1
Community Manager

Re: [resolved] Parsing data from HTML

Hi 
There is no a special component for extracting the value of an attribute or a tag from a html file. You can try to use tFileInputRegex to do this with regex expression.
Best regards
Shong
----------------------------------------------------------
Talend | Data Agility for Modern Business
Four Stars

Re: [resolved] Parsing data from HTML

thnx. I was hoping someone would give some other answer but was aware that this will probably be the case.
thnx once again

2019 GARNER MAGIC QUADRANT FOR DATA INTEGRATION TOOL

Talend named a Leader.

Get your copy

OPEN STUDIO FOR DATA INTEGRATION

Kickstart your first data integration and ETL projects.

Download now

What’s New for Talend Summer ’19

Watch the recorded webinar!

Watch Now

Best Practices for Using Context Variables with Talend – Part 4

Pick up some tips and tricks with Context Variables

Blog

How Media Organizations Achieved Success with Data Integration

Learn how media organizations have achieved success with Data Integration

Read

Why Companies Move to the Cloud: 7 Success Stories

Learn how and why companies are moving to the Cloud

Read Now