I want to know whether NLP like extraction is possible with Talend Data Integration tool.
My requirement is as follows:
In an excel sheet ,I have a column named 'Description'. It is a lengthy string giving details about a product.
I have a column in DB,named 'Product Type'. Product Type can have values like Product1,Product2,Product3 etc.
I need to analyse the 'Description' field in the excel sheet and find out whether a value like Product1 or Product2 or Product3 is available in that .If its present,extract the value and populate the 'Product Type' column in DB.
Please help me to figure out the feasibility of doing such a task using Talend DI.
IMHO, using just "equals" or "contains" methods may be a little restrictive compared from what you can expect from a NLP method.
I suggest you to include Stanford CoreNLP (or other NLP engine) in your project if you want to take advantage of a real natural language analysis tool.
Yes.It helped to get an idea about the steps to follow.I created a lookup table with possible values and merged it in tmap.
Join us at the Community Lounge.
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Watch the recorded webinar!
Pick up some tips and tricks with Context Variables
Learn how media organizations have achieved success with Data Integration
Accelerate your data lake projects with an agile approach