Problem converting HTML to plain text using jsoup

Seven Stars

Problem converting HTML to plain text using jsoup

Hi

I have a base64 data coming in from one of a column from salesforce, after converting the data, its evenutally a html data with all the tags, the downstream doesn't want those tags, i tried converting the html data to plain text using "jsoup" in tjavaflex. but I keep getting the error  "Detail Message: whitelist cannot be resolved to a variable", can someone let me know what i'm doing wrong, i have even placed the jar files in the appropriate directory.

 

 

code to convert html to plain text in "tjavaflex"

 

import org.jsoup.Jsoup;
import org.jsoup.helper.Validate;
import org.jsoup.nodes.Document;
import org.jsoup.nodes.Element;
import org.jsoup.select.Elements;
import java.io.IOException;

output_row.sf_input_col= Jsoup.clean(sf_input_col, whitelist);

 

 

 

error

 

Error Line: 7406
Detail Message: whitelist cannot be resolved to a variable

thanks

MJ

 

Tags (3)
Moderator

Re: Problem converting HTML to plain text using jsoup

Hello,

Where is your  "whitelist" from? Any double quotes missing? Could you please post the whole job setting screenshots here?

Best regards

Sabrina

--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.

2019 GARNER MAGIC QUADRANT FOR DATA INTEGRATION TOOL

Talend named a Leader.

Get your copy

OPEN STUDIO FOR DATA INTEGRATION

Kickstart your first data integration and ETL projects.

Download now

What’s New for Talend Summer ’19

Watch the recorded webinar!

Watch Now

Best Practices for Using Context Variables with Talend – Part 2

Part 2 of a series on Context Variables

Blog

Best Practices for Using Context Variables with Talend – Part 1

Learn how to do cool things with Context Variables

Blog

Migrate Data from one Database to another with one Job using the Dynamic Schema

Find out how to migrate from one database to another using the Dynamic schema

Blog