Questions about reusing Data Quality source code

One Star

Questions about reusing Data Quality source code

Hi there
DQ provide very powerful data analysis functionality with with customizable validation rules, so I would like to use data quality to validate my data in my project. For example, I need to use predefined (java based) rules and custom rules via regular expression, such as checking empty string, invalid url and date format. After that, run the DQ analysis job to generate a report in order to highlight invalid data/rows.
I do not have use the DQ User Interface in my project. I prefer to run the validation jobs via scripts or a console app , because I need to validate a large number of csv data files. Does DQ provide a open source API/ or standalone component that I can reuse to define my validation rules, read source data and validate the data based the rules? any suggestion?
Thank you
Yukun
Moderator

Re: Questions about reusing Data Quality source code

Hi,
Talend is an RCP based on Eclipse, Talend offers open source skills and solutions. If you want to develop an app similar to Talend, launch Talend application from Talend source code is the best way to learn Eclipse skill and Talend. You can download source code from here:
SVN: http://svn.talendforge.org/svn/tos/
WEB:http://svn.talendforge.org/svn/tos/
Best regards
Sabrina
--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.
One Star

Re: Questions about reusing Data Quality source code

Hi,
Talend is an RCP based on Eclipse, Talend offers open source skills and solutions. If you want to develop an app similar to Talend, launch Talend application from Talend source code is the best way to learn Eclipse skill and Talend. You can download source code from here:
SVN: http://svn.talendforge.org/svn/tos/
WEB:http://svn.talendforge.org/svn/tos/
Best regards
Sabrina

Hi Sabrina
thank you for your reply. In order to handle data validation logic, I am going to see whether Talend provides an open source component/API that can be reused easily in my project.
SVN: http://svn.talendforge.org/svn/tos/ is source code of open studio for data integration
SVN: http://svn.talendforge.org/svn/top/ is source code of open studio for data quality.
What is the relationship between data integration and data quality code base? does data quality should be built on top of data integration?
When I check out the source code from http://svn.talendforge.org/svn/tos/ is requires user name and password, is there any guest account with read only permission?
Thanks
Yukun
Moderator

Re: Questions about reusing Data Quality source code

Hi,
What is the relationship between data integration and data quality code base? does data quality should be built on top of data integration?

For Underlying technology, I will direct it to DQ manager for you.
When I check out the source code from http://svn.talendforge.org/svn/tos/ is requires user name and password, is there any guest account with read only permission?

These 2 URLs are public. It doesn't ask for a user password. It's our SVN URL.
Is it working when you use your forum account(register account, Talend Forge account) to download the source?
What the TortoiseSVN version are you using?
Best regards
Sabrina
--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.
Employee

Re: Questions about reusing Data Quality source code

Hi,
DQ does depend on some plugins from DI.
Have a look at this discussion http://www.talendforge.org/forum/viewtopic.php?id=5877 for the dependencies.
Moderator

Re: Questions about reusing Data Quality source code

Hi yukun,
Is there any update for your issue downloading source code from talend svn url?
Best regards
Sabrina
--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.
One Star

Re: Questions about reusing Data Quality source code

Hi,
DQ does depend on some plugins from DI.
Have a look at this discussion http://www.talendforge.org/forum/viewtopic.php?id=5877 for the dependencies.

sure, it makes science. If I only download DQ code base, Eclipse complains that some DI related bundles are missing, that means I need to download DI as well.
Thanks
Yukun
One Star

Re: Questions about reusing Data Quality source code

Hi yukun,
Is there any update for your issue downloading source code from talend svn url?
Best regards
Sabrina

Hi Sabrina
I still got the same problem with my talendforge username and password.
"Checkout operation for 'http://talendforge.org/svn/tos/trunk' failed.
Authentication failed
svn: No more credentials or we tried too many times.
Authentication failed"
I will try it on another computer, and give you a update next week.
Thanks
Yukun