I have a requirement wherein i need to import different flat files data in orcale data base in 3NF data model which would be then used to export to a datawarehouse.
However before importing to 3NF data model, i need to do data analysis, cleaning, transforming, etc on the data
Could you please let me know if my understanding is correct reg different talend products that can be used in process (Note: I am limited to use only free Talend products)
also to note that this is monthly process, has to be done every month files.
1) Data Integration - Load the initial flat file data in oracle flat table (combine all the different flat files data into one oracle flat table) (IT Responsibility)
2) Data Quality - Analyse the data inconsistency and data mismatch in the oracle flat table (IT/Data Quality Analyst Responsibility)
3) Data preparation - After data quality, the data prep tool will help to cleanse data and import the data back in the oracle flat table or may be in similar table (based on features available in free open source tool) (doubt- can we update the data in same oracle flat table ) (IT/Data Quality Analyst Responsibility)
4) Data Integration - This cleansed data can be used to import in 3NF data model with applicable business rules and other transformations,etc (IT Responsibility)
Please do correct my understanding and help me with appropriate solution
Below link would give details about all different open source products of Talend in detail.
I would suggest to start the data quality verification work as the starting point as it will help to identify the source system related issues at the earliest possible time.Once the quality issues are addressed, you can do data preparation and move further to integrate the flow from source to target system.
If you would like to have a single software, Talend Data Fabric will be the one stop solution although you will have to purchase the license which is quite less compared to other competitors in the market.
Thanks for the update. If you are happy with the details provided about the details of different Talend Open source products, could you please close the topic with solution accepted flag?
Introduction to Talend Open Studio for Data Integration.
Practical steps to developing your data integration strategy.
Create systems and workflow to manage clean data ingestion and data transformation.