The Talend Data Lake on AWS Quick Start builds a data lake environment on the Amazon Web Services (AWS) Cloud by deploying Talend Big Data Platform components and AWS services such as Amazon EMR, Amazon Redshift, Amazon Simple Storage Service (Amazon S3), and Amazon Relational Database Service (Amazon RDS). The Quick Start is for users who are evaluating big data in the cloud or looking to accelerate their big data initiative through the adoption of best practices for big data integration.
The Quick Start also provides an optional sample dataset and Talend jobs developed by Cognizant Technology Solutions to illustrate big data practices for integrating Apache Spark, Apache Hadoop, Amazon EMR, Amazon Redshift, and Amazon S3 technologies into the data lake implementation.
In order to use the Quickstart you will need to have a Talend Platform for Big Data license. You can obtain an eval license from your Talend Account Executive, or you can register for a trial license. Read more about the Talend AWS Data Lake Quickstart Solution and then follow the instructions in the Deployment Guide to provision your data lake at the click of a button. After it is up and running, walk through the sample jobs using the User Guide.