One Star

Unable to run steps on EMR using talend studio for data integration

I'm trying to use Talend data integration for pulling data set and pig script from Amazon S3 and run it on EMR. When I try to launch an EMR Cluster using talend, only core hadoop is installed so I wrote separate steps to install PIG and HIVE which are failing. The following are the parameters I gave for the steps in talend open studio
JAR:https://s3-us-west-2.amazonaws.com/us-west-2.elasticmapreduce/libs/script-runner/script-runner.jar
Arguments:https://s3-us-west-2.amazonaws.com/us-west-2.elasticmapreduce/libs/pig/pig-script,--base-path,https://s3-us-west-2.amazonaws.com/us-west-2.elasticmapreduce/libs/pig/,--install-pig,--pig-versions,latest
2 REPLIES
Employee

Re: Unable to run steps on EMR using talend studio for data integration

Which version of Talend are you using? And how are you starting the cluster: with the Talend components or the AWS Console?
Thomas
Thomas Steinborn
VP Product Management
One Star

Re: Unable to run steps on EMR using talend studio for data integration

I'm using Talend Open Studio 6.3.0 and I'm launching EMR using talend tAmazonEMRManage component.