Hadoop client setup

Three Stars

Hadoop client setup

My cluster is on Linux and our ETL infrasturcture is on Windows.  What I am coming up against is that on Windows for Talend BigData edition to run, there needs to be a local hadoop client installed.  While there is no good documentation available, does anyone have steps on how to get going with setting up a Hadoop client on Windows and configuring it to connect to the cluster on Linux ?  Also the same configs that needs to be keyed in Talend itself (tHDFS).  I tried a few options, but does not seem to work.
Five Stars

Re: Hadoop client setup

Hi Ben,
I'm using both Hive and HDFS components with CDH 5.4 on windows 7 in GUI mode and deploying in standalone to Solaris.
Neither Windows nor Solaris have any hadoop client stuff installed and they both work fine (except for an exception on windows about an exe that is not a blocker).
For hdfs I am using the Cloudera and CDH 5.4(Yarn).  For Hive I am using the same in Stand Alone Mode.
If your circumstances are different and you really need the client installed it's possible.  One of our admins here did it to help testing kerberos, but I have not done it myself.  He used these two links to get it done.
Hope that helps.

What’s New for Talend Spring ’19

Watch the recorded webinar!

Watch Now

Agile Data lakes & Analytics

Accelerate your data lake projects with an agile approach


Definitive Guide to Data Quality

Create systems and workflow to manage clean data ingestion and data transformation.



Introduction to Talend Open Studio for Data Integration.