Problem with tHiveInput

One Star

Problem with tHiveInput

Hello,
I would take a doubt, which I do not know what it can be.
When I connect a tHiveInput component to an output component, in my case tLogRow, he brings no data and not even starts running.
Connection with the Cluster and Hive verified, component metadata also...
What could it be?
Thank you. 
Moderator

Re: Problem with tHiveInput

Hi,
Do you mean your MapReduce job fail to execute without error message? What's your BigData product build version? Could you please share your tHiveInput component setting screenshot with us? How did you verify your Cluster and Hive?
Best regards
Sabrina
--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.
One Star

Re: Problem with tHiveInput

So, the thing is when I click Run i have no errors, but the main row stays "Starting" forever. 
This problem does not occurs when I send a generic select like:
SELECT * FROM USERS;
But when I do something like this, specifying columns I have no answer, like this:
SELECT NAME, LASTNAME FROM USERS;
We are using Talend Big Data 5.5.1.
   
Employee

Re: Problem with tHiveInput

Hello,
SELECT * FROM USERS won't start a M/R job on the cluster side. So the data can be retrieved very quickly if you don't have a lot of data.
As soon as you define the columns you want to read in the query, a M/R job is started on the cluster side. This job could be longer. If it hangs forever, then you could find more information on the resource manager side browsing 192.168.25.86:8088.
HTH,
Rémy.
One Star

Re: Problem with tHiveInput

Hello,
SELECT * FROM USERS won't start a M/R job on the cluster side. So the data can be retrieved very quickly if you don't have a lot of data.
As soon as you define the columns you want to read in the query, a M/R job is started on the cluster side. This job could be longer. If it hangs forever, then you could find more information on the resource manager side browsing 192.168.25.86:8088.
HTH,
Rémy.

Hello rdubois,
When I'm running in the HortonWorks/Talend Sandbox, it works well.
But, when I try to use another machine with remote connection, just to perform that function... when I'm running remotely, it doesn't work.
I didn't say it before, but this remote machine that I'm trying to run the job ISN'T in the Hadoop Cluster, I'm just connecting remotely and trying to run the job.
In this case, will the job work well if I configure everything correctly? 
Best,
Jefferson Moreira
Employee

Re: Problem with tHiveInput

Hello Jefferson Moreira,
Yes, there is abolutely no problem for a job to be executed remotely, from a machine which is not part of the cluster.
What error do you have when you execute from this machine?
Regards,
Rémy.
One Star

Re: Problem with tHiveInput

Hello Jefferson Moreira,
Yes, there is abolutely no problem for a job to be executed remotely, from a machine which is not part of the cluster.
What error do you have when you execute from this machine?
Regards,
Rémy.

Hello Rémy,
So... this is the problem, we don't have error message, the job just doesn't finish.
I don't know if you saw the screens that i sent before, is that configuration right ?
Best,
Jefferson.
Employee

Re: Problem with tHiveInput

Hello Jefferson,
The port you used for the resource manager looks wrong. Can you please try to replace 50300 by 8050.
HTH,
Rémy.