Employee

Getting error while running Mapreduce job

Hi All,
I am getting error while I am trying to run the Mapreduce job using Talend. My job is very simple I am reading file from HDFS and sorting it and writing back to HDFS. I am getting the following error. I gave the following parameters on the Hadoop connection.
mapreduce.jobhistory.address = hdpmaster1.bigdatadev3.lb.*****.com:10020
yarn.app.mapreduce.am.staging-dir = /user

Before setting those parameters the job was just stuck on  map 0% reduce 0%. Any idea about the following error 
connecting to socket on port 3753
connected
Running job: job_1408646446603_62682
 map 0% reduce 0%
Job complete: job_1408646446603_62682
Counters: 0
Job Failed: Application application_1408646446603_62682 failed 2 times due to AM Container for appattempt_1408646446603_62682_000002 exited with  exitCode: 1 due to: Exception from container-launch: org.apache.hadoop.util.Shell$ExitCodeException: 
org.apache.hadoop.util.Shell$ExitCodeException: 
at org.apache.hadoop.util.Shell.runCommand(Shell.java:505)
at org.apache.hadoop.util.Shell.run(Shell.java:418)
at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:650)
at org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:195)
at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:300)
at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:81)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:744)

Container exited with a non-zero exit code 1
.Failing this attempt.. Failing the application.
java.io.IOException: Job failed!
at org.talend.hadoop.mapred.lib.MRJobClient.runJob(MRJobClient.java:154)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.runMRJob(Mapreducetest.java:3114)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.access$0(Mapreducetest.java:3104)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest$2.run(Mapreducetest.java:3014)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest$2.run(Mapreducetest.java:1)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Unknown Source)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.tXInput_tSortRow_1Process(Mapreducetest.java:2945)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.tHDFSInput_1Process(Mapreducetest.java:1818)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.run(Mapreducetest.java:3084)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.runJobInTOS(Mapreducetest.java:3060)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.main(Mapreducetest.java:3046)
disconnected
Job Mapreducetest ended at 10:40 13/10/2014.
2 REPLIES
Employee

Re: Getting error while running Mapreduce job

Now able to resolved the error I mentioned but now my job is stuck on map 0% and reduce 0%. I can see the job on cluster in prep mode any idea? Here are the logs
>FO org.apache.hadoop.yarn.webapp.WebApps: Web app /mapreduce
>started at 46913
>2014-10-13 15:20:24,901 INFO org.apache.hadoop.yarn.webapp.WebApps:
>Registered webapp guice modules
>2014-10-13 15:20:24,904 INFO
>org.apache.hadoop.mapreduce.v2.app.speculate.DefaultSpeculator:
>JOB_CREATE
>job_1408646446603_63059
>2014-10-13 15:20:24,905 INFO
>org.apache.hadoop.ipc.CallQueueManager: Using callQueue class
>java.util.concurrent.LinkedBlockingQueue
>2014-10-13 15:20:24,906 INFO
>org.apache.hadoop.ipc.Server: Starting Socket Reader #1 for port 38080
>2014-10-13 15:20:24,910 INFO
>org.apache.hadoop.ipc.Server: IPC Server Responder: starting
>2014-10-13 15:20:24,911 INFO
>org.apache.hadoop.ipc.Server: IPC Server listener on 38080: starting
>2014-10-13 15:20:24,925 INFO
>org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor:
>nodeBlacklistingEnabled:true
>2014-10-13 15:20:24,925 INFO
>org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor:
>maxTaskFailuresPerNode is 3
>2014-10-13 15:20:24,925 INFO
>org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor:
>blacklistDisablePercent is 33
>2014-10-13 15:20:24,966 WARN org.apache.hadoop.conf.Configuration:
>job.xml:an attempt to override final parameter:
>mapreduce.job.end-notification.max.retry.interval;  Ignoring.
>2014-10-13 15:20:24,970 WARN org.apache.hadoop.conf.Configuration:
>job.xml:an attempt to override final parameter:
>mapreduce.job.end-notification.max.attempts;  Ignoring.
>2014-10-13 15:20:24,972 INFO org.apache.hadoop.yarn.client.RMProxy:
>Connecting to ResourceManager at /0.0.0.0:8030
>2014-10-13 15:20:25,996 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 0
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:26,997 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 1
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:27,998 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 2
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:28,998 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 3
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:29,999 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 4
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:31,000 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 5
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:32,000 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 6
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:33,001 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 7
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:34,002 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 8
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:35,002 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 9
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)

 
Employee

Re: Getting error while running Mapreduce job

Hi Namita,
Can you please check the "Define Resource manager scheduler" and define it ? You should find the information in the yarn-site, looking at the property called "yarn.resourcemanager.scheduler.address".
Cheers,