Getting error while running Mapreduce job

Employee

Getting error while running Mapreduce job

Hi All,
I am getting error while I am trying to run the Mapreduce job using Talend. My job is very simple I am reading file from HDFS and sorting it and writing back to HDFS. I am getting the following error. I gave the following parameters on the Hadoop connection.
mapreduce.jobhistory.address = hdpmaster1.bigdatadev3.lb.*****.com:10020
yarn.app.mapreduce.am.staging-dir = /user

Before setting those parameters the job was just stuck on  map 0% reduce 0%. Any idea about the following error 
connecting to socket on port 3753
connected
Running job: job_1408646446603_62682
 map 0% reduce 0%
Job complete: job_1408646446603_62682
Counters: 0
Job Failed: Application application_1408646446603_62682 failed 2 times due to AM Container for appattempt_1408646446603_62682_000002 exited with  exitCode: 1 due to: Exception from container-launch: org.apache.hadoop.util.Shell$ExitCodeException: 
org.apache.hadoop.util.Shell$ExitCodeException: 
at org.apache.hadoop.util.Shell.runCommand(Shell.java:505)
at org.apache.hadoop.util.Shell.run(Shell.java:418)
at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:650)
at org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:195)
at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:300)
at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:81)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:744)

Container exited with a non-zero exit code 1
.Failing this attempt.. Failing the application.
java.io.IOException: Job failed!
at org.talend.hadoop.mapred.lib.MRJobClient.runJob(MRJobClient.java:154)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.runMRJob(Mapreducetest.java:3114)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.access$0(Mapreducetest.java:3104)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest$2.run(Mapreducetest.java:3014)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest$2.run(Mapreducetest.java:1)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Unknown Source)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.tXInput_tSortRow_1Process(Mapreducetest.java:2945)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.tHDFSInput_1Process(Mapreducetest.java:1818)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.run(Mapreducetest.java:3084)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.runJobInTOS(Mapreducetest.java:3060)
at truecar_talendsvn_qa.mapreducetest_0_1.Mapreducetest.main(Mapreducetest.java:3046)
disconnected
Job Mapreducetest ended at 10:40 13/10/2014.
Employee

Re: Getting error while running Mapreduce job

Now able to resolved the error I mentioned but now my job is stuck on map 0% and reduce 0%. I can see the job on cluster in prep mode any idea? Here are the logs
>FO org.apache.hadoop.yarn.webapp.WebApps: Web app /mapreduce
>started at 46913
>2014-10-13 15:20:24,901 INFO org.apache.hadoop.yarn.webapp.WebApps:
>Registered webapp guice modules
>2014-10-13 15:20:24,904 INFO
>org.apache.hadoop.mapreduce.v2.app.speculate.DefaultSpeculator:
>JOB_CREATE
>job_1408646446603_63059
>2014-10-13 15:20:24,905 INFO
>org.apache.hadoop.ipc.CallQueueManager: Using callQueue class
>java.util.concurrent.LinkedBlockingQueue
>2014-10-13 15:20:24,906 INFO
>org.apache.hadoop.ipc.Server: Starting Socket Reader #1 for port 38080
>2014-10-13 15:20:24,910 INFO
>org.apache.hadoop.ipc.Server: IPC Server Responder: starting
>2014-10-13 15:20:24,911 INFO
>org.apache.hadoop.ipc.Server: IPC Server listener on 38080: starting
>2014-10-13 15:20:24,925 INFO
>org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor:
>nodeBlacklistingEnabled:true
>2014-10-13 15:20:24,925 INFO
>org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor:
>maxTaskFailuresPerNode is 3
>2014-10-13 15:20:24,925 INFO
>org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor:
>blacklistDisablePercent is 33
>2014-10-13 15:20:24,966 WARN org.apache.hadoop.conf.Configuration:
>job.xml:an attempt to override final parameter:
>mapreduce.job.end-notification.max.retry.interval;  Ignoring.
>2014-10-13 15:20:24,970 WARN org.apache.hadoop.conf.Configuration:
>job.xml:an attempt to override final parameter:
>mapreduce.job.end-notification.max.attempts;  Ignoring.
>2014-10-13 15:20:24,972 INFO org.apache.hadoop.yarn.client.RMProxy:
>Connecting to ResourceManager at /0.0.0.0:8030
>2014-10-13 15:20:25,996 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 0
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:26,997 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 1
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:27,998 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 2
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:28,998 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 3
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:29,999 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 4
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:31,000 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 5
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:32,000 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 6
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:33,001 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 7
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:34,002 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 8
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)
>2014-10-13 15:20:35,002 INFO org.apache.hadoop.ipc.Client:
>Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 9
>time(s); retry policy is
>RetryUpToMaximumCountWithFixedSleep(maxRetries=10,
>sleepTime=1000 MILLISECONDS)

 
Employee

Re: Getting error while running Mapreduce job

Hi Namita,
Can you please check the "Define Resource manager scheduler" and define it ? You should find the information in the yarn-site, looking at the property called "yarn.resourcemanager.scheduler.address".
Cheers,

What’s New for Talend Spring ’19

Watch the recorded webinar!

Watch Now

Tutorial

Introduction to Talend Open Studio for Data Integration.

Watch

Downloads and Trials

Test drive Talend's enterprise products.

Downloads

Definitive Guide to Data Integration

Practical steps to developing your data integration strategy.

Download