分享

oozie又有错,不懂啊?【Unknown hadoop job associated with action】

jttsai 发表于 2014-8-15 16:23:28 [显示全部楼层] 回帖奖励 阅读模式 关闭右栏 7 29610
本帖最后由 jttsai 于 2014-8-15 16:38 编辑

2014-08-15 16:12:29,364  INFO ActionStartXCommand:539 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@:start:] Start action [0000001-140815155144199-oozie-oozi-W@:start:] with user-retry state : userRetryCount [0], userRetryMax [0], userRetryInterval [10]
2014-08-15 16:12:29,365  WARN ActionStartXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@:start:] [***0000001-140815155144199-oozie-oozi-W@:start:***]Action status=DONE
2014-08-15 16:12:29,365  WARN ActionStartXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@:start:] [***0000001-140815155144199-oozie-oozi-W@:start:***]Action updated in DB!
2014-08-15 16:12:29,454  INFO ActionStartXCommand:539 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] Start action [0000001-140815155144199-oozie-oozi-W@mr-node] with user-retry state : userRetryCount [0], userRetryMax [0], userRetryInterval [10]
2014-08-15 16:12:30,512  INFO MapReduceActionExecutor:539 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] checking action, external ID [job_1408075517794_0010] status [RUNNING]
2014-08-15 16:12:30,515  WARN ActionStartXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] [***0000001-140815155144199-oozie-oozi-W@mr-node***]Action status=RUNNING
2014-08-15 16:12:30,515  WARN ActionStartXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] [***0000001-140815155144199-oozie-oozi-W@mr-node***]Action updated in DB!
2014-08-15 16:12:46,983  INFO CallbackServlet:539 - SERVER[master1.hadoop] USER[-] GROUP[-] TOKEN[-] APP[-] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] callback for action [0000001-140815155144199-oozie-oozi-W@mr-node]
2014-08-15 16:12:47,129  WARN ActionCheckXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] Exception while executing check(). Error Code [JA017], Message[JA017: Unknown hadoop job [job_1408075517794_0010] associated with action [0000001-140815155144199-oozie-oozi-W@mr-node].  Failing this action!]
org.apache.oozie.action.ActionExecutorException: JA017: Unknown hadoop job [job_1408075517794_0010] associated with action [0000001-140815155144199-oozie-oozi-W@mr-node].  Failing this action!
        at org.apache.oozie.action.hadoop.JavaActionExecutor.check(JavaActionExecutor.java:1134)
        at org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:180)
        at org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:55)
        at org.apache.oozie.command.XCommand.call(XCommand.java:280)
        at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:174)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:724)
2014-08-15 16:12:47,130  WARN ActionCheckXCommand:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] Failing Job due to failed action [mr-node]
2014-08-15 16:12:47,132  WARN LiteWorkflowInstance:542 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] Workflow Failed. Failing node [mr-node]
2014-08-15 16:12:47,161  INFO KillXCommand:539 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[-] STARTED WorkflowKillXCommand for jobId=0000001-140815155144199-oozie-oozi-W
2014-08-15 16:12:47,171  INFO KillXCommand:539 - SERVER[master1.hadoop] USER[oozie] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[-] ENDED WorkflowKillXCommand for jobId=0000001-140815155144199-oozie-oozi-W
2014-08-15 16:13:03,603  INFO CallbackServlet:539 - SERVER[master1.hadoop] USER[-] GROUP[-] TOKEN[-] APP[-] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] callback for action [0000001-140815155144199-oozie-oozi-W@mr-node]
2014-08-15 16:13:03,613 ERROR CompletedActionXCommand:536 - SERVER[master1.hadoop] USER[-] GROUP[-] TOKEN[] APP[-] JOB[0000001-140815155144199-oozie-oozi-W] ACTION[0000001-140815155144199-oozie-oozi-W@mr-node] XException,
org.apache.oozie.command.CommandException: E0800: Action it is not running its in [FAILED] state, action [0000001-140815155144199-oozie-oozi-W@mr-node]
        at org.apache.oozie.command.wf.CompletedActionXCommand.eagerVerifyPrecondition(CompletedActionXCommand.java:77)
        at org.apache.oozie.command.XCommand.call(XCommand.java:251)
        at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:174)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:724)

我已经把把jobhistory的配置相关信息放在oozie的conf/hadoop-conf/core-site.xml中,jobhistory也已经开启了,我的配置如下:
<property>
       <name>mapreduce.jobhistory.address</name>
       <value>master1.hadoop:10020</value>
    </property>

    <property>
       <name>mapreduce.jobhistory.webapp.address</name>
       <value>master1.hadoop:19888</value>
    </property>
    <property>
       <name>mapreduce.jobhistory.intermediate-done-dir</name>
       <value>${hadoop.tmp.dir}/mr/history-tmp</value>
    </property>

    <property>
       <name>mapreduce.jobhistory.done-dir</name>
       <value>${hadoop.tmp.dir}/mr/history-done</value>
    </property>


已有(7)人评论

跳转到指定楼层
howtodown 发表于 2014-8-15 17:08:06
可以改成如下形式,不要是这种$变量的形式,下面你根据自己的实际情况来该。

<property>
       <name>mapreduce.jobhistory.webapp.address</name>
        <value>**:19888</value>
    </property>
  <property>
    <name>mapreduce.jobhistory.intermediate-done-dir</name>
    <value>/user/yarn/tmp</value>
  </property>
  <property>
    <name>mapreduce.jobhistory.done-dir</name>
    <value>/user/yarn/done</value>
  </property>

回复

使用道具 举报

jttsai 发表于 2014-8-15 17:23:54
howtodown 发表于 2014-8-15 17:08
可以改成如下形式,不要是这种$变量的形式,下面你根据自己的实际情况来该。

恩,我已经意识到这个了,然后我改了变成如下:
<property>
       <name>mapreduce.jobhistory.address</name>
       <value>master1.hadoop:10020</value>
    </property>

    <property>
       <name>mapreduce.jobhistory.webapp.address</name>
       <value>master1.hadoop:19888</value>
    </property>
    <property>
       <name>mapreduce.jobhistory.intermediate-done-dir</name>
       <value>/home/hdfs/bch/tmp/mr/history-tmp</value>
    </property>

    <property>
       <name>mapreduce.jobhistory.done-dir</name>
       <value>/home/hdfs/bch/tmp/mr/history-done</value>
    </property>


可还是这样的错误,真不知道我还有哪里有错?
回复

使用道具 举报

howtodown 发表于 2014-8-15 17:26:24
jttsai 发表于 2014-8-15 17:23
恩,我已经意识到这个了,然后我改了变成如下:

       mapreduce.jobhistory.address

看看这些文件夹的权限是否一致,属于当前用户。还有你执行用户,不要用错了。
回复

使用道具 举报

long1657 发表于 2015-5-14 12:26:13
请问一下,你的这个问题解决了吗?如何解决的啊?
回复

使用道具 举报

liuzhixin137 发表于 2016-7-1 14:38:56
同样的问题
回复

使用道具 举报

elbert.malone 发表于 2016-9-16 01:11:09
请问怎么解决的?同样的问题!
回复

使用道具 举报

flash胜龙 发表于 2017-2-6 18:10:36
ActionExecutorException: JA017: Could not lookup launched hadoop Job ID问题!(对就是这个,蛋碎不已)解决办法:OOZIE系统遇到的各种问题解决2  (页面最下面):
经过详细对比oozie-site.xml的配置,发现有些不同,修改了以下部分:
将原来的
    <property>  
        <name>oozie.service.HadoopAccessorService.hadoop.configurations</name>  
        <value>*=/home/master2/oozie-4.3.0/distro/target/oozie-4.3.0-distro/oozie-4.3.0/conf/hadoop/</value>  
    </property>
修改为
    <property>  
        <name>oozie.service.HadoopAccessorService.hadoop.configurations</name>  
        <value>*=/home/master2/hadoop-2.7.3/etc/hadoop/</value>  
    </property>
并添加如下
    <property>  
        <name>oozie.service.HadoopAccessorService.action.configurations</name>  
        <value>*=/home/master2/hadoop-2.7.3/etc/hadoop/</value>  
    </property>
有效地解决了找不到hadoop job id的问题
【参考】http://blog.sina.com.cn/s/blog_4b1452dd0102wy1t.html
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

关闭

推荐上一条 /2 下一条