I have created one oozie workflow for hive script to load data in a table.
My workflow.xml contains -
<workflow-app xmlns="uri:oozie:workflow:0.4" name="Hive-Table-Insertion">
<start to="InsertData"/>
<action name="InsertData">
<hive xmlns="uri:oozie:hive-action:0.4">
<job-tracker>${jobTracker}</job-tracker>
<name-node>${nameNode}</name-node>
<prepare>
<delete path="${workflowRoot}/output-data/hive"/>
<mkdir path="${workflowRoot}/output-data"/>
</prepare>
<job-xml>${workflowRoot}/hive-site.xml</job-xml>
<configuration>
<property>
<name>oozie.hive.defaults</name>
<value>${workflowRoot}/hive-site.xml</value>
</property>
</configuration>
<script>load_data.hql</script>
</hive>
<ok to="end"/>
<error to="fail"/>
</action>
<kill name="fail">
<message>Hive failed, error message[${wf:errorMessage(wf:lastErrorNode())}]</message>
</kill>
<end name="end"/>
</workflow-app>
My job.properties file contains -
nameNode=hdfs://localhost:8020
jobTracker=localhost:8021
queueName=default
workflowRoot=HiveLoadData
oozie.libpath=${nameNode}/user/oozie/share/lib
oozie.wf.application.path=${nameNode}/user/${user.name}/${workflowRoot}
When I try to submit my job using command "oozie job -oozie http://localhost:11000/oozie -config /user/oozie/HiveLoadData/job.properties -submit" I get following error,
java.io.IOException: configuration is not specified
at org.apache.oozie.cli.OozieCLI.getConfiguration(OozieCLI.java:729)
at org.apache.oozie.cli.OozieCLI.jobCommand(OozieCLI.java:879)
at org.apache.oozie.cli.OozieCLI.processCommand(OozieCLI.java:604)
at org.apache.oozie.cli.OozieCLI.run(OozieCLI.java:577)
at org.apache.oozie.cli.OozieCLI.main(OozieCLI.java:204)
configuration is not specified
The path that you give to the -config
parameter must exist on the local drive (not on HDFS). Make sure that /user/oozie/HiveLoadData/job.properties
does exist - do e.g. ls /user/oozie/HiveLoadData/job.properties
on the same machine where you execute the oozie job -oozie...
command