...
This job entry executes Oozie Workflows. It is a front end on top of the OozieClient Java API that submits jobs to an Oozie server using web service calls.
Oozie is a workflow/coordination system to manage Hadoop jobs. Oozie Workflow jobs are Directed Acyclical Graphs (DAGs) of actions. Oozie Coordinator jobs are recurrent Oozie Workflow jobs and can be configured so a job is triggered by time (frequency) and data availability.
...
Option | Definition | |
---|---|---|
Name | The name of this job instance. | |
Hadoop Cluster | Allows you to create, edit, and select a Hadoop cluster configuration for use. Hadoop cluster configurations settings can be reused in transformation steps and job entries that support this feature. In a Hadoop cluster configuration, you can specify information like host names and ports for HDFS, Job Tracker, and other big data cluster components. The Edit button allows you to edit Hadoop cluster configuration information. The New button allows you to add a new Hadoop cluster configuration. Information on Hadoop Clusters can be found in Pentaho Help. | |
Oozie URL | Field to enter an Oozie URL. This must be a valid Oozie location. |
|
Enable Blocking | Option blocks the rest of a transformation from executing until the Oozie job finishes when checked. | |
Polling Interval (ms) | Field allows you to set the interval rate to check for Oozie workflows. | |
Workflow Properties | Field to enter the Workfile Properties file. This path is required and must be a valid job properties file. In the properties file, the oozie.wf.application.path path must be set. |
Hadoop Cluster
Insert excerpt | |||||||
---|---|---|---|---|---|---|---|
|
Oozie Job Executor (Advanced Setup Mode)
...