Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin

Excerpt

Additional configuration required to allow access to a CDH 5.0 cluster configured for MapReduce 1. This feature was removed in CDH 5.1.

Before you start

These steps assume that you have already followed the Set Active Hadoop Distribution instructions and are completing your shim setup for a CDH 5.0 distribution. If you have not, or don't understand what any of this means, you should read Configuring Pentaho for your Hadoop Distro and Version.

Configuring MapReduce 1

CDH 5.0 is configured to use MapReduce 2 by default. If

...

your cluster is set up to use MapReduce 1 instead, you will need to change a properties file. If you are not sure what version of MapReduce your CDH 5 cluster is using, check with your system administrator.

To specify configuration settings, complete these steps.

  1. Open the config.properties file for the components you want to access CDH 5 from. The config.properties file appears in these directories.

...

  1. Navigate to the CDH5.0 shim folder located in the hadoop-configurations folder that matches shim you previously configured. For example: cdh50, etc. These steps will refer to that directory as cdhXX. This folder is different for each application and is located:
    • DI Server - data-integration-server/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/

...

    • cdhXX
    • BA Server -

...

    • biserver-ee/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/

...

    • cdhXX
    • Spoon -

...

    • data-integration/plugins/pentaho-big-data-plugin/

...

      2. Find the line of code that looks like this:

...

    • cdhXX
  1. Edit the config.properties file.
  2. Change the shim.current.config

...

  1. property from mr2 to mr1. It should look like:
    Code Block
    
              shim.current.config=mr1
    

...

  1. Save and close the file.