Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Before you start

These steps assume that you have already followed the Set Active Hadoop Distribution instructions and are completing your shim setup for a CDH 5.x distribution. If you have not, or don't understand what any of this means, you should read Configuring Pentaho for your Hadoop Distro and Version.

Configuring MapReduce 1

CDH 5.x is configured to use MapReduce 2 by default. If you want to use MapReduce 1 instead, you will need to change a properties file. If you are not sure what version of MapReduce your CDH 5 cluster is using, check with your system administrator.

To specify configuration settings, complete these steps.

  1. Open the config.properties file for the components you want to access CDH 5 from. The config.properties file appears in these directories.

...

  1. Navigate to the CDH5.x shim folder located in the hadoop-configurations folder that matches shim you previously configured. For example: cdh50, cdh51 etc. These steps will refer to that directory as cdhXX. This folder is different for each application and is located:
    • DI Server - data-integration-server/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/

...

    • cdhXX
    • BA Server -

...

    • biserver-ee/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/

...

    • cdhXX
    • Spoon -

...

    • data-integration/plugins/pentaho-big-data-plugin/

...

      2. Find the line of code that looks like this:

Code Block

          shim.current.config=mr2

...

    • cdhXX
  1. Edit the config.properties file.
  2. Change the shim.current.config property from mr2 to mr1. It should look like:
    Code Block
    
              shim.current.config=mr1
    

...

  1. Save and close the file.