Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Next »

Additional configuration required to allow access to a CDH 5.x cluster configured for MapReduce 1.

Before you start

These steps assume that you have already followed the Set Active Hadoop Distribution instructions and are completing your shim setup for a CDH 5.x distribution. If you have not, or don't understand what any of this means, you should read Configuring Pentaho for your Hadoop Distro and Version.

Configuring MapReduce 1

CDH 5.x is configured to use MapReduce 2 by default. If you want to use MapReduce 1 instead, you will need to change a properties file. If you are not sure what version of MapReduce your CDH 5 cluster is using, check with your system administrator.

  1. Navigate to the CDH5.x shim folder located in the hadoop-configurations folder that matches shim you previously configured. For example: cdh50, cdh51 etc. These steps will refer to that directory as cdhXX. This folder is different for each application and is located:
    • DI Server - data-integration-server/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/cdhXX
    • BA Server - biserver-ee/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/cdhXX
    • Spoon - data-integration/plugins/pentaho-big-data-plugin/cdhXX
  2. Edit the config.properties file.
  3. Change the shim.current.config property from mr2 to mr1. It should look like:
              shim.current.config=mr1
    
  4. Save and close the file.
  • No labels