Before you start
These steps assume that you have already followed the Set Active Hadoop Distribution instructions and are completing your shim setup for a CDH 5.x distribution. If you have not, or don't understand what any of this means, you should read Configuring Pentaho for your Hadoop Distro and Version.
Configuring MapReduce 1
CDH 5.x is configured to use MapReduce 2 by default. If you want to use MapReduce 1 instead, you will need to change a properties file. If you are not sure what version of MapReduce your CDH 5 cluster is using, check with your system administrator.
To specify configuration settings, complete these steps.
- Open the config.properties file for the components you want to access CDH 5 from. The config.properties file appears in these directories.
...
- Navigate to the CDH5.x shim folder located in the hadoop-configurations folder that matches shim you previously configured. For example: cdh50, cdh51 etc. These steps will refer to that directory as cdhXX. This folder is different for each application and is located:
- DI Server - data-integration-server/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/
...
- cdhXX
- BA Server -
...
- biserver-ee/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/
...
- cdhXX
- Spoon -
...
- data-integration/plugins/pentaho-big-data-plugin/
...
2. Find the line of code that looks like this:
Code Block |
---|
shim.current.config=mr2
|
...
- cdhXX
- Edit the config.properties file.
- Change the shim.current.config property from mr2 to mr1. It should look like:
Code Block shim.current.config=mr1
...
- Save and close the file.