CDH 5 is configured to use MapReduce 2 by default. If you want to use MapReduce 1 instead, you will need to change a properties file. If you are not sure what version of MapReduce your CDH 5 cluster is using, check with your system administrator.
To specify configuration settings, complete these steps.
- Open the config.properties file for the components you want to access CDH 5 from. The config.properties file appears in these directories.
- DI Server - data-integration-server/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/hadoop-configurations/cdh50beta
- BA Server - biserver-ee/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/hadoop-configurations/cdh50beta
- Spoon - data-integration/plugins/pentaho-big-data-plugin/hadoop-configurations/cdh50beta
- Report Designer - report-designer/plugins/pentaho-big-data-plugin/hadoop-configurations/cdh50beta
- Metadata Editor - metadata-editor/plugins/pentaho-big-data-plugin/hadoop-configurations/cdh50beta
2. Find the line of code that looks like this:
shim.current.config=mr2
3. To configure the shim to use MapReduce 1, change mr2 to mr1 like this:
shim.current.config=mr1
4. Save and close the file.