Pentaho develops shims to support different Hadoop distributions. Many shims are already included with the software, but new shims must sometimes be downloaded and installed manually.
Note: Before you complete the tasks on this page, you should read Configuring Pentaho for your Hadoop Distro and Version That page explains what a shim is and explains where to get the shim you want.
Not all shims are distributed with Pentaho Software. If the shim that your cluster requires is not included you'll need to follow these instructions to install the shim. You need to install the shim for each Pentaho application from which you want to access the cluster.
Pentaho applications include the DI and BA Servers as well as design tools such as Spoon, Report Designer, and Metadata Editor.
- Stop the application (e.g. Spoon, DI Server, Report Design, BA Server, Metadata Editor) if it is running.
- Copy the shim .zip file that you downloaded in Configuring Pentaho for your Hadoop Distro and Version#GetShim, to the hadoop-configurations folder. This folder is different for each application and located:
- DI Server - data-integration-server/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/hadoop-configurations
- Spoon - data-integration/plugins/pentaho-big-data-plugin/hadoop-configurations
- BA Server - biserver-ee/pentaho-solutions/system/kettle/plugins/pentaho-big-data-plugin/hadoop-configurations
- Pentaho Report Designer - report-designer/plugins/pentaho-big-data-plugin/hadoop-configurations
- Metadata Editor - metadata-editor/plugins/pentaho-big-data-plugin/hadoop-configurations
- Unzip the new shim. It should create a directory that is the name of the shim. For example, cdh42.zip should create a directory named cdh42. Do not rename the directory.
3. Go to the Set Active Hadoop Distribution section of the Configure Pentaho for Your Hadoop Distribution and Version page.