Unknown macro: {scrollbar}
Client Configuration
These instructions are for Hadoop distros other than MapR, if you are using MapR go to the Configure Pentaho for MapR page.
Kettle Client
- Download and extract Kettle CE from the Downloads page.
The Kettle Client comes pre-configured for Apache Hadoop 0.20.2. If you are using this distro and version, no further configuration is required. - Configure PDI Client for a different version of Hadoop
- Delete $PDI_HOME/libext/pentaho/hadoop-0.20.2-core.jar
- For all other distributions you should replace this core jar with the one from your cluster. For example, if you are using Cloudera CDHu3:
Copy $HADOOP_HOME/hadoop-core-0.20.2-cdh3u3.jar to $PDI_HOME/libext/pentaho - For Hadoop 0.20.205 you also need to have Apache Commons Configuration included in your set of PDI libraries. In that case copy commons-configuration-1.7.jar to $PDI_HOME/libext/commons
- For Cloudera CDH3 Update 3) you also need to copy $HADOOP_HOME/lib/guava-r09-jarjar.jar to $PDI_HOME/libext/pentaho.
Pentaho Report Designer (PRD)
- Download and extract PRD from the Downloads page.
The PRD comes pre-configured for Apache Hadoop 0.20.2. If you are using this distro and version, no further configuration is required. - Configure PRD for a different version of Hadoop
- Delete $PRD_HOME/lib/jdbc/hadoop-0.20.2-core.jar
- Copy $HADOOP_HOME/hadoop-core.jar from your distribution into $PRD_HOME/lib/jdbc
- For Hadoop 0.20.205 you also need to have Apache Commons Configuration included in your set of PDI libraries. In that case copy commons-configuration-1.7.jar to $PRD_HOME/lib/jdbc
- For Cloudera CDH3 Update 3) you also need to copy $HADOOP_HOME/lib/guava-r09-jarjar.jar to $PRD_HOME/lib/jdbc.
Pentaho BI Server
- Download and extract The BI Server from the Downloads page.
The BI Server comes pre-configured for Apache Hadoop 0.20.2. If you are using this distro and version, no further configuration is required. - Configure BI Server for a different version of Hadoop
- Delete $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib/hadoop-0.20.2-core.jar
- Copy $HADOOP_HOME/hadoop-core.jar from your distribution into $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib/
- For Hadoop 0.20.205 you also need to have Apache Commons Configuration included in your set of PDI libraries. In that case copy commons-configuration-1.7.jar to $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib
- For Cloudera CDH3 Update 3) you also need to copy $HADOOP_HOME/lib/guava-r09-jarjar.jar to $PDI_HOME/libext/pentaho.