Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Wiki Markup
{scrollbar}

h1. Client Configuration

These instructions are for Hadoop distros other than MapR, if.  If you are using MapR go to the [Configure Pentaho for MapR] page.  If you are using Cloudera CDH4 MRv1 go to the [BAD:Configure Pentaho for Cloudera CDH4] page.

h2. Kettle Client

# Download and extract Kettle CE from the [Downloads] page.
_The Kettle Client comes pre-configured for Apache Hadoop 0.20.2. If you are using this distro and version, no further configuration is required._
# Configure PDI Client for a different version of Hadoop
## Delete $PDI_HOME/libext/bigdata/hadoop-0.20.2-core.jar
## For all other distributions you should replace this core jar with the one from your cluster. For example, if you are using Cloudera CDHu3:
Copy $HADOOP_HOME/hadoop-core-0.20.2-cdh3u3.jar to $PDI_HOME/libext/bigdata
## For Hadoop 0.20.205 you also need to have Apache Commons Configuration included in your set of PDI libraries.  In that case copy [commons-configuration-1.7.jar|http://commons.apache.org/configuration/download_configuration.cgi] to $PDI_HOME/libext/commons
## For Cloudera CDH3 Update 3 you also need to copy $HADOOP_HOME/lib/guava-r09-jarjar.jar to $PDI_HOME/libext/bigdata.

h2. Pentaho Report Designer (PRD)

# Download and extract PRD from the [Downloads] page.
_PRD comes pre-configured for Apache Hadoop 0.20.2. If you are using this distro and version, no further configuration is required._
# Configure PRD for a different version of Hadoop
## Delete $PRD_HOME/lib/bigdata/hadoop-0.20.2-core.jar
## Copy $HADOOP_HOME/hadoop-core.jar from your distribution into $PRD_HOME/lib/bigdata
## For Hadoop 0.20.205 you also need to have Apache Commons Configuration included in your set of PDI libraries.  In that case copy [commons-configuration-1.7.jar|http://commons.apache.org/configuration/download_configuration.cgi] to $PRD_HOME/lib/bigdata
## For Cloudera CDH3 Update 3 you also need to copy $HADOOP_HOME/lib/guava-r09-jarjar.jar to $PRD_HOME/lib.

h2. Pentaho BI Server

# Download and extract The BI Server from the [Downloads] page.
_The BI Server comes pre-configured for Apache Hadoop 0.20.2. If you are using this distro and version, no further configuration is required._
# Configure BI Server for a different version of Hadoop
## Delete $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib/hadoop-0.20.2-core.jar
## Copy $HADOOP_HOME/hadoop-core.jar from your distribution into $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib/
## For Hadoop 0.20.205 you also need to have Apache Commons Configuration included in your set of PDI libraries.  In that case copy [commons-configuration-1.7.jar|http://commons.apache.org/configuration/download_configuration.cgi] to $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib
## For Cloudera CDH3 Update 3) you also need to copy $HADOOP_HOME/lib/guava-r09-jarjar.jar to $PDI_HOME/libext/pentaho.

{include:Known Configuration Issues}