Configure Pentaho for MapR
Setting up and configuring the Pentaho node dist, Kettle (PDI) and Reporting
Preconfigured Packages
These instructions are specific to the MapR distribution of Hadoop, if you are not using MapR, go to the Configure Pentaho for Cloudera and Other Hadoop Versions page.
Client Configuration
MapR Client
Follow installation instructions provided by MapR for your architecture: Setting up the Client - MapR
Kettle Client
Download and extract Kettle CE from the Downloads page.
Configure PDI Client for MapR
Overview:
The MapR native libraries for your architecture must be added to the
java.library.pathMapR Hadoop Configuration directory needs to be on the classpath
MapR Hadoop Core library must be on the classpath
All architectures
Update the $PDI_HOME/launcher/launcher.properties's classpath property to include the relative path to your MapR configuration directory. e.g.:
classpath=../:../ui:../ui/images:../libext/mondrian/config:${HADOOP_HOME}/conf:../libext/bigdata/pigConf:../../../../opt/mapr/conf, or use the attachedDelete $PDI_HOME/libext/bigdata/hadoop-0.20.2-core.jar
Copy $MAPR_HOME/hadoop/hadoop-0.20.2/lib/hadoop-0.20.2-dev-core.jar into $PDI_HOME/libext/bigdata
Copy $MAPR_HOME/hadoop/hadoop-0.20.2/lib/maprfs-0.1.jar into $PDI_HOME/libext/bigdata
Linux x64
Update the $PDI_HOME/spoon.sh with the attached
Update the $PDI_HOME/pan.sh with the attached
Update the $PDI_HOME/kitchen.sh with the attached
Update the $PDI_HOME/carte.sh with the attached
Mac OS X 64-bit
Update the Data Integration 64-bit.app/Content/Info.plist with the attached
Apply the Hadoop client configuration files by adding the core-site, hdfs-site, and mapred-site.xml files in the $PDI_HOME directory.
Report Designer
Download and extract PRD from the Downloads page.
Configure PRD for MapR
Delete $PRD_HOME/lib/jdbc/hadoop-0.20.2-core.jar
Copy $MAPR_HOME/hadoop/hadoop-0.20.2/lib/hadoop-0.20.2-dev-core.jar into $PRD_HOME/lib
Copy $MAPR_HOME/hadoop/hadoop-0.20.2/lib/maprfs-0.1.jar into $PRD_HOME/lib
Linux x64:
Add "-Djava.library.path=/opt/mapr/hadoop/hadoop-0.20.2/lib/native/Linux-amd64-64" to the last line in $PRD_HOME/report-designer.sh
For MacOS:
Add "-Djava.library.path=/opt/mapr/hadoop/hadoop-0.20.2/lib/native/Mac_OS_X-x86_64-64" to the "VMOptions" entry in $PRD_HOME/Pentaho\ Report\ Designer.app/Contents/Info.plist