...
- Unzip pentaho-mapreduce-libraries.zip, it contains a single lib/ directory with the required Kettle dependencies
- Add Copy additional libraries to the lib/ directory
- Zip up the lib/ directory into pentaho-mapreduce-libraries-custom.zip so the archive contains the lib/ with all jars within it (you may create subdirectories within lib/. All jars found in lib/ and its subdirectories will be added to the classpath of the executing job.)
- Update
pentaho-mapreduce.properties
and update the following properties:Code Block pmr.kettle.installation.id=custom pmr.libraries.archive.file=pentaho-mapreduce-libraries-custom.zip
...
- Remove the
pentaho.*
properties from yourmapred-site.xml
- Remove the directories those properties referenced
- Restart the TaskTracker process
Appendix A: pentaho-mapreduce-libraries.zip structure:
Code Block |
---|
pentaho-mapreduce-libraries.zip/
+- lib/
+- kettle-core-{version}.jar
+- kettle-engine-{version}.jar
..
|