Pentaho and Hadoop - Visual Development, Data Integration, Immediate Insight
Pentaho Business Analytics provides easy to use visual development tools and big data analytics that empower users to easily prepare, model, visualize and explore structured and unstructured data sets in Hadoop. Pentaho simplifies the end-to-end Hadoop data life cycle by providing a complete platform from data preparation to predictive analytics. Pentaho is unique by providing in-Hadoop execution for extremely fast performance.
Unable to render embedded object: File (align=center) not found.
The first three videos compare using Pentaho Kettle to create and execute a simple MapReduce job with using Java to solve the same problem. The Kettle transform shown here runs as a Mapper and Reducer within the cluster.
KZe1UugxXcs
What would the same task as "1) Pentaho MapReduce with Kettle" look like if you coded it in Java? At a half hour long, you may not want to watch the entire video...
cfFq1XB4kww
This is a quick summary of the previous two videos, "1) Pentaho MapReduce with Kettle" and "2) Straight Java", and why Pentaho Kettle boosts productivity and maintainability.
ZnyuTICOrhk
A quick example of loading into the Hadoop Distributed File System (HDFS) using Pentaho Kettle.
Ylekzmd6TAc
A quick example of extracting data from the Hadoop Distributed File System (HDFS) using Pentaho Kettle.
3Xew58LcMbg