Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Pentaho Cassandra How-To Guides
These how-to guides show, with step-by-step instructions, how to write data to, read data from, Cassandra using graphical tools. These guides include instructions on how to sort and group data, create reports, and combine data from Cassandra with data from other sources.
Some of these guides are available as screen casts with audio commentary. The recordings are available here:

Prerequisites

In order follow along with these how-to guides you will need the following:

Cassandra

A single-node local cluster is sufficient for these exercises but a larger and/or remote configuration will work as well.
These guides were developed using the Apache Cassandra distribution version 1.0.3. You can find Apache Cassandra downloads here: http://cassandra.apache.org/download/

Pentaho Data Integration

A desktop installation of Pentaho Data Integration. You will need version [TODO]. You can download this software here: [TODO]

Sample Files

The sample data files for these guides can be found here: [TODO]

Outline of Pentaho How-Tos

1. How To Write Data To Cassandra

This guide shows you how to read data from a data source (flat file) and write it to a column family in Cassandra.

2. How To Read Data From Cassandra

This guide shows you how to read data from a Cassandra column family and write it to a data source (flat file or database).

3. How To Create a Report Using Cassandra

This guide shows you how to read data from a Cassandra column family, sort it, rank it, and then use it as data for a PDF and Excel report.

4. How To Create a Parameterized Report Using Cassandra

This guide shows you how to submit a parameterized query to Cassandra how to use the resulting data for a PDF and Excel report.

5. How To Merge Cassandra Data with Data From Other Sources

This guide shows you how to read data from a Cassandra column family and merge it with data from other sources to create an enriched data stream.