...
- Create a PDI Transformation that sources the picklist of distinct IP Address to be used for a report parameter
- Create a PDI Transformation that sources a set of weblog data for a selected IP Address. This will be the primary data source for the report
- Create a Report that uses the PDI transformations for parameter list and report data.
Prerequisites
In order follow along with this how-to guide you will need the following:
- MapR
- Pentaho Data Integration
- Report Designer
Sample Files
The sample data file needed for this guide is:
...
NOTE: If you have already completed the "Using Pentaho MapReduce to Generate an Aggregate Dataset" guide, then the necessary files will already be in the proper location.
This file should be placed into the CLDB at /weblogs/aggregate_mr using the following commands.
hadoop fs -mkdir /weblogs
hadoop fs -mkdir /weblogs/aggregate_mr
hadoop fs -put weblogs_aggregate.txt /weblogs/aggregate_mr/part-00000
Step-By-Step Instructions
Setup
Start MapR if it is not already running.
Create a Parameter Picklist PDI Transformation
In this task you will create a PDI transformation to get a list of distinct IP Addresses from CLDB data. This transformation will later be used as the data source for a report parameter picklist.
...
- Save the Transformation: Choose 'File' -> 'Save as...' from the menu system. Save the transformation as 'cldb_ip_list.ktr' into a folder of your choice.
Create a CLDB Query PDI Transformation
In this task you will create a PDI transformation to get the data for your report from CLDB. This transformation will later be used as the primary datasource for the report.
...
- Save the Transformation: Choose 'File' -> 'Save as...' from the menu system. Save the transformation as 'cldb_to_report.ktr' into a folder of your choice.
Create a Report
In this task you will create a report that uses PDI transformations to source data from CLDB files.
...
- Do any desired formatting to the report.
Summary
Anchor | ||||
---|---|---|---|---|
|