Annotation Step
Description
The Annotation Stream step helps you refine your data table for the Streamlined Data Refinery by either creating a measure or attribute on column(s) that you specify.
If you want, you can create multiple annotations on the same field; for example, you might want to create an average measure and a sum measure on the same field. You can also create multiple Annotation Streams to modify the same data model.
Job Entry Workflow
The Annotation Stream modifies the default model produced from the Build Model job entry.
Here is an example workflow.
After you are done annotating your data model, you are ready to publish it.
Prerequisites
You must have the Streamlined Data Refinery set up on your system before using this step.
Annotate Stream
These sections describe the different ways that you can create annotations. The annotation type that you create - either Create Measure or Create Attribute - determines which properties are shown in the dialog box to complete that annotation. Â
Option |
Description |
---|---|
Step Name: |
Displays the name of the step. |
Annotations |
Displays a list of fields and annotations. |
-Edit |
Use to edit a field that you have highlighted from the Field list. |
-Delete |
Use to delete a field that you have highlighted from the Field list. |
Field |
Lists the names of the fields selected for annotation. |
Model Action |
Specifies which model action is being taken: Create Measure or Create Attribute. |
Summary |
Displays a summary of that specific annotation. |
Select Fields |
Displays a list of fields to select from. |
Help |
Provides a link to the Help documentation. |
OK |
Click OK to finish the annotation and close the window. |
Cancel |
Click Cancel to discard entries and close the window. |
Select Fields to Annotate
This section describes how to use the Select Fields to Annotate dialog box.
Option |
Description |
---|---|
Available Fields |
Lists the available fields for annotating. |
> |
Use this button to Add only the selected field(s). |
>> |
Use this button to Add All available fields. |
< |
Use this button to Remove only the selected field(s). |
<< |
Use this button to Remove All selected fields. |
Selected Fields |
Lists all of the fields selected for annotating. |
OK |
Click OK to finish the annotation and close the window. |
Cancel |
Click Cancel to discard entries and close the window. |
Annotate Individual Fields
This section describes how to annotate an individual field.
To begin annotating, select and double click on a field in the list of available fields.
Options |
Description |
---|---|
Annotate Field |
Shows the name of the field to be annotated. |
Actions: |
Select an annotation type from the drop down: either Create Measure or Create Attribute. |
Create Measure on a Column
Creates a measure or calculated measure using a field of existing measures.
Â
Options |
Description |
---|---|
Name - Value |
Displays the name of the annotation component. Click in the Value column next to the item that you want to work with. |
Measure Name |
Type a name for your annotation. |
Aggregation Type |
Choose from a list of aggregation types: SUM, AVERAGE, COUNT, COUNT_DISTINCT, MINIMUM, MAXIMUM. |
Format String |
Choose from a list of Format Strings. |
Description |
Enter a description for the annotation. |
Help |
Provides a link to the Help documentation. |
Previous |
Scrolls back through the list of created annotations on the model. |
Next |
Scrolls forward through a list of created annotations on the model. |
OK |
Click OK to finish the annotation and close the window. |
Cancel |
Click Cancel to discard entries and close the window. |
Create Attribute
Creates a single level hierarchy, using a field or existing hierarchy level.
Options |
Description |
---|---|
Name - Value |
Displays the name of the annotation component. Click in the Value column next to the item that you want to work with. |
Attribute Name |
Type a name for your annotation. |
Time Level Type |
Choose a Time Level Type from the drop down menu. |
Time Source Format |
Choose the format that you want the attribute to appear in. |
Ordinal Field |
Refers to the field used to sort the members. |
Description |
Enter a description for the annotation. |
Parent Attribute |
Populated with fields that have been previously annotated within the same dimension/hierarchy. |
Dimension |
Choose an existing dimension or create a new dimension for the attribute. |
Hierarchy |
Choose an existing hierarchy or create a new hierarchy for the attribute. |
Is Unique |
Select either True or False. |
Example/Sample(s)
Information about working with the Streamlined Data Refinery, and other general information related to PDI, is located in the Pentaho Documentation.
You can download a set of SDR sample files from our folder on Box.