Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

This step type allows you to validate an input field against regular expression. A regular expression (regex or regexp for short) is a special text string for describing a search pattern. For example, the equivalent regex for wildcard notations such as *.txt to find all text files in a file manager is:

Code Block
.*\.txt

If you enable capture groups, this step can be used to parse a complex string of text and create several new fields out of it.  For instance, if you had a field of text containing an author's name in quotes and the number of posts made by them:

Code Block

"Author, Ann" - 53 posts

You could use the following regex with two capture groups to create two new fields in the transformation, one for the name, and one for the number of posts:

Code Block

^"([^"]*)" - (\d*) posts$


See also:

IMPORTANT: Don't panic!  For people new to regular expressions, the cryptic nature of the language can be a bit daunting.  However, "RegExps" regular expressions pack a lot of punch and are very much worth the time you spend on it.  There are several websites and software packages available to help you create and test regular expressions.  Just do a web search with the terms regular expression editor.

Settings Tab

Option

Description

Step name

Name of the step.

Note: This name has to be unique in a single transformation.

Field to evaluate

Name of the field to evaluate

Result Fieldname

The name of the return field (boolean)

Create fields for capture groups

Enable this if you want to create new fields based on capture groups in the regular expression.   If this option is not enabled, the step will determine whether the row matches the regular expression or not.

Regular expression

Put here the regular expression to match.

Use variable substitution

If you use variable, return it's content by selecting this option.

Capture group fields

Here you can specify the new fields you would like to capture.
IMPORTANT: The order of occurrence is the same as the order of the capture groups in the regular expression. The different columns allow you to change to the required data type right away.Note:  A capture group is a part of the expression between 2 brackets.  For example, you can capture a sequence of numerical characters like this: .([0-9]).*

...