CEWS Pipeline Toolkit - DictionaryMatcher

CEWS Pipeline Toolkit - DictionaryMatcher

DictionaryMatcher

DictionaryMatcher performs lookups on a dictionary.

Usage

To perform dictionary lookups on input fields and populate the output field with the dictionary match values. Supports substring, regex, and exact string matching.

If regex, then group substitutions are supported, allowing part of the matching value to be inserted into the output value. Group substitution proceeds from left to right, in numeric order, starting with "<0>",which is the whole matching string.

If you have a large dictionary you should look at using the TriggerMatcherDictionary instead.

Configuration

 

<documentProcessor name="Source matcher"
             dllFile="StandardDocumentProcessors.dll"
             className="StandardDocumentProcessors.DictionaryMatcher">
    <parameterList name="InputFields" delimiter=";">Path</parameterList>
    <parameterList name="OutputFields" delimiter=";">Source</parameterList>
    <parameter name="DefinitionFile">../etc/sourceDictionary.xml</parameter>
    <parameter name="DefaultValue">Unknown</parameter>
    <parameter name="OutputDelimiter">;</parameter>
</documentProcessor>

 

Configuration Options

InputFields

List of fields to test against the dictionary.

OutputFields

List of fields to assign output to. Number of fields much match that of InputFields.
All the fields (managed properties) you list must be of type Text, and they must have HasMultipleEntries set to true if you expect to match multiple values.

DefinitionFile

Path to dictionary xml file. Sample configuration:

<dictionary>
   <entry type="substring" key="string" value="Success" />
   <entry type="regex" key=" ([\\d]+) " value="Number: {1}" />
</dictionary>

 

DefaultValue

If no match is found, use this value

OutputDelimiter

String out separate multiple values in output fields. For SharePoint 2013 you must use the Split processor down-pipeline to support multi value fields.

ErrorCodeOnMatch

If set to non-zero value then on match the document's ErrorCode will be set to this value. And if the pipelines @breakOnError is set, processing of the pipeline will stop after this state. Note that the output/default value will still be set. To avoid this, use the output property "*IGNORE*" (without the quotes)
 

 

Leave a Comment
  • Please add 1 and 6 and type the answer here:
  • Post
Wiki - Revision Comment List(Revision Comment)
Sort by: Published Date | Most Recent | Most Useful
Comments
  • Maheshkumar S Tiwari edited Revision 1. Comment: Added tags. Its better to mention that content will be added in the article or in Tags .

Page 1 of 1 (1 items)
Wikis - Comment List
Sort by: Published Date | Most Recent | Most Useful
Posting comments is temporarily disabled until 10:00am PST on Saturday, December 14th. Thank you for your patience.
Comments
  • Maheshkumar S Tiwari edited Revision 1. Comment: Added tags. Its better to mention that content will be added in the article or in Tags .

Page 1 of 1 (1 items)