CEWS Pipeline Toolkit - MetaDataInjectorSuffixTree

CEWS Pipeline Toolkit - MetaDataInjectorSuffixTree

MetaDataInjectorSuffixTree

This stage performs dictionary lookups on input fields and populates the output fields with all the matched values. This is similar to MetaDataInjector, except that it uses a SuffixTree which is more efficient in handling large numbers of triggers vs. sequential string comparison so it only supports “equals” matches.

Configuration

<documentProcessor dllFile="StandardDocumentProcessors.dll" className="StandardDocumentProcessors.MetaDataInjectorSuffixTree">
     <parameter name="InputField">in</parameter>
     <parameter name="DefinitionFile">../etc/MetaDataInjectorDictionary.xml</parameter>
     <parameter name="DefaultValue">None</parameter>
     <parameter name="OutputDelimiter">;</parameter>
</documentProcessor>

Sample DefinitionFile

<dictionary>
     <entry type="equals" key="x" value="a" field="out1" />
     <entry type="equals" key="x" value="b" field="out2" />
     <entry type="equals" key="x" value="c" field="out3" />
</dictionary>

Configuration Options

  • type: equals
  • key: the input string or regex to look for in the input field 
  • value: Is the value (or pattern) to put into the output field
  • field: The name of the output crawled property. (Can be "type|name|propsetid" if both PropSetID & PropType are not populated)

 

Sample Input XML

<Document>
     <ManagedProperty type=“String” name="in">a string that contains x</ManagedProperty>
</Document>

Sample Output XML

<?xml version="1.0" encoding="utf-8"?>
<Document>
     <ManagedProperty type=“String” name="in">a string that contains x</ManagedProperty>
     <ManagedProperty type=“String” name="out1">a</ManagedProperty>
     <ManagedProperty type=“String” name="out2">b</ManagedProperty>
     <ManagedProperty type=“String” name="out3">c</ManagedProperty>
</Document>
Leave a Comment
  • Please add 1 and 8 and type the answer here:
  • Post
Wiki - Revision Comment List(Revision Comment)
Sort by: Published Date | Most Recent | Most Useful
Comments
Page 1 of 1 (1 items)
Wikis - Comment List
Sort by: Published Date | Most Recent | Most Useful
Posting comments is temporarily disabled until 10:00am PST on Saturday, December 14th. Thank you for your patience.
Comments
  • Maheshkumar S Tiwari edited Original. Comment: Added code block and Tags

Page 1 of 1 (1 items)