Configure Line Item Column

Column definition

Name: Name a column as needed. A name is automatically generated from the OCR of the original column header, but it can be changed as needed.

Optional: Select this box to specify that this column is optional and not required for classification matching.

Column can wrap over multiple lines: Select this box to specify that the column may wrap over multiple lines in a horizontal direction, that is, more columns than the originally detected column.

Header pattern options

Header pattern: The column header pattern is automatically generated from the OCR recognition of the column header upon initial creation. The regular expressions and recognition options can be edited with the two buttons next to the field.

Global Header pattern: Global column header patterns are initially generated from the OCR recognition of a column header upon the initial creation of column definition within a configured line item. These global header patterns allow users to quickly select a globally configured regular expression and recognition options to be applied to that field.

Value pattern options

  • Match Numeric Values: Specify that this column contains only numeric values, and only those values should be extracted via OCR.

  • Match Text Values: Indicate that PSIcapture must recognize alphanumeric text syntax and extract all text-based data.

Use a custom value pattern: Set up a specific value pattern to fine-tune OCR recognition and apply Regex and text filtering options to the pattern.

Use global pattern: Select from a list of previously defined global patterns to fine-tune OCR recognition and apply Regex and text filtering options to the recognition of this field or line item.

Character filtering options

Character filter: Apply character filtering to the recognition of the characters in the column and row. These character filters are the standard filters found throughout the rest of PSIcapture.

The following options are disabled if All Characters is selected.

Enable extended characters: Define a list of extended characters to recognize, including currency symbols, unique syntax, and so on.

Invalid character action: Select one of the following options:

  • Do Not Correct: No character adjustments is applied.
  • Remove: Remove the detected invalid characters.
  • Auto Correct: Automatically correct any invalid characters with the specified replacements.
  • Replace with marker: Replace the invalid characters with a marker for review at a later time.