OCR Utils

Provides post-OCR matching utilities to find or validate words/phrases against OCR bounding boxes.

Quick Start

To get started:

Configuration varies by operation type.

Word or phrase (or a list of words/phrases) to search for.

OCR corpus bounding boxes. This must be a list in one of these common formats:

Word-level: [[[x1, y1, x2, y2], "text"], ...]
Phrase-level: [[[x1, y1, x2, y2], [[[x1, y1, x2, y2], "text"], ...], "phrase text"], ...]

Restrict matching to one or more regions.

Format: [[x1, y1, x2, y2], ...]

msg.payload contains an output field with the results.

msg.payload.output is a dictionary keyed by the searched word/phrase.

msg.payload.output is an object with a matches field.

This block currently supports:

{
  "word_to_search": "Invoice",
  "list_of_bboxes": [[[10, 20, 50, 40], "Invoice"], [[60, 70, 100, 90], "Total"]]
}

{
  "output": {
    "Invoice": [
      { "word": "Invoice", "word_bbox": [10, 20, 50, 40], "confidence": 100.0, "edit_distance": 0 }
    ]
  }
}

When the block fails, it raises an error. Use a Catch block in your flow to handle failures and inspect the error payload.

Invalid OCR format: Input data doesn't match expected OCR format.
Missing required fields: word_to_search and list_of_bboxes are required.
Service unavailable: The service is unavailable or unreachable.