Text Processor

Performs various text processing operations including cleaning, normalization, tokenization, and transformation of text data.

Quick Start

To get started:

  • Select an operation from the Choose Operation dropdown
  • Configure operation-specific parameters
  • Send text via msg.payload.text
  • Receive processed text in msg.payload

Configuration

Configuration varies by operation type.

Text Processor configuration showing text normalization operations

Common Input Format (All Operations)

msg.payload.text (string)

Input text to process.

Example: "This is sample text to process."

Common Output Format (All Operations)

msg.payload (object)

msg.payload contains an output field with the processed text.

Example: {"output": "this is sample text to process"}

Available Operations

Operations include: lowercase, uppercase, remove special characters, tokenize, stemming, and more.

Example

Input (msg.payload)

{
  "text": "This is SAMPLE text to process!"
}

Output (msg.payload)

{
  "output": "this is sample text to process"
}

Errors

When the block fails, it raises an error. Use a Catch block in your flow to handle failures and inspect the error payload.

Common mistakes

  • Empty text: No text provided in msg.payload.text.
  • Invalid parameters: Operation parameters are invalid.
  • Service unavailable: The service is unavailable or unreachable.

Best Practices

  • Chain multiple text processing operations as needed
  • Use appropriate operations for your text type
  • Test with sample data first
  • Clean text before further NLP processing