RAP Logo

Corpus (Knowledge Base)

Comprehensive management of document collections and knowledge bases for RAG capabilities with AI agents in workgroups

Pro Feature: Corpus (Knowledge Base) management and RAG capabilities are available only in Pro version.

The Corpus page provides comprehensive management of document collections and knowledge bases created through RAPFlow flows. These knowledge bases can be consumed by agents created in agent workgroups, enabling powerful RAG (Retrieval-Augmented Generation) capabilities.

Overview

The Corpus page displays all knowledge bases created through RAPFlow flows, which can be consumed by agents in agent workgroups. This interface provides comprehensive information about:

  • Collection names and their embedding models
  • Document counts and upload details
  • Organization-level access and management

Document Collections

This page displays all document collections created through RAPFlow flows, which can be consumed by agents in agent workgroups. This interface provides comprehensive information about:

  • Collection names and their embedding models
  • Document counts and upload details
  • Organization-level access and management

Document Details Page

Clicking on the document count in any collection record navigates to a detailed page showing all files uploaded to that specific knowledge base.

Document Information

The document details page provides comprehensive information about each file:

File Details

  • File Name: Original name of the uploaded document
  • File Type: Document format (PDF, DOC, TXT, etc.)

Processing Metadata

  • Chunk Size: Size of text chunks used for vectorization
  • Chunk Overlap: Overlap between consecutive chunks for better context
  • Embedding Model: Specific model used for document vectorization
  • Processing Date: Date and time when the document was processed

Use Cases

Knowledge Management

  • Document Repositories: Store and organize company documents
  • Research Libraries: Maintain research and reference materials
  • Policy Documents: Store and retrieve policy and procedure documents
  • Training Materials: Organize training and educational content

AI Agent Enhancement

  • Context Provision: Provide relevant context to AI agents
  • Knowledge Retrieval: Retrieve specific information for agent tasks
  • Response Enhancement: Improve agent response quality and accuracy
  • Domain Expertise: Provide domain-specific knowledge to agents

Best Practices

Collection Organization

  • Descriptive Names: Use clear, descriptive collection names
  • Logical Grouping: Group related documents together
  • Consistent Naming: Follow consistent naming conventions
  • Regular Updates: Keep collections current and relevant

Document Quality

  • Clean Content: Ensure documents are well-formatted and clean
  • Relevant Content: Include only relevant and useful documents
  • Regular Review: Regularly review and update document content

Performance Optimization

  • Appropriate Sizing: Keep collections at optimal sizes
  • Efficient Indexing: Use appropriate embedding models

Integration with Agentic Workgroups

RAPFlow to Agent Workgroup Flow

Knowledge bases created through RAPFlow flows are automatically available for consumption by agents in agent workgroups:

  1. Knowledge Base Creation: Documents are processed and indexed through RAPFlow
  2. Agent Configuration: Agents in workgroups can access and utilize these knowledge bases
  3. Context Enhancement: Agents use the knowledge bases to provide enhanced, contextual responses
  4. Real-time Access: Agents have real-time access to updated knowledge base content

Agent Configuration

  • Knowledge Base Selection: Select relevant collections for specific agents
  • Context Configuration: Configure how agents access and use knowledge base content
  • Search Parameters: Set retrieval parameters for optimal context matching