Skip to Content
ProcessorsCustom Processors

Custom Processors

Document AI’s Custom Processors allow you to quickly generate predictions with Generative AI or train your own tailored processors from scratch. Train models with as few as 10 examples to extract custom entities, classify documents, split files, or generate summaries tailored to your needs.

Navigate to ProcessorsCustom Processors in the sidebar to create a new processor.

Custom Processors landing page showing five processor type cards (Extractor, Classifier, Splitter, Layout, Summarizer) with Generative AI badges

Processor Types

All processor types are powered by Generative AI.

Creating a Processor

  1. Navigate to ProcessorsCustom Processors
  2. Click Create processor on the card for the type you need
  3. Enter a processor name in the slide panel
  4. Click Create

M3 Forge automatically provisions the processor with an initial version and default query plan. You are redirected to My Processors where you can begin configuring, importing data, and training.

Quick Comparison

FeatureExtractorClassifierSplitterLayoutSummarizer
PurposeExtract field valuesCategorize documentsSplit multi-doc filesDetect layout variationsGenerate summaries
Requires trainingYesYesYesYesNo
Min examples10+10+ per class10+5+ per variation0
ML modelLayoutLMv3 / LLMLayoutLMv3LayoutLMv3LayoutLMv3LLM only
OutputStructured fieldsCategory labelSplit pointsVariation IDFree-form text
AI schema genYesNoNoNoN/A

All custom processor types except Summarizer follow the same workflow: Create → Define schema/labels → Import documents → Label examples → Train → Evaluate → Deploy.

Managing Processors

Navigate to ProcessorsMy Processors to view and manage all your created processors.

The page displays processor cards with:

  • Name, description, and type badges
  • Execution statistics (count, last executed, average time, success rate)
  • Version information with active version indicator
  • Quick actions: edit, delete, test

Search by name and filter by category (all, custom, general, specialized).

About Custom Processors

Custom processors use advanced machine learning to understand your specific document types. The underlying models combine:

  • Generative AI (LLM) — For zero-shot extraction, summarization, and schema generation
  • LayoutLMv3 — For trained classification, splitting, and layout detection using both text and visual features
  • Hybrid matching — FAISS semantic search + fuzzy string matching for robust field extraction

Processing endpoints follow the pattern: extract_executor://document/{processor_type}

Next Steps

Last updated on