Skip to main content
Recipes are detailed, end-to-end guides with fully working code samples. Pick a recipe based on what you’re trying to accomplish.

By Use Case

Build Production Pipelines

Chain processors into versioned pipelines for production use

Process PDFs for RAG/LLM

Convert documents to markdown or chunks for retrieval-augmented generation

Extract Invoice/Receipt Data

Pull structured fields (amounts, dates, line items) from financial documents

Analyze Contracts

Extract parties, dates, and clauses from legal documents

Process Research Papers

Extract titles, authors, abstracts, and citations from academic papers

Fill Out Forms

Automatically fill PDF and image forms with structured data

Generate Documents

Create Word documents from markdown with track changes

Split Batch-Scanned PDFs

Separate multi-document PDFs into individual documents

Review Track Changes

Extract redlines, insertions, deletions, and comments from Word documents

By Feature

FeatureDescriptionGuide
Document ConversionConvert PDFs, images, and office docs to markdown/HTML/JSONGuide
Batch ProcessingProcess multiple documents concurrentlyGuide
Structured ExtractionExtract fields using JSON schemasGuide
Long Document HandlingStrategies for 100+ page documentsGuide
Document SegmentationSplit multi-document PDFs by sectionGuide
Form FillingFill PDF and image forms programmaticallyGuide
Create DocumentGenerate DOCX files from markdownGuide
File UploadUpload and manage files for reuseGuide
PipelinesChain processors into versioned, reusable configurationsGuide
Pipeline VersioningManage drafts, publish versions, pin production deploymentsGuide
Track ChangesExtract redlines and comments from Word docsGuide
Forge EvalsCompare parsing configurations side-by-sideGuide

Self-Hosted

All cloud API recipes work with our on-premises containers for sensitive documents. See the feature parity table for available features.