> ## Documentation Index
> Fetch the complete documentation index at: https://documentation.datalab.to/llms.txt
> Use this file to discover all available pages before exploring further.

# Recipes Overview

> End-to-end guides for common document processing workflows.

Recipes are detailed, end-to-end guides with fully working code samples. Pick a recipe based on what you're trying to accomplish.

## By Use Case

<CardGroup cols={2}>
  <Card title="Build Production Pipelines" icon="workflow" href="/docs/recipes/pipelines/pipeline-overview">
    Chain processors into versioned pipelines for production use
  </Card>

  <Card title="Process PDFs for RAG/LLM" icon="robot" href="/docs/recipes/conversion/conversion-api-overview">
    Convert documents to markdown or chunks for retrieval-augmented generation
  </Card>

  <Card title="Extract Invoice/Receipt Data" icon="receipt" href="/docs/recipes/structured-extraction/api-overview">
    Pull structured fields (amounts, dates, line items) from financial documents
  </Card>

  <Card title="Analyze Contracts" icon="file-contract" href="/docs/recipes/structured-extraction/api-overview">
    Extract parties, dates, and clauses from legal documents
  </Card>

  <Card title="Process Research Papers" icon="flask" href="/docs/recipes/structured-extraction/api-overview">
    Extract titles, authors, abstracts, and citations from academic papers
  </Card>

  <Card title="Fill Out Forms" icon="pen-line" href="/docs/recipes/form-filling/form-filling-api-overview">
    Automatically fill PDF and image forms with structured data
  </Card>

  <Card title="Generate Documents" icon="file-export" href="/docs/recipes/create-document/create-document-api-overview">
    Create Word documents from markdown with track changes
  </Card>

  <Card title="Split Batch-Scanned PDFs" icon="scissors" href="/docs/recipes/document-segmentation/auto-segmentation">
    Separate multi-document PDFs into individual documents
  </Card>

  <Card title="Review Track Changes" icon="file-diff" href="/docs/recipes/extract-redlines-and-comments/track-changes-from-word-documents">
    Extract redlines, insertions, deletions, and comments from Word documents
  </Card>
</CardGroup>

## By Feature

| Feature                | Description                                                 | Guide                                                                                  |
| ---------------------- | ----------------------------------------------------------- | -------------------------------------------------------------------------------------- |
| Document Conversion    | Convert PDFs, images, and office docs to markdown/HTML/JSON | [Guide](/docs/recipes/conversion/conversion-api-overview)                              |
| Batch Processing       | Process multiple documents concurrently                     | [Guide](/docs/recipes/conversion/batch-documents)                                      |
| Structured Extraction  | Extract fields using JSON schemas                           | [Guide](/docs/recipes/structured-extraction/api-overview)                              |
| Long Document Handling | Strategies for 100+ page documents                          | [Guide](/docs/recipes/structured-extraction/handling-long-documents)                   |
| Document Segmentation  | Split multi-document PDFs by section                        | [Guide](/docs/recipes/document-segmentation/auto-segmentation)                         |
| Form Filling           | Fill PDF and image forms programmatically                   | [Guide](/docs/recipes/form-filling/form-filling-api-overview)                          |
| Create Document        | Generate DOCX files from markdown                           | [Guide](/docs/recipes/create-document/create-document-api-overview)                    |
| File Upload            | Upload and manage files for reuse                           | [Guide](/docs/recipes/file-management/file-upload-api)                                 |
| Pipelines              | Chain processors into versioned, reusable configurations    | [Guide](/docs/recipes/pipelines/pipeline-overview)                                     |
| Pipeline Versioning    | Manage drafts, publish versions, pin production deployments | [Guide](/docs/recipes/pipelines/pipeline-versioning)                                   |
| Track Changes          | Extract redlines and comments from Word docs                | [Guide](/docs/recipes/extract-redlines-and-comments/track-changes-from-word-documents) |
| Forge Evals            | Compare parsing configurations side-by-side                 | [Guide](/docs/recipes/forge-evals/overview)                                            |

## Self-Hosted

All cloud API recipes work with our [on-premises containers](/docs/on-prem/overview) for sensitive documents. See the [feature parity table](/docs/on-prem/api#feature-parity) for available features.
