Installation
The CLI is included with the SDK:Authentication
Set your API key as an environment variable:Convert Documents
Convert documents to markdown, HTML, JSON, or chunks.Basic Usage
Output Options
Processing Options
Advanced Options
Directory Processing
Convert all documents in a directory:Convert Command Reference
| Option | Description |
|---|---|
--format | Output format: markdown, html, json, chunks |
--mode | Processing mode: fast, balanced, accurate |
--output_dir, -o | Output directory |
--max_pages | Maximum pages to process |
--page_range | Specific pages (e.g., "0-5,10") |
--paginate | Add page delimiters |
--add_block_ids | Add block IDs to HTML output |
--disable_image_extraction | Don’t extract images |
--disable_image_captions | Don’t generate image captions |
--page_schema | JSON schema for structured extraction |
--skip_cache | Force reprocessing |
--extensions | File extensions to process (for directories) |
--max_concurrent | Maximum concurrent requests |
--max_polls | Maximum polling attempts |
--poll_interval | Seconds between polls |
--api_key | Datalab API key |
--base_url | API base URL |
Workflow Commands
List Workflows
Get Workflow Details
Get Step Types
List available workflow step types:Create Workflow
Create a workflow from a JSON definition file:workflow.json:
Execute Workflow
Check Execution Status
Visualize Workflow
Generate a visual representation of a workflow:Examples
Batch Convert PDFs
Extract Data from Documents
High-Throughput Processing
Getting Help
Try Datalab
Get started with our API in less than a minute. We include free credits.