- A Datalab account with an API key (new accounts include $5 in free credits)
- Python 3.10+ installed
- The Datalab SDK:
pip install datalab-python-sdk - Your
DATALAB_API_KEYenvironment variable set
Version Lifecycle
Every pipeline goes through a predictable lifecycle:| State | active_version | Description |
|---|---|---|
| Draft | 0 | Edits auto-save. No published version yet. |
| Saved | 0 | Named pipeline, still no published version. |
| Published | 1, 2, … | Immutable version snapshots exist. |
Publish a Version
Create an immutable snapshot of the current pipeline steps:Edit and Iterate
After publishing, any edits create a draft that is separate from the published version:Discard a Draft
Revert unsaved changes and restore the published version’s steps:Browse Version History
List all published versions for a pipeline:Best Practices
Pin production integrations to a specific version. When callingrun_pipeline() from production code, pass an explicit version number. This protects you from accidental changes:
version=0 to run the draft version against test documents:
Next Steps
Run a Pipeline
Execute pipelines with version selection, overrides, and polling.
Create a Pipeline
Build pipelines with Forge or the SDK.
Pipeline Overview
Processor types, composition rules, and when to use pipelines.
SDK Reference
Full SDK reference for all pipeline methods.