Configuration¶

DocFlow is configured entirely through environment variables prefixed with DOCFLOW_. Copy .env.example to .env and customise as needed.

Application Settings¶

Variable	Default	Description
`DOCFLOW_ENV`	`development`	Environment: `development`, `staging`, `production`
`DOCFLOW_LOG_LEVEL`	`INFO`	Log level: `DEBUG`, `INFO`, `WARNING`, `ERROR`
`DOCFLOW_LOG_FORMAT`	`text`	Log format: `text` (dev), `json` (prod)
`DOCFLOW_PORT`	`8000`	API server port
`DOCFLOW_WORKERS`	`4`	Uvicorn worker count
`DOCFLOW_MAX_FILE_SIZE_MB`	`100`	Maximum upload file size

Variable	Default	Description
`DOCFLOW_DEFAULT_EXTRACTOR`	`auto`	Default extractor: `auto`, `tika`, `pymupdf`
`DOCFLOW_TIKA_URL`	`http://localhost:9998`	Tika server URL
`DOCFLOW_TIKA_TIMEOUT`	`120`	Tika request timeout (seconds)

Variable	Default	Description
`DOCFLOW_OCR_ENABLED`	`true`	Enable/disable OCR
`DOCFLOW_OCR_ENGINE`	`tesseract`	OCR engine: `tesseract`, `google_vision`, `none`
`DOCFLOW_OCR_LANGUAGE`	`deu`	OCR language code
`DOCFLOW_OCR_DPI`	`300`	OCR resolution

Variable	Default	Description
`DOCFLOW_DEFAULT_OUTPUT_FORMAT`	`markdown`	Output format: `markdown`, `json`, `plaintext`
`DOCFLOW_DEFAULT_LANGUAGE`	`deu`	Default document language
`DOCFLOW_PROCESSORS`	`cleanup`	Comma-separated processor list