Monitoring¶
FileFlux provides built-in monitoring and observability through metrics, logs, and health checks.
Health Endpoints¶
| Endpoint | Description |
|---|---|
GET /health |
Backend readiness check (DB connectivity) |
Kubernetes Probes
Dedizierte Liveness- (/health/live) und Readiness-Probes (/health/ready) sind für eine zukünftige Version geplant.
Prometheus Metrics¶
Geplant
Prometheus-Metriken unter /metrics sind für eine zukünftige Version geplant. Die folgenden Metriken sind vorgesehen:
| Metric | Type | Description |
|---|---|---|
fileflux_transfers_total |
Counter | Total transfers by status |
fileflux_transfer_bytes_total |
Counter | Total bytes transferred |
fileflux_transfer_duration_seconds |
Histogram | Transfer duration |
fileflux_agents_connected |
Gauge | Currently connected agents |
fileflux_jobs_active |
Gauge | Active job count |
fileflux_chunks_transferred |
Counter | Total chunks transferred |
fileflux_websocket_messages_total |
Counter | WebSocket messages by type |
Grafana Dashboard¶
A pre-built Grafana dashboard is available at monitoring/grafana/dashboards/. Import it to visualize:
- Transfer throughput over time
- Agent connection status
- Job success/failure rates
- System resource usage
Structured Logging¶
FileFlux uses the Go standard log package for logging. Migration to log/slog for structured JSON logging is planned (see ADR-009).
Configure log level via LOG_LEVEL environment variable (debug, info, warn, error).
Alerting¶
Integrate with your existing alerting stack via Prometheus Alertmanager. Example alerts:
- Agent offline for > 5 minutes
- Transfer failure rate > 5% in 15 minutes
- Disk usage > 80% on transfer temp directory