Declares how and where the output should be accessible. The developer states intent; the platform routes to the correct store via Dagster IO Managers.
S3 object storage (Ceph RGW, via Iceberg/Lakekeeper) is always written to regardless of endpoint configuration. It is the source of truth with full time-travel capability. The endpoints below are additional projections optimized for specific access patterns.
No-op — StarRocks federates to Iceberg on S3 directly
OLAP, GROUP BY, bulk scans, dashboards
type: realtime
Redis
IO Manager SET with TTL, tenant-prefixed keys
Sub-10ms reads, streaming hot path
(no endpoints)
S3 object storage only
Iceberg via Lakekeeper
Cold storage, time travel, source of truth
type: analytics is a no-op write. StarRocks queries Iceberg tables on S3 object storage directly via its External Catalog feature. The platform simply registers the table in StarRocks’ catalog. No data movement occurs.
Optional Superset integration. When enabled, the platform creates or updates a Superset dataset, applies RLS filters, and optionally provisions a dashboard.