Skip to content

Data Mesh Deployment Status

Data Mesh is part of Floe’s architecture direction. The current alpha includes schema, contract, inheritance, and lineage namespace primitives, but it does not yet provide a validated multi-cluster Data Mesh deployment path.

CapabilityStatus
Enterprise and domain manifest fieldsImplemented primitive
Manifest inheritance validationImplemented primitive
Data contract schemasImplemented primitive
Data Mesh lineage namespace strategyImplemented primitive
Dedicated domain Helm chartPlanned
Multi-cluster deployment guidePlanned
Product registration workflowPlanned

Use the single-platform Kubernetes deployment path for alpha. Treat the diagrams below as architecture context, not a supported deployment recipe.


+---------------------------------------------------------------------------+
| KUBERNETES CLUSTER |
| |
| +---------------------------------------------------------------------+ |
| | Namespace: floe-platform (Shared Services) | |
| | Owner: Central Platform Team | |
| | | |
| | * Polaris Catalog (shared across all domains) | |
| | * OTLP Collector (federated traces/metrics/lineage) | |
| | * Marquez (cross-domain lineage) | |
| | * MinIO / Object Storage (shared Iceberg warehouse) | |
| +----------------------------------+-----------------------------------+ |
| | |
| +-------------------------+-------------------------+ |
| v v v |
| +---------------------+ +---------------------+ +---------------------+|
| | Namespace: | | Namespace: | | Namespace: ||
| | floe-sales-domain | | floe-marketing-domain| | floe-finance-domain ||
| | | | | | ||
| | Orchestrator: | | Orchestrator: | | Orchestrator: ||
| | * Dagster webserver | | * Airflow (domain | | * Dagster webserver ||
| | * Dagster daemon | | choice) | | * Dagster daemon ||
| | * PostgreSQL | | * PostgreSQL | | * PostgreSQL ||
| | | | | | ||
| | Data Products: | | Data Products: | | Data Products: ||
| | * customer-360 | | * campaign-perf | | * revenue-metrics ||
| | * order-analytics | | * attribution | | * cost-center ||
| +---------------------+ +---------------------+ +---------------------+|
| |
+---------------------------------------------------------------------------+

ServiceDeploymentRationale
Catalog (Polaris)SharedSingle source of truth for all domains
Object StorageSharedUnified Iceberg warehouse
Lineage (Marquez)SharedCross-domain lineage visibility
OTLP CollectorSharedFederated observability
OrchestratorPer-domainDomain autonomy in tooling
Semantic LayerPer-domainDomain-specific business logic
PostgreSQL (orchestrator)Per-domainIsolation of execution state

Domains connect to shared services via K8s DNS:

# Domain deployment references shared catalog
env:
- name: POLARIS_HOST
value: "<polaris-service>.<platform-namespace>.svc.cluster.local"
- name: OTLP_ENDPOINT
value: "http://<platform-otlp-endpoint>:4317"
- name: OPENLINEAGE_URL
value: "http://<marquez-service>.<platform-namespace>.svc.cluster.local:5000"

For the default chart service prefix/fullnameOverride floe-platform in namespace floe-platform, the shared service names are floe-platform-polaris and floe-platform-marquez.


For large enterprises, domains may run in separate clusters:

+---------------------+ +---------------------+ +---------------------+
| SHARED SERVICES | | SALES CLUSTER | | MARKETING CLUSTER |
| CLUSTER | | (Region: us-east) | | (Region: eu-west) |
| | | | | |
| * Polaris Catalog |<---->| * Dagster |<---->| * Airflow |
| * Marquez Lineage | | * Sales Products | | * Marketing Prods |
| * Central OTLP | | * Local OTLP | | * Local OTLP |
+---------------------+ +---------------------+ +---------------------+
^ | |
| | |
+----------------------------+----------------------------+
Cross-cluster networking
(Service Mesh / Ingress)