Documentation Index
Fetch the complete documentation index at: https://cubed3-docs-cub-2416-update-semantic-snowflake-semantic-vie.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
- Shared — designed for development use cases. Runs on compute shared with other deployments within the selected region.
- Dedicated — designed for production workloads and high-availability. Runs on compute dedicated to your deployment.
- Multi-cluster — designed for demanding production workloads, high-scalability, high-availability, and advanced multi-tenancy configurations. Runs on multiple Dedicated deployments.
Shared
Available for free, no credit card required. Your free trial is limited to 2
Shared deployments and only 1,000 queries per day. Upgrade to
any paid plan to unlock all features.
If your account uses single-tenant infrastructure,
Shared deployments are only shared with your other deployments on that
infrastructure — never with other customers. Your environment remains fully
isolated at the infrastructure level.
Dedicated
Available on all paid plans.
Multi-cluster
Multi-cluster deployments are designed for demanding production workloads, high-scalability, high-availability, and large multi-tenancy configurations, e.g., with more than 100 tenants.Available on Premium and above plans.
- Scale the number of Dedicated deployments serving your workload, allowing to route requests over up to 10 Dedicated deployments and up to 100 API instances.
- Optionally, scale the number of Cube Store routers, allowing for increased Cube Store querying performance.
Configuring Multi-cluster
To switch your deployment to Multi-cluster, navigate to Settings → General, select it under Type, and confirm with ✓. To set the number of Dedicated deployments within your Multi-cluster deployment, navigate to Settings → Configuration and edit Number of clusters.Routing traffic between Dedicated deployments
Cube routes requests between multiple Dedicated deployments within a Multi-cluster deployment based oncontext_to_app_id.
In most cases, it should return an identifier that does not change over time
for each tenant.
The following implementation will make sure that all requests from a
particular tenant are always routed to the same Dedicated deployment. This
approach ensures that only one Dedicated deployment keeps compiled data model
cache for each tenant and serves its requests. It allows to reduce the
footprint of the compiled data model cache on individual Dedicated deployments.