The SQL API is able to execute regular queries, queries with post-processing and queries with pushdown. This page explains their format and details if they are handled differently by the SQL API.Documentation Index
Fetch the complete documentation index at: https://cubed3-docs-cub-2416-update-semantic-snowflake-semantic-vie.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Data model mapping
In the SQL API, each cube or view from the data model is represented as a table. Measures, dimensions, and segments are represented as columns in these tables.Cubes and views
Given that you have a cube or a view calledorders, you can query it as if it’s
a table:
Dimensions
Given that your cube or view has a dimension calledstatus, you can reference it
as a column in the SELECT clause. Note that you’ll also have to add it to the
GROUP BY clause:
Measures
Given that your cube or view has a measure calledcount, you can reference it
by wrapping with the MEASURE aggregate function:
Aggregate functions
The specialMEASURE function works with measures of any type.
Measure columns can also be aggregated with the following aggregate functions that
correspond to measure types:
| Measure type in Cube | Aggregate function in an aggregated query |
|---|---|
avg | MEASURE or AVG |
boolean | MEASURE |
count | MEASURE or COUNT |
count_distinct | MEASURE or COUNT(DISTINCT …) |
count_distinct_approx | MEASURE or COUNT(DISTINCT …) |
max | MEASURE or MAX |
min | MEASURE or MIN |
number | MEASURE or any other function from this table |
string | MEASURE or STRING_AGG |
sum | MEASURE or SUM |
time | MEASURE or MAX or MIN |
Segments
Segments are exposed as columns of theboolean type.
Given that your cube or view has a segment called is_completed, you can reference it
as a column in the WHERE clause:
Joins
Please refer to this page for details on joins.Post-processing and pushdown
Since 1.0 by default, the SQL API executes regular queries, queries with post-processing, and queries with pushdown.Query post-processing
The following query is performing aSELECT from the orders cube:
SELECT query fragments into a regular
query. It can be represented as follows in the REST (JSON) API query
format:
SELECT from cube tables. Please refer to the
SQL API reference to see whether a specific expression or function
is supported and whether it can be used in selection
(e.g., WHERE) or projection (e.g., SELECT) parts
of SQL queries.
For example, the following query won’t work because the SQL API can’t push down the
CASE expression to Cube for processing. It is not possible to translate CASE
expressions in measures.
SELECT
statement from a cube table (inner query) into another SELECT statement
(outer query) to perform calculations with expressions like CASE.
This outer SELECT is not part of the SQL query that being rewritten and
thus allows you to use more SQL functions, operators and expressions.
You can rewrite the above query as follows, making sure to wrap the original
SELECT statement:
CASE expression is supported in SELECT
queries not querying cube tables.
Query pushdown
Query pushdown provides a safe net for queries that can’t be rewritten into combination of a regular query and post-processing. Such queries’ SQL would be transpiled to target database query leveraging all target database capabilities for data processing. During the rewrite process, Cube validates that the target database would support transpired SQL queries. If direct conversion is not possible, different SQL transformation rewrite rules can be applied to achieve successful translation. Please refer to the SQL API reference for the list of supported SQL functions and clauses. Support varies based on the target database.Top-down and bottom-up evaluation
Fundamentally, every SQL operation results in a tabular data set. This is usually referred to as SQL operational closure or bottom-up SQL evaluation. However, for OLAP queries, most of the time, top-down evaluation is required. Top-down evaluation is whenever the outermost sub-query operation decides on how measures would be actually evaluated as opposed to innermost sub-query in case of standard SQL behavior. To balance between SQL guarantees and OLAP requirements, Cube- uses top-down evaluation from the innermost aggregation operation down to all ungrouped sub-queries,
- uses bottom-up evaluation from the innermost aggregation tabular result set up to the outermost sub-query.
| id | status | created_at | completed_percentage |
|---|---|---|---|
| 1 | shipped | 2024-01-01 | 0.0 |
| 2 | completed | 2024-01-01 | 100.0 |
| 3 | completed | 2024-01-02 | 100.0 |
| date_trunc(‘day’, created_at) | completed_percentage |
|---|---|
| 2024-01-01 | 50.0 |
| 2024-01-02 | 100.0 |
inner_query won’t be evaluated as a table.
Instead, Cube would postpone its execution until wrapping GROUP BY and would use only date_trunc('day', created_at) as a dimension to evaluate completed_percentage measure instead of full set of inner_query columns id,status and created_at.
To make it possible, Cube keeps track of ungrouped queries and evaluates them only on the first occurrence of a GROUP BY query in case there’s one.
Aggregated and non-aggregated queries
SQL API supports two types of queries against cube tables: aggregated (those withGROUP BY statement) and non-aggregated (those without).
Without query pushdown, queries that Cube runs against your database will always be aggregated,
regardless of whether you use aggregated (with
GROUP BY) or non-aggregated
queries with the SQL API.
Whenever you enable query pushdown, queries which do not contain GROUP BY clause will be executed as ungrouped queries.GROUP BY to execute such a query.
Whenever query pushdown is enabled, such query would run as ungrouped query.
As with REST (JSON) API such queries do not use GROUP BY and render measures as if those would be grouped by primary key of a cube.
Aggregated query must aggregate all measure columns and group by
all dimension columns. You can use the special MEASURE aggregate function
for measures of any type. This is quite convenient, especially
in case you’re manually writing ad-hoc queries:
Filtering
Without query pushdown, Cube supports most simple equality operators like=, <>, <, <=, >, >= as well as IN and LIKE operators.
Cube tries to push down all filters into a regular query.
In some cases, filtering can only be done during post-processing.
Time dimension filters will be converted to time dimension date ranges whenever
it’s possible.
Ordering
Without query pushdown, Cube tries to push down allORDER BY statements into
a regular query.
Row limit edge case
If theORDER BY statement can’t be pushed down, ordering would be performed during
post-processing. If there are more than 50,000 rows
in the result set, this can yield incorrect results.
However, given that queries to Cube are usually aggregated, this is a very rare case;
anyway, please keep this limitation in mind when designing your queries.
Consider the following query. Because of the SUM(total_value) + 2 expression
in the projection of the outer query, thr SQL API can’t push down ORDER BY:
EXPLAIN against the above query to look at the query plan.
As you can see, the sorting operation is done after the regular query and the projection: