docs-mintlify/docs/data-modeling/multi-fact-views.mdx
In many data models, you have multiple fact tables that share common dimensions but have no direct relationship to each other. For example, an e-commerce company tracks both orders and returns:
orders — one row per order, with customer_id and created_atreturns — one row per return, with customer_id and created_atcustomers — one row per customerdates — a date spineBoth orders and returns join to customers and dates, but they don't
join to each other:
customers
/ \
orders returns
\ /
dates
You need a report showing orders_count, total_revenue, returns_count,
and total_refunds grouped by customer and month. But joining orders and
returns directly would produce a cross product — every order matched with
every return for that customer and date — inflating all counts and sums.
In a regular view, there is a single root cube — the first
cube listed in the view's cubes array. All joins flow from this root, and
Cube uses it as the base table in the generated SQL.
Multi-fact views work differently. When a view includes measures from multiple fact tables, Cube selects the root dynamically at query time based on which measures are requested. Each fact table gets its own aggregating subquery, and the results are joined on the shared dimensions. No fanout, no manual workarounds.
<Warning>Multi-fact views are powered by Tesseract, the next-generation data modeling
engine. Tesseract is currently in preview. Use the
CUBEJS_TESSERACT_SQL_PLANNER environment variable to
enable it.
Each fact table becomes a cube with explicit joins to the shared dimension tables:
<CodeGroup>cubes:
- name: customers
sql_table: customers
dimensions:
- name: id
type: number
sql: id
primary_key: true
- name: name
type: string
sql: name
- name: city
type: string
sql: city
- name: dates
sql_table: dates
dimensions:
- name: date
type: time
sql: date
primary_key: true
- name: orders
sql_table: orders
joins:
- name: customers
relationship: many_to_one
sql: "{orders}.customer_id = {customers.id}"
- name: dates
relationship: many_to_one
sql: "DATE_TRUNC('day', {orders}.created_at) = {dates.date}"
dimensions:
- name: id
type: number
sql: id
primary_key: true
- name: status
type: string
sql: status
measures:
- name: count
type: count
- name: total_amount
type: sum
sql: amount
- name: returns
sql_table: returns
joins:
- name: customers
relationship: many_to_one
sql: "{returns}.customer_id = {customers.id}"
- name: dates
relationship: many_to_one
sql: "DATE_TRUNC('day', {returns}.created_at) = {dates.date}"
dimensions:
- name: id
type: number
sql: id
primary_key: true
measures:
- name: count
type: count
- name: total_refund
type: sum
sql: refund_amount
cube(`customers`, {
sql_table: `customers`,
dimensions: {
id: { sql: `id`, type: `number`, primary_key: true },
name: { sql: `name`, type: `string` },
city: { sql: `city`, type: `string` }
}
})
cube(`dates`, {
sql_table: `dates`,
dimensions: {
date: { sql: `date`, type: `time`, primary_key: true }
}
})
cube(`orders`, {
sql_table: `orders`,
joins: {
customers: {
relationship: `many_to_one`,
sql: `${orders}.customer_id = ${customers.id}`
},
dates: {
relationship: `many_to_one`,
sql: `DATE_TRUNC('day', ${orders}.created_at) = ${dates.date}`
}
},
dimensions: {
id: { sql: `id`, type: `number`, primary_key: true },
status: { sql: `status`, type: `string` }
},
measures: {
count: { type: `count` },
total_amount: { sql: `amount`, type: `sum` }
}
})
cube(`returns`, {
sql_table: `returns`,
joins: {
customers: {
relationship: `many_to_one`,
sql: `${returns}.customer_id = ${customers.id}`
},
dates: {
relationship: `many_to_one`,
sql: `DATE_TRUNC('day', ${returns}.created_at) = ${dates.date}`
}
},
dimensions: {
id: { sql: `id`, type: `number`, primary_key: true }
},
measures: {
count: { type: `count` },
total_refund: { sql: `refund_amount`, type: `sum` }
}
})
The critical detail: both orders and returns declare direct joins to
customers and dates. This tells Cube that these dimension tables are shared
between the two facts.
The view brings both fact tables and the shared dimension tables together.
Dimension tables are included at root-level join paths (not nested under a
specific fact), which makes their dimensions common to both facts. Use
prefix to disambiguate identically named members across fact cubes:
views:
- name: customer_overview
cubes:
- join_path: orders
prefix: true
includes:
- count
- total_amount
- join_path: returns
prefix: true
includes:
- count
- total_refund
- join_path: customers
includes:
- name
- city
- join_path: dates
includes:
- date
view(`customer_overview`, {
cubes: [
{
join_path: orders,
prefix: true,
includes: [`count`, `total_amount`]
},
{
join_path: returns,
prefix: true,
includes: [`count`, `total_refund`]
},
{
join_path: customers,
includes: [`name`, `city`]
},
{
join_path: dates,
includes: [`date`]
}
]
})
When you query orders_count, orders_total_amount, returns_count, and
returns_total_refund grouped by name, city, and date, Cube detects
the two separate fact roots and automatically executes a multi-fact query.
Cube executes the query in three stages:
Each fact table gets its own independent subquery that joins only the tables it needs, applies relevant filters, and aggregates by the common dimensions:
orders → customers and orders → dates,
computes COUNT(*) and SUM(amount), grouped by name, city, datereturns → customers and returns → dates,
computes COUNT(*) and SUM(refund_amount), grouped by name, city, dateThe subquery results are joined with FULL JOIN on all common dimension
columns (name, city, date). This preserves rows that exist in only one
fact table — a customer who placed orders but never returned anything still
appears in the results.
The combined result shows measures from each fact table side by side:
| name | city | date | orders_count | orders_total_amount | returns_count | returns_total_refund |
|---|---|---|---|---|---|---|
| Alice | New York | 2025-01-15 | 2 | 200.00 | 0 | NULL |
| Alice | New York | 2025-02-10 | 2 | 225.00 | 1 | 100.00 |
| Bob | Seattle | 2025-01-20 | 3 | 550.00 | 2 | 130.00 |
| Charlie | New York | 2025-02-05 | 0 | NULL | 2 | 100.00 |
| Diana | Boston | 2025-03-01 | 1 | 400.00 | 0 | NULL |
Charlie has no orders and Diana has no returns — both are still included
with NULL values for the missing fact table.
You don't have to define a dedicated multi-fact view to get multi-fact behavior. The SQL API produces the same query when you join two or more views on a dimension they share and group by that dimension.
Suppose orders_view and returns_view are two separate views that each
expose the customer's name (both backed by the same underlying
customers.name member). Joining them on name and grouping by it triggers a
multi-fact query:
SELECT
o.name,
MEASURE(o.total_amount),
MEASURE(r.total_refund)
FROM orders_view o
LEFT JOIN returns_view r ON r.name = o.name
GROUP BY 1
Cube recognizes that both name columns resolve to the same cube member,
merges the two view scans into a single multi-fact query, and runs it with the
separate-subquery-then-join strategy described
above.
This rewrite applies only when:
CUBEJS_TESSERACT_SQL_PLANNER.SELECT *) and queries that group
by a different dimension are not merged and fall back to standard join
handling.The rewrite is not limited to two views. Chained joins on the same shared key are merged into a single multi-fact query, with each view contributing its own aggregating subquery:
SELECT
o.name,
MEASURE(o.total_amount),
MEASURE(r.total_refund),
MEASURE(p.total_paid)
FROM orders_view o
FULL JOIN returns_view r ON r.name = o.name
FULL JOIN payments_view p ON p.name = o.name
GROUP BY 1
A common multi-fact pattern joins facts on a shared time dimension and groups by
a truncated grain. Join on DATE_TRUNC at the same granularity you group by:
SELECT DATE_TRUNC('day', o.created_at), MEASURE(o.total_amount), MEASURE(r.total_refund)
FROM orders_view o
JOIN returns_view r ON DATE_TRUNC('day', r.created_at) = DATE_TRUNC('day', o.created_at)
GROUP BY 1
The grouped column is emitted as a time dimension with its granularity. A join
written on DATE_TRUNC is an INNER join (the SQL planner expresses it as a
filtered cross join), so both sides must share a key; both truncated columns
must resolve to the same underlying time member at the same granularity.
The join-key granularity must match the GROUP BY granularity, because the
facts are stitched together at the grain you group by. This has two
consequences:
DATE_TRUNC('month', …) while grouping by DATE_TRUNC('day', …)
is not merged (it would silently stitch at day grain, diverging from the
month-grain join).ON r.created_at = o.created_at, an
exact-timestamp join) while grouping by DATE_TRUNC('day', …) is likewise not
merged — the row-grain join doesn't match the day-grain group-by. Truncate the
join key to the grain you group by instead.In both cases the query falls back to standard join handling.
You can also combine a DATE_TRUNC equality with a plain dimension equality in
the same join (a composite key), and group by both:
SELECT DATE_TRUNC('day', o.created_at), o.name, MEASURE(o.total_amount), MEASURE(r.total_refund)
FROM orders_view o
JOIN returns_view r
ON DATE_TRUNC('day', r.created_at) = DATE_TRUNC('day', o.created_at)
AND r.name = o.name
GROUP BY 1, 2
Filters on top of the join are supported and are applied to the merged query:
WHERE clause is pushed into the merged scan. A predicate on a dimension
shared by all facts filters the whole result; a predicate on a fact-specific
dimension filters only that fact's subquery.ON clause that the planner can attach to a single side
(for example, a condition on the optional side of a LEFT JOIN) becomes a
filter on that fact. Predicates that the SQL planner can't push to one side
of an outer join (such as a left-table condition in a LEFT JOIN ON) aren't
supported by the planner and will raise an error.The facts are stitched together with a FULL JOIN on the shared key, and the
JOIN type in your SQL controls which rows are kept:
| SQL join | Result |
|---|---|
FULL [OUTER] JOIN | every key from either view (default multi-fact behavior) |
INNER JOIN | only keys present in both views |
LEFT JOIN | every key from the left view; right-side measures are NULL when missing |
RIGHT JOIN | every key from the right view; left-side measures are NULL when missing |
The most common multi-fact pattern uses time as the shared dimension.
For example, you might have page_views, signups, and purchases that all
have timestamps but no direct relationship. By joining each to a shared
dates cube, you can analyze conversion funnels — page views vs. signups
vs. purchases by day — without any row multiplication.
Multi-fact queries are not limited to two fact tables. If a view includes three or more facts, each gets its own aggregating subquery, and all results are joined on the common dimensions.
Every root fact table must be joinable to the same set of common dimension tables. If a fact table doesn't naturally have a foreign key for one of the common dimensions, you can create a synthetic join:
<CodeGroup>cubes:
- name: refunds
sql: >
SELECT *, NULL AS customer_id FROM refunds
joins:
- name: customers
relationship: many_to_one
sql: "{refunds}.customer_id = {customers.id}"
- name: dates
relationship: many_to_one
sql: "DATE_TRUNC('day', {refunds}.created_at) = {dates.date}"
dimensions:
- name: id
type: number
sql: id
primary_key: true
measures:
- name: count
type: count
- name: total_amount
type: sum
sql: amount
cube(`refunds`, {
sql: `SELECT *, NULL AS customer_id FROM refunds`,
joins: {
customers: {
relationship: `many_to_one`,
sql: `${refunds}.customer_id = ${customers.id}`
},
dates: {
relationship: `many_to_one`,
sql: `DATE_TRUNC('day', ${refunds}.created_at) = ${dates.date}`
}
},
dimensions: {
id: { sql: `id`, type: `number`, primary_key: true }
},
measures: {
count: { type: `count` },
total_amount: { sql: `amount`, type: `sum` }
}
})
The NULL AS customer_id makes the join syntactically valid. Refund rows
won't match a specific customer, but the subquery can still participate in
the multi-fact join on the full set of common dimensions.
Common dimension filters (like city = 'New York' or date > '2025-01-01')
are applied to every subquery, ensuring consistent filtering across all facts.
Fact-specific filters (like orders.status = 'completed') are applied only
to that fact's subquery. Other fact subqueries remain unaffected.
Measure filters (like orders_count > 1) are applied as HAVING
conditions after the subqueries are joined.
Segments that belong to a specific fact table are applied only to that fact's subquery.
customers, not orders.customers)prefix on fact cubes to disambiguate identically named members