[CALCITE-7254] Add rule for sharing trivially equivalent RelNodes within Combine #4658

tjbanghart · 2025-12-02T03:36:26Z

This is a first pass, but I’m not yet convinced it’s the right approach. We may need to gather potential shared sub-expressions in a dedicated context and defer the rewrite until later. At the moment, the suggester identifies candidate spools and performs the replacement directly within the rule, which may unintentionally prevent the optimizer from exploring cheaper plans.

Likewise, certain rules may prevent shareable components from being recognized. For example, filter pushdown can eliminate a join tree that would otherwise be reusable.

core/src/main/java/org/apache/calcite/rel/rules/CombineSimpleEquivalenceRule.java

core/src/main/java/org/apache/calcite/adapter/enumerable/EnumerableCombine.java

core/src/test/java/org/apache/calcite/test/enumerable/EnumerableCombineTest.java

xiedeyantu · 2025-12-02T23:00:58Z

core/src/test/java/org/apache/calcite/test/enumerable/EnumerableCombineTest.java

+              // Combine both queries
+              return builder.combine(2).build();
+            })
+        .returnsUnordered(


I see that AssertQuery.returnsUnordered(String... lines) can return multiple lines. In JdbcTest, some examples use the format {column_name}={value}, such as returnsUnordered("DID2=1", "DID2=2"). I'm not sure if the current QUERY_0=xxx format is well-suited for future Quidem tests. Assuming that in the future I could use a setting like set enable_combine = true to allow multiple SQL statements to execute simultaneously, would we still want to obtain two separate result sets—just like when the two SQL statements are executed separately?

Good questions, I've updated the PR so Combine now returns one row per input query. Each row holds an array of maps, where each map represents a row from that query with column names as keys.

We are constrained by JDBC's limitations of returning a single result set per query but this would allow iterating over individual query results with resultSet.next() and then iterating over each query's results via the array.

For example, combining:
SELECT name FROM depts;
and
SELECT empid, name, deptno FROM emps WHERE deptno = 10;

Yields:

-- Row 1 QUERY=[ {name=Sales}, {name=Marketing}, {name=HR} ], -- Row 2 QUERY=[ {empid=100, name=Bill, deptno=10}, {empid=150, name=Sebastian, deptno=10} , {empid=110, name=Theodore, deptno=10} ]

Later, we should probably allow for aliases for queries so they don't have the generic QUERY column label but that may be more challenging (see the syntax for MULTI described in CALCITE-6188).

Thank you for the very detailed explanation. I think the current format of the result presentation looks quite good.

To later accommodate named queries I've changed the format once more. From the comment in EnumerableCombine:

The output format is a wide table where each column corresponds to a query (named QUERY_0, QUERY_1, etc.) and each row contains a struct with that query's column values for that row index. The number of output rows equals the maximum row count across all input queries. Queries with fewer rows have null values for the additional rows.

Example output for two queries:

QUERY_0 | QUERY_1 ------------------------ | ------------------------ {empno=100, name=Bill} | {deptno=10, name=Sales} {empno=110, name=Eric} | {deptno=20, name=HR} {empno=120, name=Ted} | null

core/src/main/java/org/apache/calcite/rel/rules/CombineSimpleEquivalenceRule.java

core/src/test/java/org/apache/calcite/test/enumerable/EnumerableCombineTest.java

core/src/main/java/org/apache/calcite/rel/rules/CombineSimpleEquivalenceRule.java

xiedeyantu

I think it looks good overall, just left a few minor comments.

xiedeyantu · 2025-12-04T05:48:57Z

core/src/main/java/org/apache/calcite/runtime/SqlFunctions.java

+   * @param queryLists array of lists, one per query
+   * @return list of Object arrays representing combined rows
+   */
+  public static List<Object[]> combineQueryResults(List[] queryLists) {


Can you add a small test for this method?

xiedeyantu · 2025-12-04T05:54:00Z

core/src/test/java/org/apache/calcite/test/enumerable/EnumerableCombineTest.java

+              // Verify both columns have values (no cross-contamination)
+              Object query0 = resultSet.getObject("QUERY_0");
+              Object query1 = resultSet.getObject("QUERY_1");
+              if (query0 == null || query1 == null) {


Would it be better to use AssertThat for positive validation? I don't understand why the throw new AssertionError approach is used here.

xiedeyantu · 2025-12-04T06:00:50Z

There are still some minor issues in the CI that need to be fixed.

xuzifu666 · 2025-12-08T02:34:03Z

core/src/main/java/org/apache/calcite/runtime/SqlFunctions.java

+    for (int rowIdx = 0; rowIdx < maxRows; rowIdx++) {
+      Object[] row = new Object[queryLists.length];
+      for (int queryIdx = 0; queryIdx < queryLists.length; queryIdx++) {
+        List queryList = queryLists[queryIdx];


The @Nullable annotation is required here; otherwise CI will fail.

Awesome, thank you - that was it.

xiedeyantu

LGTM. I think we can squash all the commits, and I'll leave the rest of the merging to you.

tjbanghart · 2025-12-08T19:40:40Z

Thank you so much for the reviews @xiedeyantu, much appreciated!

…hin Combine

sonarqubecloud · 2025-12-08T21:34:26Z

Quality Gate passed

Issues
3 New issues
0 Accepted issues

Measures
0 Security Hotspots
92.6% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

tjbanghart force-pushed the 7254 branch from 9e30133 to fcb1ecd Compare December 2, 2025 03:54

tjbanghart marked this pull request as ready for review December 2, 2025 03:55

tjbanghart force-pushed the 7254 branch from fcb1ecd to 375b4d2 Compare December 2, 2025 04:31

xiedeyantu reviewed Dec 2, 2025

View reviewed changes

core/src/main/java/org/apache/calcite/rel/rules/CombineSimpleEquivalenceRule.java Outdated Show resolved Hide resolved

core/src/main/java/org/apache/calcite/adapter/enumerable/EnumerableCombine.java Outdated Show resolved Hide resolved

tjbanghart force-pushed the 7254 branch 2 times, most recently from b60609a to 25e3ae0 Compare December 2, 2025 20:02

tjbanghart requested a review from xiedeyantu December 2, 2025 20:38

xiedeyantu reviewed Dec 2, 2025

View reviewed changes

tjbanghart force-pushed the 7254 branch 2 times, most recently from 04a59dd to 95e2419 Compare December 3, 2025 06:21

xiedeyantu reviewed Dec 3, 2025

View reviewed changes

core/src/test/java/org/apache/calcite/test/enumerable/EnumerableCombineTest.java Outdated Show resolved Hide resolved

core/src/main/java/org/apache/calcite/rel/rules/CombineSimpleEquivalenceRule.java Outdated Show resolved Hide resolved

tjbanghart requested a review from xiedeyantu December 3, 2025 20:17

xiedeyantu approved these changes Dec 4, 2025

View reviewed changes

xuzifu666 reviewed Dec 8, 2025

View reviewed changes

tjbanghart force-pushed the 7254 branch from cfc3c86 to 698d62a Compare December 8, 2025 04:53

xiedeyantu approved these changes Dec 8, 2025

View reviewed changes

[CALCITE-7254] Add rule for sharing trivially equivalent RelNodes wit…

41cc58a

…hin Combine

tjbanghart force-pushed the 7254 branch from 698d62a to 41cc58a Compare December 8, 2025 19:42

tjbanghart merged commit 3269244 into apache:main Dec 8, 2025
20 checks passed

[CALCITE-7254] Add rule for sharing trivially equivalent RelNodes within Combine #4658

[CALCITE-7254] Add rule for sharing trivially equivalent RelNodes within Combine #4658

Uh oh!

Conversation

tjbanghart commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xiedeyantu left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xiedeyantu commented Dec 4, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xiedeyantu left a comment

Choose a reason for hiding this comment

Uh oh!

tjbanghart commented Dec 8, 2025

Uh oh!

sonarqubecloud bot commented Dec 8, 2025

Quality Gate passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tjbanghart commented Dec 2, 2025 •

edited

Loading