test(order_by): add ORE ordering test without WHERE clause operators by tobyhede · Pull Request #168 · cipherstash/encrypt-query-language

tobyhede · 2026-03-10T21:49:01Z

Verifies ORDER BY eql_v2.order_by(e) DESC works over the full dataset without relying on any comparison operators in a WHERE clause.

Summary by CodeRabbit

New Features
- Added server-side sorting utilities for encrypted values, including a dynamic ORDER BY helper and in-place sorter.
Tests
- Added fixture-driven tests for ORDER BY NULL and two new ORDER BY without-WHERE tests (asc/desc).
- Added tests simulating missing database operator registrations to validate degraded ordering and expected failures.
- Added comprehensive correctness and performance tests for the new sort/order helpers, including scaled and filtered performance comparisons.
Documentation
- Added fixture schema docs describing ORDER BY NULL test data and usage.

Verifies ORDER BY eql_v2.order_by(e) DESC works over the full dataset without relying on any comparison operators in a WHERE clause.

coderabbitai · 2026-03-10T21:49:20Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Replaces in-test setup with fixture-driven data for ORDER BY NULL tests, adds fixtures that drop specific eql_v2 operator classes, introduces multiple new test modules verifying ORDER BY behavior and failure modes when opclasses/operators are absent, and adds SQL sorting utilities plus two new helper-ordering tests without WHERE.

Changes

Cohort / File(s)	Summary
Order-by tests (updated) `tests/sqlx/tests/order_by_tests.rs`	Replaced inline CREATE/INSERT setup with fixture `order_by_null_data`; adjusted expectations to fixture-seeded rows; added two new tests: `order_by_helper_function_without_where_clause` and `..._asc`.
Fixtures - drop operator classes `tests/sqlx/fixtures/drop_operator_classes.sql`	New fixture that drops multiple `eql_v2` operator classes/families and related operators (btree/hash) via CASCADE to emulate an environment missing opclasses.
New tests - no opclass `tests/sqlx/tests/order_by_no_opclass_tests.rs`	New test module using the `drop_operator_classes` fixture to assert altered/incorrect ordering when operator classes/operators are absent; covers helper ORDER BY, direct ORDER BY, and correlated-subquery ranking variants.
New tests - ORDER BY USING operator `tests/sqlx/tests/order_by_using_operator_tests.rs`	New tests asserting `ORDER BY ... USING <
Fixtures - order_by null data & manifest `tests/sqlx/fixtures/order_by_null_data.sql`, `tests/sqlx/fixtures/FIXTURE_SCHEMA.md`	Adds `order_by_null_data` fixture creating an encrypted table and inserting four rows (including two NULLs) for deterministic NULL-ordering tests; updates fixture manifest/documentation.
Sorting utilities (SQL) `src/operators/sort.sql`	Adds `eql_v2._quicksort_compare`, `eql_v2.sort_compare`, and `eql_v2.order_by_compare`: an in-place array quicksort, a public sorter, and a dynamic-SQL wrapper that collects query results into arrays then sorts them.
Sort & perf tests `tests/sqlx/tests/order_by_sort_tests.rs`	New comprehensive tests for `sort_compare`/`order_by_compare`: correctness, filtered queries, LIMIT, empty/single inputs, and scaled performance comparisons using generated datasets and fixtures.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

Add hash operator class for cross-document equality #164 — touches eql_v2 operator-class/operator-family surface; likely related to tests that drop or alter operator-class/operator registrations.

Suggested reviewers

freshtonic
coderdan

Poem

🐇 I dug through fixtures, nibble by bite,

Dropped some opclasses, watched ORDER BY fight,
Quicksort in burrows, tests taking flight,
Rows hop in order, or scramble at night,
A carrot for passing — hop, tuck, delight! 🥕

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title accurately reflects the main changes: adding new ORDER BY tests without WHERE clause operators, specifically validating eql_v2.order_by(e) DESC/ASC on full datasets.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch expand-order-tests

📝 Coding Plan

Generate coding plan for human review comments

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

…y workaround Add fixture that drops operator classes/operators to simulate Supabase environment. Tests prove ORDER BY e and ORDER BY eql_v2.order_by(e) both produce wrong results without operator classes. Add correlated subquery ranking tests demonstrating eql_v2.compare() as a working alternative for correct ordering in Supabase mode: - ASC/DESC full ordering verification - LIMIT support - WHERE clause filtering with correct sort order

Verify ORDER BY e USING </> syntax works with raw operators independently of operator class registration. Tests ascending, descending, LIMIT, and WHERE clause filtering after dropping btree/hash operator families.

- Extract shared NULL test data into order_by_null_data.sql fixture, removing ~120 lines of duplicated setup across 4 tests - Deduplicate correlated subquery WHERE test (single fetch_all instead of QueryAssertion + fetch_one running the query twice) - Add ASC variant for order_by helper without WHERE clause - Remove redundant USING+LIMIT tests (same plan-time error as non-LIMIT) - Document new fixture in FIXTURE_SCHEMA.md

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

tests/sqlx/fixtures/order_by_null_data.sql (1)
13-30: Add a small validation test for this fixture.

Lines 24 and 27 can silently insert zero rows if the seeded ore data ever changes, which would turn the NULL-ordering tests into fixture-debugging exercises. A dedicated validation test for the expected 4-row shape would make this fixture much safer to maintain.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/sqlx/fixtures/order_by_null_data.sql` around lines 13 - 30, Add a small
validation test that runs after this fixture to assert the encrypted table has
exactly four rows so missing INSERTs don’t silently break NULL-order tests:
write a test that queries SELECT count(*) FROM encrypted and asserts the result
equals 4 (and optionally checks that rows with id=1 and id=4 have NULL e and
id=2 and id=3 have non-NULL e) to fail fast if the ore-derived INSERTs (the
SELECT e FROM ore WHERE id = 42 / id = 3 lines) insert zero rows; add this test
to the fixture’s test suite so it runs immediately after loading the SQL
fixture.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@tests/sqlx/tests/order_by_using_operator_tests.rs`:
- Around line 18-25: The test currently only checks result.is_err(), which
accepts any error; replace that with extracting the error (e.g., let err =
result.unwrap_err()) and assert the specific failure by matching the SQL error
code or message for missing btree ordering support (for the query string "SELECT
id FROM ore ORDER BY e USING <" and the other similar queries in the same file),
e.g., assert that err contains the expected SQLSTATE or a substring like
"operator family" or "could not identify an ordering operator family", and apply
the same tightened assertion to the other cases referenced (lines 34-41 and
52-64).

---

Nitpick comments:
In `@tests/sqlx/fixtures/order_by_null_data.sql`:
- Around line 13-30: Add a small validation test that runs after this fixture to
assert the encrypted table has exactly four rows so missing INSERTs don’t
silently break NULL-order tests: write a test that queries SELECT count(*) FROM
encrypted and asserts the result equals 4 (and optionally checks that rows with
id=1 and id=4 have NULL e and id=2 and id=3 have non-NULL e) to fail fast if the
ore-derived INSERTs (the SELECT e FROM ore WHERE id = 42 / id = 3 lines) insert
zero rows; add this test to the fixture’s test suite so it runs immediately
after loading the SQL fixture.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: a0dde4a5-da4f-4d9e-b1e5-de97f12cdb81

📥 Commits

Reviewing files that changed from the base of the PR and between 291255a and 621b265.

📒 Files selected for processing (5)

tests/sqlx/fixtures/FIXTURE_SCHEMA.md
tests/sqlx/fixtures/order_by_null_data.sql
tests/sqlx/tests/order_by_no_opclass_tests.rs
tests/sqlx/tests/order_by_tests.rs
tests/sqlx/tests/order_by_using_operator_tests.rs

🚧 Files skipped from review as they are similar to previous changes (1)

tests/sqlx/tests/order_by_no_opclass_tests.rs

coderabbitai · 2026-03-10T23:32:27Z

+    let result = sqlx::query("SELECT id FROM ore ORDER BY e USING <")
+        .fetch_all(&pool)
+        .await;
+
+    assert!(
+        result.is_err(),
+        "ORDER BY e USING < should fail without btree operator family"
+    );


⚠️ Potential issue | 🟠 Major

Assert the specific failure, not just is_err().

These tests currently pass on any error path, so they do not prove that ORDER BY ... USING is failing because the btree ordering support is missing. In particular, the third case would also pass on an unrelated SQL/literal issue. Please check the returned error message or SQLSTATE for the expected ordering-operator failure.

🔎 Tighten one of the assertions

- let result = sqlx::query("SELECT id FROM ore ORDER BY e USING <") - .fetch_all(&pool) - .await; - - assert!( - result.is_err(), - "ORDER BY e USING < should fail without btree operator family" - ); + let err = sqlx::query("SELECT id FROM ore ORDER BY e USING <") + .fetch_all(&pool) + .await + .expect_err("ORDER BY e USING < should fail without btree operator family"); + + let msg = err.to_string(); + assert!( + msg.contains("valid ordering operator") || msg.contains("operator family"), + "expected missing btree ordering support error, got: {msg}" + );

Also applies to: 34-41, 52-64

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@tests/sqlx/tests/order_by_using_operator_tests.rs` around lines 18 - 25, The test currently only checks result.is_err(), which accepts any error; replace that with extracting the error (e.g., let err = result.unwrap_err()) and assert the specific failure by matching the SQL error code or message for missing btree ordering support (for the query string "SELECT id FROM ore ORDER BY e USING <" and the other similar queries in the same file), e.g., assert that err contains the expected SQLSTATE or a substring like "operator family" or "could not identify an ordering operator family", and apply the same tightened assertion to the other cases referenced (lines 34-41 and 52-64).

freshtonic

This works but via a technique with quadratic time complexity.

I've found no way to convince Postgres to order with a comparison function without custom types/operator classes.

However, if we dropped the original block ORE schema and only provide the lexicographical ORE variant then this problem would go away entirely.

freshtonic · 2026-03-11T00:15:18Z

+    let sql = "SELECT id FROM ore t \
+               ORDER BY (SELECT COUNT(*) FROM ore t2 WHERE eql_v2.compare(t.e, t2.e) > 0)";


While this works around the issue it does have quadratic time complexity - O(N²) - because the subquery generates the ranking across the whole table for every row in the table.

Not something that should be fixed in this PR necessarily, but we should circle back to figure out something more optimal.

…rows Expand the 99-row ORE dataset to 495 rows via generate_series cross join to demonstrate O(n log n) vs O(n^2) divergence more dramatically. At 495 rows, sort_compare achieves ~28x speedup over correlated subquery. Also adds sort.sql with the quicksort implementation (sort_compare and order_by_compare functions).

coderabbitai

🧹 Nitpick comments (2)

tests/sqlx/tests/order_by_sort_tests.rs (1)
316-322: Performance assertion is defensive but may miss regressions.

The assertion sort_elapsed < correlated_elapsed * 3 allows sort_compare to be up to 3x slower than the correlated subquery, which contradicts the test name "faster_than". This is understandable for CI stability, but consider tightening to sort_elapsed < correlated_elapsed with a retry mechanism or statistical sampling if flakiness becomes an issue.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/sqlx/tests/order_by_sort_tests.rs` around lines 316 - 322, The test
currently allows sort_compare to be up to 3x slower via the assertion using
sort_elapsed and correlated_elapsed; change the assertion in the test (the block
using sort_elapsed and correlated_elapsed in order_by_sort_tests.rs / the
"faster_than" test) to require sort_elapsed < correlated_elapsed (strictly
faster), and add a simple retry/statistical approach: run the timing measurement
multiple times (e.g., N runs), use the median or best-case timing for both
sort_elapsed and correlated_elapsed, and only assert on those aggregated values;
keep variable names sort_elapsed and correlated_elapsed and ensure retries are
bounded and logged to avoid CI flakiness.
src/operators/sort.sql (1)
128-146: Consider adding array length validation.

The documentation states arrays "must be same length" but there's no runtime check. Mismatched lengths would silently produce incorrect results (sorting stops at the shorter array's length).
🛡️ Add length validation
     n := array_length(ids, 1);
 
     IF n IS NULL OR n = 0 THEN
         RETURN;
     END IF;
+
+    IF n != array_length(vals, 1) THEN
+        RAISE EXCEPTION 'ids and vals arrays must have the same length';
+    END IF;
 
     IF n = 1 THEN
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/operators/sort.sql` around lines 128 - 146, The function
eql_v2.sort_compare currently computes n := array_length(ids,1) but never
validates that vals has the same length; add a runtime check after computing n
to get m := array_length(vals,1) and if m IS NULL OR n IS NULL OR n <> m then
RAISE EXCEPTION with a clear message (e.g., "ids and vals must be non-empty and
the same length") so mismatched or nil arrays fail fast; reference ids, vals, n
and the function name eql_v2.sort_compare when locating where to insert this
validation.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@src/operators/sort.sql`:
- Around line 128-146: The function eql_v2.sort_compare currently computes n :=
array_length(ids,1) but never validates that vals has the same length; add a
runtime check after computing n to get m := array_length(vals,1) and if m IS
NULL OR n IS NULL OR n <> m then RAISE EXCEPTION with a clear message (e.g.,
"ids and vals must be non-empty and the same length") so mismatched or nil
arrays fail fast; reference ids, vals, n and the function name
eql_v2.sort_compare when locating where to insert this validation.

In `@tests/sqlx/tests/order_by_sort_tests.rs`:
- Around line 316-322: The test currently allows sort_compare to be up to 3x
slower via the assertion using sort_elapsed and correlated_elapsed; change the
assertion in the test (the block using sort_elapsed and correlated_elapsed in
order_by_sort_tests.rs / the "faster_than" test) to require sort_elapsed <
correlated_elapsed (strictly faster), and add a simple retry/statistical
approach: run the timing measurement multiple times (e.g., N runs), use the
median or best-case timing for both sort_elapsed and correlated_elapsed, and
only assert on those aggregated values; keep variable names sort_elapsed and
correlated_elapsed and ensure retries are bounded and logged to avoid CI
flakiness.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 2e4f32a5-42a5-4a1e-84b6-7a46f91b1c24

📥 Commits

Reviewing files that changed from the base of the PR and between 621b265 and 2928cdb.

📒 Files selected for processing (2)

src/operators/sort.sql
tests/sqlx/tests/order_by_sort_tests.rs

Independent array_agg() subqueries don't guarantee matching row order, which could silently misalign ids with ciphertexts. Adding ORDER BY id inside each array_agg() ensures deterministic, aligned array construction. Updates 8 test call sites and 3 docstring examples.

coderabbitai

🧹 Nitpick comments (1)

tests/sqlx/tests/order_by_sort_tests.rs (1)
394-450: Consider adding a small margin to the scaled performance assertion.

The assertion at lines 442-447 is stricter than the 99-row test (no margin). While the O(n log n) vs O(n²) gap should be substantial at 495 rows, CI environments can have variable performance. If this test becomes flaky, consider adding a small margin (e.g., sort_elapsed * 2 < correlated_elapsed).
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/sqlx/tests/order_by_sort_tests.rs` around lines 394 - 450, The
assertion in sort_compare_performance_at_scale is too strict and can be flaky in
CI; change the final assertion to require a small margin (e.g.,
correlated_elapsed is at least twice as slow as sort_elapsed) by comparing their
seconds as floats: replace the assert!(sort_elapsed < correlated_elapsed, ...)
with an assertion like assert!(correlated_elapsed.as_secs_f64() >
sort_elapsed.as_secs_f64() * 2.0, "...") referencing the existing sort_elapsed
and correlated_elapsed variables inside the sort_compare_performance_at_scale
test.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@tests/sqlx/tests/order_by_sort_tests.rs`:
- Around line 394-450: The assertion in sort_compare_performance_at_scale is too
strict and can be flaky in CI; change the final assertion to require a small
margin (e.g., correlated_elapsed is at least twice as slow as sort_elapsed) by
comparing their seconds as floats: replace the assert!(sort_elapsed <
correlated_elapsed, ...) with an assertion like
assert!(correlated_elapsed.as_secs_f64() > sort_elapsed.as_secs_f64() * 2.0,
"...") referencing the existing sort_elapsed and correlated_elapsed variables
inside the sort_compare_performance_at_scale test.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: cd28ca3c-9f7f-45a9-9ad9-0eb5df425c67

📥 Commits

Reviewing files that changed from the base of the PR and between 2928cdb and 2b4e0d8.

📒 Files selected for processing (2)

src/operators/sort.sql
tests/sqlx/tests/order_by_sort_tests.rs

✅ Files skipped from review due to trivial changes (1)

src/operators/sort.sql

Add sort_compare(text, text, text, text, text) overload that accepts column names, table name, optional direction and filter instead of pre-aggregated arrays. Delegates to order_by_compare internally. Includes 8 tests covering ASC/DESC, default direction, LIMIT, filter, empty results, NULL handling, and equivalence with order_by_compare.

Add required Doxygen tags for _compare_sort_pivot, _insertion_sort, and _emit_sorted_rows to pass docs:validate:required-tags.

Expand numeric ORE fixture from 99 to 1000 rows and add text ORE fixture with 100 lexicographically sorted words for broader test coverage. - Expand migration 002 to 1000 numeric ORE rows - Add migration 006 with 100 text ORE rows (aardvark..zinc) - Add get_ore_text_encrypted and assert_sequential_ids helpers - Update all existing test assertions for the expanded dataset - Add 14 new text ORE ordering tests (operator classes, sort_compare, prefix ordering, similar starts edge cases) - Simplify performance tests to use 1000 base rows directly

Add 29 tests covering equality, comparison, function, and JSONB variants for text ORE encryption using ore_text table (100 words). Includes lexicographic edge cases (prefix ordering, similar starts) and boundary tests. Adds get_ore_text_encrypted_as_jsonb helper.

test(order_by): add ORE ordering test without WHERE clause operators

9ee5a37

Verifies ORDER BY eql_v2.order_by(e) DESC works over the full dataset without relying on any comparison operators in a WHERE clause.

tobyhede added 3 commits March 11, 2026 09:25

test(order_by): add ORDER BY USING operator tests

291255a

Verify ORDER BY e USING </> syntax works with raw operators independently of operator class registration. Tests ascending, descending, LIMIT, and WHERE clause filtering after dropping btree/hash operator families.

coderabbitai Bot reviewed Mar 10, 2026

View reviewed changes

freshtonic approved these changes Mar 11, 2026

View reviewed changes

coderabbitai Bot reviewed Mar 11, 2026

View reviewed changes

tobyhede force-pushed the expand-order-tests branch from 2928cdb to 2b4e0d8 Compare March 11, 2026 02:21

coderabbitai Bot reviewed Mar 11, 2026

View reviewed changes

tobyhede added 6 commits March 11, 2026 13:49

docs(sort): add missing @param and @return tags to internal functions

a0cf5bc

Add required Doxygen tags for _compare_sort_pivot, _insertion_sort, and _emit_sorted_rows to pass docs:validate:required-tags.

Fix sort_compare table resolution

9c4285a

Add ore text jsonb operator coverage

690696d

tobyhede merged commit 68c32ea into main Mar 18, 2026
5 of 9 checks passed

tobyhede deleted the expand-order-tests branch March 18, 2026 06:56

auxesis mentioned this pull request Apr 22, 2026

feat(bench): Phase 2 - benchmark table, 10K row fixture, and verification tests #171

Merged

1 task

coderabbitai Bot mentioned this pull request Apr 22, 2026

feat(bench): index performance benchmark suite #173

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(order_by): add ORE ordering test without WHERE clause operators#168

test(order_by): add ORE ordering test without WHERE clause operators#168
tobyhede merged 12 commits intomainfrom
expand-order-tests

tobyhede commented Mar 10, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Mar 10, 2026 •

edited

Loading

Reviews paused

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Mar 10, 2026

Uh oh!

freshtonic left a comment

Uh oh!

freshtonic Mar 11, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		let sql = "SELECT id FROM ore t \
		ORDER BY (SELECT COUNT(*) FROM ore t2 WHERE eql_v2.compare(t.e, t2.e) > 0)";

Conversation

tobyhede commented Mar 10, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

freshtonic left a comment

Choose a reason for hiding this comment

Uh oh!

freshtonic Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tobyhede commented Mar 10, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Mar 10, 2026 •

edited

Loading