fix: add order_by to first and last #11693

mesejo · 2025-10-16T20:13:17Z

Description of changes

N/A

Issues closed

Resolves bug: incorrect results when using last with order_by #11656

cpcloud · 2025-10-17T12:40:37Z

@mesejo Is there a test you can add to ensure this doesn't regress?

closes ibis-project#11656

mesejo · 2025-10-17T19:35:04Z

@cpcloud, Thanks for the review, just added a test

NickCrews · 2025-10-17T19:51:05Z

@mesejo I just updated the test, let me know if that is acceptable to you! I think using the handcrafted data will make it more clear what we are testing. Also added tests for descending order bys

NickCrews · 2025-10-17T19:52:13Z

LGTM

mesejo · 2025-10-17T20:04:58Z

@mesejo I just updated the test, let me know if that is acceptable to you! I think using the handcrafted data will make it more clear what we are testing. Also added tests for descending order bys

That's fine with me. Thanks!

NickCrews · 2025-10-17T20:13:54Z

@mesejo any idea why druid is failing on my version? On your version, the commit right before, it passed.

NickCrews · 2025-10-17T20:40:20Z

I'm guessing because in your version you only operated over a single column, where in mine we are ordering by a different column than we are aggregating over? Probably just mark as xfail and call it a day?

mesejo · 2025-10-17T20:45:13Z

@NickCrews, I think it is because of the memtable. The following works for the Druid backend:

@pytest.mark.parametrize(
    "method,expected",
    [
        pytest.param(lambda col: col.first(order_by="bigint_col"), 0, id="first_asc"),
        pytest.param(lambda col: col.last(order_by="bigint_col"), 9, id="last_asc"),
        pytest.param(
            lambda col: col.first(order_by=ibis._.bigint_col.desc()), 9, id="first_desc"
        ),
        pytest.param(
            lambda col: col.last(order_by=ibis._.bigint_col.desc()), 0, id="last_desc"
        ),
    ],
)
def test_first_last_ordered_in_mutate(alltypes, con, method, expected):
    # originally reported in https://github.com/ibis-project/ibis/issues/11656
    t = alltypes
    expr = t.mutate(new=method(t.int_col))
    actual = con.to_pyarrow(expr.new).to_pylist()
    assert actual == [expected] * len(actual)

NickCrews · 2025-10-17T22:00:16Z

huh, good detective work, thanks for cleaning up my mess. That is weird, since that seems like such basic usage of memtable, and memtable doesn't appear to be broken otherwise on druid.

I don't love how the test data is so far away from the test (like I have no idea if 0 and 9 are actually the right results without looking up the test data).

But I'm willing to say good enough to get this fix in, but @cpcloud may feel differently.

mesejo · 2025-10-18T08:26:01Z

@NickCrews, I changed the test to:

t = alltypes.select(
    a=ibis._.tinyint_col, val=ibis._.int_col, ob=ibis._.bigint_col
).filter(
    ((ibis._.val == 4) & (ibis._.ob == 40))
    | ((ibis._.val == 5) & (ibis._.ob == 50))
)
expr = t.mutate(new=method(t.val)).limit(10)
actual = con.to_pyarrow(expr.new).to_pylist()
assert actual == [expected] * 10

This form makes it easy to reason about the shape and content of the data.

NickCrews · 2025-10-18T14:01:45Z

That is better, thanks. still not ideal as handwritten data, but if we can't have that, then this really helps

github-actions bot added tests Issues or PRs related to tests sql Backends that generate SQL labels Oct 16, 2025

mesejo force-pushed the fix/order_by_first_and_last branch 2 times, most recently from cf7287a to 6713a4c Compare October 17, 2025 10:12

mesejo marked this pull request as ready for review October 17, 2025 10:48

mesejo force-pushed the fix/order_by_first_and_last branch from c015111 to 2fd80fc Compare October 17, 2025 16:25

mesejo added 4 commits October 17, 2025 19:30

fix: add order_by to first and last

7b6c445

closes ibis-project#11656

refactor: set order_by at creation time

1e11450

chore: add test

5d23682

chore: mark notimpl risingwave

d357c42

mesejo force-pushed the fix/order_by_first_and_last branch from 2fd80fc to d357c42 Compare October 17, 2025 17:30

chore: use named marks, test desc, use hand-crafted data

b0a59a1

NickCrews approved these changes Oct 17, 2025

View reviewed changes

chore: use table instead of memtable

ca22ef8

chore: make test data closer

f57ce17

mesejo force-pushed the fix/order_by_first_and_last branch from 3bff124 to f57ce17 Compare October 18, 2025 04:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix: add order_by to first and last #11693

fix: add order_by to first and last #11693

Uh oh!

mesejo commented Oct 16, 2025

Uh oh!

cpcloud commented Oct 17, 2025

Uh oh!

mesejo commented Oct 17, 2025

Uh oh!

NickCrews commented Oct 17, 2025

Uh oh!

NickCrews commented Oct 17, 2025

Uh oh!

mesejo commented Oct 17, 2025

Uh oh!

NickCrews commented Oct 17, 2025 •

edited

Loading

Uh oh!

NickCrews commented Oct 17, 2025

Uh oh!

mesejo commented Oct 17, 2025

Uh oh!

NickCrews commented Oct 17, 2025

Uh oh!

mesejo commented Oct 18, 2025

Uh oh!

NickCrews commented Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

fix: add order_by to first and last #11693

Are you sure you want to change the base?

fix: add order_by to first and last #11693

Uh oh!

Conversation

mesejo commented Oct 16, 2025

Description of changes

Issues closed

Uh oh!

cpcloud commented Oct 17, 2025

Uh oh!

mesejo commented Oct 17, 2025

Uh oh!

NickCrews commented Oct 17, 2025

Uh oh!

NickCrews commented Oct 17, 2025

Uh oh!

mesejo commented Oct 17, 2025

Uh oh!

NickCrews commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NickCrews commented Oct 17, 2025

Uh oh!

mesejo commented Oct 17, 2025

Uh oh!

NickCrews commented Oct 17, 2025

Uh oh!

mesejo commented Oct 18, 2025

Uh oh!

NickCrews commented Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

NickCrews commented Oct 17, 2025 •

edited

Loading