-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Open
Description
DataFusion is underperforming the Polars streaming engine on some localhost join queries (1e8 rows of data on a Macbook M3 with 16GB of RAM):

Here are the join queries.
I am guessing the join operator can be optimized, similar to how the filtering and aggregation operations were optimized.
Here is an example of how the median function was made faster: #13550
See this epic for more info: #13548
alamb, jonathanc-n, comphead, Dandandan and zhuqi-lucas
Metadata
Metadata
Assignees
Labels
No labels