Skip to content

Polars Backend over Pandas #1951

Closed as not planned
Closed as not planned
@kailukowiak

Description

@kailukowiak

Is your feature request related to a problem? Please describe.
Pandas can be slow and memory intensive. When dealing with large files I need lots more memory in my EC2 instance than if I was using Polars.

Also, and this is a matter of personal preference but the Polars API can be much cleaner.

Describe the solution you'd like
It would be really nice if I could use a faster and more memory efficient DataFrame API to ingest and export data.

Describe alternatives you've considered
I often convert Pandas DFs to Polars ones, and then process the data before writing it back out. This works fine on small data sets but it would be nice on large ones to never have to allocate all the memory needed for Pandas.

Comments
I know this is a large ask and currently Polars isn't that popular but I think this would be a huge performance increase if implemented and would make my ETL much prettier (subjectively) too.

Additional context
Add any other context or screenshots about the feature request here.

P.S. Please do not attach files as it's considered a security risk. Add code snippets directly in the message body as much as possible.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions