Skip to content

Schema error: Invalid data type for Delta Lake: Null #1946

@resulyrt93

Description

@resulyrt93

Environment

Delta-rs version: 0.14.0

Binding: Python

Environment:

  • Cloud provider: AWS and Local
  • OS: Linux and MacOS
  • Other:

Bug

What happened: When i try to write my dataframe that has full null column, it raises error. If all rows are None in a column, it seems as null type in pyarrow schema and it causes error because i guess delta-rs doesn't recognize that type.

What you expected to happen: I would expect that create/write delta table.

How to reproduce it:

data = [['kevin', 10, None], ['angela', 12, None], ['oscar', 9, None]]
df = pd.DataFrame(data, columns=['Name', 'Age', 'Gender'])

write_deltalake('/my_table_path', df)

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions