Changelog
55.1.0 (2025-05-09)
Breaking changes:
- refactor!: do not default the struct array length to 0 in Struct::try_new #7247 [arrow] (westonpace)
Implemented enhancements:
- Add a way to get max
usize
fromOffsetSizeTrait
#7474 [arrow] - Deterministic metadata encoding #7448 [arrow]
- Support Arrow type Dictionary with value FixedSizeBinary in Parquet #7445
- Parquet: Add ability to project rowid in parquet reader #7444
- Move parquet::file::metadata::reader::FooterTail to parquet::file::metadata so that it is public #7438 [parquet]
- Speedup take_bytes by precalculating capacity #7432 [arrow]
- Improve performance of interleave_primitive and interleave_bytes #7421 [arrow]
- Implement
Eq
andDefault
forScalarBuffer
#7411 [arrow] - Add decryption support for column index and offset index #7390 [parquet]
- Support writing encrypted Parquet files with plaintext footers #7320 [parquet]
- Support Parquet key management tools #7256 [parquet]
- Verify footer tags when reading encrypted Parquet files with plaintext footers #7255 [parquet]
- StructArray::try_new behavior can be unexpected when there are no child arrays #7246 [arrow]
- Parquet performance: improve performance of reading int8/int16 #7097 [parquet]
Fixed bugs:
- StructArray::try_new validation incorrectly returns an error when
logical_nulls()
returns Some() && null_count == 0 #7435 - Reading empty DataPageV2 fails with
snappy: corrupt input (empty)
#7388 [parquet]
Documentation updates:
- Improve documentation and add examples for ArrowPredicateFn #7480 [parquet] (alamb)
- Document Arrow <--> Parquet schema conversion better #7479 [parquet] (alamb)
- Fix a typo in arrow/examples/README.md #7473 [arrow] (Mottl)
Closed issues:
- Refactor Parquet DecryptionPropertiesBuilder to fix use of unreachable #7476 [parquet]
- Implement
Eq
andDefault
forOffsetBuffer
#7417 [arrow]
Merged pull requests:
- Add Parquet
arrow_reader
benchmarks for {u}int{8,16} columns #7484 [parquet] (alamb) - fix:
rustdoc::unportable_markdown
was removed #7483 [arrow] [arrow-flight] (crepererum) - Support round trip reading / writing Arrow
Duration
type to parquet #7482 [parquet] (Liyixin95) - Add const MAX_OFFSET to OffsetSizeTrait #7478 [arrow] (thinkharderdev)
- Refactor Parquet DecryptionPropertiesBuilder #7477 [parquet] (adamreeve)
- Support parsing and display pretty for StructType #7469 [arrow] (goldmedal)
- chore(deps): update sysinfo requirement from 0.34.0 to 0.35.0 #7462 [parquet] (dependabot[bot])
- Verify footer tags when reading encrypted Parquet files with plaintext footers #7459 [parquet] (rok)
- Improve comments for avro #7449 [arrow] (kumarlokesh)
- feat: Support round trip reading/writing Arrow type
Dictionary(_, FixedSizeBinary(_))
to Parquet #7446 [parquet] (albertlockett) - Fix out of bounds crash in RleValueDecoder #7441 [parquet] (apilloud)
- Make
FooterTail
public #7440 [parquet] (masonh22) - Support writing encrypted Parquet files with plaintext footers #7439 [parquet] (rok)
- feat: deterministic metadata encoding #7437 [arrow] (timsaucer)
- Fix validation logic in
StructArray::try_new
to account for array.logical_nulls() returning Some() and null_count == 0 #7436 [arrow] (phillipleblanc) - Minor: Fix typo in async_reader comment #7433 [parquet] (amoeba)
- feat: coerce fixed size binary to binary view #7431 [arrow] (chenkovsky)
- chore(deps): update brotli requirement from 7.0 to 8.0 #7430 [parquet] (dependabot[bot])
- Speedup take_bytes (-35% -69%) by precalculating capacity #7422 [arrow] (Dandandan)
- Improve performance of interleave_primitive (-15% - 45%) / interleave_bytes (-10-25%) #7420 [arrow] (Dandandan)
- Implement
Eq
andDefault
forOffsetBuffer
#7418 [arrow] (kylebarron) - Implement
Default
forBuffer
&ScalarBuffer
#7413 [arrow] (emilk) - Implement
Eq
forScalarBuffer
whenT: Eq
#7412 [arrow] (emilk) - Skip page should also support skip dict page #7409 [parquet] (zhuqi-lucas)
- Replace
RecordBatch::with_schema_unchecked
withRecordBatch::new_unchecked
#7405 [arrow] (tustvold) - feat: Adding
with_schema_unchecked
method forRecordBatch
#7402 [arrow] (comphead) - Add benchmark for parquet reader with row_filter and project settings #7401 [parquet] (zhuqi-lucas)
- Parquet: Expose accessors from
ArrowReaderOptions
#7400 (kylebarron) - Support decryption of Parquet column and offset indexes #7399 [parquet] (adamreeve)
- Handle compressed empty DataPage v2 #7389 [parquet] (EnricoMi)
- Improve performance of reading int8/int16 Parquet data #7055 [parquet] (etseidl)
* This Changelog was automatically generated by github_changelog_generator