-
Notifications
You must be signed in to change notification settings - Fork 56
Open
Description
There are many reader and writer properties that have been added to C++ Parquet that could be useful to expose in ParquetSharp.
Possible properties to add:
ReaderProperties
- memory_pool (related to Allow specifying the native allocator to use in code #545)
-
read_dense_for_nullable(only relevant when using a TypedRecordReader, which is not used by ParquetSharp) - thrift_string_size_limit (Thrift string size limit (Reader Properties) #564)
- thrift_container_size_limit (Add thrift_container_size_limit & footer_read_size properties (ReaderProperties) #567)
- footer_read_size (Add thrift_container_size_limit & footer_read_size properties (ReaderProperties) #567)
ArrowReaderProperties
- binary_type (Add binary_type to ArrowReaderProperties #569)
- list_type (Add ListType & ArrowExtensionsEnabled to ArrowReaderProperties #582)
- cache_options
- arrow_extensions_enabled (Add ListType & ArrowExtensionsEnabled to ArrowReaderProperties #582)
- smallest_decimal_enabled (Requires Parquet C++ 22)
WriterProperties
- memory_pool (related to Allow specifying the native allocator to use in code #545)
- data_page_version
- max_statistics_size (Add set_max_statistics_size to WriterProperties #581)
- store_decimal_as_integer (Add EnableStoreDecimalAsInteger to WriterProperties #572)
- size_statistics_level
ArrowWriterProperties
- set_time_adjusted_to_utc (Requires Parquet C++ 22)
Metadata
Metadata
Assignees
Labels
No labels