IO interfaces. More...
Classes | |
class | arrow_io_source |
Implementation class for reading from an Apache Arrow file. The file could be a memory-mapped file or other implementation supported by Arrow. More... | |
class | avro_reader_options |
Settings to use for read_avro() . More... | |
class | avro_reader_options_builder |
struct | binary_statistics |
Statistics for binary columns. More... | |
struct | bucket_statistics |
Statistics for boolean columns. More... | |
class | chunked_orc_writer_options |
Settings to use for write_orc_chunked() . More... | |
class | chunked_orc_writer_options_builder |
class | chunked_parquet_writer_options |
Settings for write_parquet_chunked() . More... | |
class | chunked_parquet_writer_options_builder |
struct | column_name_info |
Detailed name information for output columns. More... | |
class | column_statistics |
Contains per-column ORC statistics. More... | |
class | csv_reader_options |
Settings to use for read_csv() . More... | |
class | csv_reader_options_builder |
class | csv_writer_options |
Settings to use for write_csv() . More... | |
class | csv_writer_options_builder |
class | data_sink |
Interface class for storing the output data from the writers. More... | |
class | datasource |
Interface class for providing input data to the readers. More... | |
struct | date_statistics |
Statistics for date(time) columns. More... | |
struct | decimal_statistics |
Statistics for decimal columns. More... | |
struct | double_statistics |
Statistics for floating point columns. More... | |
struct | host_buffer |
Non-owning view of a host memory buffer. More... | |
struct | integer_statistics |
Statistics for integral columns. More... | |
class | json_reader_options |
Input arguments to the read_json interface. More... | |
class | json_reader_options_builder |
struct | minmax_statistics |
Base class for column statistics that include optional minimum and maximum. More... | |
class | orc_chunked_writer |
Chunked orc writer class writes an ORC file in a chunked/stream form. More... | |
class | orc_reader_options |
Settings to use for read_orc() . More... | |
class | orc_reader_options_builder |
class | orc_writer_options |
Settings to use for write_orc() . More... | |
class | orc_writer_options_builder |
class | parquet_chunked_writer |
chunked parquet writer class to handle options and write tables in chunks. More... | |
class | parquet_reader_options |
Settings or read_parquet() . More... | |
class | parquet_reader_options_builder |
class | parquet_writer_options |
Settings for write_parquet() . More... | |
class | parquet_writer_options_builder |
struct | parsed_orc_statistics |
Holds column names and parsed file-level and stripe-level statistics. More... | |
struct | raw_orc_statistics |
Holds column names and buffers containing raw file-level and stripe-level statistics. More... | |
struct | sink_info |
Destination information for write interfaces. More... | |
struct | source_info |
Source information for read interfaces. More... | |
struct | string_statistics |
Statistics for string columns. More... | |
struct | sum_statistics |
Base class for column statistics that include an optional sum. More... | |
struct | table_metadata |
Table metadata for io readers/writers (primarily column names) For nested types (structs, maps, unions), the ordering of names in the column_names vector corresponds to a pre-order traversal of the column tree. In the example below (2 top-level columns: struct column "col1" and string column "col2"), column_names = {"col1", "s3", "f5", "f6", "f4", "col2"}. More... | |
struct | table_metadata_with_nullability |
Derived class of table_metadata which includes flattened nullability information of input. More... | |
struct | table_with_metadata |
Table with table metadata used by io readers to return the metadata by value. More... | |
struct | timestamp_statistics |
Statistics for timestamp columns. More... | |
Enumerations | |
enum | statistics_type { NONE, INT, DOUBLE, STRING, BUCKET, DECIMAL, DATE, BINARY, TIMESTAMP } |
Enumerator for types of column statistics that can be included in column_statistics . More... | |
enum | compression_type { compression_type::NONE, compression_type::AUTO, compression_type::SNAPPY, compression_type::GZIP, compression_type::BZIP2, compression_type::BROTLI, compression_type::ZIP, compression_type::XZ } |
Compression algorithms. More... | |
enum | io_type { io_type::FILEPATH, io_type::HOST_BUFFER, io_type::VOID, io_type::USER_IMPLEMENTED } |
Data source or destination types. More... | |
enum | quote_style { quote_style::MINIMAL, quote_style::ALL, quote_style::NONNUMERIC, quote_style::NONE } |
Behavior when handling quotations in field data. More... | |
enum | statistics_freq { STATISTICS_NONE = 0, STATISTICS_ROWGROUP = 1, STATISTICS_PAGE = 2 } |
Column statistics granularity type for parquet/orc writers. More... | |
IO interfaces.
|
strong |
Compression algorithms.
Definition at line 53 of file io/types.hpp.
|
strong |
Data source or destination types.
Definition at line 67 of file io/types.hpp.
|
strong |
Behavior when handling quotations in field data.
Enumerator | |
---|---|
MINIMAL | Quote only fields which contain special characters. |
ALL | Quote all fields. |
NONNUMERIC | Quote all non-numeric fields. |
NONE | Never quote fields; disable quotation parsing. |
Definition at line 77 of file io/types.hpp.
Column statistics granularity type for parquet/orc writers.
Enumerator | |
---|---|
STATISTICS_NONE | No column statistics. |
STATISTICS_ROWGROUP | Per-Rowgroup column statistics. |
STATISTICS_PAGE | Per-page column statistics. |
Definition at line 87 of file io/types.hpp.
|
strong |
Enumerator for types of column statistics that can be included in column_statistics
.
The statistics type depends on the column data type.
Definition at line 68 of file orc_metadata.hpp.