Files | |
| file | parquet_io_utils.hpp |
| IO utilities for the Parquet and Hybrid scan readers. | |
Functions | |
| std::unique_ptr< cudf::io::datasource::buffer > | cudf::io::parquet::fetch_footer_to_host (cudf::io::datasource &datasource) |
| Fetches a host buffer of Parquet footer bytes from the input data source. More... | |
| std::vector< std::unique_ptr< cudf::io::datasource::buffer > > | cudf::io::parquet::fetch_footers_to_host (cudf::host_span< std::reference_wrapper< cudf::io::datasource > const > datasources) |
| Fetches host buffers of Parquet footer bytes from multiple input data sources. More... | |
| std::unique_ptr< cudf::io::datasource::buffer > | cudf::io::parquet::fetch_page_index_to_host (cudf::io::datasource &datasource, byte_range_info const page_index_bytes) |
| Fetches a host buffer of Parquet page index from the input data source. More... | |
| std::vector< std::unique_ptr< cudf::io::datasource::buffer > > | cudf::io::parquet::fetch_page_indexes_to_host (cudf::host_span< std::reference_wrapper< cudf::io::datasource > const > datasources, cudf::host_span< byte_range_info const > page_index_bytes_per_source) |
| Fetches host buffers of Parquet page index bytes from multiple input data sources. More... | |
| std::tuple< std::vector< rmm::device_buffer >, std::vector< cudf::device_span< uint8_t const > >, std::future< void > > | cudf::io::parquet::fetch_byte_ranges_to_device_async (cudf::io::datasource &datasource, cudf::host_span< byte_range_info const > byte_ranges, rmm::cuda_stream_view stream, rmm::device_async_resource_ref mr) |
| Fetches a list of byte ranges from a datasource into device buffers. More... | |
| std::tuple< std::vector< rmm::device_buffer >, std::vector< std::vector< cudf::device_span< uint8_t const > > >, std::future< void > > | cudf::io::parquet::fetch_byte_ranges_to_device_async (cudf::host_span< std::reference_wrapper< cudf::io::datasource > const > datasources, cudf::host_span< std::vector< byte_range_info > const > byte_ranges_per_source, rmm::cuda_stream_view stream, rmm::device_async_resource_ref mr) |
| Fetches lists of byte ranges from multiple datasources into device buffers. More... | |
| std::tuple<std::vector<rmm::device_buffer>, std::vector<std::vector<cudf::device_span<uint8_t const> > >, std::future<void> > cudf::io::parquet::fetch_byte_ranges_to_device_async | ( | cudf::host_span< std::reference_wrapper< cudf::io::datasource > const > | datasources, |
| cudf::host_span< std::vector< byte_range_info > const > | byte_ranges_per_source, | ||
| rmm::cuda_stream_view | stream, | ||
| rmm::device_async_resource_ref | mr | ||
| ) |
Fetches lists of byte ranges from multiple datasources into device buffers.
| datasources | Input datasources |
| byte_ranges_per_source | Vector of byte ranges to fetch, one per datasource |
| stream | CUDA stream |
| mr | Device memory resource |
| std::tuple<std::vector<rmm::device_buffer>, std::vector<cudf::device_span<uint8_t const> >, std::future<void> > cudf::io::parquet::fetch_byte_ranges_to_device_async | ( | cudf::io::datasource & | datasource, |
| cudf::host_span< byte_range_info const > | byte_ranges, | ||
| rmm::cuda_stream_view | stream, | ||
| rmm::device_async_resource_ref | mr | ||
| ) |
Fetches a list of byte ranges from a datasource into device buffers.
| datasource | Input datasource |
| byte_ranges | Byte ranges to fetch |
| stream | CUDA stream |
| mr | Device memory resource |
| std::unique_ptr<cudf::io::datasource::buffer> cudf::io::parquet::fetch_footer_to_host | ( | cudf::io::datasource & | datasource | ) |
Fetches a host buffer of Parquet footer bytes from the input data source.
| datasource | Input data source |
| std::vector<std::unique_ptr<cudf::io::datasource::buffer> > cudf::io::parquet::fetch_footers_to_host | ( | cudf::host_span< std::reference_wrapper< cudf::io::datasource > const > | datasources | ) |
Fetches host buffers of Parquet footer bytes from multiple input data sources.
| datasources | Input data sources |
| cudf::logic_error | if any datasource contains a corrupted Parquet magic number, header or footer, or has an invalid footer length. |
| std::unique_ptr<cudf::io::datasource::buffer> cudf::io::parquet::fetch_page_index_to_host | ( | cudf::io::datasource & | datasource, |
| byte_range_info const | page_index_bytes | ||
| ) |
Fetches a host buffer of Parquet page index from the input data source.
| datasource | Input datasource |
| page_index_bytes | Byte range of page index |
| std::vector<std::unique_ptr<cudf::io::datasource::buffer> > cudf::io::parquet::fetch_page_indexes_to_host | ( | cudf::host_span< std::reference_wrapper< cudf::io::datasource > const > | datasources, |
| cudf::host_span< byte_range_info const > | page_index_bytes_per_source | ||
| ) |
Fetches host buffers of Parquet page index bytes from multiple input data sources.
| datasources | Input datasources |
| page_index_bytes_per_source | Byte ranges of page index, one per datasource |
| cudf::logic_error | if the number of datasources does not match the number of page index byte ranges |
| std::out_of_range | if any page index byte range is out of range for its datasource |