Files | Classes | Functions
Writers

Files

file  csv.hpp
 
file  io/json.hpp
 
file  orc.hpp
 
file  parquet.hpp
 

Classes

class  cudf::io::csv_writer_options
 Settings to use for write_csv(). More...
 
class  cudf::io::csv_writer_options_builder
 Builder to build options for writer_csv() More...
 
class  cudf::io::json_writer_options
 Settings to use for write_json(). More...
 
class  cudf::io::json_writer_options_builder
 Builder to build options for writer_json() More...
 
class  cudf::io::orc_writer_options
 Settings to use for write_orc(). More...
 
class  cudf::io::orc_writer_options_builder
 Builds settings to use for write_orc(). More...
 
class  cudf::io::chunked_orc_writer_options
 Settings to use for write_orc_chunked(). More...
 
class  cudf::io::chunked_orc_writer_options_builder
 Builds settings to use for write_orc_chunked(). More...
 
class  cudf::io::orc_chunked_writer
 Chunked orc writer class writes an ORC file in a chunked/stream form. More...
 
struct  cudf::io::sorting_column
 Struct used to describe column sorting metadata. More...
 
class  cudf::io::parquet_writer_options_base
 Base settings for write_parquet() and parquet_chunked_writer. More...
 
class  cudf::io::parquet_writer_options_builder_base< BuilderT, OptionsT >
 Base class for Parquet options builders. More...
 
class  cudf::io::parquet_writer_options
 Settings for write_parquet(). More...
 
class  cudf::io::parquet_writer_options_builder
 Class to build parquet_writer_options. More...
 
class  cudf::io::chunked_parquet_writer_options
 Settings for parquet_chunked_writer. More...
 
class  cudf::io::chunked_parquet_writer_options_builder
 Class to build chunked_parquet_writer_options. More...
 
class  cudf::io::parquet_chunked_writer
 chunked parquet writer class to handle options and write tables in chunks. More...
 

Functions

void cudf::io::write_csv (csv_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
 Writes a set of columns to CSV format. More...
 
void cudf::io::write_json (json_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
 Writes a set of columns to JSON format. More...
 
void cudf::io::write_orc (orc_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
 Writes a set of columns to ORC format. More...
 
std::unique_ptr< std::vector< uint8_t > > cudf::io::write_parquet (parquet_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
 Writes a set of columns to parquet format. More...
 
std::unique_ptr< std::vector< uint8_t > > cudf::io::merge_row_group_metadata (std::vector< std::unique_ptr< std::vector< uint8_t >>> const &metadata_list)
 Merges multiple raw metadata blobs that were previously created by write_parquet into a single metadata blob. More...
 

Detailed Description

Function Documentation

◆ merge_row_group_metadata()

std::unique_ptr<std::vector<uint8_t> > cudf::io::merge_row_group_metadata ( std::vector< std::unique_ptr< std::vector< uint8_t >>> const &  metadata_list)

Merges multiple raw metadata blobs that were previously created by write_parquet into a single metadata blob.

Parameters
[in]metadata_listList of input file metadata
Returns
A parquet-compatible blob that contains the data for all row groups in the list

◆ write_csv()

void cudf::io::write_csv ( csv_writer_options const &  options,
rmm::cuda_stream_view  stream = cudf::get_default_stream() 
)

Writes a set of columns to CSV format.

The following code snippet demonstrates how to write columns to a file:

auto destination = cudf::io::sink_info("dataset.csv");
auto options = cudf::io::csv_writer_options(destination, table->view())
.na_rep(na)
.include_header(include_header)
.rows_per_chunk(rows_per_chunk);
Settings to use for write_csv().
Definition: csv.hpp:1372
void write_csv(csv_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
Writes a set of columns to CSV format.
Destination information for write interfaces.
Definition: io/types.hpp:512
Parameters
optionsSettings for controlling writing behavior
streamCUDA stream used for device memory operations and kernel launches

◆ write_json()

void cudf::io::write_json ( json_writer_options const &  options,
rmm::cuda_stream_view  stream = cudf::get_default_stream() 
)

Writes a set of columns to JSON format.

The following code snippet demonstrates how to write columns to a file:

auto destination = cudf::io::sink_info("dataset.json");
auto options = cudf::io::json_writer_options(destination, table->view())
.na_rep(na)
.lines(lines)
.rows_per_chunk(rows_per_chunk);
Settings to use for write_json().
Definition: io/json.hpp:946
void write_json(json_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
Writes a set of columns to JSON format.
Parameters
optionsSettings for controlling writing behavior
streamCUDA stream used for device memory operations and kernel launches

◆ write_orc()

void cudf::io::write_orc ( orc_writer_options const &  options,
rmm::cuda_stream_view  stream = cudf::get_default_stream() 
)

Writes a set of columns to ORC format.

The following code snippet demonstrates how to write columns to a file:

auto destination = cudf::io::sink_info("dataset.orc");
auto options = cudf::io::orc_writer_options::builder(destination, table->view());
static orc_writer_options_builder builder(sink_info const &sink, table_view const &table)
Create builder to create orc_writer_options.
void write_orc(orc_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
Writes a set of columns to ORC format.
Parameters
optionsSettings for controlling reading behavior
streamCUDA stream used for device memory operations and kernel launches

◆ write_parquet()

std::unique_ptr<std::vector<uint8_t> > cudf::io::write_parquet ( parquet_writer_options const &  options,
rmm::cuda_stream_view  stream = cudf::get_default_stream() 
)

Writes a set of columns to parquet format.

The following code snippet demonstrates how to write columns to a file:

auto destination = cudf::io::sink_info("dataset.parquet");
auto options = cudf::io::parquet_writer_options::builder(destination, table->view());
static parquet_writer_options_builder builder()
Create builder to create parquet_writer_options.
std::unique_ptr< std::vector< uint8_t > > write_parquet(parquet_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
Writes a set of columns to parquet format.
Parameters
optionsSettings for controlling writing behavior
streamCUDA stream used for device memory operations and kernel launches
Returns
A blob that contains the file metadata (parquet FileMetadata thrift message) if requested in parquet_writer_options (empty blob otherwise).