I/O#

I/O Utility Classes#

pylibcudf.io.types.ColumnEncoding#

See also cudf::column_encoding.

Enum members

  • USE_DEFAULT

  • DICTIONARY

  • PLAIN

  • DELTA_BINARY_PACKED

  • DELTA_LENGTH_BYTE_ARRAY

  • DELTA_BYTE_ARRAY

  • BYTE_STREAM_SPLIT

  • DIRECT

  • DIRECT_V2

  • DICTIONARY_V2

pylibcudf.io.types.CompressionType#

See also cudf::compression_type.

Enum members

  • NONE

  • AUTO

  • SNAPPY

  • GZIP

  • BZIP2

  • BROTLI

  • ZIP

  • XZ

  • ZLIB

  • LZ4

  • LZO

  • ZSTD

pylibcudf.io.types.DictionaryPolicy#

See also cudf::dictionary_policy.

Enum members

  • NEVER

  • ADAPTIVE

  • ALWAYS

pylibcudf.io.types.JSONRecoveryMode#

See also cudf::json_recovery_mode_t.

Enum members

  • FAIL

  • RECOVER_WITH_NULL

pylibcudf.io.types.QuoteStyle#

See also cudf::quote_style.

Enum members

  • MINIMAL

  • ALL

  • NONNUMERIC

  • NONE

class pylibcudf.io.types.SinkInfo(list sinks)#

A class containing details about destinations (sinks) to write data to.

For more details, see cudf::io::sink_info.

Parameters:
sinkslist of str, PathLike, or io.IOBase instances

A list of sinks to write data to. Each sink can be:

  • A string representing a filename.

  • A PathLike object.

  • An instance of a Python I/O class that is a subclass of io.IOBase (eg., io.BytesIO, io.StringIO).

The list must be homogeneous in type unless all sinks are instances of subclasses of io.IOBase. Mixing different types of sinks (that are not all io.IOBase instances) will raise a ValueError.

class pylibcudf.io.types.SourceInfo(list sources)#

A class containing details on a source to read from.

For details, see cudf::io::source_info.

Parameters:
sourcesList[Union[str, os.PathLike, bytes, io.BytesIO, DataSource]]

A homogeneous list of sources to read from.

Mixing different types of sources will raise a ValueError.

pylibcudf.io.types.StatisticsFreq#

See also cudf::statistics_freq.

Enum members

  • STATISTICS_NONE

  • STATISTICS_ROWGROUP

  • STATISTICS_PAGE

  • STATISTICS_COLUMN

class pylibcudf.io.types.TableWithMetadata(Table tbl, list column_names) A container holding a table and its associated metadata (e.g. column names)#

A container holding a table and its associated metadata (e.g. column names)

For details, see cudf::io::table_with_metadata.

Parameters:
tblTable

The input table.

column_nameslist

A list of tuples each containing the name of each column and the names of its child columns (in the same format). e.g. [(“id”, []), (“name”, [(“first”, []), (“last”, [])])]

Attributes

child_names

Return a dictionary mapping the names of columns with children to the names of their child columns

columns

Return a list containing the columns of the table

per_file_user_data

Returns a list containing a dict containing file-format specific metadata, for each file being read in.

tbl

tbl: pylibcudf.table.Table

Methods

column_names(self[, include_children])

Return a list containing the column names of the table

child_names#

Return a dictionary mapping the names of columns with children to the names of their child columns

column_names(self, include_children=False)#

Return a list containing the column names of the table

columns#

Return a list containing the columns of the table

per_file_user_data#

Returns a list containing a dict containing file-format specific metadata, for each file being read in.

tbl#

tbl: pylibcudf.table.Table

I/O Functions#