I/O#

I/O Utility Classes#

pylibcudf.io.types.ColumnEncoding#

See also cudf::column_encoding.

Enum members

  • USE_DEFAULT

  • DICTIONARY

  • PLAIN

  • DELTA_BINARY_PACKED

  • DELTA_LENGTH_BYTE_ARRAY

  • DELTA_BYTE_ARRAY

  • BYTE_STREAM_SPLIT

  • DIRECT

  • DIRECT_V2

  • DICTIONARY_V2

pylibcudf.io.types.CompressionType#

See also cudf::compression_type.

Enum members

  • NONE

  • AUTO

  • SNAPPY

  • GZIP

  • BZIP2

  • BROTLI

  • ZIP

  • XZ

  • ZLIB

  • LZ4

  • LZO

  • ZSTD

pylibcudf.io.types.DictionaryPolicy#

See also cudf::dictionary_policy.

Enum members

  • NEVER

  • ADAPTIVE

  • ALWAYS

pylibcudf.io.types.JSONRecoveryMode#

See also cudf::json_recovery_mode_t.

Enum members

  • FAIL

  • RECOVER_WITH_NULL

pylibcudf.io.types.QuoteStyle#

See also cudf::quote_style.

Enum members

  • MINIMAL

  • ALL

  • NONNUMERIC

  • NONE

class pylibcudf.io.types.SinkInfo(list sinks)#

A class containing details on a source to read from.

For details, see cudf::io::sink_info.

Parameters:
sinkslist of str, PathLike, BytesIO, StringIO

A homogeneous list of sinks (this can be a string filename, bytes, or one of the Python I/O classes) to read from.

Mixing different types of sinks will raise a ValueError.

class pylibcudf.io.types.SourceInfo(list sources)#

A class containing details on a source to read from.

For details, see cudf::io::source_info.

Parameters:
sourcesList[Union[str, os.PathLike, bytes, io.BytesIO, DataSource]]

A homogeneous list of sources to read from.

Mixing different types of sources will raise a ValueError.

pylibcudf.io.types.StatisticsFreq#

See also cudf::statistics_freq.

Enum members

  • STATISTICS_NONE

  • STATISTICS_ROWGROUP

  • STATISTICS_PAGE

  • STATISTICS_COLUMN

class pylibcudf.io.types.TableWithMetadata(Table tbl, list column_names) A container holding a table and its associated metadata (e.g. column names)#

A container holding a table and its associated metadata (e.g. column names)

For details, see cudf::io::table_with_metadata.

Parameters:
tblTable

The input table.

column_nameslist

A list of tuples each containing the name of each column and the names of its child columns (in the same format). e.g. [(“id”, []), (“name”, [(“first”, []), (“last”, [])])]

Attributes

child_names

Return a dictionary mapping the names of columns with children to the names of their child columns

columns

Return a list containing the columns of the table

per_file_user_data

Returns a list containing a dict containing file-format specific metadata, for each file being read in.

tbl

tbl: pylibcudf.table.Table

Methods

column_names(self[, include_children])

Return a list containing the column names of the table

child_names#

Return a dictionary mapping the names of columns with children to the names of their child columns

column_names(self, include_children=False)#

Return a list containing the column names of the table

columns#

Return a list containing the columns of the table

per_file_user_data#

Returns a list containing a dict containing file-format specific metadata, for each file being read in.

tbl#

tbl: pylibcudf.table.Table

I/O Functions#