I/O#
I/O Utility Classes#
- pylibcudf.io.types.ColumnEncoding#
See also
cudf::column_encoding
.Enum members
USE_DEFAULT
DICTIONARY
PLAIN
DELTA_BINARY_PACKED
DELTA_LENGTH_BYTE_ARRAY
DELTA_BYTE_ARRAY
BYTE_STREAM_SPLIT
DIRECT
DIRECT_V2
DICTIONARY_V2
- pylibcudf.io.types.CompressionType#
See also
cudf::compression_type
.Enum members
NONE
AUTO
SNAPPY
GZIP
BZIP2
BROTLI
ZIP
XZ
ZLIB
LZ4
LZO
ZSTD
- pylibcudf.io.types.DictionaryPolicy#
See also
cudf::dictionary_policy
.Enum members
NEVER
ADAPTIVE
ALWAYS
- pylibcudf.io.types.JSONRecoveryMode#
See also
cudf::json_recovery_mode_t
.Enum members
FAIL
RECOVER_WITH_NULL
- pylibcudf.io.types.QuoteStyle#
See also
cudf::quote_style
.Enum members
MINIMAL
ALL
NONNUMERIC
NONE
- class pylibcudf.io.types.SinkInfo(list sinks)#
A class containing details on a source to read from.
For details, see
cudf::io::sink_info
.- Parameters:
- sinkslist of str, PathLike, BytesIO, StringIO
A homogeneous list of sinks (this can be a string filename, bytes, or one of the Python I/O classes) to read from.
Mixing different types of sinks will raise a ValueError.
- class pylibcudf.io.types.SourceInfo(list sources)#
A class containing details on a source to read from.
For details, see
cudf::io::source_info
.- Parameters:
- sourcesList[Union[str, os.PathLike, bytes, io.BytesIO, DataSource]]
A homogeneous list of sources to read from.
Mixing different types of sources will raise a ValueError.
- pylibcudf.io.types.StatisticsFreq#
See also
cudf::statistics_freq
.Enum members
STATISTICS_NONE
STATISTICS_ROWGROUP
STATISTICS_PAGE
STATISTICS_COLUMN
- class pylibcudf.io.types.TableWithMetadata(Table tbl, list column_names) A container holding a table and its associated metadata (e.g. column names)#
A container holding a table and its associated metadata (e.g. column names)
For details, see
cudf::io::table_with_metadata
.- Parameters:
- tblTable
The input table.
- column_nameslist
A list of tuples each containing the name of each column and the names of its child columns (in the same format). e.g. [(“id”, []), (“name”, [(“first”, []), (“last”, [])])]
Attributes
Return a dictionary mapping the names of columns with children to the names of their child columns
Return a list containing the columns of the table
Returns a list containing a dict containing file-format specific metadata, for each file being read in.
tbl: pylibcudf.table.Table
Methods
column_names
(self[, include_children])Return a list containing the column names of the table
- child_names#
Return a dictionary mapping the names of columns with children to the names of their child columns
- column_names(self, include_children=False)#
Return a list containing the column names of the table
- columns#
Return a list containing the columns of the table
- per_file_user_data#
Returns a list containing a dict containing file-format specific metadata, for each file being read in.
- tbl#
tbl: pylibcudf.table.Table