I/O#
I/O Utility Classes#
- pylibcudf.io.types.ColumnEncoding#
See also
cudf::column_encoding
.Enum members
USE_DEFAULT
DICTIONARY
PLAIN
DELTA_BINARY_PACKED
DELTA_LENGTH_BYTE_ARRAY
DELTA_BYTE_ARRAY
BYTE_STREAM_SPLIT
DIRECT
DIRECT_V2
DICTIONARY_V2
- pylibcudf.io.types.CompressionType#
See also
cudf::compression_type
.Enum members
NONE
AUTO
SNAPPY
GZIP
BZIP2
BROTLI
ZIP
XZ
ZLIB
LZ4
LZO
ZSTD
- pylibcudf.io.types.DictionaryPolicy#
See also
cudf::dictionary_policy
.Enum members
NEVER
ADAPTIVE
ALWAYS
- pylibcudf.io.types.JSONRecoveryMode#
See also
cudf::json_recovery_mode_t
.Enum members
FAIL
RECOVER_WITH_NULL
- pylibcudf.io.types.QuoteStyle#
See also
cudf::quote_style
.Enum members
MINIMAL
ALL
NONNUMERIC
NONE
- class pylibcudf.io.types.SinkInfo(list sinks)#
A class containing details about destinations (sinks) to write data to.
For more details, see
cudf::io::sink_info
.- Parameters:
- sinkslist of str, PathLike, or io.IOBase instances
A list of sinks to write data to. Each sink can be:
A string representing a filename.
A PathLike object.
An instance of a Python I/O class that is a subclass of io.IOBase (eg., io.BytesIO, io.StringIO).
The list must be homogeneous in type unless all sinks are instances of subclasses of io.IOBase. Mixing different types of sinks (that are not all io.IOBase instances) will raise a ValueError.
- class pylibcudf.io.types.SourceInfo(list sources)#
A class containing details on a source to read from.
For details, see
cudf::io::source_info
.- Parameters:
- sourcesList[Union[str, os.PathLike, bytes, io.BytesIO, DataSource]]
A homogeneous list of sources to read from.
Mixing different types of sources will raise a ValueError.
- pylibcudf.io.types.StatisticsFreq#
See also
cudf::statistics_freq
.Enum members
STATISTICS_NONE
STATISTICS_ROWGROUP
STATISTICS_PAGE
STATISTICS_COLUMN
- class pylibcudf.io.types.TableWithMetadata(Table tbl, list column_names) A container holding a table and its associated metadata (e.g. column names)#
A container holding a table and its associated metadata (e.g. column names)
For details, see
cudf::io::table_with_metadata
.- Parameters:
- tblTable
The input table.
- column_nameslist
A list of tuples each containing the name of each column and the names of its child columns (in the same format). e.g. [(“id”, []), (“name”, [(“first”, []), (“last”, [])])]
Attributes
Return a dictionary mapping the names of columns with children to the names of their child columns
Return a list containing the columns of the table
Returns a list containing a dict containing file-format specific metadata, for each file being read in.
tbl: pylibcudf.table.Table
Methods
column_names
(self[, include_children])Return a list containing the column names of the table
- child_names#
Return a dictionary mapping the names of columns with children to the names of their child columns
- column_names(self, include_children=False)#
Return a list containing the column names of the table
- columns#
Return a list containing the columns of the table
- per_file_user_data#
Returns a list containing a dict containing file-format specific metadata, for each file being read in.
- tbl#
tbl: pylibcudf.table.Table