All Classes and Interfaces
Class
Description
Utility methods for breaking apart and reassembling 128-bit values during aggregations
to enable hash-based aggregations and detect overflows.
An Aggregation instance that also holds a column number and window metadata so the aggregation
can be done over a specific window.
Column builder from Arrow data.
Options for reading data in Arrow IPC format
Settings for writing Arrow IPC data.
This class is a Helper class to assert there are no non-empty nulls in a ColumnView
The reason for the existence of this class is so that we can turn the asserts on/off when needed
by passing "-da:ai.rapids.cudf.AssertEmptyNulls".
Base class of every node in an AST
Enumeration for the types of AST nodes that can appear in a serialized AST.
Options for reading an Avro file
Base class for all MemoryBuffers that are in device memory.
Multi-buffer compressor
Decompressor that operates on multiple input buffers in a batch
Multi-buffer LZ4 compressor
LZ4 decompressor that operates on multiple input buffers in a batch
Multi-buffer ZSTD compressor
ZSTD decompressor that operates on multiple input buffers in a batch
Mathematical binary operations.
A binary operation consisting of an operator and two operands.
Enumeration of AST operators that can appear in a binary operation.
Types of buffers supported by ColumnVectors and HostColumnVectors
Capture groups setting, closely following cudf::strings::capture_groups.
JNI interface to cudf::chunked_pack.
Utility class that wraps an array of closeable instances and can be closed
Base options class for input formats that can filter columns.
A reference to a column in an input table.
This class represents the immutable vector of data.
Interface to handle events for this ColumnVector.
Holds the off heap state of the column vector so we can clean it up, even if it is leaked.
This class represents the column_view of a column analogous to its cudf cpp counterpart.
Enum to choose behaviour of listIndexOf functions:
1.
Per column settings for writing Parquet/ORC files.
ColumnWriterOptions.AbstractStructBuilder<T extends ColumnWriterOptions.AbstractStructBuilder,V extends ColumnWriterOptions>
ColumnWriterOptions.NestedBuilder<T extends ColumnWriterOptions.NestedBuilder,V extends ColumnWriterOptions>
This class wraps a native compiled AST and must be closed to avoid native memory leaks.
CompressionMetadataWriterOptions.Builder<T extends CompressionMetadataWriterOptions.Builder,V extends CompressionMetadataWriterOptions>
Enumeration of compression formats.
Enumeration of data types that can be compressed.
Used to save groups and uniq key table for `Table.contiguousSplitGroupsAndGenUniqKeys`
Each row in uniq key table is corresponding to a group
Resource management note:
This class is the owner of `groups` and
`uniqKeysTable`(or uniqKeyColumns if table is not constructed)
1: Use `closeGroups` and `closeUniqKeyTable` to close the resources separately
if you want to close eagerly.
A table that is backed by a single contiguous device buffer.
Options for reading a CSV file
Options for writing a CSV file
A class representing a CUDA stream
This is the Java mapping of CUDA device compute modes.
Exception from the cuda language/library.
The Java mirror of cudaError, which facilities the tracking of CUDA errors in JVM.
CudaFatalException is a kind of CudaException which leaves the process in an inconsistent state
and any further CUDA work will return the same error.
Represent free and total device memory.
This class represents data allocated using `cudaMalloc` directly instead of the default RMM
memory resource.
Exception thrown when CUDF operation results in a column size
exceeding CUDF column size limits
Exception thrown by cudf itself.
JNI wrapper for accessing the cuFile API.
Represents a cuFile buffer.
Represents a cuFile file handle for reading.
Represents a cuFile file handle for reading.
Base class that can be used to provide data dynamically to CUDF.
Types of datetime components that may be extracted.
This class represents data in some form on the GPU.
This class represents data in some form on the GPU.
Used for the dropListDuplicates function
Specifies which duplicate to keep
This class tracks the data associated with a gather map, a buffer of INT32 elements that index
a source table and can be passed to a table gather operation.
An aggregation that can be used for a reduce.
A GroupByAggregation for a specific column in a table.
Options for groupby (see cudf::groupby::groupby's constructor)
An aggregation that can be used for a grouped scan.
A GroupByScanAggregation for a specific column in a table.
This class represents a hash table built from the join keys of the right-side table for a
join operation.
Hash algorithm identifiers, mirroring native enum cudf::hash_id
Provides a set of APIs for consuming host buffers.
Provides a set of APIs for providing host buffers to be read.
Similar to a ColumnVector, but the data is stored in host memory and accessible directly from
the JVM.
Build
Interface to handle events for this HostColumnVector.
A class that holds Host side Column Vector APIs and the OffHeapState.
Holds the off heap state of the column vector so we can clean it up, even if it is leaked.
This class holds an off-heap buffer in the host/CPU memory.
A wrapper around native host UDF aggregations.
Serialize and deserialize CUDF tables and columns using a custom format.
Class to hold the header and buffer pair result from host-side concatenation
Holds the metadata about a serialized column.
Holds the metadata about a serialized table.
Holds the result of deserializing a table.
Options for reading in JSON encoded data.
A literal value in an AST expression.
Abstract class for representing the Memory Buffer
NOTE: MemoryBuffer is public to make it easier to work with the class hierarchy,
subclassing beyond what is included in CUDF is not recommended and not supported.
Interface to handle events for this MemoryBuffer.
ColumnVectors may store data off heap, and because of complicated processing the life time of
an individual vector can vary a lot.
API that can be used to clean up the resources for a vector, even if there was a leak
This class tracks size information associated with a mixed table join.
This is a DataSource that can take multiple HostMemoryBuffers.
How should NaNs be compared in an operation.
This class will load the native dependencies.
How should nulls be compared in an operation.
Specify whether to include nulls or exclude nulls in an operation.
Exception thrown from nvcomp indicating a CUDA error occurred.
Base class for all nvcomp-specific exceptions
This class supports push/pop NVTX profiling ranges, or "scoped" ranges.
This class supports start/end NVTX profiling ranges.
Provide an interface for reading an ORC file in an iterative manner.
Options for reading a ORC file
This class represents settings for writing ORC files.
Provides the ordering for specific columns.
Policy to account for possible out-of-bounds indices
`NULLIFY` means to nullify output values corresponding to out-of-bounds gather map values.
Metadata for a table that is backed by a single contiguous device buffer.
Provide an interface for reading a Parquet file in an iterative manner.
Options for reading a parquet file
This class represents settings for writing Parquet files.
Class to provide a PartitionedTable
This is the JNI interface to a rmm::pool_memory_resource<rmm::pinned_host_memory_resource>.
Interpolation method to use when the desired quantile lies between
two data points i and j.
Quote style for CSV records, closely following cudf::io::quote_style.
Helper utility for creating ranges.
An aggregation that can be used for a reduce.
Regex flags setting, closely following cudf::strings::regex_flags.
Regex program class, closely following cudf::strings::regex_program.
Policy to specify the position of replacement values relative to null rows.
A replacement policy for a specific column
This is the binding class for rmm lib.
What to send RMM alloc and free logs to.
A device memory resource that will pre-allocate a pool of resources and sub-allocate from this
pool to improve memory performance.
A device memory resource that uses `cudaMallocAsync` and `cudaFreeAsync` for allocation and
deallocation.
A device memory resource that uses `cudaMalloc` and `cudaFree` for allocation and deallocation.
A resource that allocates/deallocates device memory.
A device memory resource that will give callbacks in specific situations.
Exception from RMM allocator.
A device memory resource that will limit the maximum amount allocated.
A device memory resource that will log interactions.
A device memory resource that uses `cudaMallocManaged` and `cudaFreeManaged` for allocation and
deallocation.
A device memory resource that will pre-allocate a pool of resources and sub-allocate from this
pool to improve memory performance.
A device memory resource that will track some basic statistics about the memory usage.
A resource that wraps another RmmDeviceMemoryResource
An aggregation that can be used on rolling windows.
A RollingAggregation for a specific column in a table.
Rounding modes supported in round method.
A single scalar value.
An aggregation that can be used for a scan.
Scan operation type.
The schema of data to be read in.
An aggregation that can be used for a reduce.
Provides an interface for reading multiple tables from a single input source.
Class to represent a collection of ColumnVectors and operations that can be performed on them
collectively.
Enum to specify which of duplicate rows/elements will be copied to the output.
Class representing groupby operations
Create a table on the GPU with data from the CPU.
Enumeration of tables that can be referenced in an AST.
A table along with some metadata about the table.
Provides an interface for writing out Table information in multiple steps.
Mathematical unary operations.
A unary operation consisting of an operator and an operand.
Enumeration of AST operators that can appear in a unary operation.
Options for rolling windows.