partitioning#

pylibcudf.partitioning.hash_partition(Table input, list columns_to_hash, int num_partitions) → tuple#

Partitions rows from the input table into multiple output tables.

Parameters:

Returns:

tuple[Table, list[int]]: An output table and a vector of row offsets to each partition

pylibcudf.partitioning.partition(Table t, Column partition_map, int num_partitions) → tuple#

Partitions rows of t according to the mapping specified by partition_map.

For details, see partition().

Parameters:

tTable: The table to partition
partition_mapColumn: Non-nullable column of integer values that map each row in t to it’s partition.
num_partitionsint: The total number of partitions

Returns:

tuple[Table, list[int]]: An output table and a list of row offsets to each partition

pylibcudf.partitioning.round_robin_partition(Table input, int num_partitions, int start_partition=0) → tuple#

Round-robin partition.

Parameters:

Returns:

tuple[Table, list[int]]: The partitioned table and the partition offsets for each partition within the table.