aggregation#

class cudf._lib.pylibcudf.aggregation.Aggregation#

A type of aggregation to perform.

Aggregations are passed to APIs like aggregate() to indicate what operations to perform. Using a class for aggregations provides a unified API for handling parametrizable aggregations. This class should never be instantiated directly, only via one of the factory functions.

For details, see cudf::aggregation.

Methods

kind(self)

Get the kind of the aggregation.

kind(self)#

Get the kind of the aggregation.

cudf._lib.pylibcudf.aggregation.all() Aggregation#

Create an all aggregation.

For details, see make_all_aggregation().

Returns:
Aggregation

The all aggregation.

cudf._lib.pylibcudf.aggregation.any() Aggregation#

Create an any aggregation.

For details, see make_any_aggregation().

Returns:
Aggregation

The any aggregation.

cudf._lib.pylibcudf.aggregation.argmax() Aggregation#

Create an argmax aggregation.

For details, see make_argmax_aggregation().

Returns:
Aggregation

The argmax aggregation.

cudf._lib.pylibcudf.aggregation.argmin() Aggregation#

Create an argmin aggregation.

For details, see make_argmin_aggregation().

Returns:
Aggregation

The argmin aggregation.

cudf._lib.pylibcudf.aggregation.collect_list(null_policy null_handling=null_policy.INCLUDE) Aggregation#

Create a collect_list aggregation.

For details, see make_collect_list_aggregation().

Parameters:
null_handlingnull_policy, default INCLUDE

Whether or not nulls should be included.

Returns:
Aggregation

The collect_list aggregation.

cudf._lib.pylibcudf.aggregation.collect_set(null_handling=null_policy.INCLUDE, nulls_equal=null_equality.EQUAL, nans_equal=nan_equality.ALL_EQUAL) Aggregation#

Create a collect_set aggregation.

For details, see make_collect_set_aggregation().

Parameters:
null_handlingnull_policy, default INCLUDE

Whether or not nulls should be included.

nulls_equalnull_equality, default EQUAL

Whether or not nulls should be considered equal.

nans_equalnan_equality, default ALL_EQUAL

Whether or not NaNs should be considered equal.

Returns:
Aggregation

The collect_set aggregation.

cudf._lib.pylibcudf.aggregation.correlation(correlation_type type, size_type min_periods) Aggregation#

Create a correlation aggregation.

For details, see make_correlation_aggregation().

Parameters:
typecorrelation_type

The type of correlation to compute.

min_periodsint

The minimum number of observations to consider for computing the correlation.

Returns:
Aggregation

The correlation aggregation.

cudf._lib.pylibcudf.aggregation.count(null_policy null_handling=null_policy.EXCLUDE) Aggregation#

Create a count aggregation.

For details, see make_count_aggregation().

Parameters:
null_handlingnull_policy, default EXCLUDE

Whether or not nulls should be included.

Returns:
Aggregation

The count aggregation.

cudf._lib.pylibcudf.aggregation.covariance(size_type min_periods, size_type ddof) Aggregation#

Create a covariance aggregation.

For details, see make_covariance_aggregation().

Parameters:
min_periodsint

The minimum number of observations to consider for computing the covariance.

ddofint

Delta degrees of freedom.

Returns:
Aggregation

The covariance aggregation.

cudf._lib.pylibcudf.aggregation.max() Aggregation#

Create a max aggregation.

For details, see make_max_aggregation().

Returns:
Aggregation

The max aggregation.

cudf._lib.pylibcudf.aggregation.mean() Aggregation#

Create a mean aggregation.

For details, see make_mean_aggregation().

Returns:
Aggregation

The mean aggregation.

cudf._lib.pylibcudf.aggregation.median() Aggregation#

Create a median aggregation.

For details, see make_median_aggregation().

Returns:
Aggregation

The median aggregation.

cudf._lib.pylibcudf.aggregation.min() Aggregation#

Create a min aggregation.

For details, see make_min_aggregation().

Returns:
Aggregation

The min aggregation.

cudf._lib.pylibcudf.aggregation.nth_element(size_type n, null_policy null_handling=null_policy.INCLUDE) Aggregation#

Create a nth_element aggregation.

For details, see make_nth_element_aggregation().

Parameters:
null_handlingnull_policy, default INCLUDE

Whether or not nulls should be included.

Returns:
Aggregation

The nth_element aggregation.

cudf._lib.pylibcudf.aggregation.nunique(null_policy null_handling=null_policy.EXCLUDE) Aggregation#

Create a nunique aggregation.

For details, see make_nunique_aggregation().

Parameters:
null_handlingnull_policy, default EXCLUDE

Whether or not nulls should be included.

Returns:
Aggregation

The nunique aggregation.

cudf._lib.pylibcudf.aggregation.product() Aggregation#

Create a product aggregation.

For details, see make_product_aggregation().

Returns:
Aggregation

The product aggregation.

cudf._lib.pylibcudf.aggregation.quantile(list quantiles, interpolation interp=interpolation.LINEAR) Aggregation#

Create a quantile aggregation.

For details, see make_quantile_aggregation().

Parameters:
quantileslist

List of quantiles to compute, should be between 0 and 1.

interpinterpolation, default LINEAR

Interpolation technique to use when the desired quantile lies between two data points.

Returns:
Aggregation

The quantile aggregation.

cudf._lib.pylibcudf.aggregation.rank(rank_method method, order column_order=order.ASCENDING, null_policy null_handling=null_policy.EXCLUDE, null_order null_precedence=null_order.AFTER, rank_percentage percentage=rank_percentage.NONE) Aggregation#

Create a rank aggregation.

For details, see make_rank_aggregation().

Parameters:
methodrank_method

The method to use for ranking.

column_orderorder, default ASCENDING

The order in which to sort the column.

null_handlingnull_policy, default EXCLUDE

Whether or not nulls should be included.

null_precedencenull_order, default AFTER

Whether nulls should come before or after non-nulls.

percentagerank_percentage, default NONE

Whether or not ranks should be converted to percentages, and if so, the type of normalization to use.

Returns:
Aggregation

The rank aggregation.

cudf._lib.pylibcudf.aggregation.std(size_type ddof=1) Aggregation#

Create a std aggregation.

For details, see make_std_aggregation().

Parameters:
ddofint, default 1

Delta degrees of freedom. The default value is 1.

Returns:
Aggregation

The std aggregation.

cudf._lib.pylibcudf.aggregation.sum() Aggregation#

Create a sum aggregation.

For details, see make_sum_aggregation().

Returns:
Aggregation

The sum aggregation.

cudf._lib.pylibcudf.aggregation.sum_of_squares() Aggregation#

Create a sum_of_squares aggregation.

For details, see make_sum_of_squares_aggregation().

Returns:
Aggregation

The sum_of_squares aggregation.

cudf._lib.pylibcudf.aggregation.udf(unicode operation, DataType output_type) Aggregation#

Create a udf aggregation.

For details, see make_udf_aggregation().

Parameters:
operationstr

The operation to perform as a string of PTX code.

output_typeDataType

The output type of the aggregation.

Returns:
Aggregation

The udf aggregation.

cudf._lib.pylibcudf.aggregation.variance(size_type ddof=1) Aggregation#

Create a variance aggregation.

For details, see make_variance_aggregation().

Parameters:
ddofint, default 1

Delta degrees of freedom.

Returns:
Aggregation

The variance aggregation.