cuml.preprocessing#
Binarize data (set feature values to 0 or 1) according to a threshold |
|
Constructs a transformer from an arbitrary callable. |
|
Bin continuous data into intervals. |
|
Center a kernel matrix |
|
Binarize labels in a one-vs-all fashion. |
|
Encode target labels with values between 0 and n_classes - 1. |
|
Scale each feature by its maximum absolute value. |
|
Transform features by scaling each feature to a given range. |
|
Binary indicators for missing values. |
|
Normalize samples individually to unit norm. |
|
Encode categorical features as a one-hot numeric array. |
|
Generate polynomial and interaction features. |
|
Apply a power transform featurewise to make data more Gaussian-like. |
|
Transform features using quantiles information. |
|
Scale features using statistics that are robust to outliers. |
|
Imputation transformer for completing missing values. |
|
Standardize features by removing the mean and scaling to unit variance |
|
A cudf based implementation of target encoding [R331d970812b4-1], which converts one or multiple categorical variables, 'Xs', with the average of corresponding values of the target variable, 'Y'. |
|
Augment dataset with an additional dummy feature. |
|
Boolean thresholding of array-like or sparse matrix |
|
Binarize labels in a one-vs-all fashion. |
|
Scale each feature to the [-1, 1] range without breaking the sparsity. |
|
Transform features by scaling each feature to a given range. |
|
Scale input vectors individually to unit norm (vector length). |
|
Standardize a dataset along any axis |
|
Standardize a dataset along any axis |
Text Preprocessing#
A word stemmer based on the Porter stemming algorithm. |