replace#

pylibcudf.nvtext.replace.filter_tokens(Column input, size_type min_token_length, Scalar replacement=None, Scalar delimiter=None) → Column#

Removes tokens whose lengths are less than a specified number of characters.

For details, see filter_tokens()

Parameters:

inputColumn: Strings column to replace
min_token_lengthsize_type: The minimum number of characters to retain a token in the output string
replacementScalar, optional: Optional replacement string to be used in place of removed tokens
delimiterScalar, optional: Characters used to separate each string into tokens. The default of empty string will identify tokens using whitespace.
Returns
——-
Column: New strings column of filtered strings

pylibcudf.nvtext.replace.replace_tokens(Column input, Column targets, Column replacements, Scalar delimiter=None) → Column#

Replaces specified tokens with corresponding replacement strings.

For details, see replace_tokens()

Parameters:

inputColumn: Strings column to replace
targetsColumn: Strings to compare against tokens found in input
replacementsColumn: Replacement strings for each string in targets
delimiterScalar, optional: Characters used to separate each string into tokens. The default of empty string will identify tokens using whitespace.

Returns:

Column: New strings column with replaced strings

replace#

This Page