generate_ngrams#

pylibcudf.nvtext.generate_ngrams.generate_character_ngrams(Column input, size_type ngrams=2) Column#

Returns a lists column of ngrams of characters within each string.

For details, see generate_character_ngrams()

Parameters:
inputColumn

Input strings

ngramsize_type

The ngram number to generate

Returns:
Column

Lists column of strings

pylibcudf.nvtext.generate_ngrams.generate_ngrams(Column input, size_type ngrams, Scalar separator) Column#

Returns a single column of strings by generating ngrams from a strings column.

For details, see generate_ngrams()

Parameters:
inputColumn

Input strings

ngramsize_type

The ngram number to generate

separatorScalar

The string to use for separating ngram tokens

Returns:
Column

New strings columns of tokens

pylibcudf.nvtext.generate_ngrams.hash_character_ngrams(Column input, size_type ngrams=2) Column#

Returns a lists column of hash values of the characters in each string

For details, see hash_character_ngrams()

Parameters:
inputColumn

Input strings

ngramsize_type

The ngram number to generate

Returns:
Column

Lists column of hash values