stemmer#
- pylibcudf.nvtext.stemmer.LetterType#
See also
letter_type.Enum members
CONSONANTVOWEL
- pylibcudf.nvtext.stemmer.is_letter(signatures, args, kwargs, defaults, _fused_sigindex_ref=[None])#
Returns boolean column indicating if the character or characters at the provided character index or indices (respectively) are consonants or vowels
For details, see
is_letter()- Parameters:
- inputColumn
Input strings
- check_vowelsbool
If true, the check is for vowels. Otherwise the check is for consonants.
- indicesUnion[Column, size_type]
The character position(s) to check in each string
- streamStream | None
CUDA stream on which to perform the operation.
- mrDeviceMemoryResource | None
Device memory resource used to allocate the returned column’s device memory.
- Returns:
- Column
New boolean column.
- pylibcudf.nvtext.stemmer.porter_stemmer_measure(Column input, Stream stream=None, DeviceMemoryResource mr=None) Column#
Returns the Porter Stemmer measurements of a strings column.
For details, see
porter_stemmer_measure()- Parameters:
- inputColumn
Strings column of words to measure
- streamStream | None
CUDA stream on which to perform the operation.
- mrDeviceMemoryResource | None
Device memory resource used to allocate the returned column’s device memory.
- Returns:
- Column
New column of measure values