cuml.feature_extraction#

From text#

CountVectorizer

Convert a collection of text documents to a matrix of token counts

HashingVectorizer

Convert a collection of text documents to a matrix of token occurrences

TfidfVectorizer

Convert a collection of raw documents to a matrix of TF-IDF features.