jaccard#

pylibcudf.nvtext.jaccard.jaccard_index(Column input1, Column input2, size_type width, Stream stream=None) Column#

Returns the Jaccard similarity between individual rows in two strings columns.

For details, see jaccard_index()

Parameters:
input1Column

Input strings column

input2Column

Input strings column

widthsize_type

The ngram number to generate

streamStream | None

CUDA stream on which to perform the operation.

Returns:
Column

Index calculation values