Files | Enumerations | Functions

Files

file  hashing.hpp
 

Enumerations

enum  cudf::hash_id { cudf::hash_id::HASH_IDENTITY = 0, cudf::hash_id::HASH_MURMUR3, cudf::hash_id::HASH_SPARK_MURMUR3, cudf::hash_id::HASH_MD5 }
 Identifies the hash function to be used. More...
 

Functions

std::unique_ptr< columncudf::hash (table_view const &input, hash_id hash_function=hash_id::HASH_MURMUR3, uint32_t seed=DEFAULT_HASH_SEED, rmm::mr::device_memory_resource *mr=rmm::mr::get_current_device_resource())
 Computes the hash value of each row in the input set of columns. More...
 

Detailed Description

Enumeration Type Documentation

◆ hash_id

enum cudf::hash_id
strong

Identifies the hash function to be used.

Enumerator
HASH_IDENTITY 

Identity hash function that simply returns the key to be hashed.

HASH_MURMUR3 

Murmur3 hash function.

HASH_SPARK_MURMUR3 

Spark Murmur3 hash function.

HASH_MD5 

MD5 hash function.

Definition at line 36 of file hashing.hpp.

Function Documentation

◆ hash()

std::unique_ptr<column> cudf::hash ( table_view const &  input,
hash_id  hash_function = hash_id::HASH_MURMUR3,
uint32_t  seed = DEFAULT_HASH_SEED,
rmm::mr::device_memory_resource mr = rmm::mr::get_current_device_resource() 
)

Computes the hash value of each row in the input set of columns.

Parameters
inputThe table of columns to hash
hash_functionThe hash function enum to use
seedOptional seed value to use for the hash function
mrDevice memory resource used to allocate the returned column's device memory
Returns
A column where each row is the hash of a column from the input