Lists Extract#
- group lists_extract
Functions
-
std::unique_ptr<column> extract_list_element(lists_column_view const &lists_column, size_type index, rmm::cuda_stream_view stream = cudf::get_default_stream(), rmm::device_async_resource_ref mr = rmm::mr::get_current_device_resource())#
Create a column where each row is the element at position
index
from the corresponding sublist in the inputlists_column
.Output
column[i]
is set from elementlists_column[i][index]
. Ifindex
is larger than the size of the sublist atlists_column[i]
then outputcolumn[i] = null
.l = { {1, 2, 3}, {4}, {5, 6} } r = extract_list_element(l, 1) r is now {2, null, 6}
The
index
may also be negative in which case the row retrieved is offset from the end of each sublist.l = { {"a"}, {"b", "c"}, {"d", "e", "f"} } r = extract_list_element(l, -1) r is now {"a", "c", "f"}
Any input where
lists_column[i] == null
will produce outputcolumn[i] = null
. Also, any element wherelists_column[i][index] == null
will produce outputcolumn[i] = null
.- Parameters:
lists_column – Column to extract elements from.
index – The row within each sublist to retrieve.
stream – CUDA stream used for device memory operations and kernel launches.
mr – Device memory resource used to allocate the returned column’s device memory.
- Returns:
Column of extracted elements.
-
std::unique_ptr<column> extract_list_element(lists_column_view const &lists_column, column_view const &indices, rmm::cuda_stream_view stream = cudf::get_default_stream(), rmm::device_async_resource_ref mr = rmm::mr::get_current_device_resource())#
Create a column where each row is a single element from the corresponding sublist in the input
lists_column
, selected using indices from theindices
column.Output
column[i]
is set from elementlists_column[i][indices[i]]
. Ifindices[i]
is larger than the size of the sublist atlists_column[i]
then outputcolumn[i] = null
. Similarly, ifindices[i]
isnull
, thencolumn[i] = null
.l = { {1, 2, 3}, {4}, {5, 6} } r = extract_list_element(l, {0, null, 2}) r is now {1, null, null}
indices[i]
may also be negative, in which case the row retrieved is offset from the end of each sublist.l = { {"a"}, {"b", "c"}, {"d", "e", "f"} } r = extract_list_element(l, {-1, -2, -4}) r is now {"a", "b", null}
Any input where
lists_column[i] == null
produces outputcolumn[i] = null
. Any input wherelists_column[i][indices[i]] == null
produces outputcolumn[i] = null
.- Parameters:
lists_column – Column to extract elements from.
indices – The column whose rows indicate the element index to be retrieved from each list row.
stream – CUDA stream used for device memory operations and kernel launches.
mr – Device memory resource used to allocate the returned column’s device memory.
- Throws:
cudf::logic_error – If the sizes of
lists_column
andindices
do not match.- Returns:
Column of extracted elements.
-
std::unique_ptr<column> extract_list_element(lists_column_view const &lists_column, size_type index, rmm::cuda_stream_view stream = cudf::get_default_stream(), rmm::device_async_resource_ref mr = rmm::mr::get_current_device_resource())#