cudf.core.column.string.StringMethods.character_tokenize#

StringMethods.character_tokenize() → SeriesOrIndex#

Each string is split into individual characters. The sequence returned contains each character as an individual string.

Returns:

Series or Index of object.

Examples

>>> import cudf
>>> data = ["hello world", None, "goodbye, thank you."]
>>> ser = cudf.Series(data)
>>> ser.str.character_tokenize()
  h
  e
  l
  l
  o
0
  w
  o
  r
  l
  d
  g
  o
  o
  d
  b
  y
  e
  ,
2
  t
  h
  a
  n
  k
2
  y
  o
  u
  .
dtype: object