cudf.DataFrame.to_orc#

DataFrame.to_orc(fname, compression=None, *args, **kwargs)#

Write a DataFrame to the ORC format.

Parameters
fnamestr

File path or object where the ORC dataset will be stored.

compression{{ ‘snappy’, ‘ZLIB’, None }}, default None

Name of the compression to use. Use None for no compression.

enable_statistics: boolean, default True

Enable writing column statistics.

stripe_size_bytes: integer or None, default None

Maximum size of each stripe of the output. If None, 67108864 (64MB) will be used.

stripe_size_rows: integer or None, default None

Maximum number of rows of each stripe of the output. If None, 1000000 will be used.

row_index_stride: integer or None, default None

Row index stride (maximum number of rows in each row group). If None, 10000 will be used.

See also

cudf.read_orc

Notes

Support for writing tables with struct columns is currently experimental, the output may not be as reliable as writing for other datatypes.