At present, this guide only covers the main cuDF library. In the future, it may be expanded to also cover dask_cudf, cudf_kafka, and custreamz.
cuDF is a GPU-accelerated, Pandas-like DataFrame library.
Under the hood, all of cuDF’s functionality relies on the CUDA-accelerated
libcudf C++ library.
Thus, cuDF’s internals are designed to efficiently and robustly map pandas APIs to
For more information about the
libcudf library, a good starting point is the
This document assumes familiarity with the overall contributing guide. The goal of this document is to provide more specific guidance for Python developers. It covers the structure of the Python code and discusses best practices. Additionally, it includes longer sections on more specific topics like testing and benchmarking.
- Library Design
- Contributing Guide
- Writing documentation
- Testing cuDF
- Benchmarking cuDF