Base Command Platform (BCP)#

NVIDIA Base Command™ Platform (BCP) is a software service in NVIDIA DGX Cloud for enterprise-class AI training that enables businesses and their data scientists to accelerate AI development.

In addition to launching batch training jobs BCP also allows you to quickly launch RAPIDS clusters running Jupyter and Dask.

Prerequisites#

To get started sign up for a DGX Cloud account.

Launch a cluster#

Login to NVIDIA NGC and navigate to the BCP service.

Select “Create Custom Project” (alternatively select a quick start project then select the three dots in the top-right to open a menu and select edit project).

Screenshot of NVIDIA NGC's BCP projects page with the "create custom project" button in the top right

Next give your project a title and description and choose an Accelerated Computing Environment (ACE) and an Instance Type.

Screenshot of the custom project page with a title, description, ace and instance type selected

Scroll down and select your mount points:

  • Datasets are existing data you have stored in NGC which you would like to mount to your cluster.

  • Workspaces are persistent storage where you may wish to keep notebooks and other working data.

  • Results is a required mount point where you will write your output results (if any).

Screenshot of the custom project page with the nyc-taxi dataset selected and a results path set

Scroll down and select the latest RAPIDS container image.

Screenshot of the custom project page with the latest RAPIDS release image selected

Note

Optionally configure your desired number of Dask workers, session wall time and any additional packages you wish to install at runtime.

When you are ready hit Create.

Accessing Jupyter#

Once you create your cluster it will go into a Queued or Starting state. Behind the scenes NVIDIA DGX Cloud will provision infrastructure either in NVIDIA data centers or in the cloud depending on your ACE selection.

Screenshot of the project status page in a starting phase

Once your cluster is running you will see various URLs to choose from:

  • Jupyter launches Jupyter Lab running on the scheduler node.

  • Dask Dashboard opens the Dask Dashboard in a new tab.

  • Scheduler URL can be used to connect a Dask client from outside of BCP.

Screenshot of the project status page in a running phase

Let’s click the Jupyter link and use RAPIDS from a notebook environment.

Screenshot of Jupyter Lab

Connecting a Dask client#

The Dask scheduler is running on the same node as Jupyter so you can connect a dask.distributed.Client to ws://localhost:8786.

Warning

BCP configures Dask to use websockets instead of the default TCP to allow for proxying ourside of NGC. So all connection urls will start with ws://.

Screenshot of connecting a Dask client in a notebook

Note

You can also copy/paste the Dask dashboard URL we saw earlier into the Dask Jupyter extension to view cluster metrics right within Jupyter.

Testing RAPIDS#

Now that we’re all set up with a Jupyter session and have connected to our Dask cluster we can import cudf to verify we can use RAPIDS.

Screenshot of importing and using cudf in a notebook

For more detailed examples see our related notebooks.