If a Magnum IO-enabled application experiences delays during the ETL phase, what troubleshooting step should be taken?
You have successfully pulled a TensorFlow container from NGC and now need to run it on your stand-alone GPU-enabled server.
Which command should you use to ensure that the container has access to all available GPUs?
A system administrator needs to scale a Kubernetes Job to 4 replicas.
What command should be used?
What two (2) platforms should be used with Fabric Manager? (Choose two.)
You are a Solutions Architect designing a data center infrastructure for a cloud-based AI application that requires high-performance networking, storage, and security. You need to choose a software framework to program the NVIDIA BlueField DPUs that will be used in the infrastructure. The framework must support the development of custom applications and services, as well as enable tailored solutions for specific workloads. Additionally, the framework should allow for the integration of storage services such as NVMe over Fabrics (NVMe-oF) and elastic block storage.
Which framework should you choose?
A system administrator of a high-performance computing (HPC) cluster that uses an InfiniBand fabric for high-speed interconnects between nodes received reports from researchers that they are experiencing unusually slow data transfer rates between two specific compute nodes. The system administrator needs to ensure the path between these two nodes is optimal.
What command should be used?
You are an administrator managing a large-scale Kubernetes-based GPU cluster using Run:AI.
To automate repetitive administrative tasks and efficiently manage resources across multiple nodes, which of the following is essential when using the Run:AI Administrator CLI for environments where automation or scripting is required?
A data scientist is training a deep learning model and notices slower than expected training times. The data scientist alerts a system administrator to inspect the issue. The system administrator suspects the disk IO is the issue.
What command should be used?
What is the primary purpose of assigning a provisioning role to a node in NVIDIA Base Command Manager (BCM)?
PDF + Testing Engine
|
---|
$57.75 |
Testing Engine
|
---|
$43.75 |
PDF (Q&A)
|
---|
$36.75 |
NVIDIA Free Exams |
---|
![]() |