NVIDIA Exam NCA-AIIO Topic 3 Question 6 Discussion

Actual exam question for NVIDIA's NCA-AIIO exam

Question #: 6
Topic #: 3

You have deployed an AI training job on a GPU cluster, but the training time has not decreased as expected after adding more GPUs. Upon further investigation, you observe that the GPU utilization is low, and the CPU utilization is very high. What is the most likely cause of this issue?

AThe AI model is not compatible with multi-GPU training.

BThe GPUs are not properly connected in the cluster.

CIncorrect software version installed on the GPUs.

DThe data preprocessing is being bottlenecked by the CPU.

Show Suggested Answer

Suggested Answer: D

The data preprocessing being bottlenecked by the CPU is the most likely cause. High CPU utilization and low GPU utilization suggest the GPUs are idle, waiting for data, a common issue when preprocessing (e.g., data loading) is CPU-bound. NVIDIA recommends GPU-accelerated preprocessing (e.g., DALI) to mitigate this. Option A (model incompatibility) would show errors, not low utilization. Option B (connection issues) would disrupt communication, not CPU load. Option C (software version) is less likely without specific errors. NVIDIA's performance guides highlight preprocessing bottlenecks.

by Eve at Jun 11, 2025, 05:33 AM

Limited Time Offer

25%

Off

Get Premium NCA-AIIO Questions as Interactive Web-Based Practice Test or PDF