Spring Sale Limited Time 75% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code = simple75

Pass the NVIDIA-Certified Professional NCP-AII Questions and answers with Dumpstech

Exam NCP-AII Premium Access

View all detail and faqs for the NCP-AII exam

Practice at least 50% of the questions to maximize your chances of passing.
Viewing page 4 out of 4 pages
Viewing questions 31-40 out of questions
Questions # 31:

A system administrator wants to configure MIG for seven slices on an H100 GPU in an NVIDIA HGX system. Which command should be used?

Options:

A.

mig-parted

B.

nvidia-smi

C.

nvcc

D.

nvlink-config

Questions # 32:

An engineer needs to verify the current firmware versions of all components (ATF, BSP, NIC, UEFI) on a BlueField-3 DPU ' s BMC. Which Redfish API command provides this information?

Options:

A.

mlxconfig -d < dev > q

B.

curl -k -u root: < password > -X GET https:// < DPU-BMC-IP > /redfish/v1/UpdateService/FirmwareList

C.

mstflint -d < PCI_ID > query full

D.

curl -k -u root: < password > -X GET https:// < DPU-BMC-IP > /redfish/v1/UpdateService/FirmwareInventory

Questions # 33:

After initial setup and health checks, the DGX H100 system administrator wants to verify that containers can access GPUs before running production workloads. Which method is recommended for this validation?

Options:

A.

sudo docker run --gpus all --rm nvcr.io/nvidia/cuda:12.1.1-base-ubuntu22.04 systemctl

B.

sudo docker run --gpus all --rm nvcr.io/nvidia/cuda:12.1.1-base-ubuntu22.04 ls -la

C.

sudo docker run --rm nvcr.io/nvidia/cuda:12.1.1-base-ubuntu22.04 nvidia-smi

D.

sudo docker run --gpus all --rm nvcr.io/nvidia/cuda:12.1.1-base-ubuntu22.04 nvidia-smi

Questions # 34:

A user encounters " permission denied " errors when running GPU-accelerated containers on a Secure Boot-enabled system. What resolves this?

Options:

A.

Enroll the MOK and sign NVIDIA kernel modules.

B.

Reinstall Docker without the NVIDIA runtime.

C.

Disable SELinux to relax unnecessary security policies.

D.

Run Docker with sudo for elevated privileges.

Questions # 35:

As the infrastructure lead for an NVIDIA AI Factory deployment, you have just uploaded the latest supported firmware packages to your DGX system. It is now critical to ensure all hardware components run the new firmware and the DGX returns to full operational capability. Which sequence best guarantees that all relevant components are correctly running updated firmware?

Options:

A.

Perform a software-driven restart on the operating system of every compute node, then use advanced tools to check firmware status, and reissue update commands if any firmware appears inactive afterward.

B.

Execute a single AC power cycle on the DGX after the update process, then reset the software stack and verify status using diagnostic commands on each node for confirmation of all component updates.

C.

Initiate a cold power cycle on all node trays to activate firmware, follow with a DGX reboot procedure, and use the management interface to finish activating CPLD firmware on the host.

D.

Initiate a cold power cycle on the system to activate firmware for components, reset the BMC using the recommended command, and perform an AC power cycle to ensure EROT and CPLD firmware is activated.

Questions # 36:

A healthcare organization is deploying an AI system to analyze patient data for predictive diagnostics. The system must comply with strict data protection regulations such as HIPAA, ensuring that sensitive information remains confidential and secure. Considering the need for robust security measures, which combination of strategies should the organization prioritize to protect against data breaches and ensure regulatory compliance?

Options:

A.

Deploy data masking to obscure sensitive data during processing and use role-based access control (RBAC) to limit data access based on user roles.

B.

Use tokenization to replace sensitive data with non-sensitive tokens and employ multi-factor authentication (MFA) for system access.

C.

Implement symmetric encryption for all data at rest and rely solely on password-based access controls.

D.

Rely on asymmetric encryption for all communications and use data deduplication to minimize storage costs without additional security measures.

Questions # 37:

An engineer is tasked with configuring Out-of-Band management for a DGX BasePOD deployment. Which network design will best ensure secure and reliable Out-of-Band management operations?

Options:

A.

Use a single VLAN for both Out-of-Band management and compute fabric to simplify network design.

B.

Configure Out-of-Band management interfaces to be accessible from any subnet within the data center for maximum flexibility.

C.

Connect Out-of-Band management ports to the same switch as user traffic for easier troubleshooting.

D.

Place all BMC and management interfaces on an isolated Out-of-Band network with access restricted by firewall rules.

Questions # 38:

ClusterKit ' s NCCL bandwidth test shows 350 GB/s on a 400G InfiniBand fabric. How should this result be interpreted?

Options:

A.

Optimal performance, indicating healthy fabric and GPUDirect RDMA.

B.

Suboptimal performance; requires FEC tuning to reach 380+ GB/s.

C.

Critical failure; expected is > 390 GB/s for HDR InfiniBand.

D.

Inconclusive; rerun with --stress=cpu to validate.

Questions # 39:

After updating BlueField-3 DPU BMC firmware via Redfish, the engineer observes “TaskState: Running” but no progress after 15 minutes. How should they track the update’s completion status?

Options:

A.

Check /var/log/messages on the DPU operating system for update logs.

B.

Query the DPU BMC with the Task ID of the installation process.

C.

Power cycle the DPU immediately to force a rollback.

D.

Run bfrec --status on the DPU to view flash progress.

Questions # 40:

Your tasked with updating both NVIDIA GPU drivers and DOCA drivers on a set of servers used for AI workloads. The environment previously had an older driver stack and custom kernel modules. What is the most important step to successfully upgrade the drivers without causing conflicts?

Options:

A.

Update the GPU driver leaving the DOCA and OFED drivers unchanged as long as they are detecting the hardware properly.

B.

Validate the driver version post-install since the fresh install will overwrite the legacy drivers.

C.

Keep the older driver running alongside the new version in case you need to roll back the upgrade.

D.

Uninstall all existing GPU and DOCA-related drivers and associated kernel modules before the new install.

Viewing page 4 out of 4 pages
Viewing questions 31-40 out of questions