You're designing a multi-GPU system for AI training using NVIDIA GPUs with NVLink connections. You need to maximize inter-GPU communication bandwidth. Which feature included in NCCL allows for improved communication between GPUs and NICs?
Which of the following scenarios would the Network Traffic Map in UFM be least useful for troubleshooting?
A cloud service provider is deploying the NVIDIA Spectrum-X Ethernet platform in a multi-tenant environment. To ensure the security and isolation of each tenant's AI workload, the provider wants to implement a feature that prevents unauthorized access to the network.
Which of the following features of the Spectrum-X platform should the provider implement?
You have recently implemented NVIDIA Spectrum-X in your data center to optimize AI workloads. You need to verify the performance improvements and create a baseline for future comparisons.
Which tool would be most appropriate for creating performance baseline results in this Spectrum-X environment?
Which of the following NCCL environment variables enable SHARP aggregation with NCCL when using the NCCL-SHARP plugin?
Pick the 2 correct responses below
As the network administrator for a large-scale AI research cluster, you are responsible for ensuring seamless data flow across an InfiniBand east-west fabric that interconnects hundreds of compute nodes.
Which tool would you use to trace and discover the network paths between nodes on this InfiniBand east-west fabric?
You are troubleshooting InfiniBand connectivity issues in a cluster managed by the NVIDIA Network Operator. You need to verify the status of the InfiniBand interfaces. Which command should you use to check the state and link layer of InfiniBand interfaces on a node?
A fabric administrator added new servers to a 40-port edge switch. The administrator now needs to gather and map the newly added ports’ LIDs and LINK SPEED. Which of the following commands can be used for that purpose?
When designing a multi-tenancy East/West (E/W) fabric using Unified Fabric Manager (UFM), which method should be used?
You are troubleshooting a Spectrum-X network and need to ensure that the network remains operational in case of a link failure. Which feature of Spectrum-X ensures that the fabric continues to deliver high performance even if there is a link failure?
You are concerned about potential security threats and unexpected downtime in your InfiniBand data center.
Which UFM platform uses analytics to detect security threats, operational issues, and predict network failures in InfiniBand data centers?
When upgrading Cumulus Linux to a new version, which configuration files should be migrated from the old installation?
Pick the 2 correct responses below.
Which of the following tools in Cumulus Linux is specifically useful for detecting and differentiating microbursts from regular network congestion?
Pick the 2 correct responses below
A financial services company is planning to implement an AI infrastructure to support real-time fraud detection and risk assessment. They need a solution that can handle both training and inference workloads while maintaining data privacy and security.
Which NVIDIA reference architecture component would be most appropriate to address the data privacy and security concerns in this AI networking setup?
In which mode of the BlueField DPU does the ARM system on the DPU control the NIC data path, but allow access to the DPU OS from the host?