Requesting exclusive/dedicated CPU core allocation for performance-sensitive exp

Requesting exclusive/dedicated CPU core allocation for performance-sensitive exp

Log In Register Lost Password

This topic has 2 replies, 3 voices, and was last updated 2 months ago by yoursunny.

Viewing 3 posts - 1 through 3 (of 3 total)

Author

Posts
May 5, 2026 at 6:32 am #9755
Xavier Querol Bassols
Participant
Hello,

I am running distributed GPU training experiments (PyTorch FSDP / NCCL all_reduce) on a FABRIC slice with two NVIDIA A30 GPUs at the PRIN site. Our main task is to compare Fabric training times with with those from our own simulator.

For reproducible bandwidth measurements I need to eliminate CPU noise from hypervisor co-scheduling. I understand that GPU components use PCIe passthrough and are therefore exclusive to my slice. My question is:

Is there a way to request a node where the physical CPU cores are also exclusively assigned to my VM (no hypervisor overcommit / no co-tenancy)? I did not find a dedicated_cpu or exclusive parameter in the FABlib API.

Site: PRIN
Node type: VM with 2× GPU_A30

Thank you.
May 5, 2026 at 8:25 am #9756
Hussam Nasir
Moderator
There is a feature in fablib that allows for cpu/core pinining. That is as close as you can get to core exclusivity i guess.
May 7, 2026 at 2:36 pm #9771
yoursunny
Participant
You can try the cpupin_common script that eliminates four of five layers of CPU interference:

The NIST-MQNS artifact contains a concrete example of using this script to perform a CPU-bound benchmark of a Python application.
Author

Posts

Viewing 3 posts - 1 through 3 (of 3 total)

You must be logged in to reply to this topic.