Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Can multiple SLURM systems be run in parallel for testing purposes?

0
Posted

Can multiple SLURM systems be run in parallel for testing purposes?

0

Yes, this is a great way to test new versions of SLURM. Just install the test version in a different location with a different slurm.conf. The test system’s slurm.conf should specify different pathnames and port numbers to avoid conflicts. The only problem is if more than one version of SLURM is configured with switch/elan or switch/federation. In that case, there can be conflicting switch window requests from the different SLURM systems. This can be avoided by configuring the test system with switch/none. MPI jobs started on an Elan or Federation switch system without the switch windows configured will not execute properly, but other jobs will run fine. Another option for testing on Elan or Federation systems is to use a different set of nodes for the different SLURM systems. That will permit both systems to allocate switch windows without conflicts.

Related Questions

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.

Experts123