What does this mean: “srun: First task exited 30s ago” followed by “srun Job Failed”?
The srun command monitors when tasks exit. By default, 30 seconds after the first task exists, the job is killed. This typically indicates some type of job failure and continuing to execute a parallel job when one of the tasks has exited is not normally productive. This behavior can be changed using srun’s –wait=
Related Questions
- Does the Employer following the statutory discipline and dismissal procedure mean that there are no other procedural requirements that must be followed?
- What does it mean when a V CASTSM Music song or album title is followed by the word "EXPLICIT"?
- What does this mean: "srun: First task exited 30s ago" followed by "srun Job Failed"?