You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In my experience, if a job runs beyond the max value given for its Slurm partition it gets killed (fine) but with a KeyboardInterrupt which is super cryptic. It always makes me think I've somehow pressed CRTRL-C or something by accident. It's also confusing Cactus users on github:
# container cleanup finally blocks can run. Ask for SIGINT so we
# can get the default Python KeyboardInterrupt which third-party
# code is likely to plan for. Make sure to send it to the batch
We could make it configurable or just change it to another signal, but we'd need to make sure that e.g. the WDL and CWL Docker container management code knows to trap that signal, clean up running containers, and then exit.
In my experience, if a job runs beyond the max value given for its Slurm partition it gets killed (fine) but with a
KeyboardInterrupt
which is super cryptic. It always makes me think I've somehow pressed CRTRL-C or something by accident. It's also confusing Cactus users on github:ComparativeGenomicsToolkit/cactus#1554
Would it be possible to patch Toil to give a more informative error for this?
┆Issue is synchronized with this Jira Story
┆Issue Number: TOIL-1685
The text was updated successfully, but these errors were encountered: