Bulletproof Runs #423

pgierz · 2021-08-05T07:29:27Z

Hello all,

@chrisdane has been helping me out testing the "output from echam.namelist" feature, which will (once ready) allow you to figure out which files to move around based upon both the streams defined in your YAML and on top of that whatever the echam namelist mvstreams sets up for you (For details, see the parts of Issue #384)

In the process of testing, he accidentally managed to break one of his production runs, because the virtual environment was turned off. Therefore, we came up with the following idea:

Virtual environments should be on by default
They should install whatever you currently have as default unless you directly specify something else

Turning on the virtual env by default is simple, I will send a PR for that in a moment. Less simple is figuring out all the branches you currently have. That would be a job for esm-version checker. @denizural , I'll open a separate issue in that repository to discuss more, but it would be nice to have a feature where we have a function, and it just spits back a dictionary of:

current_branches: {
    "esm_tools": "140d197"
}

Effectively, one for git-sha for each project. But, as I said, I'll make a separate issue to discuss that part.

The text was updated successfully, but these errors were encountered:

dbarbi · 2021-08-05T07:34:39Z

objection... we decided to not have any default for venv because either default will mess with people. i would rather not do that...

chrisdane · 2021-08-05T07:58:10Z

I dont unterstand you Dirk. What Paul suggests was the default in the old esm tools, right? Everything was copied to the experiment directory and then used from there forever. In my view that made sense and as a user I would like to have that as well in the new esm tools.

pgierz · 2021-08-05T07:59:05Z

@cdanek, to clarify for you. In principle, if you want to make sure nothing ever changes, always make a virtual environment.

What we currently have implemented is:

Runscript Option	Command Line Option	Already using a venv at submission time?	Tool Location Used	Tool Version Used
unset	unset	No	`~/.local/lib/python-<version>/site-packages/esm-tools`	whatever your branch is
unset	unset	Yes	`<venv_base>/lib/python-<version>/site-packages/esm-tools`	whatever your branch is
`use_venv: True`	unset	No	`<experiment_path>/.venv/site-packages/esm_tools`	`release`
unset	`--open-run`	No	`~/.local/lib/python-<version>/site-packages/esm-tools`	whatever your branch is
unset	`--open-run`	Yes	`<venv_base>/lib/python-<version>/site-packages/esm-tools`	whatever your branch is
unset	`--contained-run`	Yes	`<experiment_path>/.venv/lib/python-<version>/site-packages/esm-tools`	`release`
`use_venv: True` and `install_esm_tools_branch: develop`	unset	No	`<experiment_path>/.venv/site-packages/esm_tools`	`develop`
`use_venv: False`	unset	No	`<experiment_path>/.venv/site-packages/esm_tools`	`release`

I'm probably missing a few cases, but I think that outlines the general idea.

dbarbi · 2021-08-05T08:00:50Z

No, in the old esm-tools we didn't have any virtual environment. i agree with you it makes a lot of sense to use venv for runs, but as we have also developers to think about, they don't want to use it. that's why we have the interactive question if you don't specify anything - to make you aware that venv exists, and to think about whether you want to use it

pgierz · 2021-08-05T08:03:09Z

I think I even made that part bright green in the question:
https://github.com/esm-tools/esm_runscripts/blob/9405139c2ddee6eeb7c9454de3cd0be81e65ba2b/esm_runscripts/virtual_env_builder.py#L246

pgierz · 2021-08-05T08:06:58Z

Chris, how did you launch the run that got messed up? Did it have anything set, or did you use any command flags?

I could check if anything is set, and then be pedantic and ask the user even one more time, just to be sure. That would quickly get annoying though. Or @dbarbi we make a secret flag "developer mode" or something that skips the second question if something is already set.

pgierz · 2021-08-05T08:17:34Z

Something not on my table but probably should be, if both the runscript and the command line are set, the unscript wins. Maybe I should swap that around...

chrisdane · 2021-08-05T08:17:57Z

i agree with you it makes a lot of sense to use venv for runs, but as we have also developers to think about, they don't want to use it.

What is the reason for the developers not to to so? It takes too long to copy everything? The cp process could run in the backgroud? And if the model needs to be resubmitted before the venv copy is ready (the only problematic case?), the esm_tools could wait until the copy process is finished (btw it takes ~6 minutes on mistral, not ~3).

Chris, how did you launch the run that got messed up? Did it have anything set, or did you use any command flags?

The problem with this interactive part is that if I submit a runscript that not explicity sets use_venv, the call

esm_runscripts runscript.yaml -e test > test.log 2>&1 &

does not work since the piping of the output does not allow the interactive part (or I was too stupid). As a consequence I have set venv=false for testing and then for production I forgot to set it back to true. So, my failure of course. Is there a way to help me not to make such errors? :D

Pauls table looks rather complicated to me. Why do you need both venv and open-run? I think I could better understand if there is only one of these.

One more thing that I dont understand: venv uses the online repo. That means that I need to push the current version of e.g. the esm_tools I want to use for production run?

pgierz · 2021-08-05T08:42:50Z

What is the reason for the developers not to to so?

This is a good question. Actually, in the Python world, you should be basically always using a separate environment for each project to avoid version conflicts.

Is there a way to help me not to make such errors? :D

We could require two separate people to double check each runscript used for production. Maybe even with a GPG signature. But this is way waaaaay overkill. I'm sure there is even a python library to check against GPG, but that's one of those things I'd have fun hacking in, but then no one would ever use.

One more thing that I dont understand: venv uses the online repo. That means that I need to push the current version of e.g. the esm_tools I want to use for production run?

Yes, you do. It does a new clone and install over pip. But the idea of the venv thing is that people should be using the last stable version for their production simulations, which for us is always whatever is in the release branch. There are ways to use special branches in the virtualenv, see my table above, but by default you get release. If you need to use a separate special branch for your simulation, what you are doing is in that case not officially supported (at least not by the esm-team and my own little volunteer time).

chrisdane · 2021-08-05T08:52:43Z

If you need to use a separate special branch for your simulation, what you are doing is in that case not officially supported

In a perfect world I would use the last stable version of the esm_tools. But it never reached a stable state for my simulations: I never made a single run with release (please make a survey and realize that I am not the only one).

So the current venv implementation forces me to push a new version/branch of the esm tools even with only one character changed compared to release. This strategy spams the whole repo, in my view.

pgierz · 2021-08-05T08:59:29Z

This strategy spams the whole repo, in my view.

Isn't any typo you fix good for the overall project?

dbarbi · 2021-08-05T11:00:09Z

I can't give that any priority.

pgierz · 2021-08-24T09:55:40Z

So, what's the status here? Do we need to include a section in the handbook to clarify things?

pgierz · 2021-08-24T13:43:51Z

Re-open if needed, please.

pgierz closed this as completed Aug 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bulletproof Runs #423

Bulletproof Runs #423

pgierz commented Aug 5, 2021

dbarbi commented Aug 5, 2021

chrisdane commented Aug 5, 2021

pgierz commented Aug 5, 2021 •

edited

Loading

dbarbi commented Aug 5, 2021

pgierz commented Aug 5, 2021

pgierz commented Aug 5, 2021

pgierz commented Aug 5, 2021

chrisdane commented Aug 5, 2021 •

edited

Loading

pgierz commented Aug 5, 2021 •

edited

Loading

chrisdane commented Aug 5, 2021

pgierz commented Aug 5, 2021

dbarbi commented Aug 5, 2021

pgierz commented Aug 24, 2021

pgierz commented Aug 24, 2021

Bulletproof Runs #423

Bulletproof Runs #423

Comments

pgierz commented Aug 5, 2021

dbarbi commented Aug 5, 2021

chrisdane commented Aug 5, 2021

pgierz commented Aug 5, 2021 • edited Loading

dbarbi commented Aug 5, 2021

pgierz commented Aug 5, 2021

pgierz commented Aug 5, 2021

pgierz commented Aug 5, 2021

chrisdane commented Aug 5, 2021 • edited Loading

pgierz commented Aug 5, 2021 • edited Loading

chrisdane commented Aug 5, 2021

pgierz commented Aug 5, 2021

dbarbi commented Aug 5, 2021

pgierz commented Aug 24, 2021

pgierz commented Aug 24, 2021

pgierz commented Aug 5, 2021 •

edited

Loading

chrisdane commented Aug 5, 2021 •

edited

Loading

pgierz commented Aug 5, 2021 •

edited

Loading