-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
🐌 Slow tests #603
Comments
Of course, we want tests' reproducibility and don't want our code to fail because some code before has modified our assets. But maybe we can consider more user-friendly approaches? I have those options in mind:
|
@John-P @shaneahmed, maybe you have other ideas? |
It looks like there are a few issues to address here:
|
@blaginin As John mentioned downloads should be for a session. Please let us know if this is not the case. Feel free to create a PR to fix this. |
This addresses part of issue #603 by improving model weights and datasets in tests. Previously, weights for each model were downloaded to `f"{tmp_path}/weights.pth"` and replaced each other. The only available dataset, Kather, was downloaded into `TIATOOLBOX_HOME`, but the entire home directory was deleted during test execution. This approach has several drawbacks: - Tests are extremely slow. On every test run, it downloads ~8 GB of weight files which never change. - Tests cannot be run in parallel as they depend on the same `f"{tmp_path}/weights.pth"` file. - Tests heavily load TIA's infrastructure. For each test run, the server has to transfer almost 10 GB of static assets. This might work while the number of contributors is small, but what happens when it increases? - Tests were inconsistent with their models caching. Why, for example, PatchPredictor cached their pretrained_models, but UNetModel did not? Now, all weights are downloaded to TIATOOLBOX_HOME and cached between runs. Storing more data in memory (+8 GB) makes tests much faster, especially if you're not in Coventry or have slow internet. In rare cases where you specifically need to test behaviour without cache, you can set TIATOOLBOX_HOME to a temporary directory.
Addresses part of issue #603 and improves fixtures in tests: - Some tests were using `_fetch_remote_sample` even though tests have a special `remote_sample` wrapper. - `remote_sample` previously created a new folder for each run (data-0, data-1, data-2, and so on). Now it uses just one folder for all remote fixtures during a test session. - To prevent duplicated downloads, I added download locks to the fixtures. - Combined `_fetch_remote_sample` and `download_data` since they had the same logic. --------- Co-authored-by: Shan E Ahmed Raza <[email protected]>
Description
Tests in the toolbox are extremely slow. Even with
CI=True
, all the checks take almost half an hour. Non-CI mode takes even more time. This makes an interactive development approach, with running tests after small changes, impossible. This results in writing bad test-related code.The main reasons why tests are slow are static files. Each test run is related to downloading and then deleting gigabytes of models' weights and test images. However, almost always, those files remain unchanged after the tests' execution.
Not only is this slow, but it also triggers a lot of false positive results. For example, that's the typical result of my local run:
Todo
The text was updated successfully, but these errors were encountered: