Question: Is it possible to utilize multiple cores when training (adding measurements)? #344

brandon-holt · 2024-08-15T19:59:39Z

Hi, I noticed that when adding measurements to a campaign object only one core is being utilized. Is there a way to parallelize this process to decrease runtime? This is currently a very slow process for me.

By contrast, I noticed when running the simulate_experiment module all cores are in use. I know these are different processes, but just was curious why this module can utilize multiple cores.

Thanks!

The text was updated successfully, but these errors were encountered:

AdrianSosic · 2024-08-21T12:22:45Z

Hi @brandon-holt, as always, thanks for reporting the issue. The fact that the mere addition of measurements (i.e., without even recommending) causes delays is clearly suboptimal and needs to be fixed. Ideally, this should not be noticeable at all but the current overhead stems from a design choice that we might need to rethink: it's probably caused by the process of "marking" parameter configurations being measured in the search space metadata. This process is currently by no means optimized for speed and I see different potential ways around it that we'd need to discuss in our team:

Making the involved fuzzy matching more efficient
Switching to a more performant backend like polars
Following an entire different approach to metadata handling
...

I suspect your search space is quite big, causing the delays? Can you give me a rough estimate of your dimensions so that I have something to work with?

brandon-holt · 2024-08-21T15:42:48Z

@AdrianSosic I see, this is interesting insight!

Here is the size of a typical campaign searchspace I am working with

campaign.searchspace.discrete.comp_rep size = (37324800, 191)
campaign.searchspace.discrete.exp_rep size = (37324800, 8)

AdrianSosic · 2024-08-26T06:26:13Z

Thanks for sharing. That is indeed already quite a bit. I'll take this into our team meeting and see what we can do about it. Perhaps we can find a quick fix for you... But priority-wise, a full fix could take a while since my focus is currently still on the surrogate / SHAP issue 😋

Scienfitz · 2024-12-09T08:30:35Z

@AdrianSosic this Issue needs to be updated to properly describe the issue causing the computational bottleneck, otherwise I will convert it to a discussion

AdrianSosic mentioned this issue Sep 5, 2024

Immutable search spaces #371

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: Is it possible to utilize multiple cores when training (adding measurements)? #344

Question: Is it possible to utilize multiple cores when training (adding measurements)? #344

brandon-holt commented Aug 15, 2024 •

edited

Loading

AdrianSosic commented Aug 21, 2024

brandon-holt commented Aug 21, 2024

AdrianSosic commented Aug 26, 2024

Scienfitz commented Dec 9, 2024

Question: Is it possible to utilize multiple cores when training (adding measurements)? #344

Question: Is it possible to utilize multiple cores when training (adding measurements)? #344

Comments

brandon-holt commented Aug 15, 2024 • edited Loading

AdrianSosic commented Aug 21, 2024

brandon-holt commented Aug 21, 2024

AdrianSosic commented Aug 26, 2024

Scienfitz commented Dec 9, 2024

brandon-holt commented Aug 15, 2024 •

edited

Loading