Add `max_threads_per_process` and `mp_context` to pca by channel computation and PCA metrics #3434

alejoe91 · 2024-09-23T15:00:26Z

No description provided.

zm711 · 2024-09-23T15:08:51Z

After tests pass I can test this in lab. PCA is so brutal on Windows! Hopefully setting this also gets us a speed boost too (although could just be a Windows problem).

alejoe91 · 2024-09-23T15:29:58Z

Since I'm at it, I'll propagate the same trick to the PCA metrics.

For PCA computation, I'm getting 5-6x speed up on linux!

…fix-pca-multiprocessing

alejoe91 · 2024-09-24T08:21:24Z

@zm711 let me know if this speeds things up on Windows! @jonahpearl can you also give it a try to see if it fixes the PCA hanging?

zm711 · 2024-09-24T12:34:15Z

In a seminar this morning, I'll test this on an old dataset this afternoon :)

zm711 · 2024-09-24T21:10:46Z

So testing now it seems like we don't get a speed up at all on Windows. What version of sklearn do you have? Basically no matter what I set for n_jobs I still use all processors at 100% activity. I tried varying the max_threads_per_process between 2-4 and also saw no difference in calculation time. So I think this is either an OS issue (Windows is just slow for this) or maybe a env issue.

ADDITION: since I never had the hanging issue I don't know if this fixes that though!

alejoe91 · 2024-09-24T23:00:15Z

Mmm interesting! Have you tried max_threads to 1?

zm711 · 2024-09-24T23:42:07Z

Mmm interesting! Have you tried max_threads to 1?

Nope, but I can try first thing tomorrow morning when I'm back at the Windows station.

samuelgarcia · 2024-09-25T08:48:32Z

OK for me.

zm711 · 2024-09-25T12:26:21Z

Same speed for 1 thread. And still uses all cores to the max. I think this must be a Windows scheduler thing. It doesn't make sense to bounce processes around on different processors, but it must be happening.

jonahpearl · 2024-09-27T22:15:17Z

No luck for me but I do have an update which i'll put in the issue

Add max_threads_per_process to pca fit_by_channel

9ddc7ce

alejoe91 added postprocessing Related to postprocessing module concurrency Related to parallel processing labels Sep 23, 2024

Add mp_context check

59bb1e7

alejoe91 added 2 commits September 23, 2024 17:53

Add mp_context and max_threads_per_process to pca metrics

5a02a26

Merge branch 'main' of github.com:SpikeInterface/spikeinterface into …

03b48fe

…fix-pca-multiprocessing

alejoe91 changed the title ~~Add max_threads_per_process to pca fit_by_channel~~ Add max_threads_per_process and mp_context to pca by channel computation and PCA metrics Sep 23, 2024

Merge branch 'main' of github.com:SpikeInterface/spikeinterface into …

1dfa55a

…fix-pca-multiprocessing

alejoe91 marked this pull request as ready for review September 24, 2024 08:20

Merge branch 'main' into fix-pca-multiprocessing

8101e20

samuelgarcia approved these changes Sep 25, 2024

View reviewed changes

alejoe91 added this to the 0.101.2 milestone Sep 27, 2024

alejoe91 merged commit 3df1083 into SpikeInterface:main Sep 27, 2024
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `max_threads_per_process` and `mp_context` to pca by channel computation and PCA metrics #3434

Add `max_threads_per_process` and `mp_context` to pca by channel computation and PCA metrics #3434

alejoe91 commented Sep 23, 2024 •

edited

Loading

zm711 commented Sep 23, 2024

alejoe91 commented Sep 23, 2024

alejoe91 commented Sep 24, 2024

zm711 commented Sep 24, 2024

zm711 commented Sep 24, 2024 •

edited

Loading

alejoe91 commented Sep 24, 2024

zm711 commented Sep 24, 2024

samuelgarcia commented Sep 25, 2024

zm711 commented Sep 25, 2024

jonahpearl commented Sep 27, 2024

Add max_threads_per_process and mp_context to pca by channel computation and PCA metrics #3434

Add max_threads_per_process and mp_context to pca by channel computation and PCA metrics #3434

Conversation

alejoe91 commented Sep 23, 2024 • edited Loading

zm711 commented Sep 23, 2024

alejoe91 commented Sep 23, 2024

alejoe91 commented Sep 24, 2024

zm711 commented Sep 24, 2024

zm711 commented Sep 24, 2024 • edited Loading

alejoe91 commented Sep 24, 2024

zm711 commented Sep 24, 2024

samuelgarcia commented Sep 25, 2024

zm711 commented Sep 25, 2024

jonahpearl commented Sep 27, 2024

Add `max_threads_per_process` and `mp_context` to pca by channel computation and PCA metrics #3434

Add `max_threads_per_process` and `mp_context` to pca by channel computation and PCA metrics #3434

alejoe91 commented Sep 23, 2024 •

edited

Loading

zm711 commented Sep 24, 2024 •

edited

Loading