-
Notifications
You must be signed in to change notification settings - Fork 190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add max_threads_per_process
and mp_context
to pca by channel computation and PCA metrics
#3434
Add max_threads_per_process
and mp_context
to pca by channel computation and PCA metrics
#3434
Conversation
After tests pass I can test this in lab. PCA is so brutal on Windows! Hopefully setting this also gets us a speed boost too (although could just be a Windows problem). |
Since I'm at it, I'll propagate the same trick to the PCA metrics. For PCA computation, I'm getting 5-6x speed up on linux! |
max_threads_per_process
to pca fit_by_channelmax_threads_per_process
and mp_context
to pca by channel computation and PCA metrics
…fix-pca-multiprocessing
@zm711 let me know if this speeds things up on Windows! @jonahpearl can you also give it a try to see if it fixes the PCA hanging? |
In a seminar this morning, I'll test this on an old dataset this afternoon :) |
So testing now it seems like we don't get a speed up at all on Windows. What version of sklearn do you have? Basically no matter what I set for n_jobs I still use all processors at 100% activity. I tried varying the ADDITION: since I never had the hanging issue I don't know if this fixes that though! |
Mmm interesting! Have you tried max_threads to 1? |
Nope, but I can try first thing tomorrow morning when I'm back at the Windows station. |
OK for me. |
Same speed for 1 thread. And still uses all cores to the max. I think this must be a Windows scheduler thing. It doesn't make sense to bounce processes around on different processors, but it must be happening. |
No luck for me but I do have an update which i'll put in the issue |
No description provided.