Feature Request: parallel matching for prescored variants #82

yangyxt · 2024-12-15T13:46:47Z

When I try to run CADD on a VCF file with 200k variants, I found the prescore match step executed by extract_scored.py is pretty time consuming. I think maybe this step can be accelerated by parallel matching per chromosome.

I suggest split the prescore file to 24 pieces by chromosome and split the input VCF to pieces by chormosome as well. For each chromosome, perform the extract_scored.py once and let them perform in parallel.

If it is OK for you, I can offer a PR later. Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: parallel matching for prescored variants #82

Feature Request: parallel matching for prescored variants #82

yangyxt commented Dec 15, 2024

Feature Request: parallel matching for prescored variants #82

Feature Request: parallel matching for prescored variants #82

Comments

yangyxt commented Dec 15, 2024