-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Somatic update #156
Comments
For 'Unify output filenames for Control Freec and CopyWriter':
Line 290 - method |
I don't understand the changes to -- ratios_fn=paste("freec.", sample_name, ".ratio.txt", sep=""),
-- log2_fn=paste("freec.", sample_name, ".log2.txt", sep=""),
-- call_fn=paste("freec.", sample_name, ".call.txt", sep=""),
-- segments_fn=paste("freec.", sample_name, ".segments.txt", sep=""),
+- ratios_fn=paste("freec.", sample_name, "_ratio.txt", sep=""),
+- log2_fn=paste("freec.", sample_name, "_gene_log2.txt", sep=""),
+- call_fn=paste("freec.", sample_name, "_gene_call.txt", sep=""),
+- segments_fn=paste("freec.", sample_name, "_segments.txt", sep=""), For the function -- paste( mapper, ".copywriter.", fullID, "_segments.txt", sep="" ) ),
+- paste( mapper, ".copywriter.", fullID, ".segments.txt", sep="" ) ), |
Another points about |
I am afraid I have created a monster here. I should have stuck to a naming convention for all CNV steps:
You are welcome to unify the naming convention. On a side note, I found that output filenames are configured differently in different places,even within the same step & tool (for example names are constructed with underscore and with dot). I don't know which is the right way to go. Ideally, I would like to:
This is probably a lot of unnecessary work, but perhaps we should consider moving to dot separation across the somatic CNV steps. |
Regarding the This is a real problem, because these packages are huge (genome sequence, genome feature annotations, gene ids & functional annotations), and they are only valid for one genome release. If we want to use GRCh38, or mouse data, then we need to use other version of these packages. The problem comes as the packages are downloaded using the wrapper's conda environment. So the only solution (at the moment) is to put in the conda environment annotation packages for several genome releases We discussed with Clemens a way around this problem. We would have an initial sub-step which creates a sub-directory in |
Name pattern: Probably a good idea to stick to one across all workflows. I think I went over all the ones present in the original PR, but it might be something to gradually fix. Workflow distinction: I suggest we keep the separation. Too much work for very little value. control_freec_write_files: Way beyond my knowledge of the topic. I was just referring to the fact that there seems to be some default values for the output arguments - in this case you needed to modify them, you might consider have no defaults and avoid that: -- ratios_fn=paste("freec.", sample_name, ".ratio.txt", sep=""),
+- ratios_fn=paste("freec.", sample_name, "_ratio.txt", sep=""), |
Sorry I misunderstood your comment. You are absolutely right: |
Tasks
Tasks extracted from PR #94 :
Based on commit 0a30513
Based on commit e3be244
mutect2
.Based on commit 1a45161
eb_filter
.jannovar
wrapper.Based on commit c259444
snappy_pipeline/workflows/somatic_variant_calling/__init__.py
-mutect2
related.Based on commit 18b8362
snappy_wrappers/wrappers/mutect2/contamination/wrapper.py
snappy_wrappers/wrappers/mutect2/filter/wrapper.py
Based on commit c469173Updatecbioportal_export
default config.Fix wrappersnappy_wrappers/wrappers/cbioportal/case_lists/wrapper.py
.Fix wrappersnappy_wrappers/wrappers/cbioportal/meta_files/wrapper.py
.Fix wrappersnappy_wrappers/wrappers/vcf2maf/vcf2maf/wrapper.py
.Based on commit d407f20
Based on commit f062fc5
The text was updated successfully, but these errors were encountered: