-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cram files? #15
Comments
I went through motions of passing IIUC, you'd specify relevant options like reference path the same way you do in hadoop-bam, e.g. as a property on the Hadoop Making those properties proper method-params to be more idiomatic Scala would be nice. Feel free to post the results of trying it! Alternatively, your application code can decide to call hadoop-bam or spark-bam based on the file's extension 🙁 |
Hey @jjfarrell, I looked through the related posts broadinstitute/gatk#4506 and HadoopGenomics/Hadoop-BAM#196 (comment) and am curious to dig a little bit. Is there a public |
@ryan-williams That cram one is not available. However, I am working on a getting a cram of a GIAB sample available for testing. |
Here is a publiclly available cram from 1000 genomes. Again I found the Spark GATK v4.0.2.1 job was quite slow processing this cram.
Here are the urls for a cram.... ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data_collections/1000_genomes_project/data/CHS/HG00419/high_cov_alignment/HG00419.alt_bwamem_GRCh38DH.20150917.CHS.high_coverage.cram |
Does spark-bam handle cram files? If so, how does the reference get specified?
The text was updated successfully, but these errors were encountered: