-
Notifications
You must be signed in to change notification settings - Fork 10
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #568 from nih-cfde/Cavatica-RNA-seq-training-updat…
…es-manifest-integration Cavatica rna seq training updates manifest integration
- Loading branch information
Showing
32 changed files
with
136 additions
and
17 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,11 +1,12 @@ | ||
nav: | ||
- rna_seq_1.md | ||
- rna_seq_2.md | ||
- rna_seq_3.md | ||
- rna_seq_4.md | ||
- rna_seq_5.md | ||
- rna_seq_6.md | ||
- rna_seq_7.md | ||
- rna_seq_8.md | ||
- rna_seq_9.md | ||
|
||
- rna_seq_01.md | ||
- rna_seq_02.md | ||
- rna_seq_03.md | ||
- rna_seq_04.md | ||
- rna_seq_05.md | ||
- rna_seq_06.md | ||
- rna_seq_07.md | ||
- rna_seq_08.md | ||
- rna_seq_09.md | ||
- rna_seq_10.md | ||
|
Binary file added
BIN
+1.56 MB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-01.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+1.19 MB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-02.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+76.7 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-03.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+116 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-04.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+126 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-05.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+1.21 MB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-06.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+66.3 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-07.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+89.5 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-08.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+29.4 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-09.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+75.4 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-10.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+130 KB
...on-Cavatica/rna-seq-images/rna-seq-10-11-copy-project-config-settings-allow.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+118 KB
...NAseq-on-Cavatica/rna-seq-images/rna-seq-10-11-copy-project-config-settings.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+51.9 KB
...nalyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-11-copy-project-config.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+181 KB
...ses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-11-copy-project-in-project.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+212 KB
...-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-11-copy-project-tile.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+51.5 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-11.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+213 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-12.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+48.2 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-13.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+44.7 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-14.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+69.3 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-15.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+459 KB
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna-seq-images/rna-seq-10-16.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
116 changes: 116 additions & 0 deletions
116
docs/Bioinformatic-Analyses/RNAseq-on-Cavatica/rna_seq_10.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,116 @@ | ||
--- | ||
layout: page | ||
title: Incorporating CFDE Portal Datasets in Kids First Analyses | ||
--- | ||
|
||
<div class="banner"><span class="banner-text">Lesson in Development</span></div> | ||
|
||
Incorporating CFDE Portal Datasets in Kids First Analyses: Find CFDE datasets on CFDE Portal and import into a CAVATICA project | ||
========================== | ||
|
||
|
||
As part of the NCPI Effort, methods have been developed to allow users to bring datasets from the other NIH platforms into CAVATICA for combined research projects. One external dataset that is supported is the [Genotype Tissue Expression (GTEx) Program](https://commonfund.nih.gov/gtex). GTEx was funded to study the relationship between genetic variants (inherited changes in DNA sequence) and gene expression (how genes are turned on and off) in multiple human tissues and across individuals. Their datasets can serve as great controls for RNA-Seq experiments, comparing expression in GTEx's "normal" brain tissue to Kids First's brain cancer tissue. | ||
|
||
The previous examples provided a walkthroughs for identifying RNA-Seq datasets from the [Kids First Data Resource Portal](https://portal.kidsfirstdrc.org/) and pushing them to [CAVATICA](https://cavatica.sbgenomics.com/) for analysis, as well as bringing in data from the [NHGRI Analysis Visualization and Informatics Lab-space (AnVIL)](https://anvilproject.org/) Portal and import these files into a CAVATICA project for a combined analysis with Kids First data. | ||
|
||
This supplemental lesson will demonstrate how to find datasets, such as GTEx and those of other CFDE Data Coordinating Centers, on the CFDE Portal and then importing a manifest list of file DRS URIs into a CAVATICA Project. | ||
|
||
|
||
## Step 1: Identify Files on the CFDE Portal and Export as an NCPI Manifest of DRS URIs | ||
|
||
- First, navigate to the [CFDE Portal](https://app.nih-cfde.org/) and select `Data Browser`, then `File` from the top toolbar. | ||
|
||
<img src="./rna-seq-images/rna-seq-10-01.png"> | ||
|
||
- From there, you can select from the filters in the left-hand toolbar to narrow the scope of your search to your desired files. In this example, we have chosen the `FASTQ` and `BAM` file formats, `RNA sequencing assay` Assay type, and the GTEx and Gabriella Miller Kids First Common Fund Programs. | ||
|
||
<img src="./rna-seq-images/rna-seq-10-02.png"> | ||
<p align="center"> | ||
<img src="./rna-seq-images/rna-seq-10-03.png" width=200> | ||
<img src="./rna-seq-images/rna-seq-10-04.png" width=200> | ||
<img src="./rna-seq-images/rna-seq-10-05.png" width=200> | ||
</p> | ||
|
||
- Once you have found your desired set of files, click on the `Export` button in the top-right corner of the screen and select the `NCPI File Manifest` option from the dropdown menu. Once selected, your manifest file should then download. | ||
|
||
<img src="./rna-seq-images/rna-seq-10-06.png"> | ||
<p align="center"> | ||
<img src="./rna-seq-images/rna-seq-10-07.png" width=500> | ||
</p> | ||
|
||
- This concludes the necessary steps in the CFDE Portal. We will now move to CAVATICA. | ||
|
||
## Step 2: Import the NCPI Manifest of DRS URIs into CAVATICA | ||
|
||
The process for importing the DRS URIs into a CAVATICA project is extremely straightforward and does not require coding. | ||
|
||
- First, navigate to [CAVATICA](https://cavatica.sbgenomics.com/) and log-in via your eRA Commons account. | ||
|
||
<p align="center"> | ||
<img src="./rna-seq-images/rna-seq-10-08.png" width=500> | ||
</p> | ||
|
||
- Note: if you are logging into CAVATICA for the first time you will be presented with an NIH consent screen followed by a Gen3 authorization screen. | ||
|
||
<p align="center"> | ||
<img src="./rna-seq-images/rna-seq-9-06-01-nih-consent.png" width=600> | ||
<img src="./rna-seq-images/rna-seq-9-06-02-gen3-authorize.png" width=500> | ||
</p> | ||
|
||
- Once you have logged into CAVATICA, you must *either* select a pre-existing project *or* create a new project where you would like to import the files from your manifest. This can be done from the `Projects` section on the homepage or using the `Projects` dropdown menu from the top bar. | ||
|
||
<p align="center"> | ||
<img src="./rna-seq-images/rna-seq-10-09.png" width=600> | ||
</p> | ||
|
||
- If you are creating a new project, click on the `Create Project` button. Select a title and billing group for your new project. Be sure to choose to `Allow network access` for this project under `Advanced settings`. | ||
|
||
<p align="center"> | ||
<img src="./rna-seq-images/rna-seq-10-10.png" width=400> | ||
<img src="./rna-seq-images/rna-seq-10-11.png" width=400> | ||
</p> | ||
|
||
- If you would like to work with the Data Interoperability public project, make a copy of the project by either navigating to Public Projects and click on the "Copy Project" button on the Data Interoperability tile. | ||
|
||
<p align="center"> | ||
<img src="./rna-seq-images/rna-seq-10-11-copy-project-tile.png" width=400> | ||
</p> | ||
|
||
- Or if you are within the Data Interoperbility project, select the "i" next to the project name, and then select to copy the project. | ||
|
||
<p align="center"> | ||
<img src="./rna-seq-images/rna-seq-10-11-copy-project-in-project.png" width=400> | ||
</p> | ||
|
||
- Both paths will bring up the project creation menu. Click Copy to finalize the creation of the project. | ||
|
||
<p align="center"> | ||
<img src="./rna-seq-images/rna-seq-10-11-copy-project-config.png" width=400> | ||
</p> | ||
|
||
- Validate that the network access does not defaulted to "Block network access" but is set to to "Allow network access". You can validate and change this setting to "Allow network access" if necessary. This will enable you to use the Cloud-agnostic Data Import interactive analysis. Click on the "i" next to the copied project title. Then click on Settings. | ||
<p align="center"> | ||
<img src="./rna-seq-images/rna-seq-10-11-copy-project-config-settings.png" width=400> | ||
</p | ||
- This will bring you to the page where you can select the "Allow network access" setting. | ||
<p align="center"> | ||
<img src="./rna-seq-images/rna-seq-10-11-copy-project-config-settings-allow.png" width=400> | ||
</p> | ||
|
||
- After creating or opening your target project, select the `Files` menu from the project toolbar. You may import your files into the main directory or you may choose at this point to create a folder where you would like to import your files instead. Once you have navigated to the desired location, select `+ Add files` and then click on `GA4GH Data Repository Service (DRS)` from the dropdown menu. | ||
|
||
<img src="./rna-seq-images/rna-seq-10-12.png"> | ||
<img src="./rna-seq-images/rna-seq-10-13.png"> | ||
|
||
- Next, select `From a manifest file` and upload your manifest file from the CFDE Portal. You may then add tags to the files you are about to import, as well as select whether to skip or autorename files when confronted with a naming conflict. Finally, select the checkbox acknowledging that you will adhere to acceptable use of the data, including but not limited to any applicable data use agreements. Once you have completed these steps, click on the `Submit` button to begin importing your files. | ||
|
||
<img src="./rna-seq-images/rna-seq-10-14.png"> | ||
<img src="./rna-seq-images/rna-seq-10-15.png"> | ||
|
||
- You should then be navigated back to the file explorer for your project where you should now be able to see your imported files. | ||
|
||
<img src="./rna-seq-images/rna-seq-10-16.png"> | ||
|
||
- This completes the file transfer - the files will now be accessible in the Data Cruncher as well as in the Files Tab of the Project! |