marp | theme | paginate | license | title | author | ||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
true |
marp-theme_dataplant-ceplas-ccby |
true |
[CC-BY 4.0](https://creativecommons.org/licenses/by/4.0/) |
Swate hands-on |
|
- Get familiar with ISA metadata and Swate
- Annotate data in your ARC
☑️ Make sure Swate is installed:
- Open Excel (online or Desktop)
- Go to the
Insert
tab: Click the arrow next to "My Add-ins". There you should be able to select Swate. - Go to the
Data
tab: you should see the Swate (Core) add-in.
💡 Alternatively, you can use Swate standalone
(:warning: this is however work in progress and likely to change)
- Windows Notepad
- MacOS TextEdit
Recommended text editor with code highlighting, git support, terminal, etc: Visual Studio Code
- Open the ARCitect
- Login (1) to your DataHUB account
- Navigate to Download ARC (4)
- Search for Talinum-CAM-Photosynthesis
- Click the download button, select a location and open the ARC.
💡 This is basically the ARC we created last session.
👩💻 Initiated an ARC
📂 Structured and ...
🌐 Shared with collaborators
Today we want to
... annotate the experimental data
- Navigate to the demo ARC.
- Open the lab notes
studies/talinum_drought/protocols/plant_material.txt
in a text editor. - Open the empty
studies/talinum_drought/isa.study.xlsx
workbook in Excel.
<style scoped> .columns { /* grid-template-columns: repeat(2, minmax(0, 1fr)); */ grid-template-columns: 500px 500px; gap: 30px; display: flex; justify-content: center; } </style>
Create a Swate annotation table via the create annotation table button in the yellow pop-up box OR click the Create Annotation Table quick access button.
💡 Each table is by default created with one input (Source Name
) and one output (Sample Name
) column
💡 Only one annotation table can be added per Excel sheet
- Navigate to the Building Blocks tab via the navbar. Here you can add Building Blocks to the table.
- Instead of Parameter select Characteristic from the drop-down menu (A)
- Search for
organism
in the search bar (B). This search looks for suitable Terms in our Ontology database. - Select the Term with the id
OBI:0100026
and, - Click Add building block.
💡 This adds three columns to your table, one visible and two hidden.
- Navigate to the Terms tab in the Navbar
- In the annotation table, select any number of cells below
Characteristic [organism]
- Click into the search field in Swate.
💡 You should see organism
showing in a field in front of the search field
💡 The search will now yield results related to organism
- In the search field, search for "Talinum fruticosum"
- Select the first hit and click Fill selected cells with this term
- In the Building Blocks tab, select Parameter, search for
light intensity exposure
and select the term with idPECO:0007224
. - Check the box for This Parameter has a unit and search for
microeinstein per square meter per second
in the adjacent search bar. - Select
UO:0000160
. - Click Add building block.
💡 This adds four columns to your table, one visible and three hidden.
In the annotation table, select any cell below Parameter [light intensity exposure]
and add "425" as light intensity.
💡 You can see the numbers being complemented with the chosen unit, e.g. 425.00 microeinstein per square meter per second
Hold Ctrl and click the Autoformat Table quick access button to adjust column widths and un-hide all hidden columns.
💡 You can see that your organism of choice was added with id and source Ontology in the reference (hidden) columns.
Click the Update Ontology Terms quick access buttons.
💡 This updates all reference columns according to the main column. In this case the reference columns for Parameter [light intensity exposure]
are updated with the id and source ontology of the microeinstein per square meter per second
unit.
At this point. Your table should look similar to this:
Click the Autoformat Table quick access button without holding Ctrl to hide all reference columns.
Try to add suitable building blocks for other pieces of metadata from the plant growth protocol (studies/talinum_drought/protocols/plant_material.txt
).
- In the Building Blocks tab, select Factor, search for
watering exposure
and select the term with idPECO:0007383
. - Click Add building block.
- Add the drought treatment ("no water for 12 days", "re-water for 2 days") to the respective samples
💡 There are different options to add the drought treatment.
- In the Building Blocks tab, select Protocol Columns -> Protocol REF.
- Click Add building block.
- Add the name of the protocol file (plant_material.txt) to the Protocol REF column.
💡 This allows you to reference the free-text, human-readable protocol.
Transfer the sample ids from the protocol.
- Invent names for
Source Name
(we do not have this information) - Use the sample names (DB_*) as
Sample Name
- Navigate to the demo ARC.
- Open the lab notes
assays/rnaseq/protocols/RNA_extraction.txt
in a text editor. - Open the empty
assays/rnaseq/isa.assay.xlsx)
workbook in Excel.
- Navigate to Templates in the Navbar and click Browse database in the first function block.
💡 Here you can find community created workflow annotation templates
- Search for
RNA extraction
and click select- You will see a preview of all building blocks which are part of this template.
- Click Add template to add all Building Blocks from the template to your table, which do not exist yet.
Sometimes you need to add or update the unit of an existing building block.
- Select any number of rows of the
Parameter [biosource amount]
building block to mark it for the next steps. - Open the Building Blocks tab
- In the bottom panel "Add/Update unit reference to existing building block", search for the unit "milligram". Select the unit term and click Update unit for cells. :bulb: If you already had values in the main column they will be updated automatically.
- Click the Update Ontology Terms quick access button, to update the reference columns.
If there are any Building Blocks which do not fit your experiment you can use the Remove Building Block quick access button to remove it including all related (hidden) reference columns.
- Add a new sheet to the
assays/rnaseq/isa.assay.xlsx)
workbook. - Add the template "RNASeq Assay"
Try to fill the two sheets with the protocol details:
assays/rnaseq/protocols/RNA_extraction.txt
andassays/rnaseq/protocols/Illumina_libraries.txt
- Use the
Sample Name
of studies/talinum_drought/isa.study.xlsx as theSource Name
to rna-extraction. - Use the
Sample Name
of rna-extraction as theSource Name
to illumina-libraries.
- In the Building Blocks tab, select Output Columns -> Raw Data File.
- Click Add building block.
💡 You see a warning about a changed output column.
- Click Continue.
- Go to the File Picker tab and click Pick file names.
- Select and open the *fastq.gz files from the dataset folder.
- Copy / paste them to the
Raw Data File
.
💡 This allows you to link your samples to the resulting raw data files.
Go ahead, adjust the Building Blocks you want to use to describe your experiment as you see fit. Insert values using Swate Term search and add input and output.
- Open the empty
assays/metabolomics/isa.assay.xlsx)
workbook in Excel. - Navigate to Templates in the Navbar and scroll down to "Add template(s) from file."
- Click Upload protocol
- Select the file "swate_agilent_gc.json" from the demo data.
- Click Insert json
💡 This adds not only an empty template, but a filled out table with keys (headers) and values (cells).
Swate uses Excel's "table" feature to annotate workflows. Each table represents one process from input (e.g. plant leaf material) to output (e.g. leaf extract).
Example workflows with three processes each:
- Plant growth → sampling → extraction
- Measured data files → statistical analysis → result files
💡 Excel tables allow to group data that belongs together inside one sheet. This is not to be confused with a (work)sheet or workbook.
workbook (e.g. "isa.assay.xlsx") └─── worksheet (e.g. "plant_growth") └─── table (e.g. "annotationTable")
- Annotation within ARCitect is not yet available.
- Swate and ARCitect handle isa.study.xlsx / isa.assay.xlsx files differently.
Slides presented here include contributions by
- name: Dominik Brilhaus github: https://github.com/brilator orcid: https://orcid.org/0000-0001-9021-3197
- name: Kevin Frey github: https://github.com/Freymaurer orcid: https://orcid.org/0000-0002-8493-1077
- name: Martin Kuhl github: https://github.com/Martin-Kuhl orcid: https://orcid.org/0000-0002-8493-1077
- name: Sabrina Zander orcid: https://orcid.org/0009-0000-4569-6126