E-Module Mapping Script #117

alFrie · 2023-02-09T12:45:04Z

The e-module part got separated from the rest of the CPTO. We have a working mapping script for the simple ontology and metadata to be mapped. So on this branch we will write a mapping script for the e-module. Things to do:

Edit the metadata exctraction script: Hardcode the specimen age to 28 days and add to yaml-file.
Edit the metadata exctraction script: Extract the experiment duration ("Zeit") and add to yaml-file.
Edit the metadata exctraction script: Make the metadata-dictionary keys match the placeholders.
Map the metadata in the yaml-file to the ontology by replacing placeholders.
Write a test.

alFrie · 2023-02-09T13:27:29Z

@raviapatel Please add the orange box for the ID, so I can append an ID after the underscore for linking mix and emodule.

ThiloMuth · 2023-02-09T13:30:41Z

Test

ThiloMuth · 2023-02-09T13:31:33Z

I want to review code - please let me do it =)

joergfunger · 2023-02-09T15:29:39Z

@ThiloMuth if you accept the invitation to the repo, you can review the merge requests, e.g. this one

alFrie · 2023-02-13T19:06:45Z

@mattheokru Is this way of spelling ("Modul") within the e-module ontology on purpose/ predefined by the ontology or something like that? It looks german to me.

mattheokru · 2023-02-20T07:44:41Z

No nothing predefined by the Ontology, I will change it. I updated the Ontologies in the pull request "updating Ontologies"

raviapatel · 2023-02-20T15:07:39Z

@mattheokru Is this way of spelling ("Modul") within the e-module ontology on purpose/ predefined by the ontology or something like that? It looks german to me.
Yes this is just individual and you are right this is german way of doing it I would propose to change it to YoungsModulusTestSpecimen_ or EModulusTestSpeciemen

alFrie · 2023-02-23T11:11:49Z

I have a question regarding the emodul_metadata_extraction.py:

It should create an entry for the "processedFile" key in the dictionary, having as value the path to csv file with values extracted by emodul_generate_processed_data.py. Currently as a placeholder it's a null pointer.
Where are these files stored? In the dodo file I find
processed_data_emodulus_directory = Path(emodul_output_directory, 'processed_data') # folder with csv data files
so should I do the same?

joergfunger · 2023-02-24T05:37:17Z

Currently, we store the files locally (as you have mentioned with that path), so I would add for now exactly this path. Ultimately, this would have to go to a file server/mongdb/openBIS with a URI, but please talk to @AidaZt or @ThiloMuth on how we should then reference these files in our KG.

AidaZt · 2023-02-24T13:59:59Z

Andre suggested that we can reference the link/URL to the file.
Example of an RDF triple would be:
<http://bam.de/material#experiment01> <http://bam.de/properties#rawdata> <http://bam.de/dataserver/rawdata.csv> .

joergfunger · 2023-02-24T14:09:22Z

But that links is not existing, or how do we intend to store data such that this link is actually a real reference?

firmao · 2023-02-24T14:26:05Z

But that links is not existing, or how do we intend to store data such that this link is actually a real reference?

Then, we need, at least to talk about a file server, point straight to github raw files, dereferencing URIs providing RDF content, etc.

I suggest we have a short meeting to have an agreement about the best way for us to deal with the raw files.
What about Monday after 3pm?

Best regards,
Andre Valdestilhas

alFrie · 2023-02-27T10:57:27Z

About the Transducer Column:
According to the drawio we're expecting an integer:
"$$TransducerColumn_Value$$"^^xsd:integer
The value gets defined within the metadata extraction script of emodule. We talked about saving a list to that key: [1,2,3], giving this result of the mapped onto:
con:Transducer_ a con:MeasuringGauge, owl:NamedIndividual ; ns3:hasPmdUnit ns3:Q56402798 ; mid:has_column_index "[1, 2, 3]"^^xsd:integer .
We have a list of integers instead of an integer. Is that still valid?

Edit: @raviapatel Is this how you imagined it to be?

firmao · 2023-02-27T11:09:19Z

About the Transducer Column: According to the drawio we're expecting an integer: "$$TransducerColumn_Value$$"^^xsd:integer The value gets defined within the metadata extraction script of emodule. We talked about saving a list to that key: [1,2,3], giving this result of the mapped onto: con:Transducer_ a con:MeasuringGauge, owl:NamedIndividual ; ns3:hasPmdUnit ns3:Q56402798 ; mid:has_column_index "[1, 2, 3]"^^xsd:integer . We have a list of integers instead of an integer. Is that still valid?

The data type expected is an xsd:integer, therefore it's supposed to be an integer number. If you still not sure about the data type, then store as an xsd:string.

alFrie · 2023-02-27T12:05:04Z

The data type expected is an xsd:integer, therefore it's supposed to be an integer number. If you still not sure about the data type, then store as an xsd:string.

And a list of integers doesn't fit the integer type, right?

AidaZt · 2023-02-27T12:28:45Z

I thinks so, because we either have string or integer as a type and we can't refer it as xsd:list or something? I think for now leave it as xsd:string.

firmao · 2023-02-27T13:06:13Z

if you still need to store a kind of list of values in RDF, there is an example here:
https://stackoverflow.com/questions/29669555/dynamic-array-in-rdf-xml

alFrie · 2023-02-28T13:18:26Z

So this is the current result of the mapping script. Please look at the following three issues:

Only the placeholder EModule_Value doesn't get a key from the metadata.
Height and Width get set to None, since the shape is cylindrical - this results in "None"^^xsd:decimal. That's problematic, None is not of type decimal, right? The type should stay decimal tho. since in the future there won't only be cylindric specimen, if I got that right.
We have more metadata values than placeholders (f.e. weight and so on have no place to get mapped to. This is not a problem for now tho I guess). You can still look through that list of unmapped metadata and see if you'd like to create some individuals for some of them within the ontology?

For your information:

Placeholders get generated through a function so in case we decide of a different placeholder strucutre, we only need to change this small function and not the main function itself.
Tests are still failing because they were designed for Ilias outdated script. @soudehMasoudian is on it (Update test_mapping_script.py #134 )

@joergfunger @raviapatel, maybe @ThiloMuth wants to have a look at it, too.

raviapatel · 2023-03-02T14:17:57Z

I thinks so, because we either have string or integer as a type and we can't refer it as xsd:list or something? I think for now leave it as xsd:string.

Ok this is also fine for me

alFrie · 2023-03-09T10:46:59Z

How will the info about the openBis raw data location get mapped? Will this be defined in the mapping script or created during the metadata extraction so that the mapping script will automatically map it? @joergfunger

joergfunger · 2023-03-09T11:40:03Z

That should be done during the extraction of the metadata. In the final setup, we will have the data all stored in the openBIS system (metadata), and then extracting this information together with the link to the raw data file should happen. Afterwards, the mapping script will just take that information in the metadata.json and replace that value in the ttl file obtained from the diagrams.net ttl template.

alFrie assigned soudehMasoudian, AidaZt and alFrie Feb 9, 2023

alFrie mentioned this issue Feb 9, 2023

117 Emodule mapping script #118

Closed

alFrie linked a pull request Feb 9, 2023 that will close this issue

117 Emodule mapping script #118

Closed

eriktamsen added the knowledge graph label Apr 14, 2023

This was referenced Jul 10, 2023

MixtureDesign Mapping #136

Open

Knowledge graphs update #190

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

E-Module Mapping Script #117

E-Module Mapping Script #117

alFrie commented Feb 9, 2023 •

edited

Loading

alFrie commented Feb 9, 2023

ThiloMuth commented Feb 9, 2023

ThiloMuth commented Feb 9, 2023

joergfunger commented Feb 9, 2023

alFrie commented Feb 13, 2023

mattheokru commented Feb 20, 2023 •

edited

Loading

raviapatel commented Feb 20, 2023

alFrie commented Feb 23, 2023

joergfunger commented Feb 24, 2023

AidaZt commented Feb 24, 2023 •

edited

Loading

joergfunger commented Feb 24, 2023

firmao commented Feb 24, 2023

alFrie commented Feb 27, 2023 •

edited

Loading

firmao commented Feb 27, 2023

alFrie commented Feb 27, 2023

AidaZt commented Feb 27, 2023 •

edited

Loading

firmao commented Feb 27, 2023

alFrie commented Feb 28, 2023 •

edited

Loading

raviapatel commented Mar 2, 2023

alFrie commented Mar 9, 2023

joergfunger commented Mar 9, 2023

E-Module Mapping Script #117

E-Module Mapping Script #117

Comments

alFrie commented Feb 9, 2023 • edited Loading

alFrie commented Feb 9, 2023

ThiloMuth commented Feb 9, 2023

ThiloMuth commented Feb 9, 2023

joergfunger commented Feb 9, 2023

alFrie commented Feb 13, 2023

mattheokru commented Feb 20, 2023 • edited Loading

raviapatel commented Feb 20, 2023

alFrie commented Feb 23, 2023

joergfunger commented Feb 24, 2023

AidaZt commented Feb 24, 2023 • edited Loading

joergfunger commented Feb 24, 2023

firmao commented Feb 24, 2023

alFrie commented Feb 27, 2023 • edited Loading

firmao commented Feb 27, 2023

alFrie commented Feb 27, 2023

AidaZt commented Feb 27, 2023 • edited Loading

firmao commented Feb 27, 2023

alFrie commented Feb 28, 2023 • edited Loading

raviapatel commented Mar 2, 2023

alFrie commented Mar 9, 2023

joergfunger commented Mar 9, 2023

alFrie commented Feb 9, 2023 •

edited

Loading

mattheokru commented Feb 20, 2023 •

edited

Loading

AidaZt commented Feb 24, 2023 •

edited

Loading

alFrie commented Feb 27, 2023 •

edited

Loading

AidaZt commented Feb 27, 2023 •

edited

Loading

alFrie commented Feb 28, 2023 •

edited

Loading