-
Notifications
You must be signed in to change notification settings - Fork 116
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closes #42 #494
base: main
Are you sure you want to change the base?
Closes #42 #494
Conversation
… to properly represent unique keys.
…examples for source schema
@jason-fries @ruisi-su this is ready for review. Per guidance from Jason, this version includes only functionality to generate examples for the source schema, as it is not possible to represent RadGraph records properly using the KB schema as is. Thus, it will not pass tests. |
@alisoncallahan is this a local dataset? Can you give us a print out of the following command?
|
@hakunanatasha yes, it is a local dataset b/c RadGraph is provided by PhysioNet, which requires user registration and vetting. In the source schema, relations are nested in entities. The output of
|
Finished data loader for source schema only, because the Bigbio KB schema does not currently support all features that exist in the source data - per conversation with @jason-fries
Checkbox
biodatasets/my_dataset/my_dataset.py
(please use only lowercase and underscore for dataset naming)._CITATION
,_DATASETNAME
,_DESCRIPTION
,_HOMEPAGE
,_LICENSE
,_URLs
,_SUPPORTED_TASKS
,_SOURCE_VERSION
, and_BIGBIO_VERSION
variables._info()
,_split_generators()
and_generate_examples()
in dataloader script.BUILDER_CONFIGS
class attribute is a list with at least oneBigBioConfig
for the source schema and one for a bigbio schema.datasets.load_dataset
function.python -m tests.test_bigbio biodatasets/my_dataset/my_dataset.py
.