Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create dataset loader for BioNLP-ST 2019 CRAFT-CA #254

Closed
jason-fries opened this issue Mar 22, 2022 · 10 comments
Closed

Create dataset loader for BioNLP-ST 2019 CRAFT-CA #254

jason-fries opened this issue Mar 22, 2022 · 10 comments
Labels
CC BY 3.0 Licence English Language NER Task XML Format

Comments

@jason-fries
Copy link
Member

Adding a Dataset

@davidstap
Copy link

#self-assign

@jason-fries
Copy link
Member Author

Hi @davidstap you let us know if you are still working on this so we can update our project board? Please just notify us the status by Friday April 8. You can response to this comment or ping us on Slack or Discord.

No worries if you are not finished but still intend to work on this!

@barthfab
Copy link
Contributor

#self-assign

@barthfab
Copy link
Contributor

I would like to unblock this issue.

@shamikbose
Copy link
Contributor

#self-assign

@shamikbose
Copy link
Contributor

According to the github linked in the paper, this is the description for the dataset. Is this sufficient information?

A collection of 97 articles from the PubMed Central Open Access subset, each of which has been annotated along a number of different axes spanning structural, coreference, and concept annotation

@shamikbose
Copy link
Contributor

@jason-fries There is already a CRAFT dataloader in #60 . Wondering if this is different in any way

@jason-fries
Copy link
Member Author

Hi @shamikbose,
You could double check to see if the #60 dataloader is the same format as this dataset. Otherwise, if it is a separate dataset or subset with potentially additional information, we should implement its own dataloader.

@shamikbose
Copy link
Contributor

#60 is a much wider dataset containing all version of CRAFT. 3.1.3 is an update for some missing or malformed information in 3.1.2 as mentioned in this comment UCDenver-ccp/craft-shared-tasks#1 (comment)

I've released an update to the CRAFT corpus that includes the fix to address the issue you reported. Please update to CRAFT v3.1.3, and to the 0.1.2 version of this project. Or, if you are running the evaluations via Docker, please use the ucdenverccp/craft-eval:3.1.3_0.1.2 container which is now available on DockerHub.

@hakunanatasha
Copy link
Collaborator

Closing - is superceded by #60

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CC BY 3.0 Licence English Language NER Task XML Format
Projects
Development

No branches or pull requests

5 participants