Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mine PMC for ethics statements #499

Open
Daniel-Mietchen opened this issue Oct 2, 2017 · 17 comments
Open

Mine PMC for ethics statements #499

Daniel-Mietchen opened this issue Oct 2, 2017 · 17 comments

Comments

@Daniel-Mietchen
Copy link
Owner

possible search terms:

  • ethical
  • "institutional review board"
  • "informed consent"
    etc.
@Daniel-Mietchen
Copy link
Owner Author

The main purpose here would be to see

  • what percentage of articles have a dedicated ethics section, and how that changes over time
  • what kind of information is provided in addition to statements of the "... received ethical approval" and "gave informed consent" kinds.
  • to what extent PIDs are being used in there and for what, and how that changes over time.

@Daniel-Mietchen
Copy link
Owner Author

A simple query for "approval number" currently yields 11404 hits:
https://www.ncbi.nlm.nih.gov/pmc/?term=%22approval+number%22

@Daniel-Mietchen
Copy link
Owner Author

Just to clarify that conflict of interest statements are within scope here as well.

@Daniel-Mietchen
Copy link
Owner Author

I just reran that "approval number" query from Oct 12, 2017, and it now yields 37963 results, i.e. an about 3.5-fold increase in about 3.5 years.

In the meantime, I have begun to collaborate with @petermr, and we are trying to use his ContentMine pipeline (which is currently being ported to Python) to extract ethics statements from PMC. On the way, we have built a first — still very rough — dictionary (i.e. a set of words highly indicative of the topic of ethics statements), and we are trying to also get a list of ethics committees mentioned in PMC-indexed papers.

@Daniel-Mietchen
Copy link
Owner Author

Daniel-Mietchen commented Apr 29, 2021

Meeting on April 29, 2021:

  • We are considering to submit something to Wikidata Workshop
  • We are also considering to submit a Research Idea to RIO and a research paper as well, perhaps in WikiJournal
  • There is an event being planned for a weekend in May that is about introducing people to Wikidata in a playful manner. Peter will think about aligning it with the Wikimedia Hackathon
  • We also looked a bit into ContentMine dictionaries.

@Daniel-Mietchen
Copy link
Owner Author

@Daniel-Mietchen
Copy link
Owner Author

A search for "approval number" now gives 38437 results, i.e. about 500 more than just two weeks ago.

@Daniel-Mietchen
Copy link
Owner Author

There are ambiguities at multiple levels.

For instance, this article states that

This study was approved by the Johns Hopkins School of Medicine IRB, Approval Number: IRB00151734. 

The problem here is that Johns Hopkins School of Medicine runs multiple IRBs, and there does not seem to be a straightforward mechanisms to resolve the approval number to get more metadata about the process.

@Daniel-Mietchen
Copy link
Owner Author

There is a Office for Human Research Protections (OHRP) Database for Registered IORGs & IRBs, Approved FWAs, and Documents Received in Last 60 Days that has identifiers for IRBs, but these do not resolve either.

@petermr
Copy link

petermr commented May 10, 2021 via email

@ShweataNHegde
Copy link

https://colab.research.google.com/drive/1sFj07mE2XRyeaplvsTs34-VaDHBjnt6U?usp=sharing

Ayush (openVirus volunteers) and I wrote a piece of code that can extract common phrases from a text file with manually scraped Ethics Statements.

@Daniel-Mietchen
Copy link
Owner Author

Some updates from this week:

@Daniel-Mietchen
Copy link
Owner Author

For more recent updates, see the notes over at Shweata's page.

@Daniel-Mietchen
Copy link
Owner Author

Here is a list of ethics-related entities Shweata has mined from articles on stem cells.

@Daniel-Mietchen
Copy link
Owner Author

Some more observations by Shweata and Peter sit here.

We now have a dedicated organization, repo and wiki:

@Daniel-Mietchen
Copy link
Owner Author

The paper How does nursing research differ internationally? A bibliometric analysis of six countries. has a Table 1 that looks at certain features of previous studies, including

Extracted specific properties (e.g., contains ethics statements)

@Daniel-Mietchen Daniel-Mietchen pinned this issue Nov 11, 2021
@Daniel-Mietchen
Copy link
Owner Author

The project with Shweata and Peter (and Ayush) has since led to a publication:

Hegde SN, Garg A, Murray-Rust P, Mietchen D (2022) Mining the literature for ethics statements: A step towards standardizing research ethics. Research Ideas and Outcomes 8: e94685. https://doi.org/10.3897/rio.8.e94685 .

It outlines a workflow for mining ethics statements and discusses motivations, applications and complications.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants