CLARINET (CLARIfying NETworks) ia a novel tool for rapid model assembly by automatically extending dynamic network models with the information published in literature. This facilitates information reuse and data reproducibility and replaces hundreds or thousands of manual experiments, thereby reducing the time needed for the advancement of knowledge.
- Functionality
- Important Abbreviations
- I/O
- Online Tutorial
- Offline Installation
- Package Structure
- Citation
- Funding
- Support
- Extraction: utilizing the knowledge published in literature and suggests model extensions, studying events extracted from literature as a collaboration graph
- Weighting: assigning weights to collaboration graph using a variety of network metrics
- Clustering: detecting communities within the collaboration graph, creating groups of interactions
- ECLG: Event ColLaboration Graph
- FC_IA: Individual assessment of events using Frequency Class concept (node weights)
- FC_PA: Pair assessment of events using Frequency Class concept (edge weights)
- IF_PA: Pair assessment of events using Inverse Frequency concept (edge weights)
- A .xlsx file containing the model to extend, in BioRECIPES model format, see
examples/input/BooleanTcell_biorecipe.xlsx
- Machine reading output file, in BioRECIPES interaction format, see
examples/input/ReadingOutput_biorecipe.csv
- Parameter for frequency class weighting, defined in Cell 14 of the notebook
- Number of return paths, defined in Cell 22 of the notebook
examples/output/freqClass
, a frequent class file that contains ECLG nodes and their freqClass level- intermediate output -
examples/output/ECLGbefore.txt
, the ECLG file containing nodes and edges before the removal of less frequent nodes - intermediate output -
examples/output/ECLGafter.txt
, the ECLG file containing nodes and edges after the removal of less frequent nodes examples/output/LSS
, a LSS file that contains ECLG edges and their weightsexamples/output/grouped_ext
, a pickle file containing grouped (clustered) extensions, specified as nested lists. Each group starts with an integer, followed by interactions specified as [regulator element, regulated element, Interaction type: Activation (+) or Inhibition (-)]examples/output/grouped_ext_Merged
, another pickle file containing the merged clusters (different than grouped_ext which is not merged), clusters are merged based on user-selected number of return pathsexamples/output/GeneratedClusters
, directory containing the resulting uninterpreted clusters- intermediate output -
examples/output/ClusterInfoFile.csv
, a .csv file showing the basic information of each uninterpreted cluster examples/output/InterpretedClusters
, directory containing the resulting interpreted clusters
Run the demonstrated example; or alternatively upload user-customized input files (see I/O) to the input/ directory on File Browser Tab (upper left corner) of Binder.
- Get familiar with and parse the input files including baseline model spreadsheet and machine reading extracted events.
- Create ECLG, process it using network algorithm and assign weights.
- Cluster the ECLG using the community detection algorithm and possibly merge clusters.
- Clone the CLARINET repository to your computer.
git clone https://github.com/pitt-miskov-zivanov-lab/CLARINET.git
- Navigate into the directory, install CLARINET and its python dependencies.
cd CLARINET pip install -e .
- Run the provided notebook (Check Jupyter notebook installation here).
jupyter notebook examples/use_CLARINET.ipynb
setup.py
: python file that help set up python dependencies installtionsrc/
: directory that includes core python CLARINET filessrc/runClarinet.py
: functions for extending discrete network models in the BioRECIPE format using knowledge from literature
examples/
: directory that includes tutorial notebook and example inputs and outputsenvironment.yml
: environment file, required by Binderdocs/
: containing files supporting the repo's host on Read the Docssupplementary/
: containing supplementary files for the studied modelsupplementary/TCELLgraph.graphml
: the graphml representation of the baseline model
LICENSE.txt
: MIT LicenseREADME.md
: this is me!
Yasmine Ahmed, Cheryl A Telmer, Natasa Miskov-Zivanov, CLARINET: efficient learning of dynamic network models from literature, Bioinformatics Advances, Volume 1, Issue 1, 2021, vbab006, https://doi.org/10.1093/bioadv/vbab006
This work was funded in part by DARPA Big Mechanism award, AIMCancer (W911NF-17-1-0135); and in part by the University of Pittsburgh, Swanson School of Engineering.
Feel free to reach out via email [email protected] for additional support if you run into any error.