Python tool to assess data for single-parameter thresholds
- python 3
- numpy
- matplotlib
- pandas
- scipy
- sklearn
- Imports
- Read Data
- Select Data
- Threshold Analysis
- generate a csv file with IDs and experimental results (exp_example.csv)
- generate an excel sheet with IDs and computed features (comp_example.csv)
- put input files and jupyter notebook in same folder
- run jupyter notebook
- Imports
- run imports cell to import required python packages
- Read Data
- edit file names to match comp. and exp. input files and run cell (lines 1, 4, (and 5))
- Select Data
- set dataset to read from experimental results and run cell (line 2)
- Threshold Analysis
- set threshold settings line 2, y_cut sets the experimental output cutoff line 3, class_weight assigns class weighting
- set parameter settings line 6, num_par sets number of parameters line 7, par_start_col sets parameter start column from csv (0-indexed)
- set features to iterate through for threshold analysis lines 12 and 13 can be edited to indicate which features the threshold analysis will iterate through
- optional
- filter buchwald ligands (or other variations)
- change lines 31-32 X_use=X_all and y_use=y_all to X_use=X_nobw and y_nobw
- toggle comment on line 76 to plot buchwald ligands in gray
- edit plot labels
- line 71 = experimental output (e.g., yield, selectivity)
- save plots
- toggle comment on line 89
- filter buchwald ligands (or other variations)
- Imports
Funding: NSF CCI Center for Computer Assisted Synthesis (CHE-1925607)
The following people have contributed to developing this workflow:
- Sleight Smith
- Ellyn Peters
- Tobias Gensch
This code is released under the MIT license. Commercial use, Modification, Distribution and Private use are all permitted. The use of this workflow can be acknowledged with the following citation: https://www.science.org/doi/10.1126/science.abj4213
Zenodo (2021); http://doi.org/10.5281/zenodo.5227162