Includes:
-
Clustering - K-means, DBSCAN, GMM --- scripts to choose close materials for better models.
-
Regression Models - Gaussian process, Kernel ridge, XGBoost, neural net (Keras Tensorflow), random forest, etc. --- scripts to use better kernel or hyperparameters.
-
Random sample generation - Brute force, and Gaussian mixture model --- with specific restrictions.
-
Backward prediction and screening - Combine all process to find solution --- allows multiple targets and show solution reliability as circle sized. Verify if standard deviation truely reflects dot size (large means small deviation)
-
Automatic outlier detection - Powerful tool to improve performance of regression models. Thank you again Kaneko sensei Meiji University. ( https://datachemeng.com/outlier_samples_detectionc_python/ )