-
Notifications
You must be signed in to change notification settings - Fork 156
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proposal for mlflow integration #1029
Comments
Proposed behaviourWhen should MLJ actions trigger mlflow logging?It should be possible to request logging for these actions:
Moreover, it should be possible to arrange automatic logging, i.e., without explicitly requesting logging for each such action. What should be logged?1. Serializing machines
2. Performance evaluationCompulsory
And, if possible:
Optional (included by default)
3. Hyperparameter tuningCompulsoryFor the optimal model:
Optional (included by default)For each model in the search (each hyperparameter-set):
3. Controlled model iterationCompulsoryFor the final trained model (different from the last "evaluated" model, if
Optional (included by default)For the partially trained model at every "break point" in the iteration:
How should logging be structuredI'm less clear about details here, but here are some comments:
User interface pointsSome suggestions: How does user request logging?In serialization, one just replaces location in In performance evaluation, we add a new kwarg Cases 3 and 4 are similar but How does user request auto logger?Add global variable How does user suppress optional logging?We could eigher extra kwargs/fields to control level of verbosity, or if we are wrapping |
Some miscellaneous thoughts on implementation
|
Thanks for putting this together, @ablaom. Regarding
I'll start with the simplest case described in the original post here:
The most obious case is when we provide an Thus, we could define:
We can use similar logic when initiating logging from different places, such as performance evaluation, hyperparameter tuning, and controlled model iteration. |
@deyandyankov I bundled a |
mlflow is an API and web-based UI for logging parameters, code
versions, metrics, and output files when running machine learning experiments, for later
visualizing the results. Integration of mlflow already exists for these other ML
platforms: Scikit-learn, Keras, Gluon, XGBoost, LightGBM, Statsmodels, Spark, Fastai,
Pytorch.
Further to this short project
outline, and after preliminary discussions with @pebeto and @deyandyankov,
I give below a tentative design proposal for integration of
mlflow with MLJ, using
MLFlowClient.jl, which already provides a
Julia interface to mlflow.
The text was updated successfully, but these errors were encountered: