-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reducing the data footprint for large simulations #1141
Comments
This would be a great feature to add! I'm copying my original response to your email below for completeness on this thread (I hope it was enough to get you going) and listing some proposals on how to address this in a better way moving forward. Current way of doing it manually: Proposal for making more configurable:
I would love to get your thoughts on this @1fish2 (once you're back from your roadtrip!) |
It looks like we could turn off all the output tables by setting the option While you're at it, set Removing classes from _listenerClasses would save additional in-memory work but as @tahorst noted, the code expects to
Yes, it'd be cleaner to turn those listeners into states. To make this configurable, @tahorst's idea of removing listeners from BTW, all this is easy in vivarium-ecoli: Just configure Store variables to have |
We would like to explore what happens to the model over large numbers of generations, spanning upwards of 106 cells. To make this feasible we would need to drastically reduce the amount of data that is saved to disk, removing most of the tables and analysis runs. Data serialisation currently involves a number of classes (
Listeners
,Loggers
,Processes
) and happens in a few places, and I'm not sure whether the model relies on any of these for calculations or whether they can be safely excised.Can data serialisation be made more configurable?
The text was updated successfully, but these errors were encountered: