Mistral V2
No due date
95% complete
Various changes to Mistral to expand its usability and scale.
Main goals
- "Plug and Play" for models up to, say, 500M or 1B parameters.
- Bring your own data, arbitrary data mixtures
- Release model/checkpoints for GPT2-L and GPT2-XL sized models.