What's Changed
- [patch] minor fixes for 0.0.2 by @francoishernandez in #109
- Rework handling of special tokens by @francoishernandez in #45
- [patch] get_transforms_cls after update_config_with_checkpoint by @francoishernandez in #110
- [patch] get_transforms_cls after update_config_with_checkpoint BIS by @francoishernandez in #111
- Updated translator.py to handle updated special token logic when computing alignments by @dameikle in #113
- clearer log by @vince62s in #112
- fix training tensor parallel by @vince62s in #115
- restore all_reduce directly but with detach.clone first - fix #115 by @vince62s in #116
- Initial support for Metal Performance Shaders (MPS) by @dameikle in #98
- Manage
share_decoder_embeddings
inconvert_HF
, misc fixes and improvements by @francoishernandez in #121 - Deduce share_decoder_embeddings from HF tie_word_embeddings flag by @francoishernandez in #123
- [docs] Upgrading docusaurus packages, should fix dependabot warnings by @francoishernandez in #124
- add estimator in decoder-only + clean code by @vince62s in #120
- fineweb10B/gpt2 recipe, and supporting changes by @francoishernandez in #32
- enable pure bf16 training by @vince62s in #133
- Update WMT17 recipe with working tokenization transforms examples by @francoishernandez in #129
- fixes #131, module 'eole.utils' has no attribute 'distributed' error when training multi-gpu by @isanvicente in #132
- add estimator in greedy inference by @vince62s in #135
- Some QOL config/saving improvements by @francoishernandez in #134
- fix #136. Updated eole/bin/model/average_models.py to work with safetensors model format. by @isanvicente in #137
- fix head dim in rope by @vince62s in #140
- fix autocast at scoring when doing AMP by @vince62s in #141
- Some minor fixes by @francoishernandez in #143
- fix lora lm head by @vince62s in #142
- fix missing pad change by @vince62s in #148
- flash_attn_func does not support padding mask maybe we need to drop a… by @vince62s in #149
- fix maybe_retranslate when number of newline does not match by @vince62s in #150
- Supporting HF tokenizers by @francoishernandez in #122
- Model Validator Recipe by @francoishernandez in #146
- apply bytefallback at detok (onmt_tokenize with sentencepiece) by @vince62s in #155
- patch eos_token_id list handling by @francoishernandez in #158
- Compile and Ctranslate2 support by @vince62s in #161
- Move predict config update from model loading to config validation by @francoishernandez in #163
- EuroLLM Gradio (web based) translator 35 languages to 35 languages by @vince62s in #164
New Contributors
- @isanvicente made their first contribution in #132
Full Changelog: 0.0.2...0.0.3