Skip to content

Version 1.0.0

Compare
Choose a tag to compare
@wartaal wartaal released this 10 Jan 10:48
· 10 commits to master since this release
f7e8420
  • Lemmatization is now learned from training data and language independet to some degree
  • Training data now have one more column: lemma and stem are both needed
  • Scripts for generating training data for English from Brown-Corpus added
  • Scripts for generating training data for Dutch from Sonar-Corpus added
  • Structure of the project changed with subfolders for creation of training data