Version 1.0.0

wartaal released this 10 Jan 10:48

· 10 commits to master since this release

f7e8420

Lemmatization is now learned from training data and language independet to some degree
Training data now have one more column: lemma and stem are both needed
Scripts for generating training data for English from Brown-Corpus added
Scripts for generating training data for Dutch from Sonar-Corpus added
Structure of the project changed with subfolders for creation of training data

Assets 2