EduNLP v0.0.3
- update formula ast: supporting more symbols and functions defined in katex
- add tokens to vector tools, including word2vec and doc2vec using gensim
- sci4sif support tokenization grouped by segments
- add special tokens: \SIFTag and \SIFSep
- add item to vector tools
- add interface for getting pretrained models, where the supported model names can be accessed by
edunlp i2v
in the command console