Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support tagging phonological features #7

Open
thatbudakguy opened this issue Feb 26, 2022 · 2 comments
Open

support tagging phonological features #7

thatbudakguy opened this issue Feb 26, 2022 · 2 comments
Labels
enhancement New feature or request

Comments

@thatbudakguy
Copy link
Member

this would more properly be called the Phonologizer, and it could borrow heavily from spaCy's Morphologizer. see for reference Wikipedia on "distinctive features".

@thatbudakguy thatbudakguy added the enhancement New feature or request label Feb 26, 2022
@thatbudakguy
Copy link
Member Author

ultimately this could just be another function of the Phonemizer — when the output of the model is just a vector, it's up to the component how to translate that information into phonological data. we could have a new component type that sets phonological properties on tokens, or we could just make this a method available on the Token itself, so that the downstream consumer can request both the phonological features or the phonemes themselves from the same source data.

@thatbudakguy
Copy link
Member Author

this becomes synonymous with the existing phonemizer as part of #24; we should rename it Phonologizer accordingly.

also with #22 we should make both components respect overwrite/extend config options (as spacy builtins do) so that they can work together in concert: rule-based runs first, then the statistical version runs and fills in all the gaps (e.g. polyphones).

@thatbudakguy thatbudakguy changed the title create pipeline component for tagging phonological features support tagging phonological features Apr 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant