You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Riksdagen open data often contains hyphenated words which end up like this in our rawtoken table:
These are mostly garbage and should be handled somehow. Maybe we can train an AI to recognize when they are good based on lexemes?
The text was updated successfully, but these errors were encountered:
The Riksdagen open data often contains hyphenated words which end up like this in our rawtoken table:
These are mostly garbage and should be handled somehow. Maybe we can train an AI to recognize when they are good based on lexemes?
The text was updated successfully, but these errors were encountered: