Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow mass-editing of similar words #350

Open
moeffju opened this issue Sep 17, 2023 · 0 comments
Open

Allow mass-editing of similar words #350

moeffju opened this issue Sep 17, 2023 · 0 comments
Labels
enhancement New feature or request frontend

Comments

@moeffju
Copy link
Contributor

moeffju commented Sep 17, 2023

One common failure mode I've found for whisper is that technical terms or proper names will often get mangled. It would be great to have a simple way to edit all instances of "similar" tokens at once.

For example, I transcribed Lilith Wittmann's cccamp23 talk and whisper (medium model) would often recognize "Bonify" (a company name) as "Bonifai", or "Schufa" (another company name) as "Schufer".

It would be great to have an easy way to modify all instances of the same or very similar tokens at once.

Alternatively, a "quick manual check" mode that would allow me to correct all low-confidence words in a single pass would be helpful. I'll file another issue for that.

@phlmn phlmn added enhancement New feature or request frontend labels Sep 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request frontend
Projects
None yet
Development

No branches or pull requests

2 participants