Allow mass-editing of similar words #350

moeffju · 2023-09-17T19:52:00Z

One common failure mode I've found for whisper is that technical terms or proper names will often get mangled. It would be great to have a simple way to edit all instances of "similar" tokens at once.

For example, I transcribed Lilith Wittmann's cccamp23 talk and whisper (medium model) would often recognize "Bonify" (a company name) as "Bonifai", or "Schufa" (another company name) as "Schufer".

It would be great to have an easy way to modify all instances of the same or very similar tokens at once.

Alternatively, a "quick manual check" mode that would allow me to correct all low-confidence words in a single pass would be helpful. I'll file another issue for that.

phlmn added enhancement New feature or request frontend labels Sep 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow mass-editing of similar words #350

Allow mass-editing of similar words #350

moeffju commented Sep 17, 2023

Allow mass-editing of similar words #350

Allow mass-editing of similar words #350

Comments

moeffju commented Sep 17, 2023