You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Whisper's confidence scores are pretty good and usually correlate well with the quality of the output, so it would be great to be able to quickly correct all low-confidence words at once before going into finer quality control. Similarly, non-dictionary words could be flagged for manual control as well.
To stick with my example from cccamp23, I would imagine it working a bit like this:
I upload the audio and wait for the worker to finish
I click a button to start quick correction mode
The transcribee editor walks me through all tokens, one by one, that have a low confidence score (threshold for dark red text), in context, and lets me edit the instance, edit all instances of the same token, or mark the token as correct (which would raise the confidence for the same token in the following text)
In the aforementioned talk, I could quickly correct different spellings of "Bonify" or "Schufa" to a single spelling and mark all instances as "high confidence" that way.
On top of using whisper's confidence scores, it could be useful to also run the transcribed text through a dictionary checker, because Whisper will sometimes transcribe things like "Geburtstartum" oder "Kreditauskunftsteil" with high confidence even though those aren't dictionary words.
The text was updated successfully, but these errors were encountered:
Whisper's confidence scores are pretty good and usually correlate well with the quality of the output, so it would be great to be able to quickly correct all low-confidence words at once before going into finer quality control. Similarly, non-dictionary words could be flagged for manual control as well.
To stick with my example from cccamp23, I would imagine it working a bit like this:
In the aforementioned talk, I could quickly correct different spellings of "Bonify" or "Schufa" to a single spelling and mark all instances as "high confidence" that way.
On top of using whisper's confidence scores, it could be useful to also run the transcribed text through a dictionary checker, because Whisper will sometimes transcribe things like "Geburtstartum" oder "Kreditauskunftsteil" with high confidence even though those aren't dictionary words.
The text was updated successfully, but these errors were encountered: