Automatically refine word-level alignments from sentence-level alignments #106

ryanfb · 2022-06-20T13:09:03Z

First of all, thanks so much for all your work on this and making it open source! It would be cool if it were possible to do a fragment search using an existing SRT transcription without having to re-transcribe all of the audio in advance. One way to do this would be to use the existing sentence-level alignments to extract the audio ranges for sentences that match a search, then use vosk to transcribe just those audio ranges, then use the results of those transcriptions to extract the fragment-level audio.

antiboredom · 2022-06-22T14:29:00Z

That's an interesting idea - I'd definitely be open to experimenting with it... Alignment might also work here. alphacep/vosk-api#756

cmprmsd · 2023-12-22T23:55:37Z

It would also be beneficial to rely on the words sourced from the subtitle file. That way the detection quality could be improved a lot, right? I tried to implement @ryanfb's suggestion back in 2021 with pocketsphinx but the results weren't promising. 😢

antiboredom added the enhancement label Jun 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically refine word-level alignments from sentence-level alignments #106

Automatically refine word-level alignments from sentence-level alignments #106

ryanfb commented Jun 20, 2022

antiboredom commented Jun 22, 2022

cmprmsd commented Dec 22, 2023

Automatically refine word-level alignments from sentence-level alignments #106

Automatically refine word-level alignments from sentence-level alignments #106

Comments

ryanfb commented Jun 20, 2022

antiboredom commented Jun 22, 2022

cmprmsd commented Dec 22, 2023