You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First of all, thanks so much for all your work on this and making it open source! It would be cool if it were possible to do a fragment search using an existing SRT transcription without having to re-transcribe all of the audio in advance. One way to do this would be to use the existing sentence-level alignments to extract the audio ranges for sentences that match a search, then use vosk to transcribe just those audio ranges, then use the results of those transcriptions to extract the fragment-level audio.
The text was updated successfully, but these errors were encountered:
It would also be beneficial to rely on the words sourced from the subtitle file. That way the detection quality could be improved a lot, right? I tried to implement @ryanfb's suggestion back in 2021 with pocketsphinx but the results weren't promising. 😢
First of all, thanks so much for all your work on this and making it open source! It would be cool if it were possible to do a fragment search using an existing SRT transcription without having to re-transcribe all of the audio in advance. One way to do this would be to use the existing sentence-level alignments to extract the audio ranges for sentences that match a search, then use vosk to transcribe just those audio ranges, then use the results of those transcriptions to extract the fragment-level audio.
The text was updated successfully, but these errors were encountered: