Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Enable Beam Search in WhisperPipeline to Enhance Transcription Accuracy #1164

Open
HighBridgeDragon opened this issue Nov 7, 2024 · 1 comment
Assignees
Labels
category: whisper Whisper pipeline

Comments

@HighBridgeDragon
Copy link

Hi OpenVINO team,

I recently submitted another issue #1150, but I’d also like to suggest another enhancement for WhisperPipeline.

Currently, WhisperPipeline(openvino_genai-2024.5.0.0rc1) relies on Greedy Search for transcription. While Greedy Search is efficient, it often leads to prematurely shortened transcriptions due to early stopping in complex scenarios.

To improve transcription quality, adding Beam Search support would allow for more comprehensive output, mitigating cases of incomplete or overly simplified results. Many applications would benefit from this feature, especially when accuracy is critical and a more extensive search process is required.

Proposed Feature:

  • Implement Beam Search as an alternative search strategy within WhisperPipeline.
  • Optionally, provide adjustable parameters to control beam size (like CTranslate2) and other relevant settings, giving users flexibility based on performance needs.

Are there any plans to introduce this functionality, or might there be existing methods to achieve similar results?

Thanks in advance for considering this improvement!

@as-suvorov
Copy link
Contributor

Hi, thank you for a feature request! We have plans to implement Beam Search for Whisper pipeline. List of planned supported parameters can be found in GenerationConfig under beam search and multinomial sections.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: whisper Whisper pipeline
Projects
None yet
Development

No branches or pull requests

4 participants