Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问说话人日志中是否需要重叠语音检测? #390

Open
wcqy-ye opened this issue Dec 1, 2024 · 1 comment
Open

请问说话人日志中是否需要重叠语音检测? #390

wcqy-ye opened this issue Dec 1, 2024 · 1 comment

Comments

@wcqy-ye
Copy link

wcqy-ye commented Dec 1, 2024

我想请问下
rttm文件中已经有了每个说话人的时间段
image
并且
image
在这里好像已经把重叠的语音进行合并了 是不是不再需要重叠语音检测?

@JiJiJiang
Copy link
Collaborator

wespeaker上面的diar系统没有特殊处理重叠语音,这个merge_segment只是为了合并重叠的相邻窗而已,与重叠语音是两个概念;
如果要针对重叠语音做优化,一般还需要重叠语音检测和语音分离的模型。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants