-
Notifications
You must be signed in to change notification settings - Fork 124
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pull request johan BUT sre #326
Pull request johan BUT sre #326
Conversation
…essing before backend. PLDA multisession scoring.
Merge branch 'master' of github.com:wenet-e2e/wespeaker
Please fix Lint errors. |
Hello Johan @gulamungon, thanks for the contribution. Can you first fix the Lint errors. Locally, you can use the |
Sure, I'll try to fix it asap. |
I fixed it hopefully. |
Changed tabs to spaces. |
@gulamungon Hi, Johan, thanks for the contribution, it seems there are still some lint errors (trailing whitespaces) |
Hi Shuai,
Ok. I see. I can also try to fix it but in the coming two weeks I'm quite
busy so most likely it will not manage during this time.
Best,
Johan
…On Sun, Jun 16, 2024 at 1:59 AM Shuai Wang ***@***.***> wrote:
@gulamungon <https://github.com/gulamungon> Hi, Johan, thanks for the
contribution, it seems there are still some lint errors (trailing
whitespaces)
@czy97 <https://github.com/czy97> @JiJiJiang
<https://github.com/JiJiJiang> Maybe you guys start the reviewing first,
and we do the lint fix afterwards
—
Reply to this email directly, view it on GitHub
<#326 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAXXG3A2DZY6V5LQLPAASBTZHTIOXAVCNFSM6AAAAABIYYC7OKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCNZQHE3TQNZTHE>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
examples/sre/v3/path.sh
Outdated
export PYTHONIOENCODING=UTF-8 | ||
export PYTHONPATH=../../../:$PYTHONPATH | ||
|
||
export PATH=$PATH:/mnt/matylda6/rohdin/software/kaldi_20200214/tools/sph2pipe_v2.5/ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi, Johan @gulamungon , thanks for your codes.
Since installing Kaldi is a little complex, we only adopt some useful shell/perl/python scripts in WeSpeaker rather than installing the whole Kaldi.
You may consider two methods here:
- Download sph2pipe_v2.5.tar.gz and decompress it into a
external_tools
dir; - Use some other tools to convert sph into wav, i.e., ffmpeg.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure, I'll fix it. I'm on a trip this week so probably next week.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@gulamungon Hi Johan, can you fix the lint errors, .etc, then we can proceed the merging process
I changed sph2pipe to ffmpeg. Since before, we simply used sre16 data prepared elsewhere by Kaldi (as in ../v2) and this uses sph2pipe, I here instead copied in the sre16 datapreparation scripts and modified them to use ffmpeg instead as well as some other minor changes to fit better here. I fixed the trailing spaces. Since now quite many things have been changed I'm rerunning the recipe to see that nothing is broken. I think you can review it but perhaps better to wait with the merge until the run has finished. |
@gulamungon Hi Johan, thank you for your update! But there still seems some Lint errors. |
Ok trying again. Actually, I did those checks manually but I was doing it from examples/sre/v3 and it seems errors in files that are links were not detected properly. pre-commit is convenient. Thanks for the tip. |
It is not clear to me what the flake8 issue is. I didn't see it when I ran it locally. |
This flake8 error was fixed in the recent updates. Maybe you need to merge the master branch first and fix the conflicts (if exists). |
Ok. I pulled the recent changes to master than merged it into this branch. Hopefully it works now. |
SRE recipe using CTS superset + voxceleb as embedding extractor training data (See README). There are very few changes outside the recipe. Let me know if this is not appropriate:
this recording will be used as is, i.e. no VAD will be applied. This is not ideal since
absence of VAD info currently occurs also if VAD was estimated but no speech was found.
However, there may be situations where we don't want to apply any sets, e.g., we may
want to apply VAD to CTS but not voxceleb. Then we need it this way. This means that
utterances for which VAD was ran but no speech detected, should be filtered before
the shards are created. This will be the case if the sets are filtered with
local/filter_utt_accd_dur.py since this script discards recordings with no Speech
according to VAD.
Ideally this should be improved so that recording for which for which no speech was
detected will be removed while file that we don't want to apply VAD to will be kepts.
Possibly by
while keeping the ones with no VAD info, i.e., those for which we do not want to apply VAD.
we don't want to apply VAD for. This info would simply mark the whole segment as speech.
whenever an entry is in the scp but not in utt2spk. This embedding will be skipped.
(This can be the case if files have been added added to the original CTS data folder
since the scp is created by finding all wav files in this directory. In the case of
BUT, we have some extra files here for sanity checks.) If this solution is not appropriate
we can change in the data preparation script so that mismatch between the created scp
and utt2spk is fixed already there.