Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

For song vc what should I do #61

Open
panxin801 opened this issue Sep 14, 2022 · 9 comments
Open

For song vc what should I do #61

panxin801 opened this issue Sep 14, 2022 · 9 comments
Labels
discussion New research topic

Comments

@panxin801
Copy link

Hello and thank you sharing your great work, but I have some questions.

  1. For song vc with Madarian, I tried train a new starganv2vc model with pretrained ASR and F0 model, but the result sound not well, do you have some advice ?
  2. In song vc with Madarian, do i need to retrain a ASR or F0 model ?
    I'm looking forward for your reply, and thank you again.
@sophiefy
Copy link

sophiefy commented Sep 16, 2022

Hello and thank you sharing your great work, but I have some questions.

  1. For song vc with Madarian, I tried train a new starganv2vc model with pretrained ASR and F0 model, but the result sound not well, do you have some advice ?
  2. In song vc with Madarian, do i need to retrain a ASR or F0 model ?
    I'm looking forward for your reply, and thank you again.

Hello, panxin! I'm also working on singing vc with StarGANv2-VC. I didn't retrain F0 and ASR model. Instead, I made a dataset consisting of Mandarin songs, Mandarin, Japanese and English speech. This is my result.

@panxin801
Copy link
Author

@Francis-Komizu well, thank you for your reply, indeed I think starganvc using for song vc may need some further works to work out

@yl4579
Copy link
Owner

yl4579 commented Sep 16, 2022

@panxin801 I'm currently working on singing conversion using this model with some further modifications for better performance. I may submit my work to INTERSPEECH next year.

@panxin801
Copy link
Author

@yl4579 Well, Congratulations. I'm looking forward for your works .

@MuruganR96
Copy link

@panxin801 I'm currently working on singing conversion using this model with some further modifications for better performance. I may submit my work to INTERSPEECH next year.

@yl4579, is INTERSPEECH 2022 September? If yes, can you share the paper link here

@yl4579
Copy link
Owner

yl4579 commented Nov 22, 2022

@mraj96 Sorry, I mean INTERSPEECH next year so it'll be 2023.

@yl4579 yl4579 added the discussion New research topic label Jan 31, 2023
@mayank-git-hub
Copy link

@yl4579 , thank you for your work on StarGANv2-vc. We have been working on making StarGANv2-vc workable on the singing domain. Please find our work https://arxiv.org/abs/2210.11096 which enhances StarGANv2-vc to make it work on the singing domain while working on any-to-any case.

@mayank-git-hub
Copy link

The main modification which makes StarGANv2-VC work on singing voice is the removal of pitch features from the instance normalization layers of the generator and having an absolute pitch reconstruction loss instead of a normalized pitch reconstruction loss.

@billnye2
Copy link

@mayank-git-hub Do you have a github for ROSVC? Couldn't find the source code, very interested!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discussion New research topic
Projects
None yet
Development

No branches or pull requests

6 participants