Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inquiry About EDC Loss Setting, YAML Configuration for Training from Scratch #1

Open
Fly-beep-256 opened this issue Oct 24, 2024 · 0 comments

Comments

@Fly-beep-256
Copy link

Fly-beep-256 commented Oct 24, 2024

Thank you for your excellent work on this repository. The implementation has been very insightful, and I truly appreciate the effort you’ve put into making this project available for the community.

However, I’ve encountered a few questions while working with the code, and I was hoping to get some clarification.

  1. EDC Loss Configuration:
    In the YAML configuration file, I noticed that the use_edc_loss_rir is set to false. Could you kindly explain the reasoning behind disabling this loss function in the current setup? Is there a specific scenario where you recommend turning this on, or is it omitted intentionally for better performance?

  2. Training from Scratch:
    If we intend to train the model entirely from scratch, are there any additional changes to the YAML configuration that we should be aware of? Specifically, would you suggest any adjustments to the optimizer, learning rates, or loss functions when starting fresh?

  3. Epoch Discrepancy:
    I also noticed a discrepancy between the repository’s implementation and the paper regarding the number of training epochs. In the paper, 200 epochs are mentioned, but the current setup doesn’t seem to correspond to this. Could you clarify if this is an intentional change in the code, and whether there are any recommendations on the number of epochs for best results?

Could you also share your specific configuration and techniques for training?

Thank you in advance for your help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant