You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for your open-source contributions! I have a few questions about the performance reported in the paper. Is the q value always set to 1? If so, it is not intuitive that opt-13B model can reach such high accuracy in those tasks by using ZO optimizers. Could you please provide the specific experimental parameters in Fig. 1?
The text was updated successfully, but these errors were encountered:
Thank you for your open-source contributions! I have a few questions about the performance reported in the paper. Is the q value always set to 1? If so, it is not intuitive that opt-13B model can reach such high accuracy in those tasks by using ZO optimizers. Could you please provide the specific experimental parameters in Fig. 1?
The text was updated successfully, but these errors were encountered: