You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I want to perform the fine-tuning of the audio subnetwork to fit my audio classification problem.
To this aim, I plan to use the _construct_linear_audio_network, _construct_mel128_audio_network, and _construct_mel256_audio_network functions to load the pre-trained Keras model and then append one or more fully-connected layers to perform the classification.
However, I don't understand the Input shape of such models. According to the models.py, the input shape is input_shape = (1, asr * audio_window_dur), where asr= 48000 and audio_window_dur=1; what's asr and why it has that value? Can you please provide an example of using the Keras model from the .wav file?
I really appreciate any help you can provide.
The text was updated successfully, but these errors were encountered:
Hi @mattiacampana Could you please tell me how you got the pre trained keras weights for the audio sub network or any code to read the model and load the pre trained weights?
Thanks you.
I want to perform the fine-tuning of the audio subnetwork to fit my audio classification problem.
To this aim, I plan to use the
_construct_linear_audio_network
,_construct_mel128_audio_network
, and_construct_mel256_audio_network
functions to load the pre-trained Keras model and then append one or more fully-connected layers to perform the classification.However, I don't understand the Input shape of such models. According to the
models.py
, the input shape isinput_shape = (1, asr * audio_window_dur)
, whereasr= 48000
andaudio_window_dur=1
; what'sasr
and why it has that value? Can you please provide an example of using the Keras model from the.wav
file?I really appreciate any help you can provide.
The text was updated successfully, but these errors were encountered: