Example of fine-tuning the audio sub-network. #91

mattiacampana · 2022-07-16T13:33:32Z

I want to perform the fine-tuning of the audio subnetwork to fit my audio classification problem.
To this aim, I plan to use the _construct_linear_audio_network, _construct_mel128_audio_network, and _construct_mel256_audio_network functions to load the pre-trained Keras model and then append one or more fully-connected layers to perform the classification.

However, I don't understand the Input shape of such models. According to the models.py, the input shape is input_shape = (1, asr * audio_window_dur), where asr= 48000 and audio_window_dur=1; what's asr and why it has that value? Can you please provide an example of using the Keras model from the .wav file?

I really appreciate any help you can provide.

The text was updated successfully, but these errors were encountered:

sreenivasaupadhyaya · 2023-02-14T08:20:22Z

Hi @mattiacampana Could you please tell me how you got the pre trained keras weights for the audio sub network or any code to read the model and load the pre trained weights?
Thanks you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Example of fine-tuning the audio sub-network. #91

Example of fine-tuning the audio sub-network. #91

mattiacampana commented Jul 16, 2022

sreenivasaupadhyaya commented Feb 14, 2023

Example of fine-tuning the audio sub-network. #91

Example of fine-tuning the audio sub-network. #91

Comments

mattiacampana commented Jul 16, 2022

sreenivasaupadhyaya commented Feb 14, 2023