Add Casual Conversation V2 audio scenario #3158

ImKeTT · 2024-11-14T00:09:09Z

Fix scenario subset name errors in the audio_run_specs.py
Add missing tags to multilinguality scenarios
Add the Casual Conversation V2 scenario

The Casual Conversation V2 dataset is quite large, with 80 .zip files totaling 30-60GB per file. We'll need to download and unzip it manually into the folder.

For the age and gender classification tasks, I adjusted them to a multiple-choice format.

For age classification, while most speakers have specific ages, it’s difficult for current audioLLMs to recognize them accurately. So I grouped the ages into four ranges: "18-30", "31-50", "51+", and "others."
For gender classification, which originally had six categories, I simplified it into three options: "male", "female", and "others."

I'm attaching the scenario_state.json files

scenario_state_ccv2_age.json
scenario_state_ccv2_gender.json

ImKeTT added 3 commits November 13, 2024 15:29

fix run specs names

6fde6bd

add tags

820678b

add casual conversations2

537abb1

ImKeTT requested a review from teetone November 14, 2024 00:09

ImKeTT added 3 commits November 14, 2024 12:11

fix

2eace7d

fix

21297e6

fix

351bcfe

teetone approved these changes Nov 15, 2024

View reviewed changes

teetone merged commit dd8a58a into stanford-crfm:main Nov 15, 2024
8 checks passed

ImKeTT deleted the fairness_audio_scenarios branch December 2, 2024 18:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Casual Conversation V2 audio scenario #3158

Add Casual Conversation V2 audio scenario #3158

ImKeTT commented Nov 14, 2024