Pre-Finetuning for Emotional Speech Recognition

Citation:

@article{chen2023pre,
  title={Pre-Finetuning for Few-Shot Emotional Speech Recognition},
  author={Chen, Maximillian and Yu, Zhou},
  journal={INTERSPEECH 2023},
  year={2023}
}

Paper Link: https://arxiv.org/abs/2302.12921

Request Access to Wav2Vec2.0 Base pre-finetuned on four corpora: https://drive.google.com/file/d/1N1JxqN8Ts2OWcoBTiHYt693DZF2sackV/view?usp=share_link

Repository under construction.

Please additionally cite the corresponding corpora if you use any of them for fine-tuning or pre-finetuning.

Downstream Fine-tuning Corpora

Emotional Speech Dataset: https://github.com/HLTSingapore/Emotional-Speech-Data

Pre-Finetuning Corpora

IEMOCAP: https://sail.usc.edu/iemocap/

Mandarin Affective Speech: https://catalog.ldc.upenn.edu/LDC2007S09

MSP-Podcast: https://ecs.utdallas.edu/research/researchlabs/msp-lab/MSP-Podcast.html

MSP-Improv: https://ecs.utdallas.edu/research/researchlabs/msp-lab/MSP-Improv.html

Requirements

transformers 4.18.0

Notes

Currently, the Trainer class for multitask learning only has single GPU support.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md
finetune_downstream.py		finetune_downstream.py
load_datasets.py		load_datasets.py
prefinetune.py		prefinetune.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pre-Finetuning for Emotional Speech Recognition

Downstream Fine-tuning Corpora

Pre-Finetuning Corpora

Requirements

Notes

About

Releases

Packages

Languages

maxlchen/Speech-PreFinetuning

Folders and files

Latest commit

History

Repository files navigation

Pre-Finetuning for Emotional Speech Recognition

Downstream Fine-tuning Corpora

Pre-Finetuning Corpora

Requirements

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages