Fine-tuning the titanet-large model

by nodlehs - opened Dec 5, 2022

Dec 5, 2022

Hi, I am working on a project which uses the titanet model to compute embeddings for audio wav files but the results aren't that good and wanted to know how I could fine-tune the model to achieve better results on my dataset

nithinraok

NVIDIA org Mar 13, 2023

Some notes on finetuning can be found here: https://github.com/NVIDIA/NeMo/blob/main/tutorials/speaker_tasks/Speaker_Identification_Verification.ipynb

pgwi

Oct 16, 2023

Hi @nithinraok ,

Hope you are doing great. I have fine-tuned model using common voice dataset (turkish) https://huggingface.co/pgwi/en_tr_titanet_large. But I am struggling to find out how to evaluate the EER and WER. Do you have some refs for me to learn. Please let me know. Thank you very much.

nithinraok

NVIDIA org Jan 28, 2024

you may refer to the evaluation of voxceleb EER in this script: https://github.com/NVIDIA/NeMo/blob/main/examples/speaker_tasks/recognition/voxceleb_eval.py

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment