NeMo
PyTorch
English
speaker
speech
audio
speaker-verification
speaker-recognition
speaker-diarization
titanet
NeMo
Eval Results (legacy)
Instructions to use nvidia/speakerverification_en_titanet_large with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/speakerverification_en_titanet_large with NeMo:
# tag did not correspond to a valid NeMo domain.
- Notebooks
- Google Colab
- Kaggle
Using on numpy array instead of audio files
#7
by Respair - opened
Hi. I checked the code, but it was really finicky to make this work on numpy audio arrays.
is there any easier way to get the embeddings with array inputs?
I assume you already figured this out but for future reference you can just use "speaker_model.infer_segment(wav_np_array)":