Estudo sobre a robustez de técnicas de verificação de locutores com D-vectors
Victor Costa Beraldo, Murilo Bellezoni Loiola

DOI: 10.14209/sbrt.2021.1570724207
Evento: XXXIX Simpósio Brasileiro de Telecomunicações e Processamento de Sinais (SBrT2021)
Keywords: Speaker Verification Neural Networks D-Vectors
Abstract
Recently, speaker verification techniques have been conceived through deep neural networks using d-vectors. In this work, we made experiments to compare different models in situations where we have data for training that were not obtained from the same source as the test data, representing a real problem, where it is necessary to choose a model, without having training data similar to the test data. Comparisons were made with the SincNet, GE2E and Triplet Loss Network models. This work also proposes the SincNet + GE2E model, whose performance is superior to the original GE2E. The SincNet, however, obtained the best performance results in the current scenarios.

Download