Conversão Texto-Fala para o Português Brasileiro Utilizando Tacotron 2 com Vocoder Griffin-Lim
Rodrigo K Rosa, Danilo Silva
DOI: 10.14209/sbrt.2021.1570727280
Evento: XXXIX Simpósio Brasileiro de Telecomunicações e Processamento de Sinais (SBrT2021)
Keywords: Redes neurais Síntese de voz Tacotron 2 Português brasileiro
This paper presents the training of a state-of-the-art neural network model, Tacotron-2, using a open-source voice dataset from the Common Voice project. Results from training the model from scratch and by applying transfer learning of a pre-trained english model were evaluated. The results show that it is possible to train the model with limited data resources and the model trained from scratch had less synthesis errors.Download