End-to-End Residual CNN with L-GM Loss Speaker Verification System

Mengyao Zhu; Xingjian Du; Xuan Shi

arxiv: 1805.00645 · v2 · pith:YRCELBF6new · submitted 2018-05-02 · 💻 cs.SD

End-to-End Residual CNN with L-GM Loss Speaker Verification System

Xuan Shi , Xingjian Du , Mengyao Zhu This is my paper

classification 💻 cs.SD

keywords lossspeakerlarge-marginverificationend-to-endfunctiongaussianmixture

0 comments

read the original abstract

We propose an end-to-end speaker verification system based on the neural network and trained by a loss function with less computational complexity. The end-to-end speaker verification system in this paper consists of a ResNet architecture to extract features from utterance, then produces utterance-level speaker embeddings, and train using the large-margin Gaussian Mixture loss function. Influenced by the large-margin and likelihood regularization, large-margin Gaussian Mixture loss function benefits the speaker verification performance. Experimental results demonstrate that the Residual CNN with large-margin Gaussian Mixture loss outperforms DNN-based i-vector baseline by more than 10% improvement in accuracy rate.

This paper has not been read by Pith yet.

End-to-End Residual CNN with L-GM Loss Speaker Verification System

discussion (0)