An overview of voice conversion systems

· 2020 · DOI 10.1016/j.specom

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Streaming Structured Inference with Flash-SemiCRF

cs.LG · 2026-04-20 · unverdicted · novelty 6.0

Flash-SemiCRF enables exact semi-CRF inference on long sequences by evaluating edge potentials from compact prefix sums and streaming the forward-backward pass while preserving exact gradients.

Hierarchical Sequence to Sequence Voice Conversion with Limited Data

eess.AS · 2019-07-15 · unverdicted · novelty 4.0

Hierarchical seq2seq model for parallel voice conversion pretrained as autoencoder on single-speaker data then adapted to limited multispeaker data, using mel spectrograms converted via wavenet vocoder.

Towards Robust Arabic Speech Emotion Recognition with Deep Learning

cs.SD · 2026-06-09 · unverdicted · novelty 3.0

CNN-Transformer hybrid reaches 98.1% accuracy on Arabic SER using EYASE and BAVED datasets, outperforming CNN-LSTM and fine-tuned wav2vec 2.0.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Streaming Structured Inference with Flash-SemiCRF cs.LG · 2026-04-20 · unverdicted · none · ref 18
Flash-SemiCRF enables exact semi-CRF inference on long sequences by evaluating edge potentials from compact prefix sums and streaming the forward-backward pass while preserving exact gradients.
Hierarchical Sequence to Sequence Voice Conversion with Limited Data eess.AS · 2019-07-15 · unverdicted · none · ref 22
Hierarchical seq2seq model for parallel voice conversion pretrained as autoencoder on single-speaker data then adapted to limited multispeaker data, using mel spectrograms converted via wavenet vocoder.
Towards Robust Arabic Speech Emotion Recognition with Deep Learning cs.SD · 2026-06-09 · unverdicted · none · ref 3
CNN-Transformer hybrid reaches 98.1% accuracy on Arabic SER using EYASE and BAVED datasets, outperforming CNN-LSTM and fine-tuned wav2vec 2.0.

An overview of voice conversion systems

fields

years

verdicts

representative citing papers

citing papers explorer