Autoencoder-based codebook for Bag-of-Audio-Words raises CCC for arousal from 0.225 to 0.322 and valence from 0.244 to 0.368 on AVEC 2017 audio data versus standard BoW.
Speaker indexing in large audio databases using anchor models
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
eess.AS 1years
2019 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Bag-of-Audio-Words based on Autoencoder Codebook for Continuous Emotion Prediction
Autoencoder-based codebook for Bag-of-Audio-Words raises CCC for arousal from 0.225 to 0.322 and valence from 0.244 to 0.368 on AVEC 2017 audio data versus standard BoW.