pith. sign in

arxiv: 1805.04136 · v1 · pith:JTYWXGATnew · submitted 2018-05-10 · 💻 cs.CV

Unsupervised Deep Representations for Learning Audience Facial Behaviors

classification 💻 cs.CV
keywords audiencebehaviorsdeepfacialgenerativelatentlearningnetwork
0
0 comments X
read the original abstract

In this paper, we present an unsupervised learning approach for analyzing facial behavior based on a deep generative model combined with a convolutional neural network (CNN). We jointly train a variational auto-encoder (VAE) and a generative adversarial network (GAN) to learn a powerful latent representation from footage of audiences viewing feature-length movies. We show that the learned latent representation successfully encodes meaningful signatures of behaviors related to audience engagement (smiling & laughing) and disengagement (yawning). Our results provide a proof of concept for a more general methodology for annotating hard-to-label multimedia data featuring sparse examples of signals of interest.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.