Information Theoretic Meta Learning with Gaussian Processes

Alexandre Galashov; Francisco J. R. Ruiz; Michalis K. Titsias; Sotirios Nikoloutsopoulos

arxiv: 2009.03228 · v3 · pith:RKXN6Y6Onew · submitted 2020-09-07 · 💻 cs.LG · cs.AI· stat.ML

Information Theoretic Meta Learning with Gaussian Processes

Michalis K. Titsias , Francisco J. R. Ruiz , Sotirios Nikoloutsopoulos , Alexandre Galashov This is my paper

classification 💻 cs.LG cs.AIstat.ML

keywords informationlearningmetaalgorithmsderiveencodingexistingfew-shot

0 comments

read the original abstract

We formulate meta learning using information theoretic concepts; namely, mutual information and the information bottleneck. The idea is to learn a stochastic representation or encoding of the task description, given by a training set, that is highly informative about predicting the validation set. By making use of variational approximations to the mutual information, we derive a general and tractable framework for meta learning. This framework unifies existing gradient-based algorithms and also allows us to derive new algorithms. In particular, we develop a memory-based algorithm that uses Gaussian processes to obtain non-parametric encoding representations. We demonstrate our method on a few-shot regression problem and on four few-shot classification problems, obtaining competitive accuracy when compared to existing baselines.

This paper has not been read by Pith yet.

Information Theoretic Meta Learning with Gaussian Processes

discussion (0)