Stress and Adaptation: Applying Anna Karenina Principle in Deep Learning for Image Classification

Hanna Antson; Jaesik Choi; Kallol Roy; Nesma Mahmoud; Osamu Shimmi

arxiv: 2302.11380 · v1 · pith:2NTLJS6Cnew · submitted 2023-02-22 · 💻 cs.LG · cs.AI

Stress and Adaptation: Applying Anna Karenina Principle in Deep Learning for Image Classification

Nesma Mahmoud , Hanna Antson , Jaesik Choi , Osamu Shimmi , Kallol Roy This is my paper

classification 💻 cs.LG cs.AI

keywords deepmodelsfamiliesgeneralizableannahappyinternalkarenina

0 comments

read the original abstract

Image classification with deep neural networks has reached state-of-art with high accuracy. This success is attributed to good internal representation features that bypasses the difficulties of the non-convex optimization problems. We have little understanding of these internal representations, let alone quantifying them. Recent research efforts have focused on alternative theories and explanations of the generalizability of these deep networks. We propose the alternative perturbation of deep models during their training induces changes that lead to transitions to different families. The result is an Anna Karenina Principle AKP for deep learning, in which less generalizable models unhappy families vary more in their representation than more generalizable models happy families paralleling Leo Tolstoy dictum that all happy families look alike, each unhappy family is unhappy in its own way. Anna Karenina principle has been found in systems in a wide range: from the surface of endangered corals exposed to harsh weather to the lungs of patients suffering from fatal diseases of AIDs. In our paper, we have generated artificial perturbations to our model by hot-swapping the activation and loss functions during the training. In this paper, we build a model to classify cancer cells from non-cancer ones. We give theoretical proof that the internal representations of generalizable happy models are similar in the asymptotic limit. Our experiments verify similar representations of generalizable models.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Robusto-2: Benchmarking Humans & VLMs for Autonomous Driving in Lima & New York City
cs.CV 2026-06 unverdicted novelty 4.0

Humans and VLMs diverge in VQA responses on driving footage, with human answers consistent across origins and no strong geography modulation observed, likely due to high OOD nature.