pith. sign in

Bishop.Pattern Recognition and Machine Learning

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 1 dataset 1

citation-polarity summary

years

2026 7

verdicts

UNVERDICTED 7

representative citing papers

Fitting Multilinear Polynomials for Logic Gate Networks

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

Fitting logic gates as 4D multilinear polynomials with covariance Jacobian selection matches or beats 16D softmax baselines on seven datasets and remains stable at 12-layer depth where the baseline drops 37 points on CIFAR-10.

In-Context Learning Operates as Concept Subspace Learning

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

In-context learning decomposes into concept-coordinate regression plus off-subspace leakage, with recoverable task information concentrating in a 68-73 dimensional task-aligned subspace of the residual stream that restores 78.8% of the accuracy gap in Llama-3-8B experiments.

Why Self-Supervised Encoders Want to Be Normal

cs.IT · 2026-04-30 · unverdicted · novelty 6.0

Self-supervised encoders prefer isotropic Gaussian latent states because the Information Bottleneck, recast as rate-distortion over the predictive manifold, makes these states optimal for target-neutral representations.

citing papers explorer

Showing 7 of 7 citing papers.

  • Fitting Multilinear Polynomials for Logic Gate Networks cs.LG · 2026-05-09 · unverdicted · none · ref 27

    Fitting logic gates as 4D multilinear polynomials with covariance Jacobian selection matches or beats 16D softmax baselines on seven datasets and remains stable at 12-layer depth where the baseline drops 37 points on CIFAR-10.

  • In-Context Learning Operates as Concept Subspace Learning cs.LG · 2026-05-12 · unverdicted · none · ref 46

    In-context learning decomposes into concept-coordinate regression plus off-subspace leakage, with recoverable task information concentrating in a 68-73 dimensional task-aligned subspace of the residual stream that restores 78.8% of the accuracy gap in Llama-3-8B experiments.

  • Bayesian low-rank latent-cluster regression for mixed health outcomes stat.ME · 2026-05-12 · unverdicted · none · ref 4

    A Bayesian finite mixture of cluster-specific low-rank regressions for mixed Gaussian-Bernoulli-negative binomial outcomes, with posterior contraction results and WAIC-based tuning of clusters and rank.

  • Laplace Variational Inference for Dirichlet Process Mixtures of Marked Poisson Point Processes stat.ME · 2026-05-10 · unverdicted · none · ref 5

    A Dirichlet process mixture model for marked Poisson point processes with squared-link intensities and Laplace variational inference jointly infers clusters, cluster count, and continuous mark-specific intensity surfaces.

  • Why Self-Supervised Encoders Want to Be Normal cs.IT · 2026-04-30 · unverdicted · none · ref 9

    Self-supervised encoders prefer isotropic Gaussian latent states because the Information Bottleneck, recast as rate-distortion over the predictive manifold, makes these states optimal for target-neutral representations.

  • From Imitation to Interaction: Mastering Game of Schnapsen with Shallow Reinforcement Learning cs.AI · 2026-05-16 · unverdicted · none · ref 2

    Reinforcement learning with shallow networks produces stronger Schnapsen agents than supervised imitation and yields statistically significant gains against strong search-based baselines when paired with lookahead.

  • Supervised Latent Restructuring for Small-Data Quantum Learning in Plant Phenomics cs.LG · 2026-05-19 · unverdicted · none · ref 14

    Supervised LDA restructuring of PCA-compressed embeddings raises silhouette separability from near zero to 0.197 in plant phenomics but yields mixed classical ML gains and persistent challenges for quantum kernel alignment under limited compute.