Pith | pre-print review for science

6

physics.bio-ph 2026-05-25 2 theorems

Lorentzian family is unique invariant under Riccati transport

by Hugues Berry (AISTROSIGHT), Leonardo Trujillo (AISTROSIGHT)

Geometric Origin of Exact Mean-Field Reductions: M{\"o}bius Symmetry and the Lorentzian Ansatz

Reformulating dynamics on the circle shows the Cauchy law is the sole rotation-invariant measure, unifying exact mean-field reductions.

abstract click to expand

Low-dimensional descriptions of large systems of coupled oscillators and spiking neurons rely heavily on the Lorentzian Ansatz. We show that its privileged role is geometric rather than heuristic: for the transport induced by Riccati dynamics, the Cauchy-Lorentz family indeed emerges as the unique connected two-dimensional family of continuous probability densities that is invariant under the induced projective transport. The key step of the demonstration is to reformulate the dynamics on the circle, where the problem reduces to the uniqueness of the rotation-invariant probability measure. Under stereographic projection, this yields the standard Cauchy law and, under the full projective action, the Lorentzian family. This result gives a unified geometric foundation for the Ott-Antonsen [Chaos 18, 037113 (2008)] and Montbri{\'o}-Paz{\'o}-Roxin [Phys. Rev. X 5, 021028 (2015)] reductions, explains the failure of Gaussian closures, and identifies the structural condition underlying exact two-parameter reductions.

0

4

cs.LG 2026-05-25 2 theorems

Low dimension suffices for near-max retrieval margins

by Kiril Bangachev, Guy Bresler +2 more

Is Dimensionality a Barrier for Retrieval Models?

Dimension O(m^{-2} log n) nearly matches the infinite-dimension margin for any relevance matrix A.

abstract click to expand

Why does the low dimensionality of representations, typically $d\approx 1000$, not prevent modern embedding-based retrieval models from scaling to billions, or even trillions, of data points? To answer this question, we study maximal-margin embeddings in the following retrieval model, classically studied in communication complexity [PS86] and more recently in embedding-based retrieval [WBNL26]. Let $A\in \{0,1\}^{N\times n}$ be a matrix indicating whether each of $N$ queries is relevant to each of $n$ documents. We are interested in the largest margin $m>0,$ denoted by $\mathsf{m}^{\mathsf{rd}}(d, A),$ for which there exist unit norm embeddings of the queries and documents $\{U_j\}_{j = 1}^N, \{V_i\}_{i = 1}^n$ with the following property. $\langle U_j, V_i\rangle \ge m$ whenever $A_{ji} = 1$ and $\langle U_j, V_i\rangle \le -m$ otherwise. A large margin is a key proxy for representation quality: it controls both robustness to perturbations and compositional generalization across queries. Our main theorem establishes that the best possible margin without a restriction on the dimension, $\mathsf{m}^{\mathsf{rd}}(+\infty, A),$ can be nearly achieved in dimension $d = O(\mathsf{m}^{\mathsf{rd}}(+\infty, A)^{-2}\log n)$ which improves a theorem of [BDES02]. Together with a matching lower bound in Theorem 1.5, we conclude that when $A\in \{0,1\}^{\binom{n}{k}\times n}$ is the matrix containing all possible $k$-sparse rows once, dimension $d = O(k\log (n/k))$ is necessary and sufficient for the maximal possible margin $\mathsf{m}^{\mathsf{rd}}(+\infty, A) = \Theta(k^{-1/2})$ in this setting. This fully resolves the setup of [WBNL26]. We also give several constructions for large margins when $d = o(k\log (n/k)).$ Finally, we empirically test the InfoNCE and sigmoid losses for producing large margin embeddings and demonstrate a clear advantage of the sigmoid loss.

0

4

cs.CV 2026-05-25 2 theorems

Transformer predicts saliency from event camera streams

by Romaric Mazna, Jean Martinet +1 more

Exploring deep learning for Event-Based Saliency Prediction with a Transformer-based model

Trained on synthetic data from RGB benchmarks, SEST beats prior event methods and works on real streams without retraining.

abstract click to expand

Saliency prediction has been extensively studied in RGB images and videos as a computational model of human visual attention. In contrast, predicting saliency from event-based data remains largely unexplored, despite the biological inspiration and favorable sensing properties of event cameras. Two obstacles have held this direction back: the absence of large-scale event saliency datasets, and the lack of a strong baseline. In this paper, we introduce SEST (Swin Event-based Saliency Transformer), a transformer-based model for saliency prediction from event data, bridging the data scarcity barrier through event-native pretraining and synthetic supervision. SEST leverages a self-supervised pretrained event-based Swin Transformer backbone combined with a lightweight CNN decoder to produce dynamic saliency maps. To address the scarcity of annotated event-based saliency data, we introduce two new benchmark datasets, N-DHF1K and N-UCF Sports, generated from large-scale RGB saliency benchmarks. Experimental results show that SEST clearly outperforms existing event-based saliency methods and narrows the performance gap with state-of-the-art RGB models. Zero-shot evaluation on a real event camera dataset further demonstrates that our model trained on synthetic data remains transferable on real event streams. To the best of our knowledge, this work is the first to apply deep learning to event-based saliency prediction, opening a new research direction at the intersection of event-based vision and neuromorphic visual attention.

0

3

math.CO 2026-05-22 Recognition

Ore degree-sum condition yields near-perfect H-tilings

by Yuping Gao, Yilin Guo +2 more

An Ore-type condition for H-tilings in graphs

For any fixed H a constant C(H) exists so that large graphs meeting the non-edge degree threshold contain an H-tiling missing at most C(H)

abstract click to expand

A graph $G$ admits an $H$-tiling if it contains a collection of vertex-disjoint copies of $H$. In this paper, we confirm a conjecture proposed by K\"{u}hn, Osthus, and Treglown by showing that for any given graph $H$, there exists a constant $C(H)$ such that the following holds. If $G$ is a sufficiently large $n$-vertex graph satisfying $d(x) + d(y) \geq 2\left(1 - 1/\chi_{\text{cr}}(H)\right)n$ for all nonadjacent vertices $x, y \in V(G)$, then $G$ contains an $H$-tiling covering all but at most $C(H)$ vertices. Here $\chi_{\text{cr}}(H)$ denotes the critical chromatic number of $H$.

0

3

math.CO 2026-05-22 2 theorems

Degree sum condition ensures almost H-tiling in large graphs

by Yuping Gao, Yilin Guo +2 more

An Ore-type condition for H-tilings in graphs

Nonadjacent vertices with d(x)+d(y) at least 2(1-1/χ_cr(H))n allow an H-tiling missing only bounded vertices.

abstract click to expand

A graph $G$ admits an $H$-tiling if it contains a collection of vertex-disjoint copies of $H$. In this paper, we confirm a conjecture proposed by K\"{u}hn, Osthus, and Treglown by showing that for any given graph $H$, there exists a constant $C(H)$ such that the following holds. If $G$ is a sufficiently large $n$-vertex graph satisfying $d(x) + d(y) \geq 2\left(1 - 1/\chi_{\text{cr}}(H)\right)n$ for all nonadjacent vertices $x, y \in V(G)$, then $G$ contains an $H$-tiling covering all but at most $C(H)$ vertices. Here $\chi_{\text{cr}}(H)$ denotes the critical chromatic number of $H$.

0

3

cs.CL 2026-05-25 2 theorems

Language flips which jailbreaks work on frontier MLLMs

by Casey Ford, Madison Van Doren +2 more

Same Model, Different Weakness: How Language and Modality Reshape the Jailbreak Attack Surface in Frontier MLLMs

Spanish reduces role-play success but increases visual attack success, reversing safety rankings across models.

abstract click to expand

The attack surface of a multimodal large language model (MLLM) is language-dependent in ways that reveal the mechanistic structure of alignment failures. We present the first systematic cross-lingual, multimodal red-teaming study comparing jailbreak vulnerability in US English (en-US) and Mexican Spanish (es-MX) across four frontier MLLMs: Claude Sonnet 4.5, GPT-5, Pixtral Large, and Qwen Omni. Using a fixed adversarial benchmark of 363 diverse prompt scenarios administered in text-only and multimodal conditions, we collected 52,272 harm ratings and binary attack success judgements from matched panels of nine native-speaker annotators per language group. Our central finding is that language does not scale vulnerability uniformly. Bayesian mixed-effects analyses reveal that linguistic framing attacks such as role-play become substantially less effective under Spanish prompting, while visually explicit multimodal attacks become more effective, which directly implicates the prompt-language interface rather than global annotator leniency. This dissociation indicates that linguistic and visual alignment failures operate through distinct mechanisms, and that switching language is sufficient to expose that separation. The practical consequence is that safety rankings are not preserved across languages. Qwen Omni overtakes Pixtral Large as the most vulnerable model among es-MX participants, a rank reversal no scalar correction of English-condition scores could recover, and absolute attack success rates have declined across model generations without closing the gaps between them. These findings demonstrate that safety evaluation frameworks treating language and modality as independent dimensions fundamentally misspecify the attack surface of globally deployed MLLMs, and must be redesigned accordingly.

0

1

math.CO 2026-05-20 2 theorems

Hypercube geodesics change color at most (π/2)√n times

by Lawrence Hollom

Hypercube geodesics with few colour changes

Any 2-edge-coloring admits a shortest antipodal path with expected (π/2 + o(1))√n color changes, replacing linear bounds.

abstract click to expand

What is the maximum, over all 2-colourings of the edges of the $n$-dimensional hypercube $Q_n$, of the minimal number of times a path between a vertex $v$ and its antipode $\bar{v}$ changes colour? A conjecture of Norine, in a form due to Feder and Subi, states that this maximum should be 1. The previous best-known upper bound on the number of colour changes was $(\tfrac{5}{16} + o(1))n$ due to Kirchweger, Peitl, Subercaseaux, and Szeider. We improve this bound and answer a question of Leader and Long by finding a geodesic path with at most $(\tfrac{\pi}{2} + o(1))\sqrt{n}$ colour changes. In fact, we show that this is the expected number of colour changes for a uniformly random start vertex. This is optimal (up to the constant) when the start vertex is chosen uniformly at random.

0

1

math.CO 2026-05-20 1 theorem

Hypercube geodesics need only sqrt(n) color changes in worst 2-coloring

by Lawrence Hollom

Hypercube geodesics with few colour changes

The expected number for random starts is (π/2 + o(1))√n, proving an upper bound that matches the lower bound up to constant.

abstract click to expand

What is the maximum, over all 2-colourings of the edges of the $n$-dimensional hypercube $Q_n$, of the minimal number of times a path between a vertex $v$ and its antipode $\bar{v}$ changes colour? A conjecture of Norine, in a form due to Feder and Subi, states that this maximum should be 1. The previous best-known upper bound on the number of colour changes was $(\tfrac{5}{16} + o(1))n$ due to Kirchweger, Peitl, Subercaseaux, and Szeider. We improve this bound and answer a question of Leader and Long by finding a geodesic path with at most $(\tfrac{\pi}{2} + o(1))\sqrt{n}$ colour changes. In fact, we show that this is the expected number of colour changes for a uniformly random start vertex. This is optimal (up to the constant) when the start vertex is chosen uniformly at random.

0

5

cs.LG 2026-05-22 3 theorems

RICA defines local disentanglement with a Hessian-Ricci tensor

by Edmond Cunningham

Disentanglement Beyond Generative Models with Riemannian ICA

The construction drops ICA's global generative requirement while recovering sources consistently across manifold representations.

abstract click to expand

There is a gap between the theoretical foundations of disentanglement and the practice of modern representation learning. Existing theoretical frameworks, particularly Independent Component Analysis (ICA) and its nonlinear variants, assume a generative model with statistically independent latent variables underlying the data so that disentanglement amounts to identifying the latents that could have generated the data. This generative framework is interpretable and theoretically justified, but its strong assumptions make it difficult to apply to modern representation learning. Modern pretrained encoders often learn features that exhibit disentangled properties without making generative assumptions, yet there is no general theory for interpreting these features as independent factors of variation. We take a step toward such a theory by introducing Riemannian ICA (RICA), which replaces ICA's global generative model with local geometric structure. RICA is founded on the observation that in ICA, the factors of variation underlying a data point can be understood through radial curves emanating from the point that map to axis-aligned lines in the latent space. We formalize this perspective using Riemannian geometry and introduce our theory in a way that is consistent with the existing generative approach. Our main contribution is the disentanglement tensor, which encodes a second-order notion of disentanglement that we call pointwise disentanglement. This tensor depends on the Hessian of the data log likelihood as well as the Ricci curvature induced by the model. In a controlled source recovery setting with known ground-truth sources, RICA recovers sources across several manifolds, while the success of ICA baselines depends on the coordinates used to represent the observations. Our work provides a theoretical basis for studying local disentanglement without assuming a global generative model.

0

5

cs.LG 2026-05-22 2 theorems

Graph tokenization fixes transformer depth for structure recovery

by Maya Bechler-Speicher, Gilad Yehudai +4 more

Lost in Tokenization: Fundamental Trade-offs in Graph Tokenization for Transformers

Random-walk maps lose information permanently while spectral maps preserve it but hinder local tasks, creating provable depth gaps between 2

abstract click to expand

Transformers have become a central architecture for graph learning, but their application to graphs requires first choosing a tokenization: a graph-to-token map that determines which structural information is exposed at the input. In this work, we show that this choice is a fundamental component of transformer expressivity. We examine three tokenizations that serve as building blocks for many existing graph tokenizations: spectral, random-walk, and adjacency tokenizations. We prove that different tokenizations induce distinct depth regimes: the same graph computation may be realizable by a shallow transformer under one tokenization, while requiring substantially larger depth under another. For example, we prove that random-walk tokenization is lossy for any walk length, making it impossible in general to recover the graph from it, and that while spectral tokenization is lossless, it is ill-conditioned for local tasks. We further show that although both random-walk and spectral tokenizations are derived from adjacency information, it is impossible for a limited-depth transformer to convert between tokenization families in general. In particular, we establish lower bounds and impossibility results showing that unfavorable tokenizations may preclude the efficient recovery of more suitable structural representations. Finally, we complement our theory with controlled experiments on synthetic and real-world tasks, validating the predicted separations and showing that different tasks favor different structural views, and combining complementary tokenizations allows the transformer to leverage distinct signals from each representation.

0

5

cs.LG 2026-05-22 2 theorems

Stronger backdoor triggers can raise clean accuracy in high dimensions

by Donald Flynn, Hadas Yaron Goldhirsh +2 more

When Stronger Triggers Backfire: A High-Dimensional Theory of Backdoor Attacks

Proportional-regime analysis shows attack success peaks then falls while clean performance improves with training trigger strength.

abstract click to expand

Backdoor poisoning attacks behave counter-intuitively in high dimensions: stronger training triggers can help the defender. We study regularised generalised linear models on Gaussian-mixture data in the proportional regime ($p/n \to \kappa$), varying the training trigger strength $\alpha$ against a fixed test trigger. Three phenomena emerge: (i) clean test accuracy increases with $\alpha$; (ii) attack success peaks at a finite $\alpha$ and then declines; and (iii) the most damaging trigger direction is the minimum eigenvector of the data covariance. We prove all three results in closed form for the squared loss, and extend (i) and (ii) to general convex GLM losses via a Gaussian-proxy fixed-point system. We identify a finite-sample noise floor proportional to $\kappa$ as the mechanism behind (i), invisible to classical $n \gg p$ analysis. Experiments on CIFAR-10 and Gaussian surrogates match the theory closely; ResNet-18 experiments show the same phenomena beyond the convex setting.

0

5

math.CO 2026-05-22 2 theorems

Projection of flags complex gives sub-polynomial expander

by Max Hopkins, Arka Ray

A Simple Sub-Polynomial Degree Coboundary Expander

A combinatorial construction from subspace chains achieves spectral and coboundary expansion at once, yielding near-linear PCPs and hypergr

abstract click to expand

High dimensional expanders simultaneously satisfying spectral and combinatorial (coboundary) expansion have recently played a major role in breakthroughs in PCP and coding theory, but the only known construction of such complexes is extremely involved, requiring deep algebraic number theory. In this work, we give an extremely simple combinatorial construction of a sub-polynomial degree complex based on projections of the flags complex (subspace chains) that is (i) a local spectral expander, (ii) a coboundary expander, and (iii) a swap coboundary expander. As a corollary, we also give the first near-linear size combinatorial hypergraphs with good agreement tests in the '1%' regime, and a simple PCP construction with near-linear size.

0

5

astro-ph.GA 2026-05-22 2 theorems

This paper reports a mean velocity difference of about 0.05 km/s between ions traced by…

by Doris Arzoumanian, Silvia Spezzano +10 more

Probing the ion-neutral drift velocity towards the L1544 prestellar core: Detection of ambipolar diffusion using N₂D^+ and para-NH₂D

Detection of ~0.05 km/s ion-neutral velocity drift in L1544 interpreted as the first observational signature of ambipolar diffusion in a…

abstract click to expand

The dynamical role of the magnetic field in the star formation process is tightly linked to the coupling between matter and the field. This coupling is due to the interaction between ions and neutrals in the partially ionized interstellar medium. When the ionization degree drops in the dense environment of prestellar cores, the magnetic field and the matter may decouple, leading to differences in the infalling velocities of ions and neutrals known as ambipolar diffusion. The onset of gravitational collapse resulting from ion-neutral decoupling has never been observed. The aim of this work is to search for signatures of ambipolar diffusion within a prestellar core. We observed the deuterated N$_2$D$^+$ ion and the neutral para-NH$_2$D species towards the prototypical prestellar core L1544. These two species are ideal tracers of prestellar cores sampling the same high densities in the core interior. We compared the velocity centroid and linewidth maps of the ion-neutral pair. We find a mean ion-neutral velocity difference of $\sim$0.05 km/s towards the core. By comparing with predictions from self-consistent calculations of the ambipolar resistivity including dust grain growth, we interpret the observed ion-neutral velocity difference in L1544 as a signature of ambipolar diffusion. We do not detect a significant ion-neutral linewidth difference that may be attributed to the subsonic infall motions of the gas in L1544 and geometrical effects in the presence of inclination. These results emphasize the role of dust grain growth at the prestellar core stage in setting the ambipolar resistivity and regulating the dynamical evolution of dense cores towards their collapse into protostars. We propose that measurements of ion-neutral drift velocities provide new constraints on the total magnetic field strength and the dust size distribution within prestellar cores.

0

5

hep-lat 2026-05-22 2 theorems

Lattice QCD yields first full form factors for rare kaon decay

by R. Di Palma, R. Frezzotti +7 more

Complete lattice QCD calculation of K⁻to ell⁻bar{ν}_(ell)ell^('+)ell^('-) form factors

Physical-mass ensembles and spectral reconstruction control errors across all four lepton channels

abstract click to expand

We present the first complete lattice QCD calculation of the four structure-dependent form factors governing the rare charged kaon decay $K^- \to \ell^- \bar{\nu}_\ell \ell'^+ \ell'^-$, with fully controlled statistical and systematic uncertainties. Our calculation is based on gauge ensembles generated by the Extended Twisted Mass Collaboration (ETMC) with $N_f = 2+1+1$ flavors of Wilson-clover twisted-mass fermions. Simulations are performed directly at the physical values of the light and strange quark masses, and include an estimate of the quark-disconnected contributions in which the virtual photon couples to sea quarks. All four form factors are determined across the kinematical region probed by experiments. The Spectral Function Reconstruction (SFR) method of Ref. [1] is employed to overcome the analytic continuation problem for dilepton invariant masses above the two-pion threshold. Finite-volume effects are investigated using ensembles with spatial extents $L\simeq [3.8,7.6]~\mathrm{fm}$, while the continuum limit is obtained from three lattice spacings in the range $a\in[0.057, 0.08]~\mathrm{fm}$. Our results for the form factors enable the evaluation of decay rates and differential observables for all four channels, $K^- \to e^- \bar{\nu}_e e^+ e^-$, $K^- \to e^- \bar{\nu}_e \mu^+ \mu^-$, $K^- \to \mu^- \bar{\nu}_\mu e^+ e^-$, and $K^- \to \mu^- \bar{\nu}_\mu \mu^+ \mu^-$, thereby providing first-principles Standard Model predictions against which existing and upcoming measurements can be directly compared. A detailed phenomenological analysis of the decay rates and associated observables is presented in a companion paper [2].

0

5

cond-mat.soft 2026-05-22 2 theorems

Exact solution shows no critical point wetting in fluid mixtures

by A.O. Parry, C. Rascón

The exact solution of the Koga-Widom-Indekeu model and related models of wetting in fluid mixtures

Local XY symmetry governs the absence of critical point wetting up to critical end points in the KWI model and variants.

abstract click to expand

We show how a broad class of two-component square-gradient models of wetting may be solved exactly for the surface tensions and density profile paths, and clarify how the presence or absence of critical point wetting, in binary and ternary mixtures, is related to universality and symmetry principles at critical end points. We begin by solving a model of fluid interfaces, first introduced by Koga and Widom, in ternary mixtures showing three phase coexistence. Numerical studies had revealed interesting wetting transitions, as well as curious geometrical properties of the profile paths in the density plane, and led these authors to conjecture expressions for the surface tensions. These conjectures were extended by Koga and Indekeu and predicted that partial wetting may persist up to the line of critical end points, i.e. critical point wetting was absent. Here, we obtain the exact density profiles and surface tensions for the Koga-Widom-Indekeu (KWI) model using complex analysis and drawing on the theory of algebraic curves. The exact solution determines the location and order of wetting transitions in the surface phase diagram, confirming that critical point wetting is absent. The model also displays the remarkable property that microscopic density profiles are mapped, by a conformal transform, onto the shape of a macroscopic drop near the contact line whose tensions satisfy the Neumann triangle. Two related models, which illustrate the role of the component isotropy, are also discussed. These models suggest that a universality principle governs wetting in fluid mixtures, resolving contradicting results from earlier studies: Critical point wetting is present if the order-parameter components of the mixture describe Ising-like criticality, but is absent if there is a local XY symmetry. Implications for wetting transitions in more microscopic models and in experiments are discussed.

0

5

math.GT 2026-05-22 2 theorems

Four elements generate Goeritz groups of S^3 Heegaard splittings

by Daiki Iguchi

A proof of Powell's conjecture on the Goeritz group of S³

Powell's conjecture holds for every genus g at least 3, proved via topological minimality of the splitting surface.

abstract click to expand

For a genus $g$ Heegaard splitting of the $3$-sphere, the Goeritz group is defined to be the group of isotopy classes of diffeomorphisms of the $3$-sphere that preserve the splitting setwise. In this paper, we prove the following conjecture proposed by Powell: For every $g \ge 3$, the Goeritz group of a genus $g$ Heegaard splitting is generated by four specific elements. Our proof relies crucially on the fact that a Heegaard surface of the $3$-sphere is topologically minimal, that is, its disk complex has nontrivial homotopy group in some dimension. Along the way, we also give a new proof of the fact that a genus $g$ Heegaard surface of the $3$-sphere has topological index $2g-1$.

0

5

math.AG 2026-05-22 2 theorems

Integrable observables prove Π-hierarchy equivalences

by Xavier Blot, Danilo Lewański +1 more

Beyond descendants: integrable observables for cohomological field theories

They replace psi classes while keeping integrability, establish Miura links to Dubrovin-Zhang and ramification hierarchies, and give a short

abstract click to expand

We introduce the concept of integrable observables and propose them as alternatives to the standard Witten's psi classes (a.k.a. descendants in $2D$ quantum gravity) to be coupled with cohomological field theories and their generalisations. The main property of integrable observables is that they retain the integrability properties. We present three examples of integrable observables. The first two recover the Dubrovin-Zhang and double ramification hierarchies, while revealing new structural features in this framework. The third, a new example, builds on recently established properties of the so-called $\mathbb{\Pi}$-class, extending them and placing this class naturally within the theory of integrable systems. Notably, our integrable observables framework yields a proof that the new $\mathbb{\Pi}$-hierarchies are Miura equivalent both to the Dubrovin-Zhang hierarchies and to the double ramification hierarchies. A new very short proof of Witten's conjecture is also provided.

0

5

math.SG 2026-05-22 2 theorems

Any two (p,q)-pinwheel embeddings in B_{p,q} are Hamiltonian isotopic

by Nikolas Adaloglou, Gerard Bargalló i Gómez +1 more

The nearby Lagrangian conjecture for pinwheels

The nearby Lagrangian conjecture holds for this class of immersed singular Lagrangians inside rational homology balls.

abstract click to expand

The Lagrangian skeleton of the rational homology ball $B_{p,q}$, for $0<q<p$ coprime integers, is an immersed but not embedded Lagrangian, called a $(p,q)$-pinwheel. We show that any two embeddings of Lagrangian $(p,q)$-pinwheels in $B_{p,q}$ are related by a compactly supported Hamiltonian isotopy, establishing Arnold's nearby Lagrangian conjecture for this wide class of singular Lagrangians. Our proof has two largely independent parts: the first uses neck-stretching and the symplectic rational blow-up to understand embeddings of pinwheels up to symplectomorphism; the second computes that $\text{Symp}_c(B_{p,q})$ is generated by a twist about the pinwheel, which we call the pintwist $\tau_{p,q}$. We provide three applications of our methods: Gromov non-squeezing for pin-balls; a new proof of the local Lagrangian unknotting theorem of Eliashberg--Polterovich; and that the only Lagrangian $(n,m)$-pinwheel in $B_{p,q}$ is of type $(p,q)$.

0

5

math.SG 2026-05-22 Recognition

All (p,q)-pinwheel embeddings in B_{p,q} are Hamiltonian isotopic

by Nikolas Adaloglou, Gerard Bargalló i Gómez +1 more

The nearby Lagrangian conjecture for pinwheels

The nearby Lagrangian conjecture holds for these singular Lagrangians because the symplectomorphism group is generated by a single twist.

abstract click to expand

The Lagrangian skeleton of the rational homology ball $B_{p,q}$, for $0<q<p$ coprime integers, is an immersed but not embedded Lagrangian, called a $(p,q)$-pinwheel. We show that any two embeddings of Lagrangian $(p,q)$-pinwheels in $B_{p,q}$ are related by a compactly supported Hamiltonian isotopy, establishing Arnold's nearby Lagrangian conjecture for this wide class of singular Lagrangians. Our proof has two largely independent parts: the first uses neck-stretching and the symplectic rational blow-up to understand embeddings of pinwheels up to symplectomorphism; the second computes that $\text{Symp}_c(B_{p,q})$ is generated by a twist about the pinwheel, which we call the pintwist $\tau_{p,q}$. We provide three applications of our methods: Gromov non-squeezing for pin-balls; a new proof of the local Lagrangian unknotting theorem of Eliashberg--Polterovich; and that the only Lagrangian $(n,m)$-pinwheel in $B_{p,q}$ is of type $(p,q)$.

0

4

quant-ph 2026-05-21 2 theorems

Optimal QPA needs input copies scaling only with outputs and gap

by Zhaoyi Li, Elias Theil +2 more

Quantum Purity Amplification for Arbitrary Eigenstates and Multiple Outputs

For constant eigenvalue gap the copy count is independent of local dimension and follows explicit phase laws when outputs grow with inputs.

abstract click to expand

Quantum purity amplification (QPA) is the task of coherently transforming $n$ copies of a mixed state into high-fidelity copies of a chosen eigenstate. We solve QPA in the general setting of $n$ input copies, $m$ output copies, arbitrary target eigenstates, arbitrary local dimension $d$, and generic input spectra. We characterize the optimal channel and derive its all-site and one-site performance laws across output regimes. For the asymptotic analysis, we use a path-graph parametrization to show that, when the target eigenvalue has a constant spectral gap $D_{k,\mathrm{min}}$, achieving all-site error $\varepsilon$ requires a number of input copies independent of $d$ and scaling as $O(m/(\varepsilon D_{k,\mathrm{min}}^2))$. When $m/n$ approaches a constant, the performance exhibits phase-like regimes, which we characterize explicitly. For the nonasymptotic analysis, we develop a theory of generalized Young diagrams that yields tight sample complexity bounds and provides the first dimension-uniform guarantee for optimal QPA. We also provide asymptotically efficient implementations of the optimal protocol. Together, these results establish QPA as a rigorous example of coherent quantum information processing with dimension-uniform sample complexity, supplying the technical foundation for the coherent-incoherent separation developed in the companion work.

0

4

cs.LG 2026-05-21 2 theorems

Entmax turns KV cache truncation into exact support recovery

by Gonçalo Duarte, Miguel Couceiro +1 more

EntmaxKV: Support-Aware Decoding for Entmax Attention

When selected pages capture the entmax support, sparse decoding matches the full version exactly and error vanishes with the dropped mass.

abstract click to expand

Long-context decoding is increasingly limited by KV-cache memory traffic since each generated token attends over a cache whose size grows linearly with context length. Existing sparse decoding methods reduce this cost by selecting subsets of tokens or pages, but are designed for softmax attention, whose dense tails make any truncation discard nonzero probability mass. In contrast, $\alpha$-entmax produces exact zeros, turning sparse decoding from dense-tail approximation into support recovery: if the selected candidates contain the entmax support, sparse decoding remains exact. While recent entmax kernels enable efficient training, they do not address the autoregressive decoding bottleneck, where dense inference still streams the full KV cache before sparsity is known. In this work, we introduce EntmaxKV, an entmax-native sparse decoding framework that exploits sparsity before KV pages are loaded. EntmaxKV combines query-aware page scoring, support-aware candidate selection, and sparse entmax attention. We analyze truncation error through the dropped probability mass $\delta$, showing that output error is controlled by $\delta$ and vanishes when the entmax support is recovered. We further introduce a Gaussian-aware entmax selector that estimates the entmax threshold from lightweight page statistics, adapting the selected budget to the score distribution. Empirically, EntmaxKV drops less probability mass, retains more support tokens, and achieves lower output error than softmax-based sparse decoding at matched KV budgets. On long-context and language modeling benchmarks, it closely matches full-cache entmax while using a small fraction of the KV cache, achieving up to $3.36\times$ (softmax) and $5.43\times$ (entmax) speedup over full attention baselines at 1M context length. Code available at: https://github.com/deep-spin/entmaxkv.

0

4

gr-qc 2026-05-21 2 theorems

Modified gravity changes low-frequency gravitational-wave lensing

by Alice Garoffolo, Gianmassimo Tasinato

Wave-optics gravitational wave lensing in modified gravity

A curvature-coupled propagation equation prevents the amplification factor from reaching unity at zero frequency.

abstract click to expand

We initiate the study of gravitational-wave lensing in the wave-optics regime within modified gravity. We consider a phenomenological setup in which the gravitational-wave amplitude obeys a curvature-coupled propagation equation. This framework reproduces the standard GR behaviour in the geometric-optics regime, while leading to qualitatively different infrared dynamics. In particular, the usual argument implying that the amplification factor approaches unity in the zero-frequency limit no longer applies. This is due to the persistence of curvature-induced interactions in the infrared, which modify the natural propagation basis itself. As a result, the standard Fresnel treatment ceases to be valid at sufficiently low frequency. The correct infrared regime is instead controlled by an interacting static Green function, with a finite-frequency completion provided by a partial-wave formulation. We show that this structure admits an equivalent distorted-wave interpretation, in which the curvature interaction is absorbed into a dressed reference propagation basis, while the residual lensing effect is encoded in finite-frequency phase shifts. We further demonstrate that these phenomena admit a natural interpretation in the language of scattering amplitudes. Wave-optics lensing can therefore probe propagation-level departures from GR that remain entirely invisible in geometric optics.

0

4

cs.DS 2026-05-21 2 theorems

Generalized Thresholding Mechanism tests DP mechanisms near-optimally

by Anamay Chaturvedi, Monika Henzinger +1 more

Near-Optimal Generalized Private Testing

It accepts the first sufficiently successful mechanism from a sequence, rejects the rest, and uses a bounded number of evaluations while保持纯ε

abstract click to expand

In differential privacy (DP), the generalized private testing problem was introduced by Liu and Talwar (STOC 2019). Given a dataset $X \in \mathcal{X}$ and a sequence of black-box $\varepsilon_t$-DP mechanisms $M_t:\mathcal{X}\to\{+1,-1\}$, the analyst must accept the first mechanism whose success probability $p_t=\Pr[M_t(X)=+1]$ exceeds a given threshold $p^*\in(0,1)$, while achieving DP. Accuracy is measured by the gap between $p^*$ and a rejection threshold $\bar{p}$, such that with probability $1-\beta$ for all $t\geq1$, if $p_t\leq\bar{p}$, then $M_t$ is rejected, and if $p_t\geq p^*$, then it is accepted. This generalizes the standard private testing problem, whose solution, the Sparse Vector Technique, is ubiquitous in DP. We introduce the Generalized Thresholding Mechanism (GTM) for generalized private testing. For $\varepsilon>0$ and any sequence of $(\varepsilon_t,\delta_t)$-DP mechanisms $M_t$, the GTM is pure $\varepsilon$-DP. For $\theta>0$, $\gamma\in(1,2]$, and $\beta\in(0,1)$, $\bar{p}_t=\max(p^*/\gamma\Lambda_t, 1 - \gamma\Lambda_t(1-p^*))-\delta_t/\varepsilon_t$ for $\Lambda_t=(5t\ln^3(t+2))^{(2+\theta)\varepsilon_t/\varepsilon}(4/\beta)^{(3+\theta+2/\theta)\varepsilon_t/\varepsilon}$. With probability $1-\beta$, the number of evaluations of $M_t$ is at most $O((\ln(t/\beta)/(\gamma-1)^2)\max(\Lambda_t/p^*,(1-p^*)^{-1}))$ for all $t\geq 1$. Our lower bounds prove near-optimality of our accuracy and sample complexity guarantees. Via the GTM, we give a black-box reduction for DP optimization from the continual observation (CO) setting to the batch setting. This gives us the first DP-CO algorithms for many maximization problems. Further, the GTM permits an adaptive choice of acceptance thresholds $(p^*_t)_{t\geq1}$, addressing a challenge mentioned in prior work on using generalized private testing for hyperparameter optimization (Papernot and Steinke (ICLR 2022)).

0

4

cs.SE 2026-05-21 2 theorems

Refusal rate misranks LLMs on bio safety

by Lukas Weidener, Marko Brkić +3 more

RefusalBench: Why Refusal Rate Misranks Frontier LLMs on Biological Research Prompts

Matched prompts show top risk discriminators often refuse fewer queries than high-refusal peers.

abstract click to expand

Frontier large language models are increasingly deployed as orchestration backbones for biological research workflows, yet no shared evidence base exists for comparing their refusal behaviour on legitimate research prompts. RefusalBench, introduced here, is a matched-triple benchmark of 141 prompts in 47 bundles that holds task framing constant while varying only biological risk tier (benign, borderline, dual-use), enabling tier-conditioned comparisons robust to subdomain confounding. A 15-prompt should-refuse positive-control module establishes per-model calibration floors; three models fail to refuse even these prompts. Across 19 frontier models in the May 2026 snapshot, strict refusal rates span 0.1% to 94.6% on identical prompts. Jurisdiction does not predict refusal in this snapshot (Mann-Whitney U, p = 0.393; EU n = 1, US bimodal); provider identity does, with Anthropic's API stack predicting refusal at OR = 21.03 (95% CI: 14.58-30.34 prompt-clustered; 5.70-77.55 under model-clustered GEE). This effect is best read as access-path-level rather than model-weight-level: 99.8% of Anthropic's strict refusals carry the same safety_policy adjudicated reason code, consistent with a small set of canonical refusal templates rather than case-by-case model reasoning. Strict refusal rate misranks safety calibration: Grok 4.20 achieves the highest tier discrimination (Youden's J = 0.787) while ranking only seventh by overall refusal rate, and Claude Opus 4.7's J dropped 65% from prior versions with no improvement in dual-use detection. Nine of 18 frontier models exhibit a hedge-but-help partial-compliance pattern at dual-use tier that binary refusal metrics cannot detect.

0

4

cs.CC 2026-05-21 2 theorems

Any sequence reduces to a poly-time random one in quasi-polynomial time

by Satyadev Nandakumar, Akhil S +1 more

Resource bounded Kuv{c}era-G\'{a}cs Theorems

The reduction uses only n plus little-o-n oracle bits and equates decompression ratios to Kolmogorov complexity rates.

abstract click to expand

The Ku\v{c}era--G\'{a}cs theorem is a fundamental result in algorithmic randomness. It states that every infinite sequence $X$ is Turing reducible to a Martin-L\"of random $R$. This paper studies resource-bounded analogues of the Ku\v{c}era-G\'acs Theorem, at the resource bounds of polynomial-time and finite-state computation. We prove a {quasi-polynomial-time}{ Ku\v{c}era-G\'acs Theorem}, showing that every infinite sequence $X$ is quasi-polynomial-time reducible to a \emph{polynomial-time random} sequence $R$. We also show that for any $X$, the oracle use of $R$ is $n+o(n)$ bits for obtaining the first $n$ bits of $X$. We then study the relationship between compressibility and Turing reductions, in the polynomial-time setting. We establish that $\rho^-_{\mathsf{poly}}(X) = K_{poly}(X)$, demonstrating that the lower polynomial-time Turing decompression ratio is precisely characterized by the polynomial-time Kolmogorov complexity rate. We note that this characterization fails for the polynomial-time dimension if one-way functions exist, resolving an open problem from Doty's work. We use these results to strengthen the {quasi-polynomial-time}{ Ku\v{c}era-G\'acs Theorem}. We show that every infinite sequence $X$ is quasi-polynomial-time reducible to a {polynomial-time random} sequence $R$, where the lower oracle use rate of the reduction is less than ${K}_{poly}(X)$. We also show that any sequence extracted from the (even larger) set of \emph{normal sequences} by a finite-state reduction must have a convergent asymptotic frequency for its symbols. Since sequences lacking this invariant property exist, they cannot be finite-state reduced from any normal sequence. Hence we show that the Ku\v{c}era-G\'acs theorem \emph{fails} for finite-state reductions.

0

3

cs.CV 2026-05-21 2 theorems

Context rewrite lifts 3D grounding accuracy by up to 22 points

by Anna Deichler, Jim O'Regan +5 more

MM-Conv: A Multimodal Dataset and Benchmark for Context-Aware Grounding in 3D Dialogue

Two-stage pipeline resolves dialogue ambiguity before detecting objects in dynamic VR scenes

abstract click to expand

Grounding language in the physical world requires AI systems to interpret references that emerge dynamically during conversation. While current vision-language models (VLMs) excel at static image tasks, they struggle to resolve ambiguous expressions in spontaneous, multi-turn dialogue. We address this gap by introducing (1) a benchmark for referential communication in dynamic 3D environments, built from 6.7 hours of egocentric VR interaction with synchronized speech, motion, gaze, and 3D scene geometry, and (2) a two-stage grounding pipeline that explicitly resolves conversational ambiguity before visual localization. The benchmark includes over 4,200 manually verified referring expressions spanning full, partitive, and pronominal types. Our contextual rewriting approach improves grounding performance by 11-22 percentage points on average, with a pure detector (GroundingDINO) reaching 56.7% on pronominals after rewriting, nearly double the best end-to-end baseline. Results demonstrate that decoupling linguistic reasoning from visual perception is more effective than end-to-end approaches for conversational grounding.

1 0

3

cond-mat.mes-hall 2026-05-22 2 theorems

Quantum metric drives Bloch oscillations without Berry curvature

by M. Maneesh Kumar, Md Kaif Faiyaz +2 more

Quantum-metric Bloch oscillations in weakly inhomogeneous electric fields

A weak electric-field gradient produces real-space oscillations via the quantum metric even when curvature vanishes.

abstract click to expand

Geometric analogs of Bloch oscillations studied so far have relied on Berry curvature. We show that a weakly inhomogeneous electric field adds a distinct quantum-metric term to semiclassical wavepacket dynamics, generating an oscillatory real-space contribution even when the Berry curvature vanishes. The associated transport response comprises an intrinsic and a scattering-time-dependent part. In the regime studied, the latter can dominate and approach finite saturation at high field when the relative field inhomogeneity is held fixed. A tilted Dirac model illustrates the mechanism. Realistic platforms will likely require synthetically engineered superlattices, with a finite quantum metric and an adequate band gap.

1 0

3

physics.optics 2026-05-22 2 theorems

Two-photon absorption at silicon enables 40x bandwidth MIR ghost imaging

by Ziyu He, Kun Huang +4 more

Mid-infrared temporal ghost imaging via two-photon structured encoding

Compact system transfers near-IR modulation to mid-IR signals for high-sensitivity detection across 2.5-3.8 μm without crystals.

abstract click to expand

Temporal ghost imaging (TGI) enables ultrafast signal reconstruction beyond electronic bandwidth limits. Extending this concept to the mid-infrared (MIR) regime through nonlinear frequency conversion offers new opportunities for high-fidelity temporal detection, but remains constrained by stringent phase-matching condition, limited spectral coverage, and intricate optical alignment. Here, we propose and demonstrate a broadband MIR TGI system based on non-degenerate two-photon absorption. A temporally encoded near-infrared pump transfers structured modulation onto a MIR signal directly at a silicon detector, which facilitates concurrent modulation and detection without external nonlinear crystals. The reconstructed temporal waveforms exceed the detector bandwidth by more than fortyfold, achieve a detection sensitivity of 0.05 pJ/pulse, allow compressed sensing with 80\% fewer measurements, and support broadband operation across 2.5-3.8 $\mu$m. This compact, alignment-free, and room-temperature system establishes a practical route for fast and sensitive MIR time-domain analysis, holding great promise for applications in time-resolved molecular spectroscopy, high-precision infrared ranging, and high-speed free-space communication.

1 0

8

math.CO 2026-05-18 3 theorems

Weak isomorphisms of graphings reduce to countable Whitney operations

by Márton Borbényi, Grigory Terlov +1 more

Whitney's 2-isomorphism theorem for graphings

This extends classical graph theory to give the first general condition for when two graphings are isomorphic in the measurable setting.

abstract click to expand

We prove measurable analogues of Whitney's classical theorems on weak isomorphisms of finite graphs. In the setting of locally finite graphings, we introduce a notion of weak isomorphism as an edge-measure-preserving Borel bijection that preserves cycles and hyperfinite subgraphs, modulo null sets. We first show a rigidity theorem, proving that for weakly 3-connected infinitely-ended graphings, every weak isomorphism is induced by an isomorphism of graphings. To our knowledge, this gives the first general sufficient condition in measurable combinatorics for the existence of an isomorphism between two given graphings. Next, we give a full measurable version of Whitney's theorem, showing that every weak isomorphism between graphings can be implemented by countably many measurable Whitney operations, which we introduce in this setting. The proofs require new measurable-combinatorial tools, including a careful analysis of infinitely-ended subforests. This work further develops the limit theory of matroids recently initiated by Lov\'asz.

0

5

cs.CV 2026-05-20 2 theorems

AI models lag behind text-only on 3D brain MRI benchmark

by Mohammad H. Abbasi, Favour Nerrise +13 more

NeuroQA: A Large-Scale Image-Grounded Benchmark for 3D Brain MRI Understanding

Top vision-language models reach only 47.5 percent on verified questions while text statistics yield 49.4 percent.

abstract click to expand

We present NeuroQA, a large-scale benchmark for visual question answering in 3D brain magnetic resonance imaging (MRI), with 56,953 QA pairs from 12,977 subjects across 12 datasets. It spans ages 5-104 and five clinical domains: Alzheimer's, Parkinson's, tumors, white matter disease, and neurodevelopment. Unlike prior medical Visual Question Answering (VQA) efforts that operate on 2D slices or rely on narrow diagnostic labels, NeuroQA pairs every item with a full 3D volume. It evaluates 11 clinically grounded reasoning skills across Yes/No, multiple-choice, and open-ended formats. Of the 203 templates, 131 are image-grounded (answerable from a 3-plane viewer) and 72 are image-informed (ground truth from quantitative volumetry or clinical instruments). To remove text-only shortcuts, we apply answer-distribution refinement, reducing closed-format text-only accuracy from $>$80% to 44.6%; image necessity is assessed separately through an image-grounding protocol released with the benchmark. A 38-rule deterministic pipeline and two rounds of expert review verify every QA pair against FreeSurfer measurements, metadata, or radiology report fields, with zero same-subject contradictions across templates. We conduct a clinician evaluation in which two clinicians independently assess 100 frozen test items on a three-plane viewer. On closed-format (Yes/No + multiple-choice) test-public items, the best zero-shot vision-language model and a supervised 3D CNN baseline reach 47.5% and 43.7% accuracy respectively, both below the 49.4% text-only majority-template floor. NeuroQA adopts a two-tier release with public QA pairs for open-access datasets and reproducible generation scripts for datasets restricted by data use agreements (DUAs), plus subject-level splits, a held-out private test set, and an online leaderboard.

0

5

stat.ML 2026-05-20 2 theorems

Contradiction graph decides VC dimension threshold for any m

by Jesse Campbell, Daniel Ibaibarriaga +1 more

Contradiction Graphs Determine VC Dimension

Vertices are realizable label sequences of length m; edges mark label disagreements on shared points, fixing whether dimension meets or tops

abstract click to expand

We study the contradiction graphs associated with binary concept classes. For a class $H \subseteq \{0,1\}^X$, the order-$m$ contradiction graph $G_m(H)$ has as vertices the $H$-realizable labeled sequences of length $m$, with two vertices adjacent when the two sequences assign opposite labels to some common domain point. Our main result is that the single graph $G_m(H)$ determines the threshold predicate $\mathrm{VCdim}(H)\ge m$. Consequently, the full sequence $(G_m(H))_{m \ge 1}$ determines the exact VC dimension and, in particular, detects finite versus infinite VC dimension, answering a question posed by Alon et al. (2024).

0

5

quant-ph 2026-05-21 2 theorems

Coherent inference reaches ε error with O(1/ε) copies vs Ω(d/ε) incoherent

by Zhaoyi Li, Elias Theil +2 more

An Exponential Sample-Complexity Advantage for Coherent Quantum Inference

For d-dimensional purity amplification, preserving output coherence cuts the required input copies by a factor linear in dimension.

abstract click to expand

Standard quantum inference converts quantum data into classical outputs. We study an alternative inference setting in which the desired output is quantum, preserving coherence. Such settings include quantum purity amplification (QPA), mixed-state approximate purification or cloning, and density matrix exponentiation. We show that such protocols can achieve exponentially lower sample complexity than incoherent, measurement-mediated protocols. For QPA with principal eigenstate targets and $d$-dimensional inputs, coherent processing achieves error $\varepsilon$ using $O(1/\varepsilon)$ copies, versus the $\Omega(d/\varepsilon)$ copies required by any incoherent protocol. Together, these sharp coherent-incoherent separations seed a theory of coherent quantum inference, with an entanglement-breaking limit identifying the optimal incoherent counterpart of each coherent protocol.

0

5

math.CO 2026-05-21 2 theorems

Odd type D has exactly 2^r-1 rational Weyl group elements

by Yutong Zhang, Yaoran Yang

Rational Weyl group elements of odd type D

They are the longest element plus two signed cyclic families indexed by subsets, forming two Boolean halves joined only at w0.

abstract click to expand

Voloshyn introduced rational Weyl group elements in connection with rational normal forms on complex reductive groups and conjectured that, in type $D_r$ with $r$ odd, their number is $2^r-1$. We prove a stronger structural statement. For $r\geq 5$ odd, the rational Weyl group elements in $W(D_r)$ are exactly the longest element $w_0$ together with two explicitly described signed cyclic elements $c_I$ and $d_I$ for every non-empty subset $I\subseteq\{1,\ldots,r-1\}$. Consequently the rationality graph $\Gamma(D_r)$ is two explicitly labelled Boolean-type halves glued at $w_0$, its number of vertices is $2^r-1$, and its only vertices of valency one are $c_{\{1\}}$ and $d_{\{1\}}$. The proof combines an acyclic two-level description of the rationality graphs $\Gamma(c_I)$ with a rigidity argument for all one-step rational descents from $w_0$. The latter uses Voloshyn's descent lemma, while all type-$D$ exclusions are given by explicit loops or two-cycles in the root-poset rationality graph.

0

5

math.PR 2026-05-21 2 theorems

Particles stay close to mean-field limit for all time in second-order CBO

by Seung-Yeal Ha, Franca Hoffmann +1 more

Uniform-in-time propagation of chaos for Second-Order Consensus-Based Optimization

First uniform-in-time propagation of chaos result gives Monte Carlo rate without time restriction for derivative-free optimization.

abstract click to expand

We study second-order Consensus-Based Optimization (CBO), a derivative-free global optimization algorithm in which the consensus force and the multiplicative exploratory noise act on particle velocities. We prove quantitative uniform-in-time propagation of chaos for the unmodified second-order CBO dynamics, together with an almost uniform-in-time stability estimate for the microscopic particle system. The proof is not a direct adaptation of the first-order CBO argument. Although both first- and second-order CBO have multiplicative noise that degenerates near consensus and a shift-invariant weighted interaction, the kinetic model has an additional structural obstruction: the consensus mechanism and the stochastic forcing act only on the velocity variable, while the position variable evolves by transport. Thus spatial concentration has to be recovered indirectly through velocity dissipation. Moreover, the shift-invariant interaction leaves a translation mode that is not directly damped by the consensus force, so a standard synchronous coupling in the Euclidean phase-space distance does not close uniformly in time. The main idea of the paper is to introduce shifted internal variables that separate the contracting fluctuation modes from the undamped translation mode. In these variables we build a Lyapunov functional with a position-velocity cross term and prove exponential decay of centered moments. This decay is the mechanism that makes the time-dependent coupling coefficient integrable. Combining it with uniform-in-time raw moment bounds, concentration inequalities, stability estimates for the weighted mean, and a Monte Carlo estimate, we obtain the classical Monte Carlo rate for propagation of chaos uniformly in time. The system-to-system stability estimate avoids the sampling error and yields the faster rate $O(J^{-q})$.

0

5

math.NT 2026-05-21 2 theorems

Valuation criterion fixes Zadoff-Chu equivalence rule

by Yutong Zhang, Yaoran Yang

A Local Valuation Criterion for Quadratic-Permutation Interleaved Zadoff--Chu Sequences

Checking the quadratic coefficient's divisibility by prime powers determines equivalence for any length and corrects the earlier conjecture.

abstract click to expand

Berggren and Popovi\'c introduced quadratic-permutation-polynomial interleaved Zadoff--Chu sequences and, from exhaustive data, conjectured that all normalized QPP-interleaved Zadoff--Chu sequences are inequivalent to ordinary Zadoff--Chu sequences precisely for prime-power lengths $N=p^n$ with $p>3$ and $n>1$. We give an exact local arithmetic criterion. For a normalized QPP $\pi_{a,b}(k)=ak^2+bk\pmod N$, the interleaved sequence is equivalent, under the standard five CAZAC-preserving operations, to a Zadoff--Chu sequence if and only if, for every prime power $p^\alpha\Vert N$, the valuation of $a$ satisfies \[ \nu_p(a)\ge \begin{cases} 0, & p=2,\ \alpha=1,\\ \alpha-1, & p=2,\ \alpha\ge2,\\ \alpha-1, & p=3,\\ \alpha, & p>3. \end{cases} \] The proof is based on a third finite-difference invariant of the lifted Zadoff--Chu phase, namely \[ \Delta^3\bigl((ak^2+bk+\varepsilon_N+2q)(ak^2+bk)\bigr) =12a(2ak+3a+b). \] As a consequence, the conjectured prime-power boundary is not correct: the exact non-vacuous condition for all nonzero normalized QPPs to be inequivalent to Zadoff--Chu sequences is that $N$ is odd, $9\nmid N$, and $p^2\mid N$ for at least one prime $p\ge5$. In particular, $N=75=3\cdot5^2$ is the smallest non-prime-power counterexample to the conjectured ``only if'' direction. A second corollary records the corresponding statement for irreducible QPPs.

0

5

math.CA 2026-05-21 2 theorems

Bessel integral bound holds uniformly for all gamma in (0,1)

by Yaoran Yang, Yutong Zhang

Weighted Uniform Endpoint Majorants for Integrals Involving Modified Bessel Functions

The endpoint majorant controls the integral from zero to x with an explicit constant, removing the earlier small-gamma restriction and fully

abstract click to expand

We give an affirmative full-range solution to Gaunt's 2019 Open Problem~2.10. The problem asks whether, for every $\nu>-1/2$ and $0<\gamma<1$, the reciprocal-power integral $\int_0^x e^{-\gamma t}I_\nu(t)t^{-\nu}\,\dd t$ is bounded by a constant multiple of $e^{-\gamma x}I_{\nu+1}(x)x^{-\nu}$, uniformly for all $x>0$. Earlier exponential-tilt estimates proved such endpoint majorants only under an additional smallness condition on $\gamma$. We prove the estimate throughout the natural range $0<\gamma<1$, with an explicit admissible constant. More generally, if $\mu>-1$, $q>-1$, $0<\gamma<1$, and $w(x)x^{-q}$ is nondecreasing on $(0,\infty)$, then for every $\theta\in(\gamma,1)$, $\int_0^x e^{-\gamma t}w(t)t^{-\mu}I_\mu(t)\,\dd t$ is controlled by an explicit multiple of $e^{-\gamma x}w(x)x^{-\mu}I_{\mu+1}(x)$. The case $w\equiv1$, $q=0$, and $\mu=\nu$ resolves Gaunt's problem. The weighted theorem also yields shifted-order and moment estimates, applies to approximate power weights and monotone regularly varying amplitudes, and provides two-sided estimates under a reversed comparison. We further analyze the sharp power-weighted quotient via endpoint expansions, a stationary equation, and parameter monotonicity.

0

5

nlin.SI 2026-05-21 2 theorems

Polynomial Hamiltonians yield meromorphic solutions only for degrees 3,4,5,7

by Marta Dell'Atti, Thomas Kecker

Modified Painlev\'e systems with meromorphic solutions for polynomial Hamiltonians of all degrees

Twelve standard forms are obtained, including new quartic and quintic examples, for use in the Painlevé equivalence problem.

abstract click to expand

We review non-autonomous Hamiltonian systems, polynomial in two dependent variables, with the property that all of their solutions are meromorphic functions in the complex plane. These are related to known Hamiltonian systems with the Painlev\'e property, for which the solutions are single-valued outside a set of fixed singularities. Our systems are equivalent to them in the absence of fixed singularities, and give modified Painlev\'e equations otherwise. Using the geometric approach by computing the Okamoto's spaces of initial conditions for certain Hamiltonian systems with general coefficient functions, we obtain differential constraints on these functions for the systems to have only meromorphic solutions. Guided by the Newton polygon of the Hamiltonian function, we obtain all such systems with polynomial Hamiltonian of degree three, four, five, and seven, up to affine equivalence in the dependent variables, while there are none for degree six or degree higher than seven. We thus obtain a list of 12 standard polynomial Hamiltonians that can serve as reference for the Painlev\'e equivalence problem. This list contains also some new Hamiltonians not previously written down, such as quartic Hamiltonians for Painlev\'e I and II, quartic Hamiltonians for the modified Painlev\'e III and V equations, a quintic Hamiltonian for Painlev\'e IV and quintic and septic Hamiltonians for a modified Painlev\'e VI equation.

0

4

cs.LG 2026-05-19 2 theorems

Wrapper gives pathwise risk control for updating LLMs

by Hamed Khosravi, Xiaoming Huo

Conformal Selective Acting: Anytime-Valid Risk Control for RLVR-Trained LLMs

CSA maintains per-round selective risk bounds under predictable updates without pooling across deployments.

abstract click to expand

A local specialist LLM, fine-tuned with reinforcement learning from verifiable rewards (RLVR) on operator-local data, is installed in a regulated organization with per-deployment error budget $\alpha$. The operator needs a safety certificate for this deployment's stream at every round: no pooling across deployments, no waiting for a long-run average. Existing wrappers cannot deliver this on adaptive, online-updated streams: offline conformal-risk methods require exchangeability; online-conformal methods bound only long-run averages; non-exchangeable extensions are marginally valid; and the closest anytime wrapper, A-RCPS, controls marginal rather than selective risk. Using a (test statistic, validity guarantee, deployment rule) framework, we identify one empty cell forced by deployment requirements: e-process per threshold, selective risk, anytime-pathwise validity, max-certified-threshold rule. Conformal Selective Acting (CSA) fills it as a per-round wrapper maintaining a Ville-type e-process per threshold on a Bonferroni grid, evaluated against the RLVR filtration. Under predictable updates and isotonic-calibrated monotone risk we prove (i) an anytime-pathwise selective-risk bound $R_T^{\mathrm{act}}\le\alpha+O(N_T^{-1/2})$, (ii) rate-optimal certification matching $\Theta(\bar\eta^{-2}\log(1/\delta))$, and (iii) a horizon-independent release-rate gap. Across eight specialist benchmarks ($480$ streams), sixteen adversarial distribution-shift cells ($160$ streams), and five live Expert-Iteration RLVR cells with online LoRA over four base models in three architecture families ($10{,}300$ rounds), CSA is the only method among ten compared that satisfies pathwise validity and non-refusing deployment on every cell. We do not propose a new LLM, training algorithm, or policy class; CSA is the deployment-side complement, orthogonal to the model, for operators who cannot use a frontier API.

0

4

cs.LG 2026-05-19 2 theorems

Low-rank bandits recover drifting subspaces from scalar rewards

by Hamed Khosravi, Xiaoming Huo

Catching a Moving Subspace: Low-Rank Bandits Beyond Stationarity

Three probe conditions suffice for identification and replace ambient d sqrt(T) regret with intrinsic r sqrt(T) plus detection cost.

abstract click to expand

Many bandit deployments (recommendation, clinical dosing, ad targeting) share two facts prior work handles only in isolation: rewards live on a low-dimensional latent subspace, and that subspace drifts. Stationary low-rank bandits exploit rank but break under subspace change; non-stationary linear bandits adapt to drift but pay ambient rate $\widetilde{O}(d\sqrt{T})$. We study piecewise-stationary low-rank linear contextual bandits with scalar feedback: $\theta_t = B_k^\star w_t$ with rank-$r$ factor $B_k^\star\in\mathbb{R}^{d\times r}$ constant within each of $K$ unknown segments and able to shift at boundaries. Our results are tight along three axes. (i) Identification boundary. With single-play scalar rewards, the moving subspace is recoverable through quadratic functionals of rewards iff three probe-side conditions hold: known noise variance, bounded state-noise coupling, and full-dimensional probe support. Each is necessary in the unrestricted-second-moment problem, and jointly they are sufficient, characterizing the boundary of the solvable region. (ii) Algorithm and dynamic regret. SPSC interleaves isotropic probes with windowed projected ridge-UCB exploitation inside the learned $r$-dimensional subspace; a CUSUM-style variant discovers segment boundaries online. The costed dynamic regret is $\widetilde{O}(r\sqrt{T})+\widetilde{O}(T^{2/3})+O(W\,V_{\mathrm{in}})$, replacing the ambient $d\sqrt{T}$ rate with the intrinsic rank. (iii) Empirics. On eleven benchmarks spanning synthetic, UCI/MovieLens, semi-synthetic clinical, and ZOZOTOWN production-log data, SPSC outperforms non-stationary and low-rank baselines whenever $d-r\gtrsim T^{1/6}$, matching the analytical crossover. To our knowledge, this is the first work to characterize the identification boundary and attain the intrinsic-rank dynamic-regret rate in this setting.

0

4

astro-ph.CO 2026-05-20 2 theorems

Photometric data alone yields first splashback mass function for clusters

by Lucas Gabriel-Silva, Laerte Sodré Jr

The Splashback Mass Function of Galaxy Clusters from Photometric Data

SDSS photometry locates cluster edges and produces abundances matching simulations at high masses.

abstract click to expand

The splashback radius marks the physical boundary of galaxy clusters, separating orbiting from infalling material, and provides a halo definition free from pseudo-evolution. In this work, we present a fully photometric framework to measure individual cluster splashback radii and masses, and to construct an observational splashback mass function. Using Sloan Digital Sky Survey data, we develop a probabilistic cluster membership method based on radial and photometric redshift information, optimized through an adaptive probability cut that maximizes the detection significance of the cluster core relative to its outskirts. We apply this methodology to a sample of 499 galaxy clusters from the \textsc{CoMaLit} weak-lensing compilation and recover splashback radii from modeling cumulative galaxy number profiles. The resulting splashback radii exhibit a median ratio $R_{\mathrm{sp}}/R_{200\mathrm{m}} \simeq 1.1$, consistent with previous observational studies. Using these measurements, we recalibrate the $M_{\mathrm{sp}}$--$R_{\mathrm{sp}}$ scaling relation over a wide redshift range ($0.01 < z < 0.8$), finding a slope shallower than the constant-density expectation and no significant redshift evolution. We then apply this relation to \textsc{redMaPPer} clusters in the SDSS Northern Galactic Cap to derive splashback masses for more than $1.5\times10^4$ systems and construct the first observational splashback mass function based solely on photometric data. The resulting mass function agrees with simulation-based predictions at the high-mass end, while deviations at lower masses are consistent with known completeness limits of optical cluster catalogs. Our results demonstrate that splashback-based cluster sizes, masses, and abundances can be robustly measured in photometric surveys, enabling cosmological studies without spectroscopic or lensing data.

0

Pre-print review for science