Expander SAEs apply left-d-regular expander masks to TopK SAEs, learning only dn decoder parameters instead of mn and tracing a storage-fidelity frontier that reaches 293x compression with 84% retained performance on Qwen2.5-3B.
Canonical reference
Title resolution pending
Canonical reference. 80% of citing Pith papers cite this work as background.
citation-role summary
citation-polarity summary
authors
co-cited works
representative citing papers
The normalized orbit of a bounded normal operator can be a frame, providing a counterexample to Conjecture 3.
Introduces the largest freely available Italian clinical notes corpus with 4M notes and expert-annotated subset for a new CRF-filling benchmark.
Machine learning methods discover a new noncrossing-partition statistic interpreting q,t-Narayana polynomials and yield a combinatorial proof of their symmetry.
SLayerGen generates crystals invariant to any space or layer group via autoregressive lattice and Wyckoff sampling plus equivariant diffusion, achieving gains over bulk models on diperiodic materials after correcting a prior loss inconsistency for hexagonal groups.
Every proper minor-closed graph class admits an optimal (1+o(1)) log n bit adjacency labeling scheme.
A directed weighted two-graph model separates feasibility from movement in solution discovery and yields a detailed complexity classification for path and shortest-path discovery.
The method reformulates ALE mesh motion as independent multi-patch spline parameterizations per time step, using barrier functions, tangential-slip reparameterization, and constant-preserving quasi-interpolation to enable large-rotation FSI simulations.
Superconductivity in high-pressure MnB4 is induced by altermagnetic spin fluctuations, yielding extended-s pairing symmetry.
A new qubit-efficient HUBO encoding for graph partitioning problems like minimum coloring uses logarithmic bits and a lexicographic penalty to cut resources while providing provable optimality conditions.
A survey of 172 open educational datasets from 204 papers across LAK, EDM, and AIED conferences reveals trends, 143 previously uncatalogued datasets, field gaps, and an 8-item PRACTICE checklist for better data publication.
A microlocal lift of Navier-Stokes dynamics on manifolds yields an if-and-only-if geometric criterion for solution blow-up in terms of deformation integrability, directional entropy, and lifted energy.
A 9U CubeSat detector can identify a thermonuclear weapon on a satellite from 4 km away by observing spallation neutrons induced by GeV protons in roughly one week.
O(n log n) algorithm and matching Omega(n log n) lower bound for partitioning a simple polygon's boundary into the minimum number of contiguous visible segments.
Introduces a method to design structure-specific relational inductive biases for a base transformer architecture, enabling end-to-end transcription of documents with intrinsic structures, demonstrated on sheet music, shape drawings, and mechanical engineering drawings.
The paper introduces a probabilistic sign rule for quotients of positive series and integral transforms that reduces monotonicity, log-supermodularity, and log-convexity to kernel sign criteria via moment identities, and applies it to derive new inequalities for hypergeometric, Stieltjes, and Prabha
SPoILeR uses multimodal pre-training to enable accurate novel view synthesis of infrared, polarimetric, and multispectral data from RGB-supervised fine-tuning on new scenes.
NEvo performs evolutionary search guided by a dynamic voxel-level encoding model to synthesize videos that maximize predicted activity in target brain ROIs, recovering known selectivities and revealing temporal dynamics differences.
Every n-vertex H-minor-free graph admits a 3-coloring with monochromatic components of size O_H(n^{4/9}).
The Spin-MInt algorithm is proven symplectic for general K electronic states via explicit verification of the condition MJM^T = J on the coadjoint orbit of the su(K) Lie-Poisson algebra.
The authors synthesize a typology of fourteen OSS sub-genres from a review of 3,925 papers and present a research agenda on cross-sub-genre generalization.
Structural identifiability analysis shows point sources restore identifiability for inferring spatial stochastic dynamics parameters from static snapshots, unlike distributed sources, with limits depending on modeling choices.
Maximal quantum leakage upper-bounds quantum inference accuracy; optimal encodings are pure states, with tight frames and equiangular tight frames optimal when system dimension is small.
In multistage SI(k)R models, the relationship between prevalence peak and weighted stage functional maxima varies with scaling of progression rates, converging under Erlang scaling to a delay model that justifies the factor-two approximation with error bounds and corrections.
citing papers explorer
-
Parameterization-driven arbitrary Lagrangian-Eulerian method for large-deformation isogeometric fluid-structure interaction
The method reformulates ALE mesh motion as independent multi-patch spline parameterizations per time step, using barrier functions, tangential-slip reparameterization, and constant-preserving quasi-interpolation to enable large-rotation FSI simulations.
-
Projection-Based Reconstruction for Achieving High-Order Accuracy from Low-Order DGSEM Simulations
A cP_n P_m scheme for DGSEM-LGL achieves m+1 convergence order via projected high-order components and a compact reconstruction operator that corrects the highest Legendre mode.
-
Entropy correction artificial viscosity for high order DG methods using multiple artificial viscosities
Multiple artificial viscosities with analytical optimal parameters enable more flexible entropy-stable DG simulations than monolithic viscosity for 1D and 2D problems.
-
A spectral-subspace-augmented POD-Galerkin method for parametrized PDEs with limited snapshot data
SS-POD augments standard POD-Galerkin with a spectral-subspace partition and local POD to achieve lower out-of-sample error than either plain POD or pure spectral-Galerkin when only a handful of snapshots are available.
-
Complementary families of approximating polynomials with applications to finite element methods applied to differential equations of arbitrary even spatial order
Complementary polynomial families are introduced to construct C^m finite element bases of order p >= 2m+2, with explicit formulas on canonical intervals, an extended interpolation error formula, and a superconvergence result for linear problems in H^{m+1}.
-
Adjoint-based Perfusion Estimation from Dynamic Contrast-Enhanced Ultrasound: Advection-Diffusion and Two-Compartment Models
Adjoint equations with Tikhonov regularization enable gradient-based reconstruction of perfusion parameters from DCE-US data in advection-diffusion and two-compartment models, validated on synthetic and in vivo data.
-
Kernel-based linear system identification using augmented Krylov subspaces
Augmented Krylov subspaces jointly approximate quadratic forms and log-dets for faster MLE-based hyperparameter tuning in kernel-based linear system identification.
-
Invariant domain preserving limiting of time explicit and time implicit discretizations for systems of conservation laws
A generalized flux-corrected transport limiter for systems of conservation laws enforces invariant domain preservation by expressing the high-order solution as a convex combination of low-order invariant-domain-preserving states, applicable to both explicit and implicit time discretizations.
-
On the Practical Impact of Local Linear Instabilities in Entropy-Stable Schemes
Local linear instabilities in entropy-stable discretizations cause negligible practical errors because their growth is small, oscillatory, boundary-localized, and suppressible, with no direct extension to nonlinear two-point-flux cases.
-
Adaptive anisotropic composite quadratures for residual minimisation in neural PDE approximations
An adaptive anisotropic composite quadrature strategy combined with refresh-based training narrows the gap between training and reference losses in neural residual minimization for PDEs while using quadrature points more efficiently.
-
A Shifted Cohesive-Zone Method for Non-Interface-Fitted Meshes with Applications to Crystal Plasticity
SCZM enables accurate cohesive interface modeling and crystal plasticity on non-interface-fitted meshes by shifting traction-separation laws to a nearby surrogate interface.
-
Neural parametric representations for thin-shell shape optimisation
A neural network with periodic activations parameterizes thin-shell mid-surfaces so that network weights can be optimized to minimize structural compliance subject to a volume limit.
-
A high order stabilization-free virtual element method for general second-order elliptic eigenvalue problem
A novel high-order stabilization-free virtual element method is developed for general second-order elliptic eigenvalue problems, with optimal a priori error estimates for eigenspaces and eigenvalues, validated on various polygonal meshes.
-
Mixed formulation and structure-preserving discretization of Cosserat rod dynamics in a port-Hamiltonian framework
A mixed formulation for Cosserat rod dynamics is cast as an infinite-dimensional nonlinear port-Hamiltonian system and discretized with structure-preserving finite elements to yield energy-momentum consistent time integration.
-
A pressure-projection formulation in a least-squares meshfree method for the incompressible Navier-Stokes equations using a staggered-variable arrangement
A least-squares meshfree method for incompressible flows introduces a local primal-dual grid to achieve consistent staggered discretization and local divergence-free velocities.
-
Conserving mass, momentum, and energy for the Benjamin-Bona-Mahony, Korteweg-de Vries, and nonlinear Schr\"odinger equations
High-order essentially explicit discretizations using Fourier Galerkin plus projection-relaxation conserve mass, momentum, and energy for BBM, KdV, and NLS equations.
-
Mixed-precision iterative refinement for low-rank Lyapunov equations
Develops mixed-precision iterative refinement for low-rank Lyapunov equations with rounding error analysis enabling reduced precision for moderately conditioned problems.
-
A lattice Boltzmann method for Biot's consolidation model of linear poroelasticity
A centered coupling scheme for lattice Boltzmann methods solves Biot's poroelasticity model stably for strong coupling and captures discontinuous solutions in consolidation problems.
-
Evolving finite elements for advection diffusion with an evolving interface
Develops an evolving finite element method for parabolic PDEs with evolving interfaces, derives a suitable weak formulation, proves optimal error bounds for isoparametric elements of arbitrary order, and verifies convergence numerically.
-
Efficient Krylov solvers for inverse source problem in 2D space-time fractional diffusion equation
GLT-based preconditioners are built for multilevel Toeplitz systems from finite-difference discretization of the quasi-boundary regularized inverse source problem, producing eigenvalue clustering around one and faster GMRES convergence.
-
Locally conservative redistribution limiting and applications to the approximation of conservation equations, Part II
Introduces a conservative, discretization- and PDE-agnostic redistribution limiting method for high-order approximations of conservation equations.
-
Multiscale Fourier Neural Operator for Inverse Wave Scattering in Highly Oscillatory Media
MscaleFNO learns mappings from oscillatory media to wavefields for Helmholtz inverse problems and pairs it with diffusion regularization for partial-aperture 2D reconstructions.
-
An indefinite LOBPCG type of algorithm for detecting a definite Hermitian matrix pair
A new subspace algorithm using LOBPCG-style subspaces detects definiteness of large Hermitian matrix pairs (A,B) with indefinite B faster than prior methods on medium, large, and banded cases.
-
A-priori error estimation for space-time Galerkin POD for linear evolution problems
An a-priori error estimate for space-time Galerkin POD is derived for linear parabolic evolution problems, bounding the FOM-ROM error by singular-value tail sums and problem-specific constants.
-
Singularity Formation: Synergy in Theoretical, Numerical and Machine Learning Approaches
The work introduces a modulation-based analytical method for singularity proofs in singular PDEs and refines ML techniques like PINNs and KANs to identify blowup solutions, with application to the open 3D Keller-Segel problem.
-
A Multi-Order Extension of Fractional HBVMs (FHBVMs)
Extension of FHBVMs to multi-order fractional differential equations with a Matlab implementation for two orders.
-
A Block-Shifted Cyclic Reduction Algorithm for Solving a Class of Quadratic Matrix Equations
Presents Block-Shifted CR algorithm using SVD and deflation to extend convergence of cyclic reduction for quadratic matrix equations with multiple unit-circle eigenvalues.
-
Adaptive hyperviscosity stabilisation for the RBF-FD method in solving advection-dominated transport equations
An adaptive hyperviscosity stabilization for RBF-FD is proposed that sets the constant from the spectral radius of the evolution matrix, supports general nodes, and is demonstrated on linear advection and Burgers' equation with limited dissipation.
-
Isogeometric Topology Optimization Based on Topological Derivatives
An isogeometric topology optimization approach using topological derivatives and level-set methods in an immersed framework enables seamless geometry updates without remeshing and benefits from higher-order basis functions for solution accuracy.
-
Neural operators for solving nonlinear inverse problems
Tikhonov regularization is analyzed using neural operators as learned surrogates for ill-posed nonlinear operator equations, with error balancing and approximation results extended to Sobolev and Lebesgue spaces.
-
Some new properties of an Active flux type scheme: PamPa
Active Flux/PamPa schemes incorporate discontinuous Galerkin methods as a building block, possess intrinsic bound-preserving properties illustrated numerically, and satisfy the summation-by-parts property in one dimension.
-
An Adaptive Subdomain Coupling Approach in Domain Decomposition for Multiphase Porous Media Flow
Presents an adaptive subdomain coupling approach in domain decomposition for multiphase porous media flow, with new strategies and initial guesses to improve nonlinear solver convergence and parallel scalability.
-
A Cartesian grid-based boundary integral method for moving interface problems
A boundary integral method on Cartesian grids is introduced for moving interface problems, using θ-L variables for stable evolution and fast solvers for efficiency, demonstrated on complex fingering and solidification cases.
-
Justification and structure- and asymptotic-preserving discretizations of a hyperbolized Cahn-Hilliard equation
Develops energy-stable asymptotic-preserving discretizations of a hyperbolized Cahn-Hilliard equation via SBP operators and IMEX Runge-Kutta methods guided by relative-energy error estimates.
-
Spectral Bounds for Tensors Derived from Trace Functionals and Wasserstein Distance in Tensor Spaces
Defines trace-based metric and Bures-Wasserstein distance for PSD tensors, derives spectral eigenvalue bounds, and analyzes dependence on PSD condition with examples and complexity.
-
Parallel matching-based AMG preconditioners for elliptic equations discretized by IgA
Matching-based AMG preconditioners deliver robust and scalable performance for solving large ill-conditioned systems from IgA discretizations in parallel HPC settings.
-
Effect of different clustering approaches on the multilevel fast multipole method for the Helmholtz equation
Clustering approaches significantly affect the stability and efficiency of the multilevel fast multipole method for the Helmholtz equation, with poor choices causing instabilities on non-uniform meshes.
- Computing k-means in mixed precision