Token Rankings are Unforgeable Language Model Signatures

Andreas Grivas; Matthew Finlayson; Swabha Swayamdipta; Xiang Ren

arxiv: 2606.04459 · v1 · pith:C6VJOE3Snew · submitted 2026-06-03 · 💻 cs.CR · cs.AI· cs.CC· cs.CL

Token Rankings are Unforgeable Language Model Signatures

Matthew Finlayson , Andreas Grivas , Xiang Ren , Swabha Swayamdipta This is my paper

Pith reviewed 2026-06-28 06:23 UTC · model grok-4.3

classification 💻 cs.CR cs.AIcs.CCcs.CL

keywords token rankingslanguage model signaturesunforgeable signaturesmodel stealingNP-hardnessAPI securitytop-k rankingslogit geometry

0 comments

The pith

Token rankings from language models form unique signatures that cannot be forged in polynomial time.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Language models impose geometric constraints on their logit outputs that produce a distinct set of feasible top-k token rankings for each model. The paper demonstrates that these rankings identify the originating model even when an API reveals only orderings and not probability values. Because recovering a model that matches a given set of feasible rankings is NP-hard, the signature resists efficient forgery. This property enables APIs to release rankings as a verifiable identifier while limiting exposure enough to block parameter stealing when k is kept small. The k threshold needed to reveal the signature is typically lower than the threshold required to stop stealing attacks.

Core claim

Every language model possesses a unique collection of feasible top-k rankings for large enough k. Finding any other model that shares exactly the same collection is NP-hard, establishing the ranking set as the first known polynomially unforgeable signature. Rankings still permit coarse approximation of the final-layer weights, yet restricting the exposed top-k to a sufficiently small value prevents useful stealing while preserving the signature.

What carries the argument

The set of feasible top-k rankings induced by a model's logit geometry, which uniquely identifies the model and renders forgery NP-hard.

If this is right

APIs can release token rankings to prove model identity without enabling forgery.
Parameter stealing from rankings remains possible but is blocked by limiting the exposed k.
Signature revelation requires smaller k than the k needed to prevent stealing.
Rankings supply a more restrictive interface than full logits while retaining identifiability.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same geometric hardness might apply to other partial output interfaces such as top-k with ties or sampled tokens.
APIs could combine ranking signatures with other lightweight checks to strengthen model provenance without full logit exposure.
Determining the smallest k that guarantees uniqueness for a given model family would be a direct next measurement.

Load-bearing premise

A model's parameters create geometric constraints on logits that yield a set of feasible rankings both unique to that model and whose matching problem is NP-hard.

What would settle it

Finding two different models that share identical feasible top-k ranking sets for all sufficiently large k, or exhibiting a polynomial-time algorithm that constructs a model matching an observed set of rankings.

Figures

Figures reproduced from arXiv: 2606.04459 by Andreas Grivas, Matthew Finlayson, Swabha Swayamdipta, Xiang Ren.

**Figure 2.** Figure 2: Distribution of feasibility boundaries (smallest [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: The realization of the simple allowable sequence (1). Projecting the points onto a vector gives a token ordering. Rotating the vector gives a sequence of token rankings [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 5.** Figure 5: Checking assumptions for parameter stealing. Left: [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

**Figure 6.** Figure 6: Approximate parameter recovery from rankings on Pythia 70M as the number of training [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗

**Figure 7.** Figure 7: Exposing the signature, but not the parameters. Top: as top- [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗

**Figure 8.** Figure 8: The probability that a randomly selected ranking can be output by a language model with [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗

**Figure 9.** Figure 9: Probability that a random top-𝑘 ranking is feasible for random 10,000×100 matrices sampled from the standard normal distribution N (0, 1). The shaded region indicates the 95% confidence interval, estimated from 30 sampled matrices for each top-𝑘 size. The data show a phase-change between 𝑘 = 9 and 𝑘 = 16, where the matrices go from high to low coverage of top-𝑘 argsorts. By definition, the Pearson correlat… view at source ↗

read the original abstract

Language model parameters are known to impose unique (to each model) geometric constraints on their logit outputs, which serves as a signature that identifies the model, but also leaks the model's final layer parameters when an API distributes logits. We investigate more restrictive APIs that expose token rankings (i.e., their ordering by probability, but not the probability values) and find that rankings also constitute a signature: every model has a unique set of feasible top-$k$ rankings for sufficiently large $k$. Furthermore, the ranking signature is the first known (polynomially) unforgeable signature, since finding a model with the same set of feasible rankings is NP-hard. On the security front, we find that token rankings are already sufficient to approximately steal the final layer of the model, similar to logits, though the approximation is too coarse to forge the signature, and can be effectively countered by restricting the API to top-$k$ tokens with sufficiently small $k$. Since the top-$k$ required to present the model signature is generally smaller than the $k$ required to prevent stealing, it is possible for an API to present an unforgeable signature without leaking model parameters.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Token rankings can serve as unforgeable model signatures via NP-hardness, but the reduction needs verification.

read the letter

The punchline is that token rankings can serve as a signature for language models because each model has a unique set of feasible top-k rankings, and recovering a model that matches a given set of rankings is NP-hard. This is presented as the first such polynomially unforgeable signature.

The paper does a good job extending previous work on logit-based identification to the more limited information in rankings. This fits better with restricted APIs that only return orderings rather than probability values. They also provide an analysis showing that while rankings can be used to approximately recover the final layer, the approximation is too rough to allow forging the signature. They note that the k needed for the signature is generally smaller than the k needed to prevent stealing, which is a useful practical point for API design.

The hardness result is the key new element, relying on the geometric constraints that logits impose on possible rankings.

On the downside, the uniqueness and hardness claims depend on those geometric properties holding in a way that makes the matching problem hard. Without the full details of the reduction, it's difficult to assess if there might be efficient ways to find matching models or if the uniqueness is strict for all models. Empirical checks on actual language models would help strengthen that part, though the abstract suggests the theory is the focus.

This paper would interest people working on AI security, model ownership, and API protections. A reader focused on cryptographic-style guarantees for ML systems would find the approach worth considering. The work shows clear engagement with the problem and has enough substance that it deserves to go through peer review rather than being desk rejected.

Referee Report

2 major / 2 minor

Summary. The paper claims that language model APIs exposing only token rankings (orderings by probability, without values) still yield model-unique signatures: each model induces a distinct set of feasible top-k rankings for sufficiently large k, owing to geometric constraints on its logit outputs. It further asserts that this signature is polynomially unforgeable because recovering any model whose feasible ranking set matches a given one is NP-hard. The work also shows that rankings suffice for coarse approximation of the final layer (similar to logit leakage) but that the approximation cannot forge the signature, and that choosing a small enough k prevents stealing while still revealing the signature.

Significance. If the uniqueness and NP-hardness results hold, the contribution would be notable for providing the first polynomially unforgeable signature derived from model outputs, enabling API designs that authenticate models without parameter leakage. The separation between the k needed for signature visibility and the k needed to block stealing is a concrete, actionable observation for API security.

major comments (2)

[Abstract, §3] Abstract and §3 (Uniqueness argument): the claim that geometric constraints on logits produce a unique set of feasible top-k rankings for large k is central but stated without an explicit characterization of the feasible set or a proof that distinct models cannot share the same set; a concrete example or theorem establishing injectivity is required.
[Abstract, §4] Abstract and §4 (NP-hardness): the assertion that finding a model with the same feasible ranking set is NP-hard is load-bearing for the unforgeability claim, yet no reduction, hardness assumption, or proof sketch appears; the manuscript must supply the reduction from a known NP-hard problem and verify that it applies to the ranking-set matching problem as defined.

minor comments (2)

[§2] Define 'feasible top-k ranking' formally at first use and clarify how k is chosen relative to vocabulary size.
[§5] The experimental section on approximate stealing should report the precise k values used and the resulting approximation error to allow direct comparison with the signature k.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. The points raised correctly identify that the uniqueness and NP-hardness claims would be strengthened by explicit formal statements. We will revise the manuscript to incorporate these elements.

read point-by-point responses

Referee: [Abstract, §3] Abstract and §3 (Uniqueness argument): the claim that geometric constraints on logits produce a unique set of feasible top-k rankings for large k is central but stated without an explicit characterization of the feasible set or a proof that distinct models cannot share the same set; a concrete example or theorem establishing injectivity is required.

Authors: We agree that an explicit characterization and injectivity proof would improve clarity. In the revised manuscript we will add a theorem that formally defines the feasible set of top-k rankings induced by the geometric constraints on logit vectors and proves that this set is distinct for distinct models. A concrete numerical example illustrating non-overlapping feasible sets for two different models will also be included. revision: yes
Referee: [Abstract, §4] Abstract and §4 (NP-hardness): the assertion that finding a model with the same feasible ranking set is NP-hard is load-bearing for the unforgeability claim, yet no reduction, hardness assumption, or proof sketch appears; the manuscript must supply the reduction from a known NP-hard problem and verify that it applies to the ranking-set matching problem as defined.

Authors: We acknowledge that a formal reduction is required to substantiate the NP-hardness claim. The revised version will contain an explicit polynomial-time reduction from a canonical NP-hard problem (e.g., 3-SAT) to the decision problem of whether there exists a model whose feasible ranking set matches a given target set, together with a verification that the reduction preserves the ranking-set equivalence relation used in the paper. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The derivation begins from the stated geometric constraints that model parameters impose on logit outputs (a premise presented as known) and extends this to show that top-k rankings induce unique feasible sets for large k. The NP-hardness of recovering a model matching a given ranking set is invoked directly as the basis for unforgeability. No equations or steps reduce a claimed prediction or uniqueness result to a fitted parameter, self-citation chain, or definitional renaming within the provided text. The central claims remain independent of the target result and rest on external complexity support rather than internal construction.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the domain assumption that logit geometry induces unique ranking sets and on the standard complexity assumption that the matching problem is NP-hard; no free parameters or invented entities are introduced in the abstract.

axioms (2)

domain assumption Model parameters impose geometric constraints on logit outputs that determine a unique collection of feasible top-k rankings for sufficiently large k.
Stated directly in the abstract as the basis for the signature property.
standard math Finding another model with an identical set of feasible rankings is NP-hard.
Invoked to establish polynomial unforgeability.

pith-pipeline@v0.9.1-grok · 5749 in / 1277 out tokens · 34613 ms · 2026-06-28T06:23:31.519606+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

66 extracted references · 21 canonical work pages

[1]

, journal=

Cover, Thomas M. , journal=. Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , year=
[2]

2023 , url=

Using oriented matroids to find low rank structure in presence of nonlinearity , author=. 2023 , url=

2023
[3]

Sign rank versus

Alon, Noga and Moran, Shay and Yehudayoff, Amir , booktitle =. Sign rank versus. 2016 , editor =

2016
[4]

1996 , issn =

On Limited Nondeterminism and the Complexity of the V-C Dimension , journal =. 1996 , issn =. doi:https://doi.org/10.1006/jcss.1996.0058 , url =

work page doi:10.1006/jcss.1996.0058 1996
[5]

Topics in Hyperplane Arrangements , author=
[6]

On k-Nearest Neighbor Voronoi Diagrams in the Plane , year=

Der-Tsai Lee , journal=. On k-Nearest Neighbor Voronoi Diagrams in the Plane , year=
[7]

Combinatorica , year=

Padrol, Arnau and Philippe, Eva , title=. Combinatorica , year=. doi:10.1007/s00493-023-00062-3 , url=

work page doi:10.1007/s00493-023-00062-3
[8]

Cohen , booktitle=

Zhilin Yang and Zihang Dai and Ruslan Salakhutdinov and William W. Cohen , booktitle=. Breaking the Softmax Bottleneck: A High-Rank. 2018 , url=

2018
[9]

Cover , journal =

Thomas M. Cover , journal =. The Number of Linearly Inducible Orderings of Points in d-Space , urldate =
[10]

1977 , issn =

Stirling numbers and a geometric structure from voting theory , journal =. 1977 , issn =. doi:https://doi.org/10.1016/0097-3165(77)90077-2 , url =

work page doi:10.1016/0097-3165(77)90077-2 1977
[11]

and Tanner, Jared , TITLE =

Donoho, David L. and Tanner, Jared , TITLE =. J. Amer. Math. Soc. , FJOURNAL =. 2009 , NUMBER =. doi:10.1090/S0894-0347-08-00600-0 , URL =

work page doi:10.1090/s0894-0347-08-00600-0 2009
[12]

Lectures on Polytopes: Updated Seventh Printing of the First Edition

Ziegler, G \"u nter M. Lectures on Polytopes: Updated Seventh Printing of the First Edition. 1995. doi:10.1007/978-1-4613-8431-1_6

work page doi:10.1007/978-1-4613-8431-1_6 1995
[13]

Ranking patterns of unfolding models of codimension one , journal =

Hidehiko Kamiya and Akimichi Takemura and Hiroaki Terao , keywords =. Ranking patterns of unfolding models of codimension one , journal =. 2011 , issn =. doi:https://doi.org/10.1016/j.aam.2010.11.002 , url =

work page doi:10.1016/j.aam.2010.11.002 2011
[14]

Björner, Anders and Las Vergnas, Michel and Sturmfels, Bernd and White, Neil and Ziegler, Gunter M. , year=. Oriented Matroids , DOI=
[15]

2018 , eprint=

A universality theorem for allowable sequences with applications , author=. 2018 , eprint=

2018
[16]

and Pollack, Richard , TITLE =

Goodman, Jacob E. and Pollack, Richard , TITLE =. J. Combin. Theory Ser. A , FJOURNAL =. 1980 , NUMBER =. doi:10.1016/0097-3165(80)90011-4 , URL =

work page doi:10.1016/0097-3165(80)90011-4 1980
[17]

N. E. Mnev , title =. Topology and Geometry --- Rohlin Seminar , series =. 1988 , doi =

1988
[18]

Applied Geometry and Discrete Mathematics: The Victor Klee Festschrift , series=

Stretchability of Pseudolines is NP-Hard , author=. Applied Geometry and Discrete Mathematics: The Victor Klee Festschrift , series=. 1991 , doi=

1991
[19]

Combinatorica , year=

How Many Circuits Determine an Oriented Matroid? , author=. Combinatorica , year=
[20]

Forty-first International Conference on Machine Learning , year=

Trustless Audits without Revealing Data or Models , author=. Forty-first International Conference on Machine Learning , year=
[21]

2024 , isbn =

Sun, Haochen and Li, Jason and Zhang, Hongyang , title =. 2024 , isbn =. doi:10.1145/3658644.3670334 , booktitle =

work page doi:10.1145/3658644.3670334 2024
[22]

2024 , eprint=

Trust the Process: Zero-Knowledge Machine Learning to Enhance Trust in Generative AI Interactions , author=. 2024 , eprint=

2024
[23]

Approximating the Existential Theory of the Reals

Deligkas, Argyrios and Fearnley, John and Melissourgos, Themistoklis and Spirakis, Paul G. Approximating the Existential Theory of the Reals. Web and Internet Economics. 2018

2018
[24]

N. J. A. Sloane , title =. 1996 , url =

1996
[25]

European Journal of Combinatorics , volume=

On the number of reduced decompositions of elements of Coxeter groups , author=. European Journal of Combinatorics , volume=. 1984 , publisher=

1984
[26]

Directed Switching Games on graphs and matroids , journal =

Yahya Ould Hamidoune and Michel. Directed Switching Games on graphs and matroids , journal =. 1986 , issn =. doi:https://doi.org/10.1016/0095-8956(86)90083-3 , url =

work page doi:10.1016/0095-8956(86)90083-3 1986
[27]

Discrete & Computational Geometry , year=

Complete Enumeration of Small Realizable Oriented Matroids , author=. Discrete & Computational Geometry , year=
[28]

and Fukuda, K

Finschi, L. and Fukuda, K. , title =. 2002 , issue_date =. doi:10.1007/s00454-001-0056-5 , journal =

work page doi:10.1007/s00454-001-0056-5 2002
[29]

Mohammad Tavakoli, Alireza Salemi, Carrie Ye, Mohamed Abdalla, Hamed Zamani, and J

Neumann, Stefan and Gemulla, Rainer and Miettinen, Pauli , booktitle=. What You Will Gain By Rounding: Theory and Algorithms for Rounding Rank , year=. doi:10.1109/ICDM.2016.0049 , annote=

work page doi:10.1109/icdm.2016.0049 2016
[30]

Discrete & Computational Geometry , volume=

Vapnik-Chervonenkis dimension and (pseudo-) hyperplane arrangements , author=. Discrete & Computational Geometry , volume=. 1994 , publisher=

1994
[31]

2002 , issue_date =

Abello and Kumar , title =. 2002 , issue_date =. doi:10.1007/s00454-002-2881-6 , journal =

work page doi:10.1007/s00454-002-2881-6 2002
[32]

2021 , issue_date =

Abrahamsen, Mikkel and Adamaszek, Anna and Miltzow, Tillmann , title =. 2021 , issue_date =. doi:10.1145/3486220 , journal =

work page doi:10.1145/3486220 2021
[33]

Logits of

Matthew Finlayson and Xiang Ren and Swabha Swayamdipta , booktitle=. Logits of. 2024 , url=

2024
[34]

Feder and Lee, Katherine and Jagielski, Matthew and Nasr, Milad and Conmy, Arthur and Wallace, Eric and Rolnick, David and Tram\`

Carlini, Nicholas and Paleka, Daniel and Dvijotham, Krishnamurthy (Dj) and Steinke, Thomas and Hayase, Jonathan and Cooper, A. Feder and Lee, Katherine and Jagielski, Matthew and Nasr, Milad and Conmy, Arthur and Wallace, Eric and Rolnick, David and Tram\`. Stealing part of a production language model , year =. Proceedings of the 41st International Confer...
[35]

The Fourteenth International Conference on Learning Representations , year=

Every Language Model Has a Forgery-Resistant Signature , author=. The Fourteenth International Conference on Learning Representations , year=
[36]

Spirakis , keywords =

Argyrios Deligkas and John Fearnley and Themistoklis Melissourgos and Paul G. Spirakis , keywords =. Approximating the existential theory of the reals , journal =. 2022 , issn =. doi:https://doi.org/10.1016/j.jcss.2021.11.002 , url =

work page doi:10.1016/j.jcss.2021.11.002 2022
[37]

Smoothing the gap between NP and ER , year=

Erickson, Jeff and van der Hoog, Ivor and Miltzow, Tillmann , booktitle=. Smoothing the gap between NP and ER , year=
[38]

Training Fully Connected Neural Networks is

Daniel Bertschinger and Christoph Hertrich and Paul Jungeblut and Tillmann Miltzow and Simon Weber , booktitle =. Training Fully Connected Neural Networks is. 2023 , url =

2023
[39]

Training Neural Networks is ER-complete , url =

Abrahamsen, Mikkel and Kleist, Linda and Miltzow, Tillmann , booktitle =. Training Neural Networks is ER-complete , url =
[40]

, year =

Heintz, J. , year =. On the Theoretical and Practical Complexity of the Existential Theory of Reals , volume =. The Computer Journal , publisher =. doi:10.1093/comjnl/36.5.427 , number =

work page doi:10.1093/comjnl/36.5.427
[41]

The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

Better Language Model Inversion by Compactly Representing Next-Token Distributions , author=. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=
[42]

and Zhao, Wenting and Chiu, Justin and Shmatikov, Vitaly and Rush, Alexander , booktitle =

Morris, John X. and Zhao, Wenting and Chiu, Justin and Shmatikov, Vitaly and Rush, Alexander , booktitle =. Language Model Inversion , url =
[43]

2022 , month = nov, day =

New Logit Bias experimental parameter , author =. 2022 , month = nov, day =

2022
[44]

Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice

Grivas, Andreas and Bogoychev, Nikolay and Lopez, Adam. Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2022. doi:10.18653/v1/2022.acl-long.465

work page doi:10.18653/v1/2022.acl-long.465 2022
[45]

Proceedings of the 40th International Conference on Machine Learning , pages =

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling , author =. Proceedings of the 40th International Conference on Machine Learning , pages =. 2023 , editor =

2023
[46]

arXiv preprint arXiv:2101.00027 , year =

The Pile: An 800GB Dataset of Diverse Text for Language Modeling , author =. arXiv preprint arXiv:2101.00027 , year =

Pith/arXiv arXiv
[47]

Mathematical Programming Computation , volume =

Parallelizing the dual revised simplex method , author =. Mathematical Programming Computation , volume =. 2018 , doi =

2018
[48]

Proceedings of the AAAI Conference on Artificial Intelligence , author=

Taming the Sigmoid Bottleneck: Provably Argmaxable Sparse Multi-Label Classification , volume=. Proceedings of the AAAI Conference on Artificial Intelligence , author=. 2024 , month=. doi:10.1609/aaai.v38i11.29110 , number=

work page doi:10.1609/aaai.v38i11.29110 2024
[49]

2025 , eprint=

A Fingerprint for Large Language Models , author=. 2025 , eprint=. doi:https://doi.org/10.2352/EI.2026.38.4.MWSF-309 , url=

work page doi:10.2352/ei.2026.38.4.mwsf-309 2025
[50]

ACM Computing Surveys , volume=

Securing Large Language Models: A Survey of Watermarking and Fingerprinting Techniques , author=. ACM Computing Surveys , volume=. 2026 , doi=

2026
[51]

2021 , doi=

Cao, Xiaoyu and Jia, Jinyuan and Gong, Neil Zhenqiang , booktitle=. 2021 , doi=

2021
[52]

MOSEK Fusion API for Python 11.1.10 , year =

ApS. MOSEK Fusion API for Python 11.1.10 , year =
[53]

2025 , eprint=

Olmo 3 , author=. 2025 , eprint=

2025
[54]

2014 , url =

Jonathan Katz and Yehuda Lindell , title =. 2014 , url =

2014
[55]

Authorship Attribution for Neural Text Generation

Uchendu, Adaku and Le, Thai and Shu, Kai and Lee, Dongwon. Authorship Attribution for Neural Text Generation. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2020. doi:10.18653/v1/2020.emnlp-main.673

work page doi:10.18653/v1/2020.emnlp-main.673 2020
[56]

International Conference on Machine Learning,

John Kirchenbauer and Jonas Geiping and Yuxin Wen and Jonathan Katz and Ian Miers and Tom Goldstein , title =. International Conference on Machine Learning,. 2023 , url =

2023
[57]

Are You Getting What You Pay For? Auditing Model Substitution in

Will Cai and Tianneng Shi and Xuandong Zhao and Dawn Song , year=. Are You Getting What You Pay For? Auditing Model Substitution in. 2504.04715 , archivePrefix=

arXiv
[58]

Graph Drawing, 17th International Symposium,

Marcus Schaefer , title =. Graph Drawing, 17th International Symposium,. 2009 , url =

2009
[59]

Canny , title =

John F. Canny , title =. Proceedings of the 20th Annual. 1988 , url =

1988
[60]

Kingma and Jimmy Ba , title =

Diederik P. Kingma and Jimmy Ba , title =. 3rd International Conference on Learning Representations,. 2015 , url =

2015
[61]

Liu and Jorge Nocedal , title =

Dong C. Liu and Jorge Nocedal , title =. Math. Program. , volume =. 1989 , url =

1989
[62]

The Electronic Journal of Combinatorics , author=

An Asymptotic Expansion for the Number of Permutations with a Certain Number of Inversions , volume=. The Electronic Journal of Combinatorics , author=. 2000 , month=. doi:10.37236/1528 , abstractNote=

work page doi:10.37236/1528 2000
[63]

Oriented matroids , journal =

Jon Folkman and Jim Lawrence , abstract =. Oriented matroids , journal =. 1978 , issn =. doi:https://doi.org/10.1016/0095-8956(78)90039-4 , url =

work page doi:10.1016/0095-8956(78)90039-4 1978
[64]

Kendall's Tau for Elliptical Distributions

Lindskog, Filip and McNeil, Alexander and Schmock, Uwe. Kendall's Tau for Elliptical Distributions. Credit Risk. 2003

2003
[65]

The Fourteenth International Conference on Learning Representations , year=

The Softmax Bottleneck Does Not Limit the Probabilities of the Most Likely Tokens , author=. The Fourteenth International Conference on Learning Representations , year=
[66]

Kruskal , journal =

William H. Kruskal , journal =. Ordinal Measures of Association , urldate =

[1] [1]

, journal=

Cover, Thomas M. , journal=. Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , year=

[2] [2]

2023 , url=

Using oriented matroids to find low rank structure in presence of nonlinearity , author=. 2023 , url=

2023

[3] [3]

Sign rank versus

Alon, Noga and Moran, Shay and Yehudayoff, Amir , booktitle =. Sign rank versus. 2016 , editor =

2016

[4] [4]

1996 , issn =

On Limited Nondeterminism and the Complexity of the V-C Dimension , journal =. 1996 , issn =. doi:https://doi.org/10.1006/jcss.1996.0058 , url =

work page doi:10.1006/jcss.1996.0058 1996

[5] [5]

Topics in Hyperplane Arrangements , author=

[6] [6]

On k-Nearest Neighbor Voronoi Diagrams in the Plane , year=

Der-Tsai Lee , journal=. On k-Nearest Neighbor Voronoi Diagrams in the Plane , year=

[7] [7]

Combinatorica , year=

Padrol, Arnau and Philippe, Eva , title=. Combinatorica , year=. doi:10.1007/s00493-023-00062-3 , url=

work page doi:10.1007/s00493-023-00062-3

[8] [8]

Cohen , booktitle=

Zhilin Yang and Zihang Dai and Ruslan Salakhutdinov and William W. Cohen , booktitle=. Breaking the Softmax Bottleneck: A High-Rank. 2018 , url=

2018

[9] [9]

Cover , journal =

Thomas M. Cover , journal =. The Number of Linearly Inducible Orderings of Points in d-Space , urldate =

[10] [10]

1977 , issn =

Stirling numbers and a geometric structure from voting theory , journal =. 1977 , issn =. doi:https://doi.org/10.1016/0097-3165(77)90077-2 , url =

work page doi:10.1016/0097-3165(77)90077-2 1977

[11] [11]

and Tanner, Jared , TITLE =

Donoho, David L. and Tanner, Jared , TITLE =. J. Amer. Math. Soc. , FJOURNAL =. 2009 , NUMBER =. doi:10.1090/S0894-0347-08-00600-0 , URL =

work page doi:10.1090/s0894-0347-08-00600-0 2009

[12] [12]

Lectures on Polytopes: Updated Seventh Printing of the First Edition

Ziegler, G \"u nter M. Lectures on Polytopes: Updated Seventh Printing of the First Edition. 1995. doi:10.1007/978-1-4613-8431-1_6

work page doi:10.1007/978-1-4613-8431-1_6 1995

[13] [13]

Ranking patterns of unfolding models of codimension one , journal =

Hidehiko Kamiya and Akimichi Takemura and Hiroaki Terao , keywords =. Ranking patterns of unfolding models of codimension one , journal =. 2011 , issn =. doi:https://doi.org/10.1016/j.aam.2010.11.002 , url =

work page doi:10.1016/j.aam.2010.11.002 2011

[14] [14]

Björner, Anders and Las Vergnas, Michel and Sturmfels, Bernd and White, Neil and Ziegler, Gunter M. , year=. Oriented Matroids , DOI=

[15] [15]

2018 , eprint=

A universality theorem for allowable sequences with applications , author=. 2018 , eprint=

2018

[16] [16]

and Pollack, Richard , TITLE =

Goodman, Jacob E. and Pollack, Richard , TITLE =. J. Combin. Theory Ser. A , FJOURNAL =. 1980 , NUMBER =. doi:10.1016/0097-3165(80)90011-4 , URL =

work page doi:10.1016/0097-3165(80)90011-4 1980

[17] [17]

N. E. Mnev , title =. Topology and Geometry --- Rohlin Seminar , series =. 1988 , doi =

1988

[18] [18]

Applied Geometry and Discrete Mathematics: The Victor Klee Festschrift , series=

Stretchability of Pseudolines is NP-Hard , author=. Applied Geometry and Discrete Mathematics: The Victor Klee Festschrift , series=. 1991 , doi=

1991

[19] [19]

Combinatorica , year=

How Many Circuits Determine an Oriented Matroid? , author=. Combinatorica , year=

[20] [20]

Forty-first International Conference on Machine Learning , year=

Trustless Audits without Revealing Data or Models , author=. Forty-first International Conference on Machine Learning , year=

[21] [21]

2024 , isbn =

Sun, Haochen and Li, Jason and Zhang, Hongyang , title =. 2024 , isbn =. doi:10.1145/3658644.3670334 , booktitle =

work page doi:10.1145/3658644.3670334 2024

[22] [22]

2024 , eprint=

Trust the Process: Zero-Knowledge Machine Learning to Enhance Trust in Generative AI Interactions , author=. 2024 , eprint=

2024

[23] [23]

Approximating the Existential Theory of the Reals

Deligkas, Argyrios and Fearnley, John and Melissourgos, Themistoklis and Spirakis, Paul G. Approximating the Existential Theory of the Reals. Web and Internet Economics. 2018

2018

[24] [24]

N. J. A. Sloane , title =. 1996 , url =

1996

[25] [25]

European Journal of Combinatorics , volume=

On the number of reduced decompositions of elements of Coxeter groups , author=. European Journal of Combinatorics , volume=. 1984 , publisher=

1984

[26] [26]

Directed Switching Games on graphs and matroids , journal =

Yahya Ould Hamidoune and Michel. Directed Switching Games on graphs and matroids , journal =. 1986 , issn =. doi:https://doi.org/10.1016/0095-8956(86)90083-3 , url =

work page doi:10.1016/0095-8956(86)90083-3 1986

[27] [27]

Discrete & Computational Geometry , year=

Complete Enumeration of Small Realizable Oriented Matroids , author=. Discrete & Computational Geometry , year=

[28] [28]

and Fukuda, K

Finschi, L. and Fukuda, K. , title =. 2002 , issue_date =. doi:10.1007/s00454-001-0056-5 , journal =

work page doi:10.1007/s00454-001-0056-5 2002

[29] [29]

Mohammad Tavakoli, Alireza Salemi, Carrie Ye, Mohamed Abdalla, Hamed Zamani, and J

Neumann, Stefan and Gemulla, Rainer and Miettinen, Pauli , booktitle=. What You Will Gain By Rounding: Theory and Algorithms for Rounding Rank , year=. doi:10.1109/ICDM.2016.0049 , annote=

work page doi:10.1109/icdm.2016.0049 2016

[30] [30]

Discrete & Computational Geometry , volume=

Vapnik-Chervonenkis dimension and (pseudo-) hyperplane arrangements , author=. Discrete & Computational Geometry , volume=. 1994 , publisher=

1994

[31] [31]

2002 , issue_date =

Abello and Kumar , title =. 2002 , issue_date =. doi:10.1007/s00454-002-2881-6 , journal =

work page doi:10.1007/s00454-002-2881-6 2002

[32] [32]

2021 , issue_date =

Abrahamsen, Mikkel and Adamaszek, Anna and Miltzow, Tillmann , title =. 2021 , issue_date =. doi:10.1145/3486220 , journal =

work page doi:10.1145/3486220 2021

[33] [33]

Logits of

Matthew Finlayson and Xiang Ren and Swabha Swayamdipta , booktitle=. Logits of. 2024 , url=

2024

[34] [34]

Feder and Lee, Katherine and Jagielski, Matthew and Nasr, Milad and Conmy, Arthur and Wallace, Eric and Rolnick, David and Tram\`

Carlini, Nicholas and Paleka, Daniel and Dvijotham, Krishnamurthy (Dj) and Steinke, Thomas and Hayase, Jonathan and Cooper, A. Feder and Lee, Katherine and Jagielski, Matthew and Nasr, Milad and Conmy, Arthur and Wallace, Eric and Rolnick, David and Tram\`. Stealing part of a production language model , year =. Proceedings of the 41st International Confer...

[35] [35]

The Fourteenth International Conference on Learning Representations , year=

Every Language Model Has a Forgery-Resistant Signature , author=. The Fourteenth International Conference on Learning Representations , year=

[36] [36]

Spirakis , keywords =

Argyrios Deligkas and John Fearnley and Themistoklis Melissourgos and Paul G. Spirakis , keywords =. Approximating the existential theory of the reals , journal =. 2022 , issn =. doi:https://doi.org/10.1016/j.jcss.2021.11.002 , url =

work page doi:10.1016/j.jcss.2021.11.002 2022

[37] [37]

Smoothing the gap between NP and ER , year=

Erickson, Jeff and van der Hoog, Ivor and Miltzow, Tillmann , booktitle=. Smoothing the gap between NP and ER , year=

[38] [38]

Training Fully Connected Neural Networks is

Daniel Bertschinger and Christoph Hertrich and Paul Jungeblut and Tillmann Miltzow and Simon Weber , booktitle =. Training Fully Connected Neural Networks is. 2023 , url =

2023

[39] [39]

Training Neural Networks is ER-complete , url =

Abrahamsen, Mikkel and Kleist, Linda and Miltzow, Tillmann , booktitle =. Training Neural Networks is ER-complete , url =

[40] [40]

, year =

Heintz, J. , year =. On the Theoretical and Practical Complexity of the Existential Theory of Reals , volume =. The Computer Journal , publisher =. doi:10.1093/comjnl/36.5.427 , number =

work page doi:10.1093/comjnl/36.5.427

[41] [41]

The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

Better Language Model Inversion by Compactly Representing Next-Token Distributions , author=. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

[42] [42]

and Zhao, Wenting and Chiu, Justin and Shmatikov, Vitaly and Rush, Alexander , booktitle =

Morris, John X. and Zhao, Wenting and Chiu, Justin and Shmatikov, Vitaly and Rush, Alexander , booktitle =. Language Model Inversion , url =

[43] [43]

2022 , month = nov, day =

New Logit Bias experimental parameter , author =. 2022 , month = nov, day =

2022

[44] [44]

Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice

Grivas, Andreas and Bogoychev, Nikolay and Lopez, Adam. Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2022. doi:10.18653/v1/2022.acl-long.465

work page doi:10.18653/v1/2022.acl-long.465 2022

[45] [45]

Proceedings of the 40th International Conference on Machine Learning , pages =

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling , author =. Proceedings of the 40th International Conference on Machine Learning , pages =. 2023 , editor =

2023

[46] [46]

arXiv preprint arXiv:2101.00027 , year =

The Pile: An 800GB Dataset of Diverse Text for Language Modeling , author =. arXiv preprint arXiv:2101.00027 , year =

Pith/arXiv arXiv

[47] [47]

Mathematical Programming Computation , volume =

Parallelizing the dual revised simplex method , author =. Mathematical Programming Computation , volume =. 2018 , doi =

2018

[48] [48]

Proceedings of the AAAI Conference on Artificial Intelligence , author=

Taming the Sigmoid Bottleneck: Provably Argmaxable Sparse Multi-Label Classification , volume=. Proceedings of the AAAI Conference on Artificial Intelligence , author=. 2024 , month=. doi:10.1609/aaai.v38i11.29110 , number=

work page doi:10.1609/aaai.v38i11.29110 2024

[49] [49]

2025 , eprint=

A Fingerprint for Large Language Models , author=. 2025 , eprint=. doi:https://doi.org/10.2352/EI.2026.38.4.MWSF-309 , url=

work page doi:10.2352/ei.2026.38.4.mwsf-309 2025

[50] [50]

ACM Computing Surveys , volume=

Securing Large Language Models: A Survey of Watermarking and Fingerprinting Techniques , author=. ACM Computing Surveys , volume=. 2026 , doi=

2026

[51] [51]

2021 , doi=

Cao, Xiaoyu and Jia, Jinyuan and Gong, Neil Zhenqiang , booktitle=. 2021 , doi=

2021

[52] [52]

MOSEK Fusion API for Python 11.1.10 , year =

ApS. MOSEK Fusion API for Python 11.1.10 , year =

[53] [53]

2025 , eprint=

Olmo 3 , author=. 2025 , eprint=

2025

[54] [54]

2014 , url =

Jonathan Katz and Yehuda Lindell , title =. 2014 , url =

2014

[55] [55]

Authorship Attribution for Neural Text Generation

Uchendu, Adaku and Le, Thai and Shu, Kai and Lee, Dongwon. Authorship Attribution for Neural Text Generation. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2020. doi:10.18653/v1/2020.emnlp-main.673

work page doi:10.18653/v1/2020.emnlp-main.673 2020

[56] [56]

International Conference on Machine Learning,

John Kirchenbauer and Jonas Geiping and Yuxin Wen and Jonathan Katz and Ian Miers and Tom Goldstein , title =. International Conference on Machine Learning,. 2023 , url =

2023

[57] [57]

Are You Getting What You Pay For? Auditing Model Substitution in

Will Cai and Tianneng Shi and Xuandong Zhao and Dawn Song , year=. Are You Getting What You Pay For? Auditing Model Substitution in. 2504.04715 , archivePrefix=

arXiv

[58] [58]

Graph Drawing, 17th International Symposium,

Marcus Schaefer , title =. Graph Drawing, 17th International Symposium,. 2009 , url =

2009

[59] [59]

Canny , title =

John F. Canny , title =. Proceedings of the 20th Annual. 1988 , url =

1988

[60] [60]

Kingma and Jimmy Ba , title =

Diederik P. Kingma and Jimmy Ba , title =. 3rd International Conference on Learning Representations,. 2015 , url =

2015

[61] [61]

Liu and Jorge Nocedal , title =

Dong C. Liu and Jorge Nocedal , title =. Math. Program. , volume =. 1989 , url =

1989

[62] [62]

The Electronic Journal of Combinatorics , author=

An Asymptotic Expansion for the Number of Permutations with a Certain Number of Inversions , volume=. The Electronic Journal of Combinatorics , author=. 2000 , month=. doi:10.37236/1528 , abstractNote=

work page doi:10.37236/1528 2000

[63] [63]

Oriented matroids , journal =

Jon Folkman and Jim Lawrence , abstract =. Oriented matroids , journal =. 1978 , issn =. doi:https://doi.org/10.1016/0095-8956(78)90039-4 , url =

work page doi:10.1016/0095-8956(78)90039-4 1978

[64] [64]

Kendall's Tau for Elliptical Distributions

Lindskog, Filip and McNeil, Alexander and Schmock, Uwe. Kendall's Tau for Elliptical Distributions. Credit Risk. 2003

2003

[65] [65]

The Fourteenth International Conference on Learning Representations , year=

The Softmax Bottleneck Does Not Limit the Probabilities of the Most Likely Tokens , author=. The Fourteenth International Conference on Learning Representations , year=

[66] [66]

Kruskal , journal =

William H. Kruskal , journal =. Ordinal Measures of Association , urldate =