An ErdH{o}s--Szekeres type result for words with repeats
Pith reviewed 2026-05-18 03:19 UTC · model grok-4.3
The pith
Every word with kn^6+1 repeats must contain one of seven specific patterns.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We prove that every word with kn^6+1 repeats contains one of the following patterns: 0^{k+2}, 0011⋯nn, nn⋯1100, 012⋯n012⋯n, 012⋯nn⋯210, n⋯210012⋯n, n⋯210n⋯210. Moreover, when k=1, this is best possible by constructing a word with n^6 repeats that does not contain any of these patterns.
What carries the argument
The seven enumerated patterns that become unavoidable once the number of repeats in the word exceeds kn^6.
If this is right
- Any word exceeding the repeat threshold is forced to realize at least one of the seven order-isomorphic subwords.
- The quantitative bound kn^6+1 is tight for k=1 because an explicit avoiding word of length n^6 exists.
- The result supplies a concrete threshold beyond which certain rising, falling, or repeated blocks must appear in sequences that allow repetitions.
Where Pith is reading between the lines
- Similar avoidance thresholds might be derived for other finite collections of target patterns.
- The construction for k=1 could be generalized to produce large avoiding words for higher k.
- The patterns identified here may serve as a basis set for studying longer unavoidable configurations in words with bounded repeat multiplicity.
Load-bearing premise
The claim depends on these seven patterns being exactly the complete set of minimal unavoidable configurations once the repeat count passes the given threshold.
What would settle it
A single explicit word containing kn^6+1 repeats but none of the seven listed patterns would falsify the statement.
read the original abstract
We prove an Erd\H{o}s--Szekeres type result for finite words over $\mathbb{N}$ with repeated values. Specifically, we define a \emph{repeat} in a word to be an occurrence of a value which is not its first occurrence. We define an occurrence of a \emph{pattern} $\pi$ in a word $w$ to be a (not necessarily consecutive) subword of $w$ that is order isomorphic to $\pi$. In this note, we show that every word with $kn^6+1$ repeats contains one of the following patterns: $0^{k+2}$, $0011\cdots nn$, $nn\cdots1100$, $012 \cdots n012 \cdots n$, $012 \cdots nn\cdots 210$, $n\cdots 210012\cdots n$, $n\cdots 210n\cdots 210$. Moreover, when $k=1$, we show that this is best possible by constructing a word with $n^6$ repeats that does not contain any of these patterns.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proves an Erdős–Szekeres-type result for finite words over ℕ: any word with kn^6 + 1 repeats must contain one of seven specified patterns (0^{k+2}, 0011⋯nn, nn⋯1100, 012⋯n012⋯n, 012⋯nn⋯210, n⋯210012⋯n, or n⋯210n⋯210). For k=1 the bound is shown to be tight by an explicit construction of a word with exactly n^6 repeats that avoids all seven patterns.
Significance. The result supplies a quantitative bound on unavoidable order-isomorphic patterns in words once the number of repeats exceeds a polynomial threshold in n. The matching construction for k=1 is a concrete strength, as it demonstrates that the stated threshold is best possible in that case and supplies an explicit extremal example.
major comments (2)
- [§3] §3 (upper-bound proof): the structural decomposition that classifies words avoiding the seven patterns according to the order of first occurrences and the placement of subsequent repeats is load-bearing for the kn^6 claim. The argument must show that every possible placement of repeats either creates one of the seven patterns or is bounded by kn^6; if any configuration is omitted, the quantitative bound fails. Please supply an explicit enumeration of the cases or a diagram that makes the exhaustion visible.
- [§4] §4 (construction for k=1): the explicit word with n^6 repeats is asserted to avoid all seven patterns, but the verification that it contains no occurrence of, e.g., 012⋯n012⋯n or n⋯210n⋯210 is only sketched. A short table or inductive argument confirming that each forbidden pattern is blocked would strengthen the lower-bound claim.
minor comments (2)
- [§1] The seven target patterns are introduced in the abstract and §1 with abbreviated notation (e.g., 0011⋯nn). A single displayed definition block that writes each pattern in full, together with a brief explanation of what “order-isomorphic” means in this context, would improve readability.
- [§2] Notation for repeats (occurrences that are not first appearances) is used throughout but never given a formal symbol. Introducing a short-hand such as r(w) for the number of repeats would make the statements of the main theorems easier to parse.
Simulated Author's Rebuttal
We thank the referee for the careful reading, the positive assessment of the result, and the recommendation for minor revision. We address each major comment below and will revise the manuscript to improve the clarity of the arguments.
read point-by-point responses
-
Referee: [§3] §3 (upper-bound proof): the structural decomposition that classifies words avoiding the seven patterns according to the order of first occurrences and the placement of subsequent repeats is load-bearing for the kn^6 claim. The argument must show that every possible placement of repeats either creates one of the seven patterns or is bounded by kn^6; if any configuration is omitted, the quantitative bound fails. Please supply an explicit enumeration of the cases or a diagram that makes the exhaustion visible.
Authors: We agree that the structural decomposition in Section 3 would benefit from greater explicitness to make the exhaustion of cases fully transparent. In the revised version we will insert a diagram that depicts the possible orderings of first occurrences together with the admissible placements of subsequent repeats. We will also add an enumerated list of the resulting configurations, showing in each case either that one of the seven patterns is forced or that the total number of repeats is at most kn^6. This revision directly addresses the concern that an omitted configuration could invalidate the quantitative bound. revision: yes
-
Referee: [§4] §4 (construction for k=1): the explicit word with n^6 repeats is asserted to avoid all seven patterns, but the verification that it contains no occurrence of, e.g., 012⋯n012⋯n or n⋯210n⋯210 is only sketched. A short table or inductive argument confirming that each forbidden pattern is blocked would strengthen the lower-bound claim.
Authors: We thank the referee for this concrete suggestion. In the revision we will augment Section 4 with a short table that records, for each of the seven patterns, the structural reason it is absent from the constructed word. For the two more involved patterns 012⋯n012⋯n and n⋯210n⋯210 we will supply a brief inductive argument on n that confirms avoidance. These additions will make the verification of the extremal example fully rigorous while preserving the explicit nature of the construction. revision: yes
Circularity Check
Direct combinatorial proof and explicit construction are self-contained
full rationale
The paper establishes the upper bound via structural decomposition of words avoiding the seven listed patterns, using the sequence of first occurrences and relative placements of repeats to bound their total number by kn^6. This is paired with an explicit construction achieving n^6 repeats for k=1 without any of the patterns. No step reduces by definition to its own output, no parameters are fitted to data and then relabeled as predictions, and no load-bearing premise rests on a self-citation chain. The argument is a standard case-analysis proof in extremal combinatorics on words and remains independent of its own conclusions.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math A pattern occurrence is a (not necessarily consecutive) subword order-isomorphic to the template sequence.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinctionreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Every word with kn⁶+1 repeats contains one of the following patterns: 0^{k+2}, 0011⋯nn, nn⋯1100, 012⋯n012⋯n, …
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.