Early-Stabilizing Counting

Christoph Lenzen; Julian Loss

arxiv: 2605.18171 · v1 · pith:MP6KZUIYnew · submitted 2026-05-18 · 💻 cs.DC

Early-Stabilizing Counting

Christoph Lenzen , Julian Loss This is my paper

Pith reviewed 2026-05-20 00:34 UTC · model grok-4.3

classification 💻 cs.DC

keywords self-stabilizing countingByzantine faultsearly stabilizationsynchronous distributed systemsconsensus reductionadaptive complexity

0 comments

The pith

Self-stabilizing counting reaches agreement on a round counter in O(f+1) rounds when there are f actual Byzantine faults.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces early stabilization for the synchronous counting problem in systems with Byzantine faults. Instead of stabilizing in time depending on the maximum number t of faults, the algorithm stabilizes in time depending on the actual number f of persistent faults in the execution. This is achieved by developing modular building blocks inspired by early-stopping consensus protocols and composing them with an overhead-free reduction from consensus. A sympathetic reader would care because this adaptivity saves rounds and communication when the number of faults is smaller than the worst case, which is common in practice. The result shows that all known lower bounds for consensus carry over to counting, but also provides an efficient construction for counting.

Core claim

Taking inspiration from early-stopping consensus protocols, the authors introduce the concept of early stabilization for counting. If there are 0 ≤ f ≤ t persistent faults in an execution, the algorithm should stabilize in a number of rounds that depends on f only. By developing a number of modular building blocks suitable to these goals, they develop a C-counting algorithm that stabilizes within asymptotically optimal O(f+1) rounds, has message size O(log² n + log C), and has amortized bit complexity O(n(f log C + log² n)).

What carries the argument

Modular building blocks for early stabilization that compose with the overhead-free reduction from consensus to counting, preserving the adaptive O(f+1) bound.

If this is right

If there are no persistent faults, the algorithm stabilizes in O(1) rounds.
The message size remains O(log² n + log C) regardless of f.
The amortized bit complexity is O(n(f log C + log² n)), which is adaptive to f.
All lower bounds and impossibilities for consensus apply directly to this counting problem.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This approach could be extended to other self-stabilizing tasks beyond counting by applying similar early-stabilization techniques.
Practical distributed systems with variable fault rates might see significant performance gains from adaptive stabilization times.
Further research could investigate whether these building blocks can reduce overhead in asynchronous settings or with different fault models.

Load-bearing premise

The modular building blocks for early stabilization correctly compose with the overhead-free reduction from consensus while preserving the adaptive O(f+1) bound and the stated bit complexity.

What would settle it

An execution trace with a specific number of persistent faults f where the non-faulty nodes fail to agree on the common round counter within O(f+1) rounds after transient faults cease would falsify the claim.

read the original abstract

Synchronous Counting is the task of reaching agreement on a common round counter in a synchronous system of $n$ nodes with up to $t$ Byzantine faults in a self-stabilizing manner. That is, after transient faults may have arbitrarily corrupted the system state and ceased, the at least $n-t$ non-faulty nodes need to (re-)establish that (i) their local outputs are identical and (ii) increase by $1$ modulo $C$ in each round. An overhead-free reduction from consensus shows that all known lower bounds and impossibilities for consensus carry over to the counting problem. In the other direction, prior work has established that a consensus algorithm $\mathcal{A}$ can be turned into a counting algorithm at small overhead relative to the running time and bit complexity of $\mathcal{A}$, without losing resilience. Taking inspiration from early-stopping consensus protocols, in this work we introduce the concept of early stabilization. That is, if there are $0\le f\le t$ (persistent) faults in an execution, the algorithm should stabilize in a number of rounds that depends on $f$ only. Likewise, we seek to achieve an amortized bit complexity that is adaptive in the number of actual faults $f$. By developing a number of modular building blocks suitable to these goals, we develop a $C$-counting algorithm that stabilizes within asymptotically optimal $O(f+1)$ rounds, has message size $O(\log^2 n + \log C)$, and has amortized bit complexity $O(n(f\log C +\log^2 n))$.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Lenzen and Loss show how to get O(f+1) stabilization for synchronous counting by adding early-stabilizing modular blocks to the existing overhead-free consensus reduction.

read the letter

The main point is that this paper makes self-stabilizing counting adaptive to the actual number of faults. Instead of always paying for the worst-case t, their algorithm stabilizes in O(f+1) rounds when only f faults occur. They also keep message size at O(log² n + log C) and amortized bit complexity at O(n(f log C + log² n)). That is the concrete advance over prior reductions that stayed non-adaptive in time and cost. They achieve it by building several modular pieces that support early stopping and then feeding them through the known consensus-to-counting reduction without breaking the adaptivity. The construction stays reasonably clean and the bounds are presented as asymptotically tight, which matches what one would hope for. The citation pattern is straightforward and does not rely on self-referential fitting. The soft spot is narrow but worth checking: the reduction must not introduce any fixed overhead or state change that forces the system to wait out t even when f is small. The abstract claims the composition works, so the paper presumably contains the lemmas that track the f-dependent termination through the transformation. If those lemmas are tight, the concern is minor. If they leave a small gap, it would only affect the exact constant factors rather than the overall claim. This is aimed at people who work on self-stabilizing and fault-tolerant distributed protocols. Readers who care about recovering quickly when faults are sporadic rather than constant will find the adaptive bounds useful. The technical detail is sufficient that the paper deserves a full referee rather than a desk rejection. I would send it to peer review with a request to verify the composition lemmas in detail.

Referee Report

2 major / 2 minor

Summary. The paper introduces early stabilization for synchronous self-stabilizing C-counting in the presence of up to t Byzantine faults. Building on an overhead-free reduction from consensus and new modular building blocks inspired by early-stopping consensus, it claims a counting algorithm that stabilizes in asymptotically optimal O(f+1) rounds (f actual faults), with message size O(log² n + log C) and amortized bit complexity O(n(f log C + log² n)).

Significance. If the composition and bounds hold, the result is significant: it transfers early-stopping optimality from consensus to counting while preserving adaptivity in both rounds and amortized communication, using modular blocks that could be reusable. The overhead-free reduction and explicit adaptive bit complexity are strengths that strengthen the contribution beyond non-adaptive self-stabilizing counting.

major comments (2)

[building blocks and prior reduction paragraph] The central claim of O(f+1) stabilization after composition (abstract and building-blocks paragraph) rests on the modular early-stabilizing blocks preserving their f-dependent termination under the state transformations of the overhead-free consensus reduction. No explicit lemma or equation is visible showing that the reduction introduces no fixed additive round overhead independent of f; if even a constant additive term appears, the adaptive guarantee reduces to the non-adaptive O(t+1) bound.
[main theorem / complexity analysis] The amortized bit-complexity bound O(n(f log C + log² n)) (abstract) must be shown to survive the composition without inflation from the modular blocks' internal messages. The proof sketch should contain an equation or invariant that bounds the per-round message volume after the reduction is applied, rather than only arguing it for the blocks in isolation.

minor comments (2)

[abstract and introduction] Notation for the counter modulus C and the distinction between persistent faults f and total bound t should be introduced earlier and used consistently in the complexity statements.
[related work] A short table comparing the new adaptive bounds against prior non-adaptive counting results would improve readability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful and constructive review of our manuscript. The comments correctly identify areas where the composition arguments and complexity analysis can be made more explicit. We respond to each major comment below and will revise the manuscript to incorporate the suggested clarifications.

read point-by-point responses

Referee: [building blocks and prior reduction paragraph] The central claim of O(f+1) stabilization after composition (abstract and building-blocks paragraph) rests on the modular early-stabilizing blocks preserving their f-dependent termination under the state transformations of the overhead-free consensus reduction. No explicit lemma or equation is visible showing that the reduction introduces no fixed additive round overhead independent of f; if even a constant additive term appears, the adaptive guarantee reduces to the non-adaptive O(t+1) bound.

Authors: The overhead-free reduction is constructed precisely so that the f-dependent early stabilization of the modular blocks is preserved under the state transformations, with no additive round overhead independent of f. The reduction invokes the consensus instances in a manner that directly transfers the early-stopping behavior. We nevertheless agree that the absence of an explicit lemma makes this preservation less transparent than it should be. In the revised version we will insert a dedicated lemma (with a short proof) establishing that the composed stabilization time remains O(f+1) with no fixed additive term. revision: yes
Referee: [main theorem / complexity analysis] The amortized bit-complexity bound O(n(f log C + log² n)) (abstract) must be shown to survive the composition without inflation from the modular blocks' internal messages. The proof sketch should contain an equation or invariant that bounds the per-round message volume after the reduction is applied, rather than only arguing it for the blocks in isolation.

Authors: We accept that the current proof sketch analyzes the blocks in isolation and does not yet supply an explicit invariant for the message volume after the reduction is applied. Because the reduction itself is communication-overhead-free, the per-round volume bound carries over, but this needs to be stated formally. We will augment the complexity analysis section with an invariant that bounds the total bits sent per round in the reduced system, confirming that the amortized bound O(n(f log C + log² n)) is preserved under composition. revision: yes

Circularity Check

0 steps flagged

Derivation is self-contained via new modular building blocks and established prior reduction

full rationale

The paper introduces new modular building blocks designed to achieve early stabilization in a number of rounds depending only on the actual number of faults f, then composes them with an overhead-free reduction from consensus established in prior work. The claimed O(f+1) stabilization time, O(log² n + log C) message size, and O(n(f log C + log² n)) amortized bit complexity follow from these components rather than being equivalent to inputs by construction, fitted parameters, or load-bearing self-citations. No equations or steps in the provided description reduce the target result to a renaming, self-definition, or unverified self-citation chain; the derivation introduces independent content for the adaptive guarantees.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The result rests on the standard synchronous message-passing model with up to t Byzantine faults and the correctness of an existing overhead-free consensus-to-counting reduction; no new free parameters or invented entities are introduced in the abstract.

axioms (2)

domain assumption Synchronous rounds with reliable message delivery in each round among non-faulty nodes
Standard model assumption invoked for the counting task definition and stabilization guarantee.
domain assumption Existence of an overhead-free reduction from consensus to counting
Cited prior result used to transfer lower bounds and to build the new algorithm.

pith-pipeline@v0.9.0 · 5806 in / 1265 out tokens · 40988 ms · 2026-05-20T00:34:10.595084+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Theorem 1.1. There is a C-counting algorithm with stabilization time O(f+1), message size O(log² n + log C), and bit complexity of O(n(f log C + log² n)) amortized over n rounds.
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We develop a C-counting algorithm that stabilizes within asymptotically optimal O(f+1) rounds

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

20 extracted references · 20 canonical work pages

[1]

Hubert Chan, Danny Dolev, Kartik Nayak, Rafael Pass, Ling Ren, and Elaine Shi

Ittai Abraham, T.-H. Hubert Chan, Danny Dolev, Kartik Nayak, Rafael Pass, Ling Ren, and Elaine Shi. Communication Complexity of Byzantine Agreement, Revisited. In Peter Robinson and Faith Ellen, editors,Symposium on Principles of Distributed Computing (PODC), pages 317–326. ACM, 2019

work page 2019
[2]

Awerbuch and G

B. Awerbuch and G. Varghese. Distributed Program Checking: a Paradigm for Building Self-stabilizing Distributed Protocols. InSymposium of Foundations of Computer Science (FOCS), pages 258–267, 1991

work page 1991
[3]

Michael Ben-Or, Danny Dolev, and Ezra N. Hoch. Fast self-stabilizing byzantine tolerant digital clock synchronization. In Rida A. Bazzi and Boaz Patt-Shamir, editors,Symposium on Principles of Distributed Computing (PODC), pages 385–394. ACM, 2008

work page 2008
[4]

Berman, J.A

P. Berman, J.A. Garay, and K.J. Perry. Towards Optimal Distributed Consensus. InSymposium on Foundations of Computer Science (FOCS), pages 410–415, 1989

work page 1989
[5]

Benny Chor, Michael Merritt, and David B. Shmoys. Simple constant-time consensus protocols in realistic failure models.J. ACM, 36(3):591–614, July 1989

work page 1989
[6]

Threshold Cryptosystems

Yvo Desmedt and Yair Frankel. Threshold Cryptosystems. In Gilles Brassard, editor,Advances in Cryptology (CRYPTO), volume 435, pages 307–315. Springer, 1989

work page 1989
[7]

The Byzantine Generals Strike Again.Journal of Algorithms, 3(1):14–30, 1982

Danny Dolev. The Byzantine Generals Strike Again.Journal of Algorithms, 3(1):14–30, 1982

work page 1982
[8]

Bounds on Information Exchange for Byzantine Agreement.J

Danny Dolev and Rüdiger Reischuk. Bounds on Information Exchange for Byzantine Agreement.J. ACM, 32(1):191–204, January 1985

work page 1985
[9]

Raymond Strong

Danny Dolev, Ruediger Reischuk, and H. Raymond Strong. ’Eventual’ is Earlier than ’Immediate’. InSymposium on Foundations of Computer Science (SFCS), pages 196–203, 1982

work page 1982
[10]

Raymond Strong

Danny Dolev, Ruediger Reischuk, and H. Raymond Strong. Early stopping in Byzantine agreement.J. ACM, 37(4):720– 741, 1990

work page 1990
[11]

Shlomi Dolev and Jennifer L. Welch. Self-stabilizing clock synchronization in the presence of Byzantine faults.J. ACM, 51(5):780–799, September 2004

work page 2004
[12]

An Optimal Probabilistic Protocol for Synchronous Byzantine Agreement.SIAM Journal on Computing, 26(4):873–933, 1997

Pesech Feldman and Silvio Micali. An Optimal Probabilistic Protocol for Synchronous Byzantine Agreement.SIAM Journal on Computing, 26(4):873–933, 1997

work page 1997
[13]

Fischer and Nancy A

Michael J. Fischer and Nancy A. Lynch. A Lower Bound for the Time to Assure Interactive Consistency.Information Processing Letters, 14(4):183–186, 1982

work page 1982
[14]

Explicit Constructions of Linear Size Superconcentrators

Ofer Gabber and Zvi Galil. Explicit Constructions of Linear Size Superconcentrators. InSymposium on Foundations of Computer Science (FOCS), pages 364–370. IEEE Computer Society, 1979

work page 1979
[15]

Self-Stabilizing Byzantine Clock Synchronization with Optimal Precision

Pankaj Khanchandani and Christoph Lenzen. Self-Stabilizing Byzantine Clock Synchronization with Optimal Precision. Theory Comput. Syst., 63(2):261–305, 2019

work page 2019
[16]

Efficient Construction of Global Time in SoCs Despite Arbitrary Faults

Christoph Lenzen, Matthias Függer, Markus Hofstatter, and Ulrich Schmid. Efficient Construction of Global Time in SoCs Despite Arbitrary Faults. InEuromicro Conference on Digital System Design (DSD), pages 142–151. IEEE Computer Society, 2013

work page 2013
[17]

Near-optimal self-stabilising counting and firing squads.Distributed Comput., 32(4):339–360, 2019

Christoph Lenzen and Joel Rybicki. Near-optimal self-stabilising counting and firing squads.Distributed Comput., 32(4):339–360, 2019. Early-Stabilizing Counting 31

work page 2019
[18]

Self-Stabilising Byzantine Clock Synchronisation Is Almost as Easy as Consensus

Christoph Lenzen and Joel Rybicki. Self-Stabilising Byzantine Clock Synchronisation Is Almost as Easy as Consensus. J. ACM, 66(5), August 2019

work page 2019
[19]

Efficient Counting with Optimal Resilience.SIAM J

Christoph Lenzen, Joel Rybicki, and Jukka Suomela. Efficient Counting with Optimal Resilience.SIAM J. Comput., 46(4):1473–1500, 2017

work page 2017
[20]

Pease, R

M. Pease, R. Shostak, and L. Lamport. Reaching Agreement in the Presence of Faults.J. ACM, 27(2):228–234, April 1980. A Results Implicit in Prior Work Lemma 2.3 (implicit in [ 17, 19]). (𝐶, 𝑋 , 𝑇)-Clock Filtering can be solved with stabilization time 𝑆=𝑋+2, where each node sends an𝑂(log|𝐶|)-sized message to each other node in each round. Proof.The claim t...

work page 1980

[1] [1]

Hubert Chan, Danny Dolev, Kartik Nayak, Rafael Pass, Ling Ren, and Elaine Shi

Ittai Abraham, T.-H. Hubert Chan, Danny Dolev, Kartik Nayak, Rafael Pass, Ling Ren, and Elaine Shi. Communication Complexity of Byzantine Agreement, Revisited. In Peter Robinson and Faith Ellen, editors,Symposium on Principles of Distributed Computing (PODC), pages 317–326. ACM, 2019

work page 2019

[2] [2]

Awerbuch and G

B. Awerbuch and G. Varghese. Distributed Program Checking: a Paradigm for Building Self-stabilizing Distributed Protocols. InSymposium of Foundations of Computer Science (FOCS), pages 258–267, 1991

work page 1991

[3] [3]

Michael Ben-Or, Danny Dolev, and Ezra N. Hoch. Fast self-stabilizing byzantine tolerant digital clock synchronization. In Rida A. Bazzi and Boaz Patt-Shamir, editors,Symposium on Principles of Distributed Computing (PODC), pages 385–394. ACM, 2008

work page 2008

[4] [4]

Berman, J.A

P. Berman, J.A. Garay, and K.J. Perry. Towards Optimal Distributed Consensus. InSymposium on Foundations of Computer Science (FOCS), pages 410–415, 1989

work page 1989

[5] [5]

Benny Chor, Michael Merritt, and David B. Shmoys. Simple constant-time consensus protocols in realistic failure models.J. ACM, 36(3):591–614, July 1989

work page 1989

[6] [6]

Threshold Cryptosystems

Yvo Desmedt and Yair Frankel. Threshold Cryptosystems. In Gilles Brassard, editor,Advances in Cryptology (CRYPTO), volume 435, pages 307–315. Springer, 1989

work page 1989

[7] [7]

The Byzantine Generals Strike Again.Journal of Algorithms, 3(1):14–30, 1982

Danny Dolev. The Byzantine Generals Strike Again.Journal of Algorithms, 3(1):14–30, 1982

work page 1982

[8] [8]

Bounds on Information Exchange for Byzantine Agreement.J

Danny Dolev and Rüdiger Reischuk. Bounds on Information Exchange for Byzantine Agreement.J. ACM, 32(1):191–204, January 1985

work page 1985

[9] [9]

Raymond Strong

Danny Dolev, Ruediger Reischuk, and H. Raymond Strong. ’Eventual’ is Earlier than ’Immediate’. InSymposium on Foundations of Computer Science (SFCS), pages 196–203, 1982

work page 1982

[10] [10]

Raymond Strong

Danny Dolev, Ruediger Reischuk, and H. Raymond Strong. Early stopping in Byzantine agreement.J. ACM, 37(4):720– 741, 1990

work page 1990

[11] [11]

Shlomi Dolev and Jennifer L. Welch. Self-stabilizing clock synchronization in the presence of Byzantine faults.J. ACM, 51(5):780–799, September 2004

work page 2004

[12] [12]

An Optimal Probabilistic Protocol for Synchronous Byzantine Agreement.SIAM Journal on Computing, 26(4):873–933, 1997

Pesech Feldman and Silvio Micali. An Optimal Probabilistic Protocol for Synchronous Byzantine Agreement.SIAM Journal on Computing, 26(4):873–933, 1997

work page 1997

[13] [13]

Fischer and Nancy A

Michael J. Fischer and Nancy A. Lynch. A Lower Bound for the Time to Assure Interactive Consistency.Information Processing Letters, 14(4):183–186, 1982

work page 1982

[14] [14]

Explicit Constructions of Linear Size Superconcentrators

Ofer Gabber and Zvi Galil. Explicit Constructions of Linear Size Superconcentrators. InSymposium on Foundations of Computer Science (FOCS), pages 364–370. IEEE Computer Society, 1979

work page 1979

[15] [15]

Self-Stabilizing Byzantine Clock Synchronization with Optimal Precision

Pankaj Khanchandani and Christoph Lenzen. Self-Stabilizing Byzantine Clock Synchronization with Optimal Precision. Theory Comput. Syst., 63(2):261–305, 2019

work page 2019

[16] [16]

Efficient Construction of Global Time in SoCs Despite Arbitrary Faults

Christoph Lenzen, Matthias Függer, Markus Hofstatter, and Ulrich Schmid. Efficient Construction of Global Time in SoCs Despite Arbitrary Faults. InEuromicro Conference on Digital System Design (DSD), pages 142–151. IEEE Computer Society, 2013

work page 2013

[17] [17]

Near-optimal self-stabilising counting and firing squads.Distributed Comput., 32(4):339–360, 2019

Christoph Lenzen and Joel Rybicki. Near-optimal self-stabilising counting and firing squads.Distributed Comput., 32(4):339–360, 2019. Early-Stabilizing Counting 31

work page 2019

[18] [18]

Self-Stabilising Byzantine Clock Synchronisation Is Almost as Easy as Consensus

Christoph Lenzen and Joel Rybicki. Self-Stabilising Byzantine Clock Synchronisation Is Almost as Easy as Consensus. J. ACM, 66(5), August 2019

work page 2019

[19] [19]

Efficient Counting with Optimal Resilience.SIAM J

Christoph Lenzen, Joel Rybicki, and Jukka Suomela. Efficient Counting with Optimal Resilience.SIAM J. Comput., 46(4):1473–1500, 2017

work page 2017

[20] [20]

Pease, R

M. Pease, R. Shostak, and L. Lamport. Reaching Agreement in the Presence of Faults.J. ACM, 27(2):228–234, April 1980. A Results Implicit in Prior Work Lemma 2.3 (implicit in [ 17, 19]). (𝐶, 𝑋 , 𝑇)-Clock Filtering can be solved with stabilization time 𝑆=𝑋+2, where each node sends an𝑂(log|𝐶|)-sized message to each other node in each round. Proof.The claim t...

work page 1980