pith. machine review for the scientific record. sign in

arxiv: 2604.07069 · v1 · submitted 2026-04-08 · 📡 eess.SY · cs.LG· cs.SY· math.DS

Recognition: 2 theorem links

· Lean Theorem

Controller Design for Structured State-space Models via Contraction Theory

Authors on Pith no claims yet

Pith reviewed 2026-05-10 18:08 UTC · model grok-4.3

classification 📡 eess.SY cs.LGcs.SYmath.DS
keywords structured state-space modelscontraction theorycontrollabilityobservabilitylinear matrix inequalitiesseparation principleoutput feedback controlnonlinear systems
0
0 comments X

The pith

Structured state-space models permit the first controllability and observability analysis, enabling LMI-based contraction controllers and a separation principle for output feedback.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops an indirect data-driven approach to output feedback controller design for nonlinear systems by treating structured state-space models as surrogate models. It supplies the initial controllability and observability conditions for these models. Those conditions produce scalable linear-matrix-inequality designs that rely on contraction theory. A separation principle is also derived, so that observers and state-feedback controllers can be synthesized independently while the closed-loop system remains exponentially stable.

Core claim

The authors establish controllability and observability conditions for structured state-space models. These conditions lead to linear matrix inequality formulations for designing contracting controllers. They further prove a separation principle that permits independent synthesis of observers and state-feedback laws without compromising the exponential stability of the closed-loop system.

What carries the argument

Structured state-space models analyzed for controllability and observability using contraction theory to derive LMI-based control conditions.

If this is right

  • Scalable control design becomes possible for long-sequence dynamical systems via LMIs.
  • Observer and controller can be designed separately while preserving closed-loop exponential stability.
  • The method supports data-driven identification of nonlinear systems followed by controller synthesis.
  • Exponential stability guarantees are obtained for the surrogate model under the derived conditions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Similar controllability and separation analysis might extend to other sequence models that maintain linear complexity in sequence length.
  • If the surrogate matches the plant dynamics sufficiently well, the framework could support real-time nonlinear control on embedded hardware.
  • Robustness extensions could incorporate bounded disturbances or model mismatch directly into the LMI conditions.

Load-bearing premise

The structured state-space model must serve as an accurate surrogate for the underlying nonlinear system so that its controllability, observability, and contraction properties transfer to guarantee stability on the real plant.

What would settle it

A counterexample in which an SSM meets the derived LMI conditions and separation principle yet the corresponding real nonlinear system under the designed output-feedback controller fails to exhibit exponential stability would falsify the transfer of guarantees.

Figures

Figures reproduced from arXiv: 2604.07069 by Alireza Karimi, Efe C. Balta, Giancarlo Ferrari-Trecate, Muhammad Zakwan, Vaibhav Gupta.

Figure 1
Figure 1. Figure 1: Typical SSM layer The recurrent unit is typically a discrete-time state-space model. In this paper, we focus on an LRU [10], defined using the following discrete-time state-space equations: xk+1 = Axk + Bu¯k (1a) y¯k = Cxk + Du¯k (1b) where x ∈ R nx , u¯ ∈ R nu¯ , and y¯ ∈ R ny¯ denote the state, input, and output, respectively.2 The matrices A ∈ R nx×nx , B ∈ R nx×nu¯ , C ∈ R ny¯×nx , and D ∈ R ny¯×nu¯ ar… view at source ↗
Figure 2
Figure 2. Figure 2: A sample validation trajectory after the training [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Controller response on the nonlinear mathematical [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
read the original abstract

This paper presents an indirect data-driven output feedback controller synthesis for nonlinear systems, leveraging Structured State-space Models (SSMs) as surrogate models. SSMs have emerged as a compelling alternative in modelling time-series data and dynamical systems. They can capture long-term dependencies while maintaining linear computational complexity with respect to the sequence length, in comparison to the quadratic complexity of Transformer-based architectures. The contributions of this work are threefold. We provide the first analysis of controllability and observability of SSMs, which leads to scalable control design via Linear Matrix Inequalities (LMIs) that leverage contraction theory. Moreover, a separation principle for SSMs is established, enabling the independent design of observers and state-feedback controllers while preserving the exponential stability of the closed-loop system. The effectiveness of the proposed framework is demonstrated through a numerical example, showcasing nonlinear system identification and the synthesis of an output feedback controller.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes an indirect data-driven output-feedback controller synthesis method for nonlinear systems that uses Structured State-space Models (SSMs) as surrogate models. It claims to deliver the first controllability and observability analysis for SSMs, which enables scalable LMI-based state-feedback and observer design via contraction theory, establishes a separation principle that preserves exponential stability, and demonstrates the approach on a numerical nonlinear system identification and control example.

Significance. If the stability transfer from the SSM surrogate to the true nonlinear plant can be made rigorous, the work would offer a computationally attractive bridge between efficient sequence modeling and contraction-theoretic control, potentially enabling scalable output-feedback design for systems where direct nonlinear analysis is intractable. The separation principle and LMI formulation are potentially valuable contributions if the underlying controllability/observability results hold.

major comments (2)
  1. [Introduction and closed-loop stability section] The central claim that the synthesized controller stabilizes the original nonlinear plant (Introduction and Section on closed-loop stability) rests on the SSM serving as a faithful surrogate, yet no incremental stability margin, approximation-error bound, or robustness result is supplied to guarantee that the contraction metric and LMI-derived gains remain valid under realistic identification residuals. The numerical example shows only one successful case without quantifying the required model accuracy.
  2. [Controllability and observability analysis section] § on controllability/observability analysis: the claimed 'first analysis' and resulting LMI conditions for contraction are presented without an explicit statement of the precise SSM state-space realization used (e.g., the structured matrices A, B, C and their dependence on the learned parameters), making it impossible to verify whether the controllability rank conditions or the LMI feasibility indeed imply the stated exponential stability for the surrogate.
minor comments (2)
  1. [Abstract and Introduction] The abstract and introduction use 'scalable' and 'parameter-free' without quantifying computational complexity relative to sequence length or comparing against standard LMI solvers for the specific SSM dimension.
  2. [Preliminaries and main results] Notation for the contraction metric and the LMI variables is introduced without a dedicated nomenclature table or consistent cross-referencing to the SSM equations.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We are grateful to the referee for their thorough review and valuable suggestions. We address each of the major comments in detail below and indicate the revisions we plan to make to the manuscript.

read point-by-point responses
  1. Referee: [Introduction and closed-loop stability section] The central claim that the synthesized controller stabilizes the original nonlinear plant (Introduction and Section on closed-loop stability) rests on the SSM serving as a faithful surrogate, yet no incremental stability margin, approximation-error bound, or robustness result is supplied to guarantee that the contraction metric and LMI-derived gains remain valid under realistic identification residuals. The numerical example shows only one successful case without quantifying the required model accuracy.

    Authors: We thank the referee for highlighting this important aspect. Our manuscript focuses on providing controllability and observability analysis for SSMs to facilitate scalable LMI-based design using contraction theory, and establishing a separation principle. The exponential stability is rigorously shown for the closed-loop system under the SSM dynamics. Regarding the transfer of stability to the true nonlinear plant, the paper assumes that the SSM serves as an accurate surrogate model obtained via system identification, which is a standard assumption in indirect data-driven control approaches. The numerical example illustrates the practical application but does not include a comprehensive sensitivity analysis. We agree that adding a discussion on potential robustness margins and the limitations due to identification errors would be beneficial. In the revised manuscript, we will include a new subsection or paragraph in the closed-loop stability section discussing the assumptions on model fidelity and suggesting future directions for incorporating approximation error bounds. revision: partial

  2. Referee: [Controllability and observability analysis section] § on controllability/observability analysis: the claimed 'first analysis' and resulting LMI conditions for contraction are presented without an explicit statement of the precise SSM state-space realization used (e.g., the structured matrices A, B, C and their dependence on the learned parameters), making it impossible to verify whether the controllability rank conditions or the LMI feasibility indeed imply the stated exponential stability for the surrogate.

    Authors: We apologize for the lack of clarity in this section. The SSM considered in our analysis follows the standard formulation used in structured state-space models for time-series modeling, typically involving a state matrix A that is structured (e.g., diagonal with learned parameters) and input/output matrices B and C that may also have specific structures depending on the parameterization. We will revise the controllability and observability analysis section to explicitly define the state-space realization, including the forms of A, B, and C in terms of the model parameters, and show how the rank conditions and LMI feasibility lead to the contraction properties and exponential stability. revision: yes

Circularity Check

0 steps flagged

No circularity: original controllability/observability analysis and separation principle for SSMs

full rationale

The paper derives controllability and observability properties for Structured State-space Models from first principles, then applies contraction theory to obtain LMI-based controller synthesis and establishes a separation principle for output-feedback design. These steps are presented as novel contributions without any reduction to self-definitional inputs, fitted parameters renamed as predictions, or load-bearing self-citations. The abstract and described contributions show independent mathematical analysis that does not tautologically reproduce its assumptions or data. The numerical example serves only for illustration, not as the source of the core claims. The derivation chain remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

Limited information available from abstract only. The framework rests on standard properties of contraction theory for exponential stability and the assumption that SSMs can serve as faithful surrogate models for nonlinear dynamics. No explicit free parameters or invented entities are described.

axioms (2)
  • domain assumption Contraction theory supplies sufficient conditions for exponential stability of nonlinear systems that can be encoded as LMIs.
    The paper uses contraction theory to obtain LMI conditions for controller design, invoking its incremental stability properties without deriving them.
  • domain assumption The SSM surrogate preserves the controllability, observability, and contraction properties of the true nonlinear system.
    The control design is performed on the SSM and claimed to work for the original system; this transfer is not justified in the abstract.

pith-pipeline@v0.9.0 · 5473 in / 1412 out tokens · 54524 ms · 2026-05-10T18:08:18.818882+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. A Nonlinear Separation Principle via Contraction Theory: Applications to Neural Networks, Control, and Learning

    eess.SY 2026-04 unverdicted novelty 5.0

    A contraction-theory separation principle yields global exponential stability for controller-observer pairs and sharp LMI certificates for contractive RNNs, enabling stable output tracking and implicit neural network design.

Reference graph

Works this paper leans on

43 extracted references · 4 canonical work pages · cited by 1 Pith paper · 1 internal anchor

  1. [1]

    Nonlinear System Identification: A User-Oriented Road Map,

    J. Schoukens and L. Ljung, “Nonlinear System Identification: A User-Oriented Road Map,”IEEE Control Systems Magazine, vol. 39, pp. 28–99, Dec. 2019

  2. [2]

    S. A. Billings,Nonlinear system identification: NARMAX methods in the time, frequency, and spatio-temporal domains. John Wiley & Sons, 2013

  3. [3]

    Ker- nel methods in system identification, machine learning and function estimation: A survey,

    G. Pillonetto, F. Dinuzzo, T. Chen, G. De Nicolao, and L. Ljung, “Ker- nel methods in system identification, machine learning and function estimation: A survey,”Automatica, vol. 50, no. 3, pp. 657–682, 2014

  4. [4]

    System identification of nonlinear state-space models,

    T. B. Sch ¨on, A. Wills, and B. Ninness, “System identification of nonlinear state-space models,”Automatica, vol. 47, no. 1, pp. 39–49, 2011

  5. [5]

    S. L. Brunton and J. N. Kutz,Data-driven science and engineering: Machine learning, dynamical systems, and control. Cambridge Uni- versity Press, 2022

  6. [6]

    System identification: A machine learn- ing perspective,

    A. Chiuso and G. Pillonetto, “System identification: A machine learn- ing perspective,”Annual Review of Control, Robotics, and Autonomous Systems, vol. 2, no. 1, pp. 281–304, 2019

  7. [7]

    Mamba: Linear-time sequence modeling with selective state spaces,

    A. Gu and T. Dao, “Mamba: Linear-time sequence modeling with selective state spaces,” inFirst Conference on Language Modeling, 2024

  8. [8]

    State space models as foundation models: A control theoretic overview,

    C. A. Alonso, J. Sieber, and M. N. Zeilinger, “State space models as foundation models: A control theoretic overview,” in2025 American Control Conference (ACC), pp. 146–153, IEEE, 2025

  9. [9]

    Structured state-space models are deep wiener models,

    F. Bonassi, C. Andersson, P. Mattsson, and T. B. Sch ¨on, “Structured state-space models are deep wiener models,”IFAC-PapersOnLine, vol. 58, no. 15, pp. 247–252, 2024

  10. [10]

    Resurrecting recurrent neural networks for long sequences,

    A. Orvieto, S. L. Smith, A. Gu, A. Fernando, C. Gulcehre, R. Pascanu, and S. De, “Resurrecting recurrent neural networks for long sequences,” inInternational Conference on Machine Learning, pp. 26670–26698, PMLR, 2023

  11. [11]

    On contraction analysis for non- linear systems,

    W. Lohmiller and J.-J. E. Slotine, “On contraction analysis for non- linear systems,”Automatica, vol. 34, no. 6, pp. 683–696, 1998

  12. [12]

    Bullo,Contraction Theory for Dynamical Systems

    F. Bullo,Contraction Theory for Dynamical Systems. Kindle Direct Publishing, 1.2 ed., 2024

  13. [13]

    Control contraction metrics: Convex and intrinsic criteria for nonlinear feedback design,

    I. R. Manchester and J.-J. E. Slotine, “Control contraction metrics: Convex and intrinsic criteria for nonlinear feedback design,”IEEE Transactions on Automatic Control, vol. 62, no. 6, pp. 3046–3053, 2017

  14. [14]

    Regional sta- bility conditions for recurrent neural network-based control systems,

    A. La Bella, M. Farina, W. D’Amico, and L. Zaccarian, “Regional sta- bility conditions for recurrent neural network-based control systems,” arXiv preprint arXiv:2409.15792, 2024

  15. [15]

    LMI-Based design of a robust model predictive controller for a class of recurrent neural networks with guaranteed properties,

    D. Ravasio, M. Farina, and A. Ballarino, “LMI-Based design of a robust model predictive controller for a class of recurrent neural networks with guaranteed properties,”IEEE Control Systems Letters, 2024

  16. [16]

    On the stability properties of gated recurrent units neural networks,

    F. Bonassi, M. Farina, and R. Scattolini, “On the stability properties of gated recurrent units neural networks,”Systems & Control Letters, vol. 157, p. 105049, 2021

  17. [17]

    Simba: System identification methods leveraging backpropagation,

    L. Di Natale, M. Zakwan, P. Heer, G. Ferrari-Trecate, and C. N. Jones, “Simba: System identification methods leveraging backpropagation,” IEEE Transactions on Control Systems Technology, 2024

  18. [18]

    Stable linear subspace identification: A machine learning approach,

    L. Di Natale, M. Zakwan, B. Svetozarevic, P. Heer, G. Ferrari-Trecate, and C. N. Jones, “Stable linear subspace identification: A machine learning approach,” in2024 European Control Conference (ECC), pp. 3539–3544, IEEE, 2024

  19. [19]

    dynoNet: A neural network architecture for learning dynamical systems,

    M. Forgione and D. Piga, “dynoNet: A neural network architecture for learning dynamical systems,”International Journal of Adaptive Control and Signal Processing, vol. 35, no. 4, pp. 612–626, 2021

  20. [20]

    Distributed neural network control with dependability guarantees: a compositional port-hamiltonian approach,

    L. Furieri, C. L. Galimberti, M. Zakwan, and G. Ferrari-Trecate, “Distributed neural network control with dependability guarantees: a compositional port-hamiltonian approach,” inlearning for dynamics and control conference, pp. 571–583, PMLR, 2022

  21. [21]

    Neural distributed controllers with port-hamiltonian structures,

    M. Zakwan and G. Ferrari-Trecate, “Neural distributed controllers with port-hamiltonian structures,” in2024 IEEE 63rd Conference on Decision and Control (CDC), pp. 8633–8638, IEEE, 2024

  22. [22]

    Neural port-hamiltonian models for nonlinear distributed control: An unconstrained parametrization approach,

    M. Zakwan and G. Ferrari-Trecate, “Neural port-hamiltonian models for nonlinear distributed control: An unconstrained parametrization approach,”arXiv preprint arXiv:2411.10096, 2024

  23. [23]

    Neu- ral ordinary differential equations,

    R. T. Chen, Y . Rubanova, J. Bettencourt, and D. K. Duvenaud, “Neu- ral ordinary differential equations,”Advances in neural information processing systems, vol. 31, 2018

  24. [24]

    Neural system level synthesis: Learning over all stabilizing policies for nonlinear systems,

    L. Furieri, C. L. Galimberti, and G. Ferrari-Trecate, “Neural system level synthesis: Learning over all stabilizing policies for nonlinear systems,” in2022 IEEE 61st Conference on Decision and Control (CDC), pp. 2765–2770, IEEE, 2022

  25. [25]

    Learning to Boost the Performance of Stable Nonlinear Systems,

    L. Furieri, C. L. Galimberti, and G. Ferrari-Trecate, “Learning to Boost the Performance of Stable Nonlinear Systems,”IEEE Open Journal of Control Systems, vol. 3, pp. 342–357, 2024

  26. [26]

    Neural exponential stabilization of control-affine nonlinear systems,

    M. Zakwan, L. Xu, and G. Ferrari-Trecate, “Neural exponential stabilization of control-affine nonlinear systems,” in2024 IEEE 63rd Conference on Decision and Control (CDC), pp. 8602–8607, IEEE, 2024

  27. [27]

    Mamba: Linear-Time Sequence Modeling with Selective State Spaces

    A. Gu and T. Dao, “Mamba: Linear-Time Sequence Modeling with Selective State Spaces,”arXiv preprint arXiv:2312.00752, 2024

  28. [28]

    Prefix Sums and Their Applications,

    G. E. Blelloch, “Prefix Sums and Their Applications,” Tech. Rep. CMU-CS-90-190, School of Computer Science, Carnegie Mellon University, Nov. 1990

  29. [29]

    Efficient and Accurate Estimation of Lipschitz Constants for Deep Neural Networks,

    M. Fazlyab, A. Robey, H. Hassani, M. Morari, and G. Pappas, “Efficient and Accurate Estimation of Lipschitz Constants for Deep Neural Networks,” inAdvances in Neural Information Processing Systems, vol. 32, Curran Associates, Inc., 2019

  30. [30]

    A Unified Algebraic Perspective on Lipschitz Neural Networks,

    A. Araujo, A. J. Havens, B. Delattre, A. Allauzen, and B. Hu, “A Unified Algebraic Perspective on Lipschitz Neural Networks,” inThe Eleventh International Conference on Learning Representations, Sept. 2022

  31. [31]

    Monotone, Bi- Lipschitz, and Polyak-Łojasiewicz networks,

    R. Wang, K. D. Dvijotham, and I. R. Manchester, “Monotone, Bi- Lipschitz, and Polyak-Łojasiewicz networks,” inProceedings of the 41st International Conference on Machine Learning, vol. 235 of ICML’24, (Vienna, Austria), pp. 50379–50399, JMLR.org, July 2024

  32. [32]

    Contraction theory for nonlinear stability analysis and learning-based control: A tutorial overview,

    H. Tsukamoto, S.-J. Chung, and J.-J. E. Slotine, “Contraction theory for nonlinear stability analysis and learning-based control: A tutorial overview,”Annual Reviews in Control, vol. 52, pp. 135–169, 2021

  33. [33]

    Local controllability does imply global controllability,

    U. Boscain, D. Cannarsa, V . Franceschi, and M. Sigalotti, “Local controllability does imply global controllability,”Comptes Rendus. Math´ematique, vol. 361, no. G11, pp. 1813–1822, 2023

  34. [34]

    Computing optimal uncertainty models from frequency domain data,

    H. Hindi, C.-Y . Seong, and S. Boyd, “Computing optimal uncertainty models from frequency domain data,” inProceedings of the 41st IEEE Conference on Decision and Control, vol. 3, pp. 2898–2905, Dec. 2002

  35. [35]

    J. P. Hespanha,Linear systems theory. Princeton university press, 2018

  36. [36]

    Control contraction metrics, robust control and observer duality,

    I. R. Manchester and J.-J. E. Slotine, “Control contraction metrics, robust control and observer duality,”arXiv preprint arXiv:1403.5364, 2014

  37. [37]

    Contracting nonlinear observers: Convex opti- mization and learning from data,

    I. R. Manchester, “Contracting nonlinear observers: Convex opti- mization and learning from data,” in2018 annual American control conference (ACC), pp. 1873–1880, IEEE, 2018

  38. [38]

    Output-feedback control of nonlinear systems using control contraction metrics and convex opti- mization,

    I. R. Manchester and J.-J. E. Slotine, “Output-feedback control of nonlinear systems using control contraction metrics and convex opti- mization,” in2014 4th Australian control conference (AUCC), pp. 215– 220, IEEE, 2014

  39. [39]

    A separation principle for the stabilization of a class of nonlinear systems,

    A. N. Atassi and H. K. Khalil, “A separation principle for the stabilization of a class of nonlinear systems,”IEEE Transactions on Automatic Control, vol. 44, no. 9, pp. 1672–1687, 2002

  40. [40]

    Sepa- ration principle for a class of nonlinear feedback systems augmented with observers,

    A. Shiriaev, R. Johansson, A. Robertsson, and L. Freidovich, “Sepa- ration principle for a class of nonlinear feedback systems augmented with observers,”IFAC Proceedings Volumes, vol. 41, no. 2, pp. 6196– 6201, 2008

  41. [41]

    A separation principle for nonlinear systems,

    R. Mart ´ınez-Guerra and C. D. Cruz-Ancona, “A separation principle for nonlinear systems,” inAlgorithms of Estimation for Nonlinear Sys- tems: A Differential and Algebraic Viewpoint, pp. 105–157, Springer, 2017

  42. [42]

    Nonlinear modeling and identification of a dc motor for bidirectional operation with real time experiments,

    T. Kara and I. Eker, “Nonlinear modeling and identification of a dc motor for bidirectional operation with real time experiments,”Energy Conversion and Management, vol. 45, no. 7-8, pp. 1087–1106, 2004

  43. [43]

    The python control systems library (python-control),

    S. Fuller, B. Greiner, J. Moore, R. Murray, R. van Paassen, and R. Yorke, “The python control systems library (python-control),” in 60th IEEE Conference on Decision and Control (CDC), pp. 4875– 4881, IEEE, 2021