pith. machine review for the scientific record. sign in

arxiv: 2604.16908 · v1 · submitted 2026-04-18 · 📡 eess.SY · cs.SY

Recognition: unknown

End-to-End ILC for Repetitive Untrackable Tasks: A Cooperative Game Perspective

Hongfeng Tao, Rodrigo A. Gonz\'alez, Tom Oomen, Wojciech Paszke, Zhihe Zhuang

Authors on Pith no claims yet

Pith reviewed 2026-05-10 07:12 UTC · model grok-4.3

classification 📡 eess.SY cs.SY
keywords iterative learning controlcooperative gameuntrackable tasksend-to-end ILCnorm-optimal ILCrepetitive controlclosed-loop systems
0
0 comments X

The pith

For untrackable repetitive tasks, jointly updating both reference and ILC feedforward input yields lower cost than standard norm-optimal ILC under a sufficient condition.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Standard iterative learning control assumes a perfect tracking input exists, yet many practical tasks are untrackable because of inherent system limits. The paper develops an end-to-end ILC scheme that updates the reference signal and the feedforward input together from trial to trial using only measurement data. By casting the resulting closed-loop behavior as a two-player cooperative game whose players share aligned objectives, the authors derive a sufficient condition guaranteeing that this joint-update strategy produces a lower cost than conventional one-player norm-optimal ILC. The approach is illustrated on a numerical example of a repetitive untrackable task. Readers should care because it offers a concrete way to improve performance precisely when perfect tracking is impossible.

Core claim

The paper discovers a sufficient condition under which the two-player end-to-end ILC has a lower cost than the one-player norm-optimal ILC for repetitive untrackable tasks. The reference input and the ILC feedforward input are updated simultaneously on the basis of measurement data, and the closed-loop dynamics are analyzed as a cooperative game in which the two updaters act as players with aligned objectives.

What carries the argument

The cooperative-game representation of the closed-loop ILC dynamics in which the reference updater and the feedforward updater are treated as players whose objectives are aligned.

Load-bearing premise

The closed-loop ILC dynamics admit a cooperative-game representation in which the reference updater and feedforward updater have aligned objectives, and the derived sufficient condition remains valid under real disturbances and model mismatch.

What would settle it

A simulation or experiment in which the stated sufficient condition holds yet the measured cost of the two-player end-to-end ILC exceeds that of the one-player norm-optimal ILC.

Figures

Figures reproduced from arXiv: 2604.16908 by Hongfeng Tao, Rodrigo A. Gonz\'alez, Tom Oomen, Wojciech Paszke, Zhihe Zhuang.

Figure 1
Figure 1. Figure 1: Control block diagram of the end-to-end ILC. [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: The 30th tracking outputs of the end-to-end ILC [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Cost convergence of the end-to-end ILC compared [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
read the original abstract

An inherent assumption of perfect tracking in iterative learning control (ILC) is that there exists an ILC input such that the generated output can track the desired trajectory reference. This assumption may fail in practice, which gives rise to desired but untrackable tasks. This paper gives an end-to-end ILC design for repetitive untrackable tasks in closed-loop systems. The reference input is trial-to-trial updated together with the ILC feedforward input based on the measurement data. This two-player behavior of the closed-loop ILC system is investigated from a cooperative game perspective. A sufficient condition for the two-player end-to-end ILC to have a lower cost than the one-player norm optimal ILC (NOILC) is discovered. Finally, a numerical example is given to verify the effectiveness of the developed method.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper proposes an end-to-end ILC design for repetitive untrackable tasks in closed-loop systems. Both the reference input and ILC feedforward input are updated trial-to-trial from measurement data. The closed-loop behavior is recast as a two-player cooperative game with aligned objectives, yielding a sufficient condition under which this two-player scheme achieves strictly lower cost than standard one-player norm-optimal ILC (NOILC). A numerical example is included to illustrate effectiveness.

Significance. If the sufficient condition is rigorously derived and the cooperative-game representation is faithful, the work supplies a principled way to handle untrackable references by jointly optimizing reference and feedforward updates. The game-theoretic framing is novel within ILC and could generalize to other multi-component learning schemes. The numerical example provides concrete verification under nominal conditions, which is a positive step toward reproducibility.

major comments (2)
  1. [Main Results / Cooperative Game Formulation] The derivation of the sufficient condition (presumably the central theorem in the main-results section) is presented under the standard ILC assumptions of perfect model knowledge and zero disturbances. The manuscript must explicitly state whether the inequality remains valid under bounded model mismatch or additive disturbances; if the proof relies on exact cancellation of the plant inverse, the condition is not load-bearing for practical untrackable tasks.
  2. [Numerical Example] The numerical example only demonstrates the nominal case. To support the claim that the two-player scheme outperforms NOILC, the example should include at least one run with realistic disturbances or parametric uncertainty and report the realized cost ratio relative to the derived sufficient-condition threshold.
minor comments (1)
  1. [Abstract / Problem Formulation] Notation for the two players' cost functions and the joint objective should be introduced once and used consistently; the current abstract leaves the precise definition of 'aligned objectives' implicit.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the major comments point by point below, agreeing to revisions that clarify the scope of the results and strengthen the numerical validation while preserving the nominal theoretical contribution.

read point-by-point responses
  1. Referee: [Main Results / Cooperative Game Formulation] The derivation of the sufficient condition (presumably the central theorem in the main-results section) is presented under the standard ILC assumptions of perfect model knowledge and zero disturbances. The manuscript must explicitly state whether the inequality remains valid under bounded model mismatch or additive disturbances; if the proof relies on exact cancellation of the plant inverse, the condition is not load-bearing for practical untrackable tasks.

    Authors: We agree that the sufficient condition is derived under the standard assumptions of perfect model knowledge and zero disturbances. The cooperative-game recasting and the proof of the cost inequality use the nominal lifted-system representation of the closed-loop dynamics; this does involve operations equivalent to the nominal plant inverse in the lifted domain. Consequently, we cannot assert that the inequality holds under bounded model mismatch or additive disturbances without further analysis. In the revised manuscript we will explicitly state these assumptions in the theorem and add a dedicated remark clarifying that the result is nominal, while noting that robustness extensions constitute future work. This accurately delimits the contribution without overstating its practical scope. revision: yes

  2. Referee: [Numerical Example] The numerical example only demonstrates the nominal case. To support the claim that the two-player scheme outperforms NOILC, the example should include at least one run with realistic disturbances or parametric uncertainty and report the realized cost ratio relative to the derived sufficient-condition threshold.

    Authors: We acknowledge that the present numerical example is confined to the nominal case. To address the concern, the revised manuscript will incorporate an additional simulation that includes bounded additive disturbances and parametric uncertainty. We will report the realized costs for both the proposed end-to-end ILC and standard NOILC, compute the cost ratio, and compare it against the threshold supplied by the sufficient condition. This will provide concrete evidence of relative performance beyond the ideal setting. revision: yes

Circularity Check

0 steps flagged

No circularity: sufficient condition derived from independent game reformulation

full rationale

The visible abstract and reader's summary indicate that the central result is a sufficient condition obtained after recasting closed-loop ILC dynamics as a cooperative game with aligned player objectives. No equations, fitted parameters, or self-citations are shown that would make the inequality hold by construction or reduce the prediction to the input data. The derivation therefore retains independent mathematical content under the stated nominal assumptions, consistent with a self-contained analysis.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the modeling choice that the reference and ILC updates constitute a cooperative two-player game whose cost can be compared directly to single-player NOILC.

axioms (1)
  • domain assumption The closed-loop ILC system admits a cooperative-game representation between reference updater and feedforward updater
    Invoked to justify the two-player formulation and the cost comparison.

pith-pipeline@v0.9.0 · 5448 in / 1141 out tokens · 43297 ms · 2026-05-10T07:12:38.038458+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

22 extracted references · 1 canonical work pages · 1 internal anchor

  1. [1]

    Amann, N., Owens, D.H., and Rogers, E. (1996). Iterative learning control for discrete-time systems with expo- nential rate of convergence. IEE Proceedings-Control Theory and Applications , 143(2), 217–224

  2. [2]

    Arimoto, S., Kawamura, S., and Miyazaki, F. (1984). Bettering operation of robots by learning. Journal of Robotic Systems, 1(2), 123–140

  3. [3]

    and Alleyne, A.G

    Barton, K.L. and Alleyne, A.G. (2008). A cross-coupled iterative learning control design for precision motion control. IEEE Transactions on Control Systems Tech- nology, 16(6), 1218–1231

  4. [4]

    Flepp, B., Goyal, P., Jackel, L.D., Monfort, M., Muller, U., Zhang, J., et al. (2016). End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316

  5. [5]

    and Oomen, T

    Bolder, J. and Oomen, T. (2016). Inferential iterative learning control: A 2D-system approach. Automatica, 71, 247–253

  6. [6]

    Bristow, D.A., Tharayil, M., and Alleyne, A.G. (2006). A survey of iterative learning control. IEEE Control Systems Magazine , 26(3), 96–114

  7. [7]

    Cobb, M.K., Barton, K., Fathy, H., and Vermillion, C. (2019). Iterative learning-based path optimization for repetitive path planning, with application to 3-D cross- wind flight of airborne wind energy systems. IEEE Transactions on Control Systems Technology , 28(4), 1447–1459. de Roover, D. (1997). Motion control of a wafer stage: a design approach for sp...

  8. [8]

    Freeman, C. (2016). Control system design for electrical stimulation in upper limb rehabilitation. Switzerland: Springer International Publishing

  9. [9]

    and Norrl¨ of, M

    Gunnarsson, S. and Norrl¨ of, M. (2001). On the design of ILC algorithms using optimization. Automatica, 37(12), 2011–2016

  10. [10]

    Lambrechts, P., Boerlage, M., and Steinbuch, M. (2005). Trajectory planning and feedforward design for elec- tromechanical motion systems. Control Engineering Practice, 13(2), 145–157

  11. [11]

    and Wu, Y

    Meng, D. and Wu, Y. (2021). Control design for iterative methods in solving linear algebraic equations. IEEE Transactions on Automatic Control , 67(10), 5039–5054

  12. [12]

    and Zhang, J

    Meng, D. and Zhang, J. (2023). Fundamental trackability problems for iterative learning control. IEEE/CAA Journal of Automatica Sinica , 10(10), 1933–1950

  13. [13]

    Mishra, S., Topcu, U., and Tomizuka, M. (2010). Optimization-based constrained iterative learning con- trol. IEEE Transactions on Control Systems Technol- ogy, 19(6), 1613–1621

  14. [14]

    and Rojas, C.R

    Oomen, T. and Rojas, C.R. (2017). Sparse iterative learn- ing control with application to a wafer stage: Achiev- ing performance, resource efficiency, and task flexibility. Mechatronics, 47, 134–147

  15. [15]

    Owen, G. (2013). Game Theory. Emerald Group Publish- ing

  16. [16]

    Shapley, L.S. (1971). Cores of convex games. International journal of game theory , 1(1), 11–26

  17. [17]

    Harley, T., Dulac-Arnold, G., Reichert, D., Rabinowitz, N., Barreto, A., et al. (2017). The predictron: End-to- end learning and planning. In International Conference on Machine Learning , 3191–3199. PMLR

  18. [18]

    Son, T.D., Ahn, H.S., and Moore, K.L. (2013). Iterative learning control in optimal tracking problems with spec- ified data points. Automatica, 49(5), 1465–1472. van Meer, M., Poot, M., Portegies, J., and Oomen, T. (2022). Gaussian process based feedforward control for nonlinear systems with flexible tasks: With application to a printer with friction. IF AC...

  19. [19]

    Wang, C., Meng, D., and Wu, Y. (2024). Trackability compensation for iterative learning control: A data- based approach. In 2024 IEEE 63rd Conference on Decision and Control (CDC) , 4893–4898. IEEE

  20. [20]

    and Meng, D

    Wu, Y. and Meng, D. (2023). Data-based trackability cri- teria and control design for disturbed learning systems. Automatica, 155, 111113

  21. [21]

    Zhang, Z., Jiang, H., Shen, D., and Saab, S.S. (2023). Data-driven learning control algorithms for unachiev- able tracking problems. IEEE/CAA Journal of Auto- matica Sinica, 11(1), 205–218

  22. [22]

    Zhou, R., Hu, C., Wang, Z., Zhu, Y., and Tomizuka, M. (2024). Real-time iterative compensation control using plant-injection feedforward architecture with applica- tion to ultraprecision wafer stages. IEEE Transactions on Industrial Informatics