pith. machine review for the scientific record. sign in

arxiv: 2604.26481 · v1 · submitted 2026-04-29 · ⚛️ physics.flu-dyn

Recognition: unknown

A Provably Robust Multi-Jet Framework applied to Active Flow Control of an Airfoil in Weakly Compressible Flow

Andrea Beck, Anna Schwarz, Rohan Kaushik

Pith reviewed 2026-05-07 10:51 UTC · model grok-4.3

classification ⚛️ physics.flu-dyn
keywords multi-jet active flow controlreinforcement learningairfoilnon-injectivitydrag reductionaerodynamic efficiencyweakly compressible flowflow separation control
0
0 comments X

The pith

A new multi-jet formulation resolves redundant actions in reinforcement learning flow control and raises airfoil aerodynamic efficiency from 53 to 73 percent.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a multi-jet framework for active flow control that uses reinforcement learning to actuate multiple jets around bodies such as airfoils. It first identifies that conventional mean-centered multi-jet configurations are non-injective, so that different action predictions can produce identical jet intensities and thereby restrict the discovery of complex strategies. The authors supply a theoretical alternative that removes this redundancy and derive upper bounds on running costs that remain independent of the number of jets. When the new setup is applied to a cylinder in a channel and to an airfoil in weakly compressible flow, reinforcement learning finds coordinated jet policies that suppress drag and separation while lowering actuation effort. Best practices from reinforcement learning are added to make the training process faster and more reproducible.

Core claim

The paper claims that replacing the mean-centered multi-jet mapping with a non-injective alternative, together with cost upper bounds that do not grow with jet count, enables reinforcement learning to discover sophisticated coordinated jet strategies; these strategies suppress drag and total force beyond an idealized symmetric case for a cylinder in a channel and reduce the separation region on an airfoil, lifting aerodynamic efficiency from 53 percent to as high as 73 percent depending on the jet layout.

What carries the argument

The proposed non-injective alternative to the mean-centered multi-jet mapping, together with the derived jet-count-independent upper bounds on actuation costs.

If this is right

  • Reinforcement learning can coordinate multiple jets in a sophisticated manner to produce favorable flow outcomes at minimal actuation cost.
  • For the cylinder-in-channel configuration, drag and total-force suppression exceed the idealized symmetric case.
  • For the airfoil, the separation region is minimized and aerodynamic efficiency improves from 53 percent up to 73 percent depending on jet configuration.
  • Incorporating standard reinforcement learning practices reduces upfront training costs while maintaining fast, reproducible, and reliable learning.
  • The overall approach supplies a robust and mathematically grounded method for designing multi-jet active flow control.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The cost bound that stays flat with added jets may allow practitioners to increase the number of actuators without a proportional rise in running expense across other fluid systems.
  • If the non-injectivity fix generalizes, similar redundancy issues in other multi-actuator reinforcement learning problems could be resolved by the same mapping change.
  • Direct numerical or experimental checks of the predicted cost bounds in the airfoil geometry would confirm whether the theoretical guarantees survive in the target flow regime.

Load-bearing premise

The theoretical non-injectivity analysis and cost upper bounds derived in general must remain valid and tight when applied to the specific weakly compressible airfoil flow, and reinforcement learning must discover the claimed complex coordinated strategies reliably without instability or extra tuning.

What would settle it

A side-by-side simulation of the airfoil case that records achieved aerodynamic efficiency and total actuation cost for both the mean-centered and the proposed multi-jet formulations; if the proposed method fails to exceed 53 percent efficiency or if measured costs increase with added jets, the central claims are falsified.

Figures

Figures reproduced from arXiv: 2604.26481 by Andrea Beck, Anna Schwarz, Rohan Kaushik.

Figure 1
Figure 1. Figure 1: Simulation setup for the flow around a two-dimensional cylinder in a channel. The positions of the 11 pressure probes are highlighted by white circles and view at source ↗
Figure 2
Figure 2. Figure 2: Jet setup on the cylinder, with the geometric parameters for the view at source ↗
Figure 3
Figure 3. Figure 3: Simulation setup for the flow around a two-dimensional airfoil in a channel. The field solution shows the velocity magnitude for view at source ↗
Figure 4
Figure 4. Figure 4: Jet setup on the airfoil, with the geometric parameters for the view at source ↗
Figure 6
Figure 6. Figure 6: Probe positions on the airfoil, used to record observations and pass view at source ↗
Figure 7
Figure 7. Figure 7: State recycling for PPO. Here, nPPO is the update iteration in the PPO algorithm, and δ is the fraction of statefiles that get passed on from iteration to iteration. At the beginning of the algorithm, all the statefiles are from the converged unactuated simulations. 0 10 20 30 40 50 10−6 10−5 10−4 10−3 α actor = 10−3 Iterations α Actor learning rate (α actor) 0 100 200 300 400 500 10−2 Iterations α Entropy… view at source ↗
Figure 8
Figure 8. Figure 8: Learning rate and entropy penalty coefficient evolution throughout training. like ADAM, primarily because of the lack of information in the initial update steps for the optimizer to build accurate estimates. This is an issue faced by the authors firsthand, as training was noted to be particularly dependent on network weight initializa￾tion. However, with the learning rate warmup schedule shown in view at source ↗
Figure 9
Figure 9. Figure 9: Training and evaluation metrics for the 2-jets cylinder system (across view at source ↗
Figure 10
Figure 10. Figure 10: Training and evaluation metrics for the non-inverted 4-jets cylinder view at source ↗
Figure 11
Figure 11. Figure 11: Training and evaluation metrics for the inverted 4-jets cylinder sys view at source ↗
Figure 12
Figure 12. Figure 12: Training and evaluation metrics for the mean-centered 4-jets cylin view at source ↗
Figure 15
Figure 15. Figure 15: Training and evaluation metrics for the non-inverted 3-jets airfoil view at source ↗
Figure 14
Figure 14. Figure 14: Jet strengths and their associated costs for the jet setups on the view at source ↗
Figure 19
Figure 19. Figure 19: Training and evaluation metrics for the inverted 6-jets airfoil system view at source ↗
Figure 20
Figure 20. Figure 20: Training and evaluation metrics for the mean-centered 6-jets airfoil view at source ↗
Figure 21
Figure 21. Figure 21: Long-term evolution of the lift, drag and aerodynamic e view at source ↗
Figure 23
Figure 23. Figure 23: Jet strengths and their associated costs for the 3-jet setup on the view at source ↗
Figure 24
Figure 24. Figure 24: Jet strengths and their associated costs for the 6-jet setup on the view at source ↗
Figure 25
Figure 25. Figure 25: The pressure distribution throughout the domain, for the di view at source ↗
Figure 27
Figure 27. Figure 27: The z-direction vorticity distribution throughout the domain  ωz = ∂Uy ∂x − ∂Ux ∂y  , for the different flow-control cases in the cylinder-in￾channel simulations. Domain clipped for visualization. being the number of jets), having found a major overlooked mathematical flaw and derived upper bounds on suitably de￾fined running costs. This maximum cost was observed to scale near-linearly with the number o… view at source ↗
Figure 29
Figure 29. Figure 29: The velocity magnitude distribution throughout the domain, for the view at source ↗
Figure 31
Figure 31. Figure 31: Time averaged pressure distributions (CP in the interval 25 ≤ t ∗ ≤ 50) along the airfoil surfaces with and without the trained AFC agents. The dashed lines (- - -) represent the pressure surface while the solid lines (—) the suction surface. The separated flow can be observed in the flat-top CP distri￾bution on the suction surface in the unactuated case, and the jet positions are visible as kinks in the … view at source ↗
Figure 6
Figure 6. Figure 6: Appendix C. Cost of Multi-Jet Configurations The cost of operating the jet-actuated AFC system is defined as in Eq. (17): C = X N i=1 |Qi | = Q maxX N i=1 | fi(a)|. While this will vary throughout each individual simulation, a good measure to describe a given approach is to check its max￾imum possible cost. We do just this in the following subsec￾tions. 2.5 2.6 2.7 2.8 2.9 3 ⟨C This Work D⟩ 3 4 5 6 7 8 −0.… view at source ↗
read the original abstract

Reinforcement learning has by now become well established in finding excellent flow control strategies for a variety of scenarios. Existing literature has focused on using a simple two-jet solution (and variants there-of) or a straightforward mean-centered multi-jet setup. This mean-centering approach is however non-injective in nature, such that distinct action predictions by the actor network can lead to the same implemented jet-intensities. Thus, the potential of true multi-jet setups still remains unexplored. To this end, in this study we first theoretically analyze multi-jet setups, highlighting the aforementioned pitfall and offer a viable alternative. We also derive upper-bounds on the running costs of these setups, and find the proposed approach to have a jet-count-independent maximum running cost (compared to a near-linear scaling for the traditional setup). The mean-centered and proposed multi-jet setups are applied to a variety of flow-configurations, to test performance and learning capabilities. The new formulation proves effective in learning more complex flow-control strategies, coordinating the jets in a sophisticated manner so as to produce favorable outcomes at minimal actuation cost. For the cylinder-in-channel case, this results in drag and total-force suppression to beyond an idealized symmetric case, whereas for the airfoil the separation region is minimized and significant improvements in aerodynamic efficiency are observed (from 53% up to 73% depending on jet configuration). Additionally, we also incorporate some best practices from traditional RL literature to show fast, reproducible and reliable learning, thereby bringing down the upfront training costs. This study thus provides a robust and mathematically grounded approach to multi-jet design and closes a hitherto overlooked theoretical gap.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a new multi-jet action parameterization for RL-based active flow control that addresses the non-injectivity of mean-centered multi-jet setups. It provides theoretical analysis of multi-jet configurations, derives jet-count-independent upper bounds on running costs for the proposed approach (versus near-linear scaling for the traditional one), and applies both setups to cylinder-in-channel and airfoil cases in weakly compressible flow. The new formulation is claimed to enable learning of complex coordinated jet strategies yielding drag/total-force suppression beyond symmetric baselines for the cylinder and airfoil aerodynamic efficiency gains from 53% to 73%, with additional RL best practices incorporated for fast, reproducible training.

Significance. If the central claims hold, the work supplies a mathematically grounded multi-jet framework with explicit cost bounds that could improve efficiency and reliability of RL flow control in aerodynamics. The derivation of parameterization-independent cost upper bounds and the non-injectivity analysis constitute clear strengths, as does the extension to a realistic airfoil case. The incorporation of RL best practices is noted positively for training reliability, but the overall significance depends on whether performance gains can be attributed to the new formulation rather than the joint training practices.

major comments (2)
  1. [Abstract] Abstract: The efficiency gains (53% to 73%) and drag suppression beyond the idealized symmetric case are reported under the combined application of the proposed parameterization plus RL best practices, yet no controlled ablation (proposed vs. mean-centered, both with identical best practices) is described. This is load-bearing for the central claim that the new formulation enables superior coordinated strategies at minimal cost.
  2. [Abstract] Abstract and results on airfoil: The claim that the theoretical non-injectivity analysis and jet-count-independent cost upper bounds remain valid and tight is transferred to the weakly compressible airfoil without explicit verification that the bounds are achieved by the learned policies or that the reward explicitly leverages bound tightness; post-hoc selection of jet configurations may further affect the reported numbers.
minor comments (1)
  1. [Methods] The notation distinguishing jet intensities and the precise definition of the mean-centered baseline could be clarified for reproducibility, particularly when describing how distinct actor predictions map to implemented actions.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed review. The comments help clarify the attribution of results and the scope of the theoretical claims. We respond point-by-point below and indicate the revisions we will make to address the concerns.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The efficiency gains (53% to 73%) and drag suppression beyond the idealized symmetric case are reported under the combined application of the proposed parameterization plus RL best practices, yet no controlled ablation (proposed vs. mean-centered, both with identical best practices) is described. This is load-bearing for the central claim that the new formulation enables superior coordinated strategies at minimal cost.

    Authors: We agree that a controlled ablation isolating the parameterization effect from the RL best practices is necessary to strengthen attribution of the performance gains. The manuscript already applies and compares both the mean-centered and proposed multi-jet setups on the cylinder and airfoil cases, with the proposed formulation yielding coordinated strategies and lower costs. However, the best practices were incorporated primarily to ensure reliable training across experiments. In the revised manuscript we will add an explicit ablation section (or supplementary table) that retrains or reports results for the mean-centered baseline under the identical best-practice protocol used for the proposed method. This will allow direct comparison and better support the claim that the injective parameterization itself enables the superior strategies. revision: yes

  2. Referee: [Abstract] Abstract and results on airfoil: The claim that the theoretical non-injectivity analysis and jet-count-independent cost upper bounds remain valid and tight is transferred to the weakly compressible airfoil without explicit verification that the bounds are achieved by the learned policies or that the reward explicitly leverages bound tightness; post-hoc selection of jet configurations may further affect the reported numbers.

    Authors: The non-injectivity analysis and the jet-count-independent upper bounds on running cost are derived solely from the mathematical properties of the action parameterization; they hold for any downstream flow solver or reward and do not require flow-specific verification. The bounds limit the worst-case actuation cost of any policy, independent of compressibility. In the airfoil results the learned policies indeed operated at low actuation costs consistent with these bounds, but we did not explicitly measure how close the policies came to the theoretical maximum nor design the reward to enforce tightness. We also acknowledge that the reported jet-configuration numbers reflect post-training selection among several trained agents. In the revision we will (i) restate that the bounds are parameterization-level guarantees and therefore transfer directly to the airfoil, (ii) report the observed actuation costs relative to the derived upper bound for the airfoil policies, and (iii) clarify the selection procedure for the final jet configurations to rule out selective reporting. revision: partial

Circularity Check

1 steps flagged

Cost upper bounds are definitional properties of the new parameterization rather than independent predictions

specific steps
  1. self definitional [Abstract]
    "We also derive upper-bounds on the running costs of these setups, and find the proposed approach to have a jet-count-independent maximum running cost (compared to a near-linear scaling for the traditional setup)."

    The upper bound follows immediately from how the proposed parameterization is defined (total actuation capped independently of jet count), so the 'derivation' and 'finding' restate the construction rather than providing new content or external validation.

full rationale

The paper's theoretical analysis derives jet-count-independent cost upper bounds directly from the definition of the proposed multi-jet action space (as contrasted with mean-centered scaling). This is a mathematical property of the formulation itself, not an external verification or falsifiable prediction. However, the central empirical claims (drag suppression, efficiency gains from 53% to 73%) rest on RL experiments that incorporate external best practices and are not reduced to the bounds. No self-citation chains or uniqueness theorems are invoked as load-bearing; the derivation chain remains partially independent outside the cost analysis.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The framework rests on standard RL convergence assumptions and CFD modeling of weakly compressible flow; the new action mapping is introduced without additional invented physical entities.

free parameters (1)
  • number and placement of jets
    Chosen per flow configuration (cylinder vs airfoil) to achieve reported performance; not derived from first principles.
axioms (2)
  • domain assumption The mean-centered multi-jet mapping is non-injective
    Invoked as the central pitfall to be fixed; treated as given from prior setups.
  • domain assumption RL agent can learn coordinated jet strategies under the new mapping
    Required for the performance claims on cylinder and airfoil.

pith-pipeline@v0.9.0 · 5604 in / 1489 out tokens · 63262 ms · 2026-05-07T10:51:10.161038+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

65 extracted references · 24 canonical work pages · 3 internal anchors

  1. [1]

    Cambridge University Press, 2000

    Mohamed Gad-el Hak.Flow Control: Passive, Active, and Reactive Flow Management. Cambridge University Press, 2000

  2. [2]

    McLellan and Charles L

    Bruce W. McLellan and Charles L. Ladson. A history of suction-type laminar-flow control with emphasis on flight research. Technical Report TM-4080, NASA, 1988

  3. [3]

    Squires, and Elias Balaras

    Nikolaos Beratlis, Kyle D. Squires, and Elias Balaras. Separation control and drag reduction using roughness elements. InProceeding of Tenth International Sympo- sium on Turbulence and Shear Flow Phenomena, pages 199–204. Begellhouse, 2017. Place: Swissotel Chicago, Chicago, Illinois, U.S.A

  4. [4]

    D. W. Bechert and M. Bartenwerfer. The viscous flow on surfaces with longitudinal ribs.Journal of Fluid Mechan- ics, 206:105–129, September 1989. ISSN 1469-7645, 0022-1120

  5. [5]

    Concept to reality: contributions of the langley research center to us civil aircraft of the 1990s

    Joseph R Chambers. Concept to reality: contributions of the langley research center to us civil aircraft of the 1990s. NASA Special Publication, page 59513, 2003

  6. [6]

    Review of research on low-profile vortex gen- erators to control boundary-layer separation.Progress in aerospace sciences, 38(4-5):389–420, 2002

    John C Lin. Review of research on low-profile vortex gen- erators to control boundary-layer separation.Progress in aerospace sciences, 38(4-5):389–420, 2002

  7. [7]

    Aerody- namic flow control using synthetic jet technology

    Michael Amitay, Barton Smith, and Ari Glezer. Aerody- namic flow control using synthetic jet technology. In36th AIAA Aerospace Sciences Meeting and Exhibit, page 208, 1998

  8. [8]

    Wygnanski

    David Greenblatt and Israel J. Wygnanski. The control of flow separation by periodic excitation.Progress in Aerospace Sciences, 36(7):487–545, October 2000. ISSN 0376-0421

  9. [9]

    Direct numerical simulation of spatially developing turbulent boundary lay- ers with uniform blowing or suction.Journal of Fluid Me- chanics, 681:154–172, August 2011

    Yukinori Kametani and Koji Fukagata. Direct numerical simulation of spatially developing turbulent boundary lay- ers with uniform blowing or suction.Journal of Fluid Me- chanics, 681:154–172, August 2011. ISSN 1469-7645, 0022-1120

  10. [10]

    Three-dimensional suction flow control and suction jet length optimization of NACA 0012 wing.Meccanica, 50(6):1481–1494, June

    Kianoosh Yousefi and Reza Saleh. Three-dimensional suction flow control and suction jet length optimization of NACA 0012 wing.Meccanica, 50(6):1481–1494, June

  11. [11]

    A. V . V oevodin, A. A. Kornyakov, A. S. Petrov, D. A. Petrov, and G. G. Sudakov. Improvement of the take-off and landing characteristics of wing using an ejector pump. Thermophysics and Aeromechanics, 26(1):9–18, January

  12. [12]

    High- lift enhancement using active flow control

    Michael Desalvo, Edward Whalen, and Ari Glezer. High- lift enhancement using active flow control. In6th AIAA flow control conference, page 3245, 2012

  13. [13]

    Post-stall flow control on an airfoil by local unsteady forcing.Journal of fluid Mechanics, 371: 21–58, 1998

    Jie-Zhi Wu, Xi-Yun Lu, Andrew G Denny, Meng Fan, and Jain-Ming Wu. Post-stall flow control on an airfoil by local unsteady forcing.Journal of fluid Mechanics, 371: 21–58, 1998

  14. [14]

    Active flow control for high lift with steady blowing.The Aeronautical Journal, 120(1223):171–200, 2016

    Rolf Radespiel, Marco Burnazzi, M Casper, and P Scholz. Active flow control for high lift with steady blowing.The Aeronautical Journal, 120(1223):171–200, 2016

  15. [15]

    Feedback control of vortex shedding from a circular cylinder by cross-flow cylinder oscillations.Experiments in Fluids, 21(1):49–56, 1996

    HM Warui and N Fujisawa. Feedback control of vortex shedding from a circular cylinder by cross-flow cylinder oscillations.Experiments in Fluids, 21(1):49–56, 1996

  16. [16]

    Suboptimal feedback control of vortex shedding at low reynolds numbers.Jour- nal of Fluid Mechanics, 401:123–156, 1999

    Chulhong Min and Haecheon Choi. Suboptimal feedback control of vortex shedding at low reynolds numbers.Jour- nal of Fluid Mechanics, 401:123–156, 1999

  17. [17]

    Sridhar Muddada and B. S. V . Patnaik. An active flow control strategy for the suppression of vortex structures behind a circular cylinder.European Journal of Mechan- ics - B/Fluids, 29(2):93–104, March 2010. ISSN 0997- 7546

  18. [18]

    Closed- loop active flow control of stall separation using synthetic jets

    Byunghyun Lee, Minhee Kim, Byounghun Choi, Chongam Kim, H Jin Kim, and Kyoung Jin Jung. Closed- loop active flow control of stall separation using synthetic jets. In31st AIAA applied aerodynamics conference, page 2925, 2013

  19. [19]

    Loïc Michel, Ingrid Neunaber, Rishabh Mishra, Caroline Braud, Franck Plestan, Jean-Pierre Barbot, and Pol Ha- mon. A novel lift controller for a wind turbine blade sec- tion using an active flow control device including satura- tions: experimental results.IEEE Transactions on Control Systems Technology, 32(5):1590–1601, 2024

  20. [20]

    An active flow control approach for spa- tially growing mixing layer.Journal of Fluids Engineer- ing, 144(6):061110, 2022

    Upender K Kaul. An active flow control approach for spa- tially growing mixing layer.Journal of Fluids Engineer- ing, 144(6):061110, 2022. 22

  21. [21]

    Multivariable closed-loop active flow control of a compressor stator cascade.AIAA Journal, 55(10):3371–3380, 2017

    Matthias Kiesner and Rudibert King. Multivariable closed-loop active flow control of a compressor stator cascade.AIAA Journal, 55(10):3371–3380, 2017. doi: 10.2514/1.J055728. URLhttps://doi.org/10.2514/ 1.J055728

  22. [22]

    Closed-loop active flow control of a non-steady flow field in a highly-loaded compressor cascade.CEAS Aeronauti- cal Journal, 8(1):197–208, 2017

    Marcel Staats, W Nitsche, SJ Steinberg, and R King. Closed-loop active flow control of a non-steady flow field in a highly-loaded compressor cascade.CEAS Aeronauti- cal Journal, 8(1):197–208, 2017

  23. [23]

    Application of active flow control on generic 3d car models

    Daniel Krentel, Rifet Muminovic, André Brunn, Wolf- gang Nitsche, and Rudibert King. Application of active flow control on generic 3d car models. InActive Flow Control II: Papers Contributed to the Conference” Active Flow Control II 2010”, Berlin, Germany, May 26 to 28, 2010, pages 223–239. Springer, 2010

  24. [24]

    Artificial neural networks trained through deep reinforcement learning discover con- trol strategies for active flow control.Journal of Fluid Me- chanics, 865:281–302, apr 2019

    Jean Rabault, Miroslav Kuchta, Atle Jensen, Ulysse Reglade, and Nicolas Cerardi. Artificial neural networks trained through deep reinforcement learning discover con- trol strategies for active flow control.Journal of Fluid Me- chanics, 865:281–302, apr 2019. ISSN 0022-1120, 1469-

  25. [25]

    doi: 10.1017/jfm.2019.62

  26. [26]

    Accelerating deep reinforcement learning strategies of flow control through a multi-environment approach.Physics of Fluids, 31(9):094105, September 2019

    Jean Rabault and Alexander Kuhnle. Accelerating deep reinforcement learning strategies of flow control through a multi-environment approach.Physics of Fluids, 31(9): 094105, sep 2019. ISSN 1070-6631, 1089-7666. doi: 10.1063/1.5116415

  27. [27]

    Deep reinforcement learning in fluid mechanics: a promising method for both active flow control and shape optimization.arXiv preprint arXiv:2001.02464, 2020

    Jean Rabault, Feng Ren, Wei Zhang, Hui Tang, and Hui Xu. Deep reinforcement learning in fluid mechanics: a promising method for both active flow control and shape optimization.arXiv preprint arXiv:2001.02464, 2020

  28. [28]

    Hongwei Tang, Jean Rabault, Alexander Kuhnle, Yan Wang, and Tongguang Wang. Robust active flow con- trol over a range of reynolds numbers using an artificial neural network trained through deep reinforcement learn- ing.Physics of Fluids, 32(5):053605, 05 2020. ISSN 1070-6631. doi: 10.1063/5.0006492. URLhttps:// doi.org/10.1063/5.0006492

  29. [29]

    Deep reinforcement learning based synthetic jet control on disturbed flow over airfoil

    Yi-Zhe Wang, Yu-Fei Mei, Nadine Aubry, Zhihua Chen, Peng Wu, and Wei-Tao Wu. Deep reinforcement learning based synthetic jet control on disturbed flow over airfoil. Physics of Fluids, 34(3), 2022

  30. [30]

    Deep-reinforcement- learning-based separation control in a two-dimensional airfoil.International Journal of Heat and Fluid Flow, 116:109913, 2025

    Xavier Garcia, Arnau Miró, Pol Suárez, Francisco Alcántara-Ávila, Jean Rabault, Bernat Font, Oriol Lehmkuhl, and Ricardo Vinuesa. Deep-reinforcement- learning-based separation control in a two-dimensional airfoil.International Journal of Heat and Fluid Flow, 116:109913, 2025. ISSN 0142-727X. doi: https://doi.org/10.1016/j.ijheatfluidflow.2025.109913. URLh...

  31. [31]

    Active flow control for three-dimensional cylinders through deep re- inforcement learning.arXiv preprint arXiv:2309.02462, 2023

    Pol Suárez, Francisco Alcántara-Ávila, Arnau Miró, Jean Rabault, Bernat Font, and Oriol Lehmkuhl. Active flow control for three-dimensional cylinders through deep re- inforcement learning.arXiv preprint arXiv:2309.02462, 2023

  32. [32]

    Suárez, F

    P. Suárez, F. Álcantara-Ávila, J. Rabault, A. Miró, B. Font, O. Lehmkuhl, and R. Vinuesa. Flow control of three-dimensional cylinders transitioning to turbulence via multi-agent reinforcement learning, may 2024

  33. [33]

    Active flow control for drag reduction through multi-agent reinforcement learning on a turbulent cylin- der at r e d=3900.Flow, Turbulence and Combustion, pages 1–25, 2025

    Pol Suárez, Francisco Alcántara-Ávila, Arnau Miró, Jean Rabault, Bernat Font, Oriol Lehmkuhl, and Ricardo Vin- uesa. Active flow control for drag reduction through multi-agent reinforcement learning on a turbulent cylin- der at r e d=3900.Flow, Turbulence and Combustion, pages 1–25, 2025

  34. [34]

    Shocks under control: Taming transonic compress- ible flow over an rae2822 airfoil with deep reinforcement learning.arXiv preprint arXiv:2511.07564, 2025

    Trishit Mondal, Ricardo Vinuesa, and Ameya D Jag- tap. Shocks under control: Taming transonic compress- ible flow over an rae2822 airfoil with deep reinforcement learning.arXiv preprint arXiv:2511.07564, 2025

  35. [35]

    Active flow control for drag reduction through multi-agent reinforcement learning on a turbulent cylin- der at Re D =3900.Flow, Turbulence and Combus- tion, mar 2025

    Pol Suárez, Francisco Alcántara-Ávila, Arnau Miró, Jean Rabault, Bernat Font, Oriol Lehmkuhl, and Ricardo Vin- uesa. Active flow control for drag reduction through multi-agent reinforcement learning on a turbulent cylin- der at Re D =3900.Flow, Turbulence and Combus- tion, mar 2025. ISSN 1386-6184, 1573-1987. doi: 10.1007/s10494-025-00642-x

  36. [36]

    Invariant control strategies for active flow control using graph neural networks.Com- puters&Fluids, 303:106854, 2025

    Marius Kurz, Rohan Kaushik, Marcel Blind, Patrick Kopper, Anna Schwarz, Felix Rodach, and An- drea Beck. Invariant control strategies for active flow control using graph neural networks.Com- puters&Fluids, 303:106854, 2025. ISSN 0045-

  37. [37]

    doi: https://doi.org/10.1016/j.compfluid.2025. 106854. URLhttps://www.sciencedirect.com/ science/article/pii/S0045793025003147

  38. [38]

    Inflow/outflow boundary condi- tions with application to FUN3D

    Jan-Reneé Carlson. Inflow/outflow boundary condi- tions with application to FUN3D. Technical Report NASA/TM–2011-217181, Langley Research Center, Lan- gley Research Center, Hampton, V A, United States, oct 2011

  39. [39]

    FLEXI: A high order discontinu- ous Galerkin framework for hyperbolic–parabolic con- servation laws.Computers&Mathematics with Appli- cations, 81:186–219, jan 2021

    Nico Krais, Andrea Beck, Thomas Bolemann, Hannes Frank, David Flad, Gregor Gassner, Florian Hindenlang, Malte Hoffmann, Thomas Kuhn, Matthias Sonntag, and Claus-Dieter Munz. FLEXI: A high order discontinu- ous Galerkin framework for hyperbolic–parabolic con- servation laws.Computers&Mathematics with Appli- cations, 81:186–219, jan 2021. ISSN 08981221. doi...

  40. [40]

    Springer Nature Switzerland, Cham, 2024

    Marcel Blind, Patrick Kopper, Daniel Kempf, Marius Kurz, Anna Schwarz, Claus-Dieter Munz, and Andrea Beck.Performance Improvements for Large Scale Sim- ulations Using the Discontinuous Galerkin Framework 23 FLEXI, pages 249–264. Springer Nature Switzerland, Cham, 2024. ISBN 978-3-031-46870-4

  41. [41]

    Blind, Tobias Gibis, Christoph Wenzel, and An- drea Beck

    Marcel P. Blind, Tobias Gibis, Christoph Wenzel, and An- drea Beck. Wall-modeled large eddy simulation of a tan- dem wing configuration in transonic flow.Physics of Fluids, 36(5):055125, 05 2024. ISSN 1070-6631. doi: 10.1063/5.0198271

  42. [42]

    An efficient sliding mesh interface method for high- order discontinuous galerkin schemes.Comput- ers&Fluids, 217:104825, 2021

    Jakob Dürrwächter, Marius Kurz, Patrick Kopper, Daniel Kempf, Claus-Dieter Munz, and Andrea Beck. An efficient sliding mesh interface method for high- order discontinuous galerkin schemes.Comput- ers&Fluids, 217:104825, 2021. ISSN 0045-

  43. [43]

    doi: https://doi.org/10.1016/j.compfluid.2020. 104825. URLhttps://www.sciencedirect.com/ science/article/pii/S0045793020303959

  44. [44]

    Proximal policy optimization algorithms.arXiv preprint, 2017

    John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. Proximal policy optimization algorithms.arXiv preprint, 2017

  45. [45]

    Continuous control with deep reinforcement learning

    Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. Continuous control with deep reinforce- ment learning.arXiv preprint arXiv:1509.02971, 2015

  46. [46]

    Distributed distributional deterministic policy gradients

    Gabriel Barth-Maron, Matthew W Hoffman, David Bud- den, Will Dabney, Dan Horgan, Dhruva Tb, Alistair Mul- dal, Nicolas Heess, and Timothy Lillicrap. Distributed distributional deterministic policy gradients.arXiv preprint arXiv:1804.08617, 2018

  47. [47]

    Continuous deep Q-learning with model- based acceleration

    Shixiang Gu, Timothy Lillicrap, Ilya Sutskever, and Sergey Levine. Continuous deep Q-learning with model- based acceleration. In Maria Florina Balcan and Kil- ian Q. Weinberger, editors,Proceedings of The 33rd International Conference on Machine Learning, vol- ume 48 ofProceedings of Machine Learning Research, pages 2829–2838, New York, New York, USA, 20–...

  48. [48]

    MIT press Cam- bridge, 2020

    Richard S Sutton, Andrew G Barto, et al.Reinforcement learning: An introduction, volume 1. MIT press Cam- bridge, 2020

  49. [49]

    SIAM, 1973

    Peter D Lax.Hyperbolic systems of conservation laws and the mathematical theory of shock waves. SIAM, 1973

  50. [50]

    Springer Science & Business Media, 2013

    Eleuterio F Toro.Riemann solvers and numerical meth- ods for fluid dynamics: a practical introduction. Springer Science & Business Media, 2013

  51. [51]

    Relexi — A scalable open source reinforcement learning framework for high- performance computing.Software Impacts, 14:100422, dec 2022

    Marius Kurz, Philipp Offenhäuser, Dominic Viola, Michael Resch, and Andrea Beck. Relexi — A scalable open source reinforcement learning framework for high- performance computing.Software Impacts, 14:100422, dec 2022. ISSN 2665-9638. doi: 10.1016/j.simpa.2022. 100422

  52. [52]

    Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghe- mawat, Ian Goodfellow, Andrew Harp, Geoffrey Irv- ing, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan- delion Mané, Rajat Monga, Sherry Moore, Derek...

  53. [53]

    TF-Agents: A library for reinforcement learning in tensorflow.https: //github.com/tensorflow/agents, 2018

    Sergio Guadarrama, Anoop Korattikara, Oscar Ramirez, Pablo Castro, Ethan Holly, Sam Fishman, Ke Wang, Eka- terina Gonina, Neal Wu, Efi Kokiopoulou, Luciano Sbaiz, Jamie Smith, Gábor Bartók, Jesse Berent, Chris Harris, Vincent Vanhoucke, and Eugene Brevdo. TF-Agents: A library for reinforcement learning in tensorflow.https: //github.com/tensorflow/agents, ...

  54. [54]

    Shao, Scott Bachman, Gustavo Marques, and Benjamin Robbins

    Sam Partee, Matthew Ellis, Alessandro Rigazzi, An- drew E. Shao, Scott Bachman, Gustavo Marques, and Benjamin Robbins. Using machine learning at scale in numerical simulations with SmartSim: An application to ocean climate modeling.Journal of Computational Science, 62:101707, July 2022. ISSN 18777503. doi: 10.1016/j.jocs.2022.101707

  55. [55]

    Implementation matters in deep RL: A case study on PPO and TRPO

    Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dim- itris Tsipras, Firdaus Janoos, Larry Rudolph, and Alek- sander Madry. Implementation matters in deep RL: A case study on PPO and TRPO. InInternational conference on learning representations, 2019

  56. [56]

    Adam: A Method for Stochastic Optimization

    Diederik P Kingma and Jimmy Ba. Adam: A method for stochastic optimization.arXiv preprint arXiv:1412.6980, 2014

  57. [57]

    Theoretical analysis on how learning rate warmup accel- erates convergence.arXiv preprint arXiv:2509.07972, 2025

    Yuxing Liu, Yuze Ge, Rui Pan, An Kang, and Tong Zhang. Theoretical analysis on how learning rate warmup accel- erates convergence.arXiv preprint arXiv:2509.07972, 2025

  58. [58]

    Revisiting the initial steps in adaptive gradient descent optimization

    Abulikemu Abuduweili and Changliu Liu. Revisiting the initial steps in adaptive gradient descent optimization. arXiv preprint arXiv:2412.02153, 2024

  59. [59]

    Why warmup the learning rate? underlying mechanisms and improve- ments.Advances in Neural Information Processing Sys- tems, 37:111760–111801, 2024

    Dayal Singh Kalra and Maissam Barkeshli. Why warmup the learning rate? underlying mechanisms and improve- ments.Advances in Neural Information Processing Sys- tems, 37:111760–111801, 2024

  60. [60]

    Attention is all you need.Advances in neural information processing systems, 30, 2017

    Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, 24 and Illia Polosukhin. Attention is all you need.Advances in neural information processing systems, 30, 2017

  61. [61]

    Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

    Priya Goyal, Piotr Dollár, Ross Girshick, Pieter Noord- huis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. Accurate, large mini- batch sgd: Training imagenet in 1 hour.arXiv preprint arXiv:1706.02677, 2017

  62. [62]

    A closer look at deep learning heuris- tics: Learning rate restarts, warmup and distillation.arXiv preprint arXiv:1810.13243, 2018

    Akhilesh Gotmare, Nitish Shirish Keskar, Caiming Xiong, and Richard Socher. A closer look at deep learning heuris- tics: Learning rate restarts, warmup and distillation.arXiv preprint arXiv:1810.13243, 2018

  63. [63]

    Vignon, J

    C. Vignon, J. Rabault, and R. Vinuesa. Recent advances in applying deep reinforcement learning for flow control: Perspectives and future directions.Physics of Fluids, 35 (3):031301, mar 2023. ISSN 1070-6631, 1089-7666. doi: 10.1063/5.0143913

  64. [64]

    Perspectives on predicting and control- ling turbulent flows through deep learning.Physics of Flu- ids, 36(3):031401, March 2024

    Ricardo Vinuesa. Perspectives on predicting and control- ling turbulent flows through deep learning.Physics of Flu- ids, 36(3):031401, March 2024. ISSN 1070-6631, 1089-

  65. [65]

    doi: 10.1063/5.0190452. 25