Neural Network Verification for the Masses (of AI graduates)
Pith reviewed 2026-05-25 11:22 UTC · model grok-4.3
The pith
Shortage of verification tools and language limitations hinder teaching safe AI in MSc programs.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper claims there is a noticeable shortage of accessible tools, methods and teaching materials for incorporating verification into AI programs. A lab engages AI and Robotics MSc students in verification projects as part of dissertation work, reporting both successes and unexpected difficulties that arise from limitations of existing programming languages used for verification, while discussing future directions for incorporating verification into AI degrees.
What carries the argument
Student verification projects in an AI MSc program, used to surface and illustrate programming language limitations as the core practical barrier.
If this is right
- Verification can be introduced into AI MSc degrees through student dissertation projects.
- Overcoming programming language limitations is required before verification becomes routine in AI teaching.
- New methods and tools will be needed to make verification accessible to AI graduates.
- Wider adoption of verification in education would help address safety vulnerabilities in neural networks.
Where Pith is reading between the lines
- Labs using similar student-project models at other universities could test whether language limitations remain the dominant obstacle once curriculum changes are tried.
- Resolving language barriers for education might also accelerate verified neural network use in deployed AI systems.
- Prioritizing neural network verification in teaching suggests that deep learning safety problems are seen as more urgent than verification for other AI techniques.
Load-bearing premise
The primary barriers to incorporating verification into AI degrees are limitations in existing programming languages used for verification, rather than other factors such as curriculum design, student background, or availability of suitable verification problems.
What would settle it
A survey of multiple AI MSc programs showing that curriculum design or student background explains the lack of verification teaching more than language limitations do.
Figures
read the original abstract
Rapid development of AI applications has stimulated demand for, and has given rise to, the rapidly growing number and diversity of AI MSc degrees. AI and Robotics research communities, industries and students are becoming increasingly aware of the problems caused by unsafe or insecure AI applications. Among them, perhaps the most famous example is vulnerability of deep neural networks to ``adversarial attacks''. Owing to wide-spread use of neural networks in all areas of AI, this problem is seen as particularly acute and pervasive. Despite of the growing number of research papers about safety and security vulnerabilities of AI applications, there is a noticeable shortage of accessible tools, methods and teaching materials for incorporating verification into AI programs. LAIV -- the Lab for AI and Verification -- is a newly opened research lab at Heriot-Watt university that engages AI and Robotics MSc students in verification projects, as part of their MSc dissertation work. In this paper, we will report on successes and unexpected difficulties LAIV faces, many of which arise from limitations of existing programming languages used for verification. We will discuss future directions for incorporating verification into AI degrees.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript is an experience report describing the LAIV lab at Heriot-Watt University and its efforts to involve AI and Robotics MSc students in neural network verification projects as part of dissertation work. It asserts a noticeable shortage of accessible tools, methods and teaching materials for verification in AI programs, reports on project successes and difficulties (many attributed to limitations of existing programming languages), and outlines future directions for incorporating verification into AI degrees.
Significance. As an observational account of lab activities, the paper could usefully inform educators seeking to add verification content to AI curricula by surfacing concrete implementation obstacles. Its significance remains modest because the central observations rest on author experience without supporting measurements, surveys or external references.
major comments (1)
- [Abstract] Abstract: the assertion of a 'noticeable shortage of accessible tools, methods and teaching materials for incorporating verification into AI programs' is presented as fact but is unsupported by any data, statistics, citations to existing tool surveys, or references to AI degree curricula; this claim is load-bearing for the motivation of both the lab and the paper.
minor comments (2)
- [Abstract] Abstract: 'Despite of the growing number' is grammatically incorrect and should read 'Despite the growing number'.
- [Abstract] Abstract: the repeated use of future tense ('we will report', 'We will discuss') is atypical for a completed manuscript; rephrase to present tense to match the reporting style of an experience report.
Simulated Author's Rebuttal
We thank the referee for the review and the suggestion to strengthen the motivation section. We address the major comment below.
read point-by-point responses
-
Referee: [Abstract] Abstract: the assertion of a 'noticeable shortage of accessible tools, methods and teaching materials for incorporating verification into AI programs' is presented as fact but is unsupported by any data, statistics, citations to existing tool surveys, or references to AI degree curricula; this claim is load-bearing for the motivation of both the lab and the paper.
Authors: We agree that the statement is presented without external supporting data, surveys or citations. As this is an experience report grounded in the LAIV lab's direct work with AI and Robotics MSc students, the observation arises from repeated practical difficulties encountered when assigning verification-related dissertation projects. To address the concern we will revise the abstract to frame the claim explicitly as an observation from our lab activities rather than an unsubstantiated general assertion. We will also add a small number of references to existing AI-safety and verification literature where they help contextualise the shortage without changing the paper's observational character. revision: partial
Circularity Check
No significant circularity: purely descriptive experience report
full rationale
The paper is an experience report describing LAIV lab activities, MSc student projects, observed shortages of tools, and practical difficulties with existing programming languages. It contains no equations, derivations, predictions, fitted parameters, or formal claims whose validity depends on hidden assumptions. All content is observational narrative with no load-bearing technical steps, self-citations, or reductions to inputs by construction. The central narrative is presented as direct reporting rather than a derived result, making the paper self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Bagnall, A., Stewart, G.: Certifying the true error: Machine learning in Coq with verified generalisation guarantees. AAAI (2019)
work page 2019
-
[2]
Bahrami, A., de Maria, E., A.Felty: Modelling and verifying dynamic properties of biological neural networks in Coq. In: CSBio 2018 Proceedings of the 9th Interna- tional Conference on Computational Systems-Biology and Bioinformatics (2018)
work page 2018
-
[3]
Cohen, C.: Construction of real algebraic numbers in coq. In: Beringer, L., Felty, A.P. (eds.) Interactive Theorem Proving - Third International Conference, ITP 2012, Princeton, NJ, USA, August 13-15, 2012. Proceedings. Lecture Notes in Computer Science, vol. 7406, pp. 67–82. Springer (2012)
work page 2012
-
[4]
Coq Development Team: Coq reference manual (2015), https://coq.inria.fr/ Neural Network Verification for the Masses (of AI graduates) 15
work page 2015
-
[5]
Hinton, G., Deng, L., Yu, D., Dahl, G.E., rahman Mohamed, A., Jaitly, N., Senior, A., Vanhoucke, V., Nguyen, P., Sainath, T.N., Kingsbury, B.: Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups 29, 82 – 97 (2012), https://ieeexplore.ieee.org/document/6296526
-
[6]
Huang, X., Kwiatkowska, M., Wang, S., Wu, M.: Safety verification of deep neural networks. In: Majumdar, R., Kuncak, V. (eds.) Computer Aided Verification - 29th International Conference, CAV 2017, Heidelberg, Germany, July 24-28, 2017, Proceedings, Part I. Lecture Notes in Computer Science, vol. 10426, pp. 3–29. Springer (2017), https://doi.org/10.1007/978...
-
[7]
Jaderberg, M., Czarnecki, W.M., Dunning, I., Marris, L., Lever, G., Casta˜ neda, A.G., Beattie, C., Rabinowitz, N.C., Morcos, A.S., Ruderman, A., Sonnerat, N., Green, T., Deason, L., Leibo, J.Z., Silver, D., Hassabis, D., Kavukcuoglu, K., Graepel, T.: Human-level performance in first-person multiplayer games with population-based deep reinforcement learnin...
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[8]
Kientiz, D.: Verification of evolved neural networks: MSc progress report (2019), Heriot-Watt University, Supervisor E. Komendantskaya
work page 2019
-
[9]
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep con- volutional neural networks. Commun. ACM 60(6), 84–90 (2017)
work page 2017
-
[10]
LeHen, P.: Verification of neural networks: MSc progress report (2019), Heriot- Watt University, Supervisor E. Komendantskaya
work page 2019
-
[11]
Mart´ ınez, G., Ahman, D., Dumitrescu, V., Giannarakis, N., Hawblitzel, C., Hritcu, C., Narasimhamurthy, M., Paraskevopoulou, Z., Pit-Claudel, C., Protzenko, J., Ramananandro, T., Rastogi, A., Swamy, N.: Meta-fstar : Proof automation with smt, tactics, and metaprograms. In: Caires, L. (ed.) Programming Languages and Systems - 28th European Symposium on Pr...
work page 2019
-
[12]
McCulloch, W., Pitts, W.: A logical calculus of the ideas immanent in nervous activity. Bulletin of Math. Bio. 5, 115–133 (1943)
work page 1943
-
[13]
de Moura, L.M., Bjørner, N.: Z3: an efficient SMT solver. In: TACAS’08. LNCS, vol. 4963, pp. 337–340 (2008)
work page 2008
-
[14]
In: Brand, J., Valenti, M.C., Ak- inpelu, A., Doshi, B.T., Gorsic, B.L
Papernot, N., McDaniel, P.D., Swami, A., Harang, R.E.: Crafting adversarial in- put sequences for recurrent neural networks. In: Brand, J., Valenti, M.C., Ak- inpelu, A., Doshi, B.T., Gorsic, B.L. (eds.) 2016 IEEE Military Communica- tions Conference, MILCOM 2016, Baltimore, MD, USA, November 1-3, 2016. pp. 49–54. IEEE (2016), http://ieeexplore.ieee.org/x...
work page 2016
-
[15]
Detecting Adversarial Examples in Convolutional Neural Networks
Pertigkiozoglou, S., Maragos, P.: Detecting adversarial examples in convolutional neural networks. CoRR abs/1812.03303 (2018), http://arxiv.org/abs/1812. 03303
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[16]
Adversarial Examples - A Complete Characterisation of the Phenomenon
Serban, A.C., Poll, E.: Adversarial examples - A complete characterisation of the phenomenon. CoRR abs/1810.01185 (2018), http://arxiv.org/abs/1810. 01185
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[17]
PACMPL 3(POPL), 41:1–41:30 (2019), https://doi.org/10
Singh, G., Gehr, T., P¨ uschel, M., Vechev, M.T.: An abstract domain for certifying neural networks. PACMPL 3(POPL), 41:1–41:30 (2019), https://doi.org/10. 1145/3290354
work page 2019
-
[18]
Intriguing properties of neural networks
Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I., Fer- gus, R.: Intriguing properties of neural networks. CoRR abs/1312.6199 (2014), https://arxiv.org/abs/1312.6199
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[19]
Yann LeCun, Corinna Cortes, C.J.B.: The mnist datbase of handwritten digits” http://yann.lecun.com/exdb/mnist/
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.