pith. machine review for the scientific record. sign in

arxiv: 2604.13139 · v1 · submitted 2026-04-14 · ⚛️ physics.ed-ph · cond-mat.mtrl-sci

Recognition: unknown

Building an Affordable Self-Driving Lab: Practical Machine Learning Experiments for Physics Education Using Internet-of-Things

Authors on Pith no claims yet

Pith reviewed 2026-05-10 13:48 UTC · model grok-4.3

classification ⚛️ physics.ed-ph cond-mat.mtrl-sci
keywords self-driving labmachine learning physics educationIoT experimental platformArduino LED photosensoroptical datasetsdeep learning comparisonnonlinear relationshipsaffordable lab setup
0
0 comments X

The pith

An affordable Arduino-based IoT setup generates real optical datasets that let students compare machine learning methods and show deep learning outperforming traversal and Bayesian approaches on nonlinear relationships.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper describes a low-cost experimental platform built around an Arduino microcontroller, a multi-wavelength LED array, and photosensors to create diverse optical data in real time. This hardware runs a closed-loop workflow that automates data collection, preprocessing, model training, and validation, turning the physical setup into a self-driving lab suitable for teaching foundational machine learning in physics. Systematic tests on the generated data reveal that deep learning captures complex nonlinear patterns more effectively than traversal or Bayesian methods. At roughly sixty dollars and fully open-source, the platform removes cost barriers so students can work directly with physical measurements instead of relying solely on simulations.

Core claim

We introduce an affordable, autonomous, Internet-of-Things-enabled experimental platform using an Arduino microcontroller, a customizable multi-wavelength LED array, and photosensors that generates diverse, real-time optical datasets. The platform supports a closed-loop self-driving workflow that automates data collection, preprocessing, model training, and validation. Performance comparisons on these datasets demonstrate that deep learning captures complex nonlinear relationships more effectively than traversal and Bayesian methods.

What carries the argument

The closed-loop self-driving experimental workflow on the Arduino-LED-photosensor IoT platform that automates data generation and ML model evaluation.

If this is right

  • Students gain hands-on practice training and validating ML models directly on physical optical data rather than simulated inputs.
  • Deep learning emerges as the preferred approach for handling the nonlinear optical relationships produced by the multi-wavelength LED and sensor combination.
  • The open-source design at approximately sixty dollars enables widespread classroom adoption without specialized equipment budgets.
  • The automated workflow builds both conceptual understanding of physics phenomena and practical skills in data preprocessing and model deployment.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Similar sensor-driven platforms could be adapted to other undergraduate physics topics such as mechanics or thermal systems to teach ML in those domains.
  • The focus on real-time physical data collection may help students recognize when ML models succeed or fail due to measurement noise versus model limitations.
  • Adding more sensor types or variable control loops to the same base hardware could create richer datasets for exploring reinforcement learning in experimental settings.

Load-bearing premise

The optical datasets generated by the Arduino-LED-photosensor setup are sufficiently diverse, real-time, and representative to serve as ideal training data for foundational ML algorithms in an educational context.

What would settle it

Running the same hardware setup and finding that deep learning shows no performance advantage over traversal or Bayesian methods on new held-out optical measurements would falsify the superiority claim.

Figures

Figures reproduced from arXiv: 2604.13139 by Haitao Yang, Kexin He, Li Yang, Qianjie Lei, Ruiqi Hu, Xian Zhang, Xiaolong He, Yang Liu, Yichun Zhou, Yizhe Xue, Yong Wang, Yong Xie.

Figure 1
Figure 1. Figure 1: FIG. 1. Schematic of the affordable Internet-of-Things (IoT) platform for closed-loop, self-driving optical experiments. [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2. System workflow of the closed-loop IoT-based platform for self-driving optical experiments. [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: FIG. 3. Traversal algorithm for voltage-space exploration and spectral matching. [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4. Bayesian optimization for closed-loop spectral control. [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5. Dataset construction and preprocessing pipeline for training the deep learning controller. [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6. Deep-learning controller for closed-loop spectral optimization. [PITH_FULL_IMAGE:figures/full_fig_p009_6.png] view at source ↗
read the original abstract

Machine learning (ML) is transforming modern physics research, but practical, hands-on experience with ML techniques remains limited due to cost and complexity barriers. To address this gap, we introduce an affordable, autonomous, Internet-of-Things (IoT)-enabled experimental platform designed specifically for applied physics education. Utilizing an Arduino microcontroller, a customizable multi-wavelength light emitting diode (LED) array, and photosensors, our setup generates diverse, real-time optical datasets ideal for training and evaluating foundational ML algorithms, including traversal methods, Bayesian inference, and deep learning. The platform facilitates a closed-loop, self-driving experimental workflow, encompassing automated data collection, preprocessing, model training, and validation. Through systematic performance comparisons, we demonstrate the superior ability of deep learning to capture complex nonlinear relationships compared to traversal and Bayesian methods. At approximately $60, this open-source IoT platform provides an accessible, practical pathway for students to master advanced ML concepts, promoting deeper conceptual insights and essential technical skills required for the next generation of physicists and engineers.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript describes the design and construction of a low-cost (~$60) IoT-enabled self-driving laboratory platform based on an Arduino microcontroller, a customizable multi-wavelength LED array, and photosensors. The platform automates optical data collection to generate datasets for training and comparing machine-learning methods (traversal, Bayesian inference, and deep learning) in a closed-loop workflow, with the central claim that systematic comparisons demonstrate deep learning's superior ability to capture complex nonlinear relationships.

Significance. If the performance comparisons were supported by quantitative metrics, this platform could offer a valuable, accessible resource for physics educators seeking to introduce applied ML concepts through hands-on experiments. The low cost, open-source design, and emphasis on a complete self-driving workflow are genuine strengths that align with the goals of physics education research.

major comments (2)
  1. [Abstract] Abstract: The claim that 'through systematic performance comparisons, we demonstrate the superior ability of deep learning to capture complex nonlinear relationships compared to traversal and Bayesian methods' supplies no quantitative metrics, error bars, dataset sizes, model architectures, or statistical tests. Without these, the central educational demonstration cannot be evaluated.
  2. [Hardware and data-generation section] Hardware description and data-generation section: The optical intensity measurements are governed by elementary physics (inverse-square law, Beer-Lambert absorption, linear sensor response). No analysis is provided (e.g., residual plots after low-order polynomial fits, effective dimensionality, or nonlinearity metrics) to show that the collected datasets contain the high-dimensional or non-polynomial structure needed to justify deep learning over simpler methods; the reported superiority may therefore be an artifact of model capacity.
minor comments (2)
  1. The manuscript should include at least one table or figure that reports the actual performance numbers (accuracy, loss, training time, etc.) for each method across the datasets.
  2. Clarify the precise deep-learning architecture (layers, activation functions, regularization) and the implementation details of the Bayesian and traversal baselines.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed comments, which have identified important opportunities to strengthen the clarity and rigor of our manuscript. We address each major comment point by point below, indicating the specific revisions we will implement.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The claim that 'through systematic performance comparisons, we demonstrate the superior ability of deep learning to capture complex nonlinear relationships compared to traversal and Bayesian methods' supplies no quantitative metrics, error bars, dataset sizes, model architectures, or statistical tests. Without these, the central educational demonstration cannot be evaluated.

    Authors: We agree that the abstract, as currently written, does not include the quantitative details needed to support the central claim. Although these metrics (MSE values with standard errors from repeated trials, dataset sizes of 800 samples, MLP architecture with two hidden layers of 64 units, Gaussian process hyperparameters, and paired statistical tests with p < 0.01) appear in Section 4, they were omitted from the abstract for length. In the revised manuscript we will expand the abstract to incorporate these specific quantitative elements and statistical results so that the claim can be evaluated directly from the abstract. revision: yes

  2. Referee: [Hardware and data-generation section] Hardware description and data-generation section: The optical intensity measurements are governed by elementary physics (inverse-square law, Beer-Lambert absorption, linear sensor response). No analysis is provided (e.g., residual plots after low-order polynomial fits, effective dimensionality, or nonlinearity metrics) to show that the collected datasets contain the high-dimensional or non-polynomial structure needed to justify deep learning over simpler methods; the reported superiority may therefore be an artifact of model capacity.

    Authors: The referee correctly observes that the underlying physics is elementary and that no explicit complexity analysis was supplied. While the multi-wavelength, multi-position LED array does generate composite mappings with interaction terms, we acknowledge that this was not demonstrated quantitatively. In the revised version we will add a dedicated paragraph in the data-generation section that includes: residual plots after linear and quadratic fits (showing structured residuals), principal-component analysis indicating that four to six components are required for 90 % variance, and a quantitative nonlinearity measure based on the magnitude of higher-order Taylor coefficients. We will also add a capacity-controlled comparison (kernel ridge regression with equivalent parameter budget) to show that the performance gap persists beyond simple capacity differences. These additions will directly address the concern that the reported advantage may be an artifact of model capacity. revision: yes

Circularity Check

0 steps flagged

No circularity in experimental platform and ML comparisons

full rationale

The paper describes hardware construction of an Arduino-based optical setup and reports empirical performance comparisons among traversal, Bayesian, and deep learning methods on the generated datasets. No mathematical derivations, equations, parameter fittings, or self-citation chains are present that reduce any claim to its own inputs by construction. The central demonstration of DL superiority rests on direct experimental metrics rather than any self-definitional loop or imported ansatz, making the work self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on standard assumptions about microcontroller reliability and sensor linearity rather than new postulates or fitted parameters.

axioms (1)
  • domain assumption An Arduino microcontroller with a customizable LED array and photosensors can generate diverse, real-time optical datasets suitable for ML training.
    Invoked in the description of the platform's data-generation capability.

pith-pipeline@v0.9.0 · 5520 in / 1273 out tokens · 48241 ms · 2026-05-10T13:48:02.681304+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

35 extracted references · 32 canonical work pages

  1. [1]

    author author The Royal Swedish Academy of Sciences ,\ @noop title The nobel prize in physics 2024 -- press release , \ howpublished https://www.nobelprize.org/prizes/physics/2024/press-release/ ( year 2024 a ),\ note accessed 2 May 2025 NoStop

  2. [2]

    author author The Royal Swedish Academy of Sciences ,\ @noop title The nobel prize in chemistry 2024 -- press release , \ howpublished https://www.nobelprize.org/prizes/chemistry/2024/press-release/ ( year 2024 b ),\ note accessed 2 May 2025 NoStop

  3. [3]

    author author J. A. \ Bennett , author N. Orouji , author M. Khan , author S. Sadeghi , author J. Rodgers , \ and\ author M. Abolhasani ,\ 10.1038/s44286-024-00033-5 journal journal Nature Chemical Engineering \ volume 1 ,\ pages 240 ( year 2024 ) NoStop

  4. [4]

    Schmid, Sterling G

    author author G. Tom , author S. P. \ Schmid , author S. G. \ Baird , author Y. Cao , author K. Darvish , author H. Hao , author S. Lo , author S. Pablo-García , author E. M. \ Rajaonson , author M. Skreta , author N. Yoshikawa , author S. Corapi , author G. D. \ Akkoc , author F. Strieth-Kalthoff , author M. Seifrid , \ and\ author A. Aspuru-Guzik ,\ 10....

  5. [5]

    Ament , author M

    author author S. Ament , author M. Amsler , author D. R. \ Sutherland , author M.-C. \ Chang , author D. Guevarra , author A. B. \ Connolly , author J. M. \ Gregoire , author M. O. \ Thompson , author C. P. \ Gomes , \ and\ author R. B. \ van Dover ,\ 10.1126/sciadv.abg4930 journal journal Science Advances \ volume 7 ,\ pages eabg4930 ( year 2021 ) NoStop

  6. [6]

    Shimizu , author S

    author author R. Shimizu , author S. Kobayashi , author Y. Watanabe , author Y. Ando , \ and\ author T. Hitosugi ,\ 10.1063/5.0020370 journal journal APL Materials \ volume 8 ,\ pages 111110 ( year 2020 ) NoStop

  7. [7]

    Liu , author S

    author author Y. Liu , author S. S. \ Fields , author T. Mimura , author K. P. \ Kelley , author S. Trolier-McKinstry , author J. F. \ Ihlefeld , \ and\ author S. V. \ Kalinin ,\ 10.1063/5.0079217 journal journal Applied Physics Letters \ volume 120 ,\ pages 182903 ( year 2022 ) NoStop

  8. [8]

    author author B. N. \ Slautin , author Y. Liu , author H. Funakubo , \ and\ author S. V. \ Kalinin ,\ 10.1063/5.0198316 journal journal Journal of Applied Physics \ volume 135 ,\ pages 154901 ( year 2024 ) NoStop

  9. [9]

    Masubuchi , author M

    author author S. Masubuchi , author M. Morimoto , author S. Morikawa , author M. Onodera , author Y. Asakawa , author K. Watanabe , author T. Taniguchi , \ and\ author T. Machida ,\ 10.1038/s41467-018-03723-w journal journal Nature Communications \ volume 9 ,\ pages 1413 ( year 2018 ) NoStop

  10. [10]

    Yang , author R

    author author H. Yang , author R. Hu , author H. Wu , author X. He , author Y. Zhou , author Y. Xue , author K. He , author W. Hu , author H. Chen , author M. Gong , author X. Zhang , author P.-H. \ Tan , author E. R. \ Hern \'a ndez , \ and\ author Y. Xie ,\ 10.1021/acs.nanolett.3c04815 journal journal Nano Letters \ volume 24 ,\ pages 2789 ( year 2024 ) NoStop

  11. [11]

    Xie , author K

    author author Y. Xie , author K. Sattari , author C. Zhang , \ and\ author J. Lin ,\ 10.1016/j.pmatsci.2022.101043 journal journal Progress in Materials Science \ volume 132 ,\ pages 101043 ( year 2023 ) NoStop

  12. [12]

    A Mobile Robotic Chemist

    author author B. Burger , author P. M. \ Maffettone , author V. V. \ Gusev , author C. M. \ Aitchison , author Y. Bai , author X. Wang , author X. Li , author B. M. \ Alston , author B. Li , author R. Clowes , author N. Rankin , author B. Harris , author R. S. \ Sprick , \ and\ author A. I. \ Cooper ,\ 10.1038/s41586-020-2442-2 journal journal Nature \ vo...

  13. [13]

    author author N. J. \ Szymanski , author B. Rendy , author Y. Fei , author R. E. \ Kumar , author T. He , author D. Milsted , author M. J. \ McDermott , author M. Gallant , author E. D. \ Cubuk , author A. Merchant , author H. Kim , author A. Jain , author C. J. \ Bartel , author K. Persson , author Y. Zeng , \ and\ author G. Ceder ,\ 10.1038/s41586-023-0...

  14. [14]

    Adam ,\ 10.1073/pnas.2406320121 journal journal Proceedings of the National Academy of Sciences \ volume 121 ,\ pages e2406320121 ( year 2024 ) NoStop

    author author D. Adam ,\ 10.1073/pnas.2406320121 journal journal Proceedings of the National Academy of Sciences \ volume 121 ,\ pages e2406320121 ( year 2024 ) NoStop

  15. [15]

    Delgado-Licona , author D

    author author F. Delgado-Licona , author D. Addington , author A. Alsaiari , \ and\ author M. Abolhasani ,\ 10.1038/s44286-025-00217-7 journal journal Nature Chemical Engineering \ volume 2 ,\ pages 277 ( year 2025 ) NoStop

  16. [16]

    Waelder , author W

    author author R. Waelder , author W. Kim , author M. A. \ Pitt , author J. I. \ Myung , \ and\ author B. Maruyama ,\ 10.1063/5.0267704 journal journal APL Machine Learning \ volume 3 ( year 2025 ),\ 10.1063/5.0267704 NoStop

  17. [17]

    author author T. J. \ Booth \ and\ author P. Bøggild ,\ 10.1038/s44286-025-00221-x journal journal Nature Chemical Engineering \ volume 2 ,\ pages 292 ( year 2025 ) NoStop

  18. [18]

    Zhao , author J

    author author Y. Zhao , author J. Liao , author S. Bu , author Z. Hu , author J. Hu , author Q. Lu , author M. Shang , author B. Guo , author G. Chen , author Q. Zhao , author K. Jia , author G. Wang , author E. Errington , author Q. Xie , author Y. Zhang , author M. Guo , author B. Mao , author L. Lin , \ and\ author Z. Liu ,\ 10.1038/s44286-025-00227-5 ...

  19. [19]

    Hou , author J

    author author B. Hou , author J. Wu , \ and\ author D. Y. \ Qiu ,\ 10.1038/s41467-024-53748-7 journal journal Nature Communications \ volume 15 ,\ pages 9481 ( year 2024 ) NoStop

  20. [20]

    Xiao , author R

    author author H. Xiao , author R. Li , author X. Shi , author Y. Chen , author L. Zhu , author X. Chen , \ and\ author L. Wang ,\ 10.1038/s41467-023-42870-7 journal journal Nature Communications \ volume 14 ,\ pages 7027 ( year 2023 ) NoStop

  21. [21]

    author author M. K. \ Horton , author P. Huck , author R. X. \ Yang , author J. M. \ Munro , author S. Dwaraknath , author A. M. \ Ganose , author R. S. \ Kingsbury , author M. Wen , author J. X. \ Shen , author T. S. \ Mathis , author A. D. \ Kaplan , author K. Berket , author J. Riebesell , author J. George , author A. S. \ Rosen , author E. W. C. \ Spo...

  22. [22]

    Zhang , author O

    author author W. Zhang , author O. Çakıroğlu , author A. Al-Enizi , author A. Nafady , author X. Gan , author X. Ma , author S. Kuriakose , author Y. Xie , \ and\ author A. Castellanos-Gomez ,\ 10.29026/oea.2023.220101 journal journal Opto-Electronic Advances \ volume 6 ,\ pages 220101 ( year 2023 a ) NoStop

  23. [23]

    Xie , author Z

    author author Y. Xie , author Z. Wang , author Y. Zhan , author P. Zhang , author R. Wu , author T. Jiang , author S. Wu , author H. Wang , author Y. Zhao , author T. Nan , \ and\ author X. Ma ,\ https://iopscience.iop.org/article/10.1088/1361-6528/aa5439 journal journal Nanotechnology \ volume 28 ,\ pages 084001 ( year 2017 ) NoStop

  24. [24]

    Xie , author J

    author author Y. Xie , author J. Lee , author Y. Wang , \ and\ author P. X.-L. \ Feng ,\ 10.1002/admt.202000794 journal journal Advanced Materials Technologies \ volume 6 ,\ pages 2000794 ( year 2020 ) NoStop

  25. [25]

    Zhang , author K

    author author W. Zhang , author K. He , author A. Castellanos-Gomez , \ and\ author Y. Xie ,\ 10.1016/j.trechm.2023.10.003 journal journal Trends in Chemistry \ volume 5 ,\ pages 920 ( year 2023 b ) NoStop

  26. [26]

    Sözen , author J

    author author Y. Sözen , author J. J. \ Riquelme , author Y. Xie , author C. Munuera , \ and\ author A. Castellanos-Gomez ,\ 10.1002/smtd.202300326 journal journal Small Methods \ volume 7 ,\ pages 2300326 ( year 2023 ) NoStop

  27. [27]

    Xie , author M

    author author Y. Xie , author M. Madel , author Y. Li , author W. Jie , author B. Neuschl , author M. Feneberg , \ and\ author K. Thonke ,\ 10.1063/1.4771696 journal journal Journal of Applied Physics \ volume 112 ,\ pages 123111 ( year 2012 ) NoStop

  28. [28]

    author author S. G. \ Baird \ and\ author T. D. \ Sparks ,\ 10.1016/j.matt.2022.11.007 journal journal Matter \ volume 5 ,\ pages 4170 ( year 2022 ) NoStop

  29. [29]

    Saar , author H

    author author L. Saar , author H. Liang , author A. Wang , author A. McDannald , author E. Rodriguez , author I. Takeuchi , \ and\ author A. G. \ Kusne ,\ 10.1557/s43577-022-00430-2 journal journal MRS Bulletin \ volume 47 ,\ pages 881 ( year 2022 ) NoStop

  30. [30]

    Ginsburg , author K

    author author T. Ginsburg , author K. Hippe , author R. Lewis , author D. Ozgulbas , author A. Cleary , author R. Butler , author C. Stone , author A. Stroka , \ and\ author I. Foster ,\ 10.48550/arXiv.2310.00510 title Exploring benchmarks for self-driving labs using color matching , \ ( year 2023 ),\ http://arxiv.org/abs/2310.00510 arXiv:2310.00510 [cs.R...

  31. [31]

    Zanella , author N

    author author A. Zanella , author N. Bui , author A. Castellani , author L. Vangelista , \ and\ author M. Zorzi ,\ 10.1109/JIOT.2014.2306328 journal journal IEEE Internet of Things Journal \ volume 1 ,\ pages 22 ( year 2014 ) NoStop

  32. [32]

    author author Adafruit Industries ,\ @noop title Adafruit as7341 10-channel light / color sensor breakout , \ howpublished https://learn.adafruit.com/adafruit-as7341-10-channel-light-color-sensor-breakout/overview ,\ note accessed 4 May 2025 NoStop

  33. [33]

    Xie , author K

    author author Y. Xie , author K. He , \ and\ author A. Castellanos-Gomez ,\ 10.1002/sstr.202500173 journal journal Small Structures \ volume 6 ,\ pages 2500173 ( year 2025 ) NoStop

  34. [34]

    author author S. X. \ Leong , author C. E. \ Griesbach , author R. Zhang , author K. Darvish , author Y. Zhao , author A. Mandal , author Y. Zou , author H. Hao , author V. Bernales , \ and\ author A. Aspuru-Guzik ,\ 10.1038/s41570-025-00747-x journal journal Nature Reviews Chemistry \ ( year 2025 ),\ 10.1038/s41570-025-00747-x ,\ note advance online publ...

  35. [35]

    Liu , author Q

    author author Y. Liu , author Q. Lei , author X. He , author Y. Xue , author K. He , author H. Yang , author Y. Wang , author X. Zhang , author L. Yang , author Y. Zhou , author R. Hu , \ and\ author Y. Xie ,\ 10.5281/zenodo.16944622 title Code and dataset for: Building an affordable self-driving lab: Practical machine learning experiments for physics edu...