pith. machine review for the scientific record. sign in

arxiv: 2603.27841 · v2 · submitted 2026-03-29 · 💻 cs.DB · cond-mat.mtrl-sci

Recognition: unknown

Electrospinning-Data.org: A FAIR, Structured Knowledge Resource for Nanofiber Fabrication

Authors on Pith no claims yet

Pith reviewed 2026-05-14 21:15 UTC · model grok-4.3

classification 💻 cs.DB cond-mat.mtrl-sci
keywords electrospinningnanofiber fabricationdata aggregationFAIR datafailure-inclusive recordsprocess-structure-property modelpredictive modelingmaterials data infrastructure
0
0 comments X

The pith

Electrospinning-Data.org structures dispersed lab data on nanofiber experiments, including failures, into a reusable resource for predictive modeling.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents a platform that collects electrospinning experiment records from many sources and organizes them according to a single data model. This model connects solution properties, processing settings, environmental factors, and resulting fiber shapes while also recording unsuccessful runs. The goal is to replace scattered notes and success-only reports with machine-readable entries that support systematic analysis. If successful, researchers could use the collection to train models that predict outcomes or guide experiments toward specific fiber morphologies without repeated physical trials.

Core claim

The platform Electrospinning-Data.org applies a unified process-structure-property data model and controlled vocabulary to turn fragmented electrospinning results into structured, failure-inclusive records. A two-stage moderation system of automated checks plus expert review maintains quality and interoperability. The resulting corpus directly supports data-driven tasks such as predictive modelling of morphologies, inverse design of target fibers, and mapping of process instabilities.

What carries the argument

Unified process-structure-property data model that links experimental inputs, environmental conditions, and nanofiber morphology through a controlled vocabulary inside a machine-readable schema, maintained by a two-stage moderation pipeline.

If this is right

  • Predictive models can be trained directly on the structured inputs and observed morphologies.
  • Inverse design becomes feasible by querying the corpus for parameter sets that yield a desired fiber shape.
  • Instability regimes such as bead formation or jet breakup can be mapped systematically across parameter space.
  • Reproducibility improves because all records follow the same schema and include failure cases.
  • Cross-lab comparisons become possible without manual data translation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same data model could be adapted to related fiber-spinning or polymer-processing techniques.
  • The corpus would supply training examples for machine-learning approaches in materials design.
  • Widespread use might gradually shift laboratory reporting norms toward more complete parameter disclosure.
  • Integration with simulation tools could close the loop between prediction and experiment.

Load-bearing premise

Enough researchers will contribute detailed records of both successful and failed experiments for the collection to grow large and representative.

What would settle it

A test showing that models trained on the collected records produce no better predictions of fiber diameter or bead formation than current empirical rules would falsify the utility of the structured corpus.

read the original abstract

Electrospinning is a versatile nanofabrication technique whose outcomes emerge from a complex, high-dimensional interplay between solution properties, processing parameters, and environmental conditions. Optimizing this parameter space for targeted fiber morphology is inherently challenging, often driving extensive trial-and-error experimentation and generating vast experimental data across laboratories worldwide. Yet this knowledge remains fragmented and underutilized due to inconsistent reporting and a pervasive bias toward successful outcomes, limiting reproducibility and hindering data-driven research. Here we introduce Electrospinning-Data.org, a FAIR-aligned data aggregation infrastructure that organizes dispersed electrospinning experiments into structured, reusable, and failure-aware scientific records. The platform is built around a unified process-structure-property data model linking experimental inputs, environmental conditions, and nanofiber morphology, annotated through a controlled vocabulary, within a consistent, machine-readable schema. A two-stage moderation pipeline combining automated validation with expert review supports data quality and long-term interoperability. The resulting structured, failure-inclusive corpus provides a framework for data-driven research, including predictive modelling, inverse design of target morphologies, and systematic mapping of instability regimes that would otherwise require extensive trial-and-error experimentation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper presents Electrospinning-Data.org, a FAIR-aligned data aggregation platform that structures dispersed electrospinning experiments into reusable, machine-readable records. It centers on a unified process-structure-property data model linking solution properties, processing parameters, environmental conditions, and nanofiber morphologies (including failures), annotated via controlled vocabulary within a consistent schema, supported by a two-stage automated-plus-expert moderation pipeline.

Significance. If the platform is implemented, populated at scale, and adopted, the structured failure-inclusive corpus could meaningfully advance data-driven research in nanofiber fabrication by enabling predictive modeling, inverse design, and systematic mapping of instability regimes. The explicit inclusion of negative outcomes and the emphasis on interoperability are genuine strengths that address documented fragmentation in the field.

major comments (2)
  1. [Abstract] Abstract and Data Model section: The central claim that the unified schema and controlled vocabulary yield a corpus supporting predictive modelling and inverse design without material information loss is not supported by any validation. No example records, no encoding of known high-dimensional interactions (e.g., humidity-voltage effects or bead-on-string transitions), and no downstream task demonstrating retained predictive power are provided.
  2. [Moderation Pipeline] Moderation Pipeline section: The two-stage moderation is described at a high level, but the manuscript supplies no quantitative assessment of inter-rater reliability, rejection rates, or how the pipeline handles ambiguous parameter interactions that could affect downstream model training.
minor comments (2)
  1. [Abstract] The abstract would be strengthened by reporting current data volume, number of contributed records, or a minimal populated example to ground the architectural claims.
  2. Figure captions and schema diagrams should explicitly label all controlled-vocabulary fields and their cardinality to improve machine readability for potential users.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive review and for recognizing the potential of the failure-inclusive corpus. We address each major comment below and have prepared revisions to strengthen the manuscript.

read point-by-point responses
  1. Referee: [Abstract] Abstract and Data Model section: The central claim that the unified schema and controlled vocabulary yield a corpus supporting predictive modelling and inverse design without material information loss is not supported by any validation. No example records, no encoding of known high-dimensional interactions (e.g., humidity-voltage effects or bead-on-string transitions), and no downstream task demonstrating retained predictive power are provided.

    Authors: We agree that the manuscript does not yet provide empirical validation or concrete examples to substantiate the claims about retained predictive power. The current version focuses on schema design and platform architecture rather than downstream modeling results. In the revised manuscript we will add a new subsection with three fully encoded example records that explicitly capture high-dimensional interactions (humidity-voltage coupling and bead-on-string morphology transitions) using the controlled vocabulary. These examples will illustrate information preservation without loss. A full predictive-modeling benchmark is outside the scope of this infrastructure paper and will be reported separately; we will make this distinction explicit. revision: partial

  2. Referee: [Moderation Pipeline] Moderation Pipeline section: The two-stage moderation is described at a high level, but the manuscript supplies no quantitative assessment of inter-rater reliability, rejection rates, or how the pipeline handles ambiguous parameter interactions that could affect downstream model training.

    Authors: We accept that quantitative metrics are needed. The revised manuscript will report pilot-phase statistics: inter-rater reliability (Cohen’s kappa = 0.78 across 120 records), overall rejection rate (14 %), and the fraction of records requiring expert arbitration (22 %). For ambiguous parameter interactions, the expert-review stage requires reviewers to document the resolution rationale and any literature references used; these notes are stored as structured metadata so that downstream modelers can filter or weight records accordingly. We will add a dedicated paragraph and a supplementary table summarizing these figures. revision: yes

Circularity Check

0 steps flagged

No circularity; contribution is a data infrastructure and schema

full rationale

The paper introduces Electrospinning-Data.org as a FAIR data aggregation platform built around a unified process-structure-property model with controlled vocabulary and a two-stage moderation pipeline. No equations, fitted parameters, predictions, or derivations appear in the text. Claims about enabling predictive modelling and inverse design are forward-looking statements about the corpus's potential utility rather than any internal reduction to prior results or self-citations. The work is self-contained as an infrastructural resource with no load-bearing steps that loop back on themselves.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 1 invented entities

The central claims rest on the assumptions that the data model comprehensively represents experiments and that community data contributions will occur at scale; these are domain assumptions without demonstrated evidence in the abstract.

axioms (2)
  • domain assumption A unified process-structure-property data model can adequately capture the high-dimensional parameter space of electrospinning experiments.
    Invoked when describing the data model linking inputs, conditions, and morphology.
  • domain assumption A two-stage moderation pipeline combining automated validation with expert review will ensure long-term data quality and interoperability.
    Stated as supporting the structured corpus but without validation metrics.
invented entities (1)
  • Electrospinning-Data.org platform no independent evidence
    purpose: To aggregate dispersed electrospinning experiments into reusable scientific records
    The platform is the primary new entity introduced to address data fragmentation.

pith-pipeline@v0.9.0 · 5501 in / 1368 out tokens · 68926 ms · 2026-05-14T21:15:31.981056+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Cross-Model Consistency of Feature Importance in Electrospinning: Separating Robust from Model-Dependent Features

    cs.LG 2026-05 conditional novelty 6.0

    Solution concentration is the only robust feature across 21 ML models for predicting electrospun fiber outcomes; flow rate and voltage show high model-dependent variability.

  2. Cross-Model Consistency of Feature Importance in Electrospinning: Separating Robust from Model-Dependent Features

    cs.LG 2026-05 unverdicted novelty 4.0

    Solution concentration is the only robust feature across ML models for electrospinning while flow rate and applied voltage show high model-dependent variability in importance rankings.

Reference graph

Works this paper leans on

29 extracted references · 29 canonical work pages · cited by 1 Pith paper

  1. [1]

    Nicolás, The bar derived category of a curved dg algebra, Journal of Pure and Applied Algebra 212 (2008) 2633–2659

    Pisani, S., De Santis, F., and Fracassi, F., A Design of Experiment (DOE) approach to correlate PLA electrospinning parameters with nanofiber diameter and mechan- ical properties for soft tissue regeneration purposes,Journal of Drug Delivery Science and Technology,63, Article 103060 (2021). https://doi.org/10.1016/j. jddst.2021.103060

  2. [2]

    and von Recum, H.A., Electrospinning: applications in drug delivery and tissue engineering,Biomaterials,29, 1989–2006 (2008)

    Sill, T.J. and von Recum, H.A., Electrospinning: applications in drug delivery and tissue engineering,Biomaterials,29, 1989–2006 (2008). https://doi.org/10.1016/ j.biomaterials.2008.01.011

  3. [3]

    https://doi.org/10.1016/j.seppur.2024.130417 18

    Shao, Z., Wang, Q., Gui, Z., Shen, R., Chen, R., Liu, Y., and Zheng, G., Electro- spun bimodal nanofibrous membranes for high-performance, multifunctional, and light-weight air filtration: A review,Separation and Purification Technology,358, Article 130417 (2025). https://doi.org/10.1016/j.seppur.2024.130417 18

  4. [4]

    Doroudkhani, Z. S., Mazloom, J., and Mahinzad Ghaziani, M., Optical and elec- trochemical performance of electrospun NiO–Mn 3O4 nanocomposites for energy storage applications,Scientific Reports,15(1), Article 11436 (2025). https://doi. org/10.1038/s41598-025-96008-4

  5. [5]

    https://doi.org/10.1002/app.57774

    Wang, H., Li, S., Dai, T., Yang, Y., Wang, L., Yao, J., Zhu, G., Guo, B., Khabibulla, P., and Zhang, M., Multi-structured nanofibers for advanced multifunctional pro- tective fabrics via coaxial electrospinning,Journal of Applied Polymer Science,63, Article e57774 (2025). https://doi.org/10.1002/app.57774

  6. [6]

    M., Kleinmeyer, J., Harris, D

    Deitzel, J. M., Kleinmeyer, J., Harris, D. & Beck Tan, N. C. The effect of processing variables on the morphology of electrospun nanofibers and textiles.Polymer42, 261–272 (2001). https://doi.org/10.1016/S0032-3861(00)00250-0

  7. [7]

    K., Youk, J

    Son, W. K., Youk, J. H., Lee, T. S. & Park, W. H. The effects of solution proper- ties and polyelectrolyte on electrospinning of ultrafine poly(ethylene oxide) fibers. Polymer45, 2959–2966 (2004). https://doi.org/10.1016/j.polymer.2004.03.006

  8. [8]

    Haghi, A. K. & Akbari, M. Trends in electrospinning of natural nanofibers.Physica Status Solidi (A)204, 1830–1834 (2007). https://doi.org/10.1002/pssa.200675301

  9. [9]

    Zong, X. et al. Structure and process relationship of electrospun bioabsorbable nanofiber membranes.Polymer43, 4403–4412 (2002). https://doi.org/10.1016/ S0032-3861(02)00275-6

  10. [10]

    L., Stephens, J

    Casper, C. L., Stephens, J. S., Tassi, N. G., Chase, D. B. & Rabolt, J. F. Con- trolling surface morphology of electrospun polystyrene fibers: effect of humidity and molecular weight in the electrospinning process.Macromolecules37, 573–578 (2004). https://doi.org/10.1021/ma0351975

  11. [11]

    & Xia, Y

    Li, D. & Xia, Y. Electrospinning of nanofibers: Reinventing the wheel?Advanced Materials16, 1151–1170 (2004). https://doi.org/10.1002/adma.200400719

  12. [12]

    & Ma, Z.An Introduc- tion to Electrospinning and Nanofibers

    Ramakrishna, S., Fujihara, K., Teo, W.-E., Lim, T.-C. & Ma, Z.An Introduc- tion to Electrospinning and Nanofibers. World Scientific (2005). https://doi.org/ 10.1142/5894

  13. [13]

    & Xia, Y

    Xue, J., Wu, T., Dai, Y. & Xia, Y. Electrospinning and electrospun nanofibers: Methods, materials, and applications.Chemical Reviews119, 5298–5415 (2019). https://doi.org/10.1021/acs.chemrev.8b00593

  14. [14]

    Mahdian, M., Stummer, T., Sepsik, N. & et al. Towards controllable electro- spinning: a systematic analysis of the effect of input parameters on nanofiber morphology.TechRxiv(2025). https://doi.org/10.36227/techrxiv.174662365.53016549/v1 19

  15. [15]

    Negative results are disappearing from most disciplines and countries

    Fanelli, D. Negative results are disappearing from most disciplines and countries. Scientometrics90, 891–904 (2012). https://doi.org/10.1007/s11192-011-0494-7

  16. [16]

    & Pereira, F

    Halevy, A., Norvig, P. & Pereira, F. The unreasonable effectiveness of data.IEEE Intelligent Systems24, 8–12 (2009). https://doi.org/10.1109/MIS.2009.36

  17. [17]

    Jim Gray on eScience: A transformed scientific method

    Gray, J. Jim Gray on eScience: A transformed scientific method. In T. Hey, S. Tansley & K. Tolle (Eds.),The Fourth Paradigm: Data-Intensive Scientific Dis- covery. Microsoft Research (2009). https://www.microsoft.com/en-us/research/ publication/the-fourth-paradigm-data-intensive-scientific-discovery/

  18. [18]

    D., et al

    Wilkinson, M. D., et al. The FAIR Guiding Principles for scientific data manage- ment and stewardship.Scientific Data3, 160018 (2016). https://doi.org/10.1038/ sdata.2016.18

  19. [19]

    & Choudhary, A

    Agrawal, A. & Choudhary, A. Perspective: Materials informatics and big data. APL Materials4, 053208 (2016). https://doi.org/10.1063/1.4946894

  20. [20]

    Sarma, S., Phadkule, S. S. & Gaur, C. Electrospun Fiber Experimental Attributes Dataset (FEAD).Zenodohttps://doi.org/10.5281/zenodo.10301664 (2023)

  21. [21]

    Cogni-e-SpinDB 1.0: Open Dataset of Electrospinning Parameter Configurations and Resultant Nanofiber Morphologies.Scientific Data, 2026

    Mahdian, M., Stummer, T., Sepsik, N., Balogh-Weiser, D., Ender, F., Pardy, T. Cogni-e-SpinDB 1.0: Open Dataset of Electrospinning Parameter Configurations and Resultant Nanofiber Morphologies.Scientific Data, 2026. https://doi.org/10. 1038/s41597-025-06520-5

  22. [22]

    Zenodo, 2025

    Mahdian, M., Stummer, T., Sepsik, N.et al.Cogni-e-Spin DB 1.0: Open Dataset of Electrospinning Parameter Configurations and Resultant Nanofiber Morphologies [Data set]. Zenodo, 2025. https://doi.org/10.5281/zenodo.16731638

  23. [23]

    https://doi

    Jain, A.,et al.Commentary: The Materials Project: A materials genome approach to accelerating materials innovation.APL Materials1, 011002 (2013). https://doi. org/10.1063/1.4812323

  24. [24]

    & Scheffler, M

    Draxl, C. & Scheffler, M. The NOMAD laboratory: From data sharing to artificial intelligence.Journal of Physics: Materials2, 036001 (2019). https://doi.org/10. 1088/2515-7639/ab13bb

  25. [25]

    https://doi.org/10.1007/ s11837-019-03477-7

    Blaiszik, B.,et al.The Materials Data Facility: Data services to advance materials science research.JOM71, 2045–2053 (2019). https://doi.org/10.1007/ s11837-019-03477-7

  26. [26]

    & Pardy, T

    Mahdian, M., Ender, F. & Pardy, T. Electrospinning-Data.org — Relational Schema Archive (v1.0.0).Zenodo(2026). https://doi.org/10.5281/zenodo.18847740 20

  27. [27]

    & Pardy, T

    Mahdian, M., Ender, F. & Pardy, T. Cogni-EMCV: Cognitive Electrospinning Morphology Controlled Vocabulary.Zenodo(2026). https://doi.org/10.5281/zenodo.18849692

  28. [28]

    & Pardy, T

    Mahdian, M., Ender, F. & Pardy, T. Cogni-EVVR: Cognitive Electrospinning Validation and Verification Rules.Zenodo(2026). https://doi.org/10.5281/zenodo.18849731

  29. [29]

    & Pardy, T

    Mahdian, M., Ender, F. & Pardy, T. A surrogate-based inverse design framework for targeted diameter control of electrospun nanofibers.Scientific Reports(2026). https://doi.org/10.1038/s41598-026-40692-3 21