pith. sign in

arxiv: 2606.09144 · v1 · pith:JXBG25ZEnew · submitted 2026-06-08 · 💻 cs.DB

Containerizing BIDSme : A Reproducible Tool for BIDS Conversion

Pith reviewed 2026-06-27 14:36 UTC · model grok-4.3

classification 💻 cs.DB
keywords BIDSDockercontainerizationneuroimagingdata conversionreproducibilityBIDSme
0
0 comments X

The pith

Containerizing BIDSme with Docker makes the neuroimaging data conversion tool portable and reproducible.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows how wrapping the semi-automated BIDSme tool in Docker containers using Docker Compose overcomes its prior limits on portability and accessibility. This packaging allows the tool to convert raw brain imaging data into the BIDS standard more consistently across different computing setups. A sympathetic reader would care because standardized BIDS datasets become easier to create, share, and analyze without installation barriers. The authors describe design choices, refinements, and validation steps that produce a flexible and lightweight containerized version. They also note improved integration with platforms such as Neurodesk.

Core claim

Packaging BIDSme into Docker containers with Docker Compose produces a portable, reproducible, and user-friendly application that preserves the original tool's semi-automated conversion of raw neuroimaging data into BIDS format while enabling easier integration into existing platforms like Neurodesk.

What carries the argument

Docker and Docker Compose containerization, which bundles BIDSme and its dependencies for consistent execution across environments.

If this is right

  • Users can execute BIDSme conversions without complex local software installations.
  • BIDS conversion results become reproducible across different operating systems and hardware.
  • The tool integrates directly into platforms such as Neurodesk for streamlined workflows.
  • Iterative refinements produce a lightweight container that remains flexible for varied use cases.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same container approach could extend to other semi-automated neuroimaging conversion tools facing adoption barriers.
  • Standardized container access might reduce variability in how different research groups prepare BIDS datasets.
  • Embedding validation scripts inside the container could enable automated checks of output quality during conversion.

Load-bearing premise

The containerized version preserves the full functionality and exact behavior of the original BIDSme tool without compatibility problems.

What would settle it

Running identical raw neuroimaging datasets through both the original BIDSme and the containerized version, then comparing the resulting BIDS folder structures and metadata files for any differences.

Figures

Figures reproduced from arXiv: 2606.09144 by Antoine Jacquemin, Bradley Spitz, Christophe Phillips, Nikita Beliy.

Figure 1
Figure 1. Figure 1: General workflow of BIDSme : from raw neuroimaging data to a BIDS-compliant dataset. Despite its potential, BIDSme was not initially designed for broad distribution or easy deploy￾ment. It required local installation with multiple dependencies and lacked support for standard￾ized, reproducible environments. This limited its accessibility for end-users, particularly those without technical expertise in Pyth… view at source ↗
Figure 2
Figure 2. Figure 2: The multi-stage Dockerfile to optimize the image building process [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Dockerfile’s entry point logic 5.4 Public distribution via GitHub The full containerization project is hosted on GitHub [16], providing open access to the Dockerfile, helper scripts, and example configurations. This ensures transparency, version control, and easy collaboration across users and contributors. The repository also includes documentation to guide new users through installation, setup, and execu… view at source ↗
read the original abstract

The "Brain Imaging Data Structure" (BIDS) has become a widely adopted standard for organizing and sharing neuroimaging datasets of various modalities. However, converting raw brain imaging data into BIDS framework remains a complex and time-consuming task. BIDSme is a semi-automated tool developed to streamline this conversion process, but until recently, it lacked the portability and accessibility needed for widespread adoption. This paper presents the containerization of BIDSme using Docker and Docker Compose, improving usability, reproducibility, and integration into existing platforms like Neurodesk. It also details the design choices, iterative refinements, and validation process that led to a flexible, lightweight, and user-friendly containerized application.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The manuscript describes the containerization of BIDSme, a semi-automated tool for converting raw neuroimaging data to the BIDS standard, using Docker and Docker Compose. It covers the design choices, iterative refinements, and validation process that produced a flexible, lightweight containerized application with improved usability, reproducibility, and integration into platforms such as Neurodesk.

Significance. If the containerized version preserves full original functionality while delivering the stated gains in portability and ease of use, the work would lower practical barriers to BIDS adoption in neuroimaging by enabling reproducible data-conversion workflows across diverse computing environments.

major comments (1)
  1. [validation process] The validation process (described in the abstract and presumably detailed in the methods/results sections) is presented only narratively; no quantitative metrics, test-dataset success rates, error analysis, or before/after comparisons are supplied to support the claims of improved usability and reproducibility.
minor comments (2)
  1. Provide explicit links to the public Dockerfiles, compose files, and any test data in the manuscript so that readers can reproduce the container exactly as described.
  2. Clarify the precise host-system prerequisites (e.g., Docker version, GPU support) required for the Neurodesk integration example.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the positive summary and the recommendation of minor revision. The single major comment is addressed point-by-point below.

read point-by-point responses
  1. Referee: [validation process] The validation process (described in the abstract and presumably detailed in the methods/results sections) is presented only narratively; no quantitative metrics, test-dataset success rates, error analysis, or before/after comparisons are supplied to support the claims of improved usability and reproducibility.

    Authors: We agree that the validation section would be strengthened by quantitative evidence. In the revised manuscript we will add: (i) success rates on a curated set of test neuroimaging datasets (raw DICOM and NIfTI files from multiple modalities), (ii) a brief error analysis of the few conversion failures encountered, and (iii) before/after comparisons of setup time and number of manual steps required. These additions will appear in the Methods and Results sections and will be supported by a supplementary table. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper is a purely descriptive engineering report on Docker containerization of the existing BIDSme tool. It contains no equations, fitted parameters, predictions, uniqueness theorems, or derivation chains of any kind. The central claim (successful containerization with improved usability and reproducibility) is advanced through narrative description of design choices and iterative validation, with no load-bearing step that reduces to a self-citation, ansatz, or input-by-construction. This is self-contained software packaging work with no opportunity for the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is a software packaging report containing no mathematical free parameters, axioms, or postulated scientific entities.

pith-pipeline@v0.9.1-grok · 5643 in / 1015 out tokens · 31329 ms · 2026-06-27T14:36:07.814909+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

16 extracted references · 2 canonical work pages

  1. [1]

    The Brain Imaging Data Structure, a Format for Organizing and Describing Outputs of Neuroimaging Experiments,

    K. J. Gorgolewski, T. Auer, V. D. Calhoun, R. C. Craddock, S. Das, E. P. Duff, G. Flandin, S. S. Ghosh, T. Glatard, Y. O. Halchenko, D. A. Handwerker, M. Hanke, D. Keator, X. Li, Z. Michael, C. Maumet, B. N. Nichols, T. E. Nichols, J. Pellman, J.-B. Poline, A. Rokem, G. Schaefer, V. Sochat, W. Triplett, J. A. Turner, G. Varoquaux, and R. A. Poldrack, “The...

  2. [2]

    The past, present, and future of the brain imaging data structure (BIDS),

    R. A. Poldrack, C. J. Markiewicz, S. Appelhoff, Y. K. Ashar, T. Auer, S. Baillet, S. Bansal, L. Beltrachini, C. G. Benar, G. Bertazzoli, S. Bhogawar, R. W. Blair, M. Bortoletto, M. Boudreau, T. L. Brooks, V. D. Calhoun, F. M. Castelli, P. Clement, A. L. Cohen, J. Cohen-Adad, S. D’Ambrosio, G. de Hollander, M. de la Iglesia-Vayá, A. de la Vega, A. Delorme,...

  3. [3]

    Beliy, G

    N. Beliy, G. Hammad, and C. Phillips, “BIDSme.” [Online]. Available: https: //github.com/CyclotronResearchCentre/BIDSme

  4. [4]

    BIDSme: Expandable BIDS-ifier of Brain Imagery Datasets,

    N. Beliy, C. Guillemin, E. Pommier, G. Hammad, and C. Phillips, “BIDSme: Expandable BIDS-ifier of Brain Imagery Datasets,”Journal of Open Source Software, vol. 8, no. 92, p. 5575, Dec. 2023

  5. [5]

    Docker: lightweight linux containers for consistent development and deploy- ment,

    D. Merkel, “Docker: lightweight linux containers for consistent development and deploy- ment,”Linux journal, vol. 2014, no. 239, p. 2, 2014. 11

  6. [6]

    Neurodesk: an accessible, flexible and portable data analysis environment for reproducible neuroimaging,

    A. I. Renton, T. T. Dao, T. Johnstone, O. Civier, R. P. Sullivan, D. J. White, P. Lyons, B. M. Slade, D. F. Abbott, T. J. Amos, S. Bollmann, A. Botting, M. E. J. Campbell, J. Chang, T. G. Close, M. Dörig, K. Eckstein, G. F. Egan, S. Evas, G. Flandin, K. G. Garner, M. I. Garrido, S. S. Ghosh, M. Grignard, Y. O. Halchenko, A. J. Hannan, A. S. Heinsfeld, L. ...

  7. [7]

    BIDSme tutorial

    N. Beliy, “BIDSme tutorial.” [Online]. Available: https://github.com/ CyclotronResearchCentre/bidsme_tutorial

  8. [8]

    BIDSme example dataset

    ——, “BIDSme example dataset.” [Online]. Available: https://github.com/ CyclotronResearchCentre/bidsme_examples

  9. [9]

    Jupyter Notebooks – a publishing format for reproducible computational workflows,

    T. Kluyver, B. Ragan-Kelley, F. Pérez, B. Granger, M. Bussonnier, J. Frederic, K. Kelley, J. Hamrick, J. Grout, S. Corlay, P. Ivanov, D. Avila, S. Abdalla, and C. Willing, “Jupyter Notebooks – a publishing format for reproducible computational workflows,” inPosition- ing and Power in Academic Publishing: Players, Agents and Agendas, F. Loizides and B. Sch...

  10. [10]

    Data Structures for Statistical Computing in Python,

    Wes McKinney, “Data Structures for Statistical Computing in Python,” inProceedings of the 9th Python in Science Conference, Stéfan van der Walt and Jarrod Millman, Eds., 2010, pp. 56 – 61

  11. [11]

    pyparsing: A Python parsing module,

    P. T. McGuire, “pyparsing: A Python parsing module,” GitHub, 2024, version 3.1.2. [Online]. Available: https://github.com/pyparsing/pyparsing

  12. [12]

    nipy/nibabel: NiBabel Release 5.3.1,

    M. Brett, C. Markiewicz, J. Millman, S. Ghosh, G. Varoquaux, O. Esteban, F. Peréz-García, F. Morency, and C. Burns, “nipy/nibabel: NiBabel Release 5.3.1,” Oct 2024. [Online]. Available: https://nipy.org/nibabel/

  13. [13]

    SU-E-T-33: pydicom: an open source DICOM library,

    D. Mason, “SU-E-T-33: pydicom: an open source DICOM library,”Medical Physics, vol. 38, no. 6, p. 3493, 2011

  14. [14]

    The first step for neuroimaging data analysis: DICOM to NIfTI conversion,

    X. Li, P. S. Morgan, J. Ashburner, J. Smith, and C. Rorden, “The first step for neuroimaging data analysis: DICOM to NIfTI conversion,”Journal of Neuroscience Methods, vol. 264, pp. 47–56, 2016

  15. [15]

    DCM-parser: A Lightning DICOM Parser,

    L. Peng, “DCM-parser: A Lightning DICOM Parser,” https://github.com/PL97/DICOM_ Parser, 2022

  16. [16]

    BIDSme Containerization Project

    B. Spitz, N. Beliy, and C. Phillips, “BIDSme Containerization Project.” [Online]. Available: https://github.com/CyclotronResearchCentre/BIDSme_containerisation 12