Transformer-Based Source Detection and Morphological Classification in LOFAR Deep-Field Continuum Images

Caijuan Yue; Daniel Magro; Fujia Li; Guangwen Chen; John Abela; Kristian Z. Adami; Leah K. Morabito; Weibin Sun; Yogesh Wadadekar; Zhaoting Chen

arxiv: 2605.30500 · v1 · pith:MFTVR7N4new · submitted 2026-05-28 · 🌌 astro-ph.IM · astro-ph.GA

Transformer-Based Source Detection and Morphological Classification in LOFAR Deep-Field Continuum Images

Guangwen Chen , Kristian Z. Adami , John Abela , Caijuan Yue , Weibin Sun , Fujia Li , Zhaoting Chen , Daniel Magro

show 2 more authors

Yogesh Wadadekar Leah K. Morabito

This is my paper

Pith reviewed 2026-06-29 00:14 UTC · model grok-4.3

classification 🌌 astro-ph.IM astro-ph.GA

keywords radio source detectionmorphological classificationtransformer modelLOFAR deep fieldsRF-DETRinterferometric imagingextended radio galaxies

0 comments

The pith

Transformer-based detector detects and classifies radio sources in LOFAR deep fields at 91 percent F1 while treating multi-component galaxies as single objects.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that a transformer model called RF-DETR can perform instance-level detection and assign sources to five morphology classes in 150 MHz LOFAR images. The model is trained only on the ELAIS-N1 field and then run unchanged on the three other LOFAR Deep Fields. It recovers most sources from standard PyBDSF catalogues yet keeps classical extended radio galaxies as single entries instead of splitting them into separate Gaussian components. Artefacts receive their own explicit class, and the same model also matches the majority of visually confirmed giant radio galaxies.

Core claim

RF-DETR, adapted to multi-frequency-synthesis radio images and trained with a morphology-driven scheme on five mutually exclusive classes, achieves F1 approximately 91 percent on the ELAIS-N1 training field; when applied without retraining to the remaining three LOFAR Deep Fields it produces consistent catalogues that recover the majority of PyBDSF sources, represent multi-component galaxies as single detections, flag artefacts explicitly, and recover most visually identified extended and giant radio galaxies.

What carries the argument

RF-DETR, a transformer-based set-prediction detector trained on morphology-driven labels for five classes and adapted to interferometric continuum images.

If this is right

Catalogues distinguish artefact detections from astrophysical sources as separate classes.
Multi-component radio galaxies appear as single source-level entries rather than fragmented components.
The same model recovers the majority of visually identified extended and giant radio galaxies.
Performance remains consistent across fields that differ in depth and calibration.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach could be tested on other radio surveys with different frequencies or resolutions to check how much retraining is actually required.
If artefact classes are reliable, downstream statistical studies could exclude or weight sources by class without additional post-processing.
Scaling the method to SKA-Low data volumes would require checking whether inference speed remains practical on the larger image sizes.

Load-bearing premise

A model trained solely on the ELAIS-N1 field can be applied directly to the other three fields without retraining and still maintain consistent performance despite differences in survey depth and calibration quality.

What would settle it

A clear drop in F1 score or a large increase in missed giant radio galaxies when the same trained model is evaluated on a new LOFAR field with measurably different noise or calibration properties.

Figures

Figures reproduced from arXiv: 2605.30500 by Caijuan Yue, Daniel Magro, Fujia Li, Guangwen Chen, John Abela, Kristian Z. Adami, Leah K. Morabito, Weibin Sun, Yogesh Wadadekar, Zhaoting Chen.

**Figure 1.** Figure 1: Examples of LOFAR 150 MHz and ancillary imaging for a 132 × 132 pixel cutout (3.3 × 3.3 arcmin at 1.5 arcsec pixel−1 ), displayed with a ZScale stretch (contrast = 0.2). Left to right: LOFAR 150 MHz, Spitzer 4.5 𝜇m, UKIDSS–DXS 𝐾, and Pan-STARRS 𝑖. Upper panels: native ancillary images (no PSF convolution). Lower panels: ancillary images PSF-matched to the LOFAR synthesized beam (FWHM ≃ 6 ′′) to enable dire… view at source ↗

**Figure 2.** Figure 2: Examples of the annotation classes shown on 132 × 132 pixel cutouts from the LOFAR 150 MHz continuum images, with a ZScale stretch (contrast = 0.2). Bounding boxes are distinguished using both colour and line style: Compact (blue, solid), Single-Extended (green, dashed), Multi-Extended (orange, dash–dot), Flagged (black, dotted), and Spurious (red, dash–dot–dotted). Left: predominantly Compact sources and … view at source ↗

**Figure 3.** Figure 3: Schematic overview of the RF-DETR architecture as used in this work. A LOFAR 150 MHz radio image cutout is processed by a ViT-based backbone with interleaved windowed and non-windowed encoder layers to extract image features, which are then passed to a Transformer encoder–decoder. A fixed set of object queries interacts with the encoded features and produces a set of predictions through the detection head,… view at source ↗

**Figure 4.** Figure 4: Detection completeness (left) and reliability (right) as a function of the IoU threshold for the full sample (All) and the five morphological classes. The curves correspond to the reference score threshold of 0.5, while shaded regions indicate the variation of score thresholds in the range 0.25–0.75. AI Model Predictions 0.96 0.96 0.95 0.95 0.94 0.94 0.91 0.91 0.90 0.90 0.89 0.86 0.86 0.85 0.84 0.97 0.96 0… view at source ↗

**Figure 5.** Figure 5: RF-DETR predictions on the validation set for the same image cutouts shown in [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

**Figure 6.** Figure 6: RF-DETR detections in randomly selected 15′ × 15′ regions from the four LoTSS Deep Fields (ELAIS-N1, Lockman Hole, Boötes, and EDFN). Predicted bounding boxes follow the same colour and line-style scheme as defined in [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗

**Figure 7.** Figure 7: Venn diagrams illustrating the catalogue overlap between PyBDSF (blue) and RF-DETR (orange) in the four LoTSS Deep Fields. Numbers inside each circle indicate the number of detections in the corresponding catalogue. The overlap region shows the number of Py-matched sources, defined as PyBDSF sources that are associated with at least one RF-DETR bounding box, as well as the corresponding number of RF-matche… view at source ↗

**Figure 8.** Figure 8: Comparison of RF-DETR and PyBDSF flux densities for matched sources in the ELAIS–N1 field. Left panel: comparison of total flux densities obtained via direct pixel summation within RF-DETR bounding boxes versus PyBDSF islands. Right panel: comparison of peak flux densities, where RF-DETR values correspond to the maximum pixel intensity within the bounding boxes, while PyBDSF measurements are derived from G… view at source ↗

**Figure 9.** Figure 9: Total flux distributions for sources detected in the ELAIS-N1 field. Left: RF-DETR catalogue decomposed by morphological class (histograms with different colours and hatch patterns: Compact, Single-Extended, Multi-Extended, Flagged and Spurious), overlaid with the PyBDSF total flux distribution (black open histograms). Middle: Comparison between RF-DETR Compact, Single-Extended and PyBDSF single-Gaussian s… view at source ↗

**Figure 10.** Figure 10: Left: RF-DETR detections overlaid on the 15′ × 15′ cutouts of ELAIS-N1 mosaic. Right: PyBDSF catalogue entries on the same regions. Blue and green labels represent single Gaussian (S) and multiple Gaussians (M) sources, respectively. tent classifications in a fully automated and scalable manner, underscoring its potential for large-scale, systematic searches for extended and giant radio galaxies in forth… view at source ↗

read the original abstract

Radio source detection and morphological classification are fundamental for exploiting the scientific potential of modern radio continuum surveys. However, the rapidly increasing data volumes and the wide diversity of radio morphologies make traditional visual inspection infeasible and pose significant challenges for automated source finding. We apply a transformer-based set-prediction detector (RF-DETR) to 150\,MHz continuum images from the LOFAR Deep Fields for instance-level source detection and morphological classification. The method is adapted to multi-frequency-synthesis images of interferometric data and trained with a morphology-driven scheme using five mutually exclusive classes. The model is trained on the ELAIS-N1 Deep Field, where it achieves high detection and classification performance ($\mathrm{F1}\simeq 91$ per cent), and is then applied without retraining to the other three LOFAR Deep Fields. Across all four fields, the model yields consistent catalogues with modest field-to-field differences arising from survey depth and calibration. Compared with widely used PyBDSF catalogues, RF-DETR recovers the majority of PyBDSF sources while representing classical multi-component radio galaxies as single source-level detections rather than fragmented Gaussian components. Artefact-affected and spurious detections are identified as explicit classes, allowing these detections to be distinguished from general astrophysical sources in the resulting catalogues. As external validation, RF-DETR recovers the majority of visually identified extended and giant radio galaxies in the LOFAR Deep Fields and assigns them predominantly to extended morphological classes. These results indicate that transformer-based detectors provide a practical, scalable, morphology-aware approach to source finding in deep radio surveys, with clear relevance for forthcoming facilities such as SKA-Low.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

RF-DETR works on LOFAR images for detection plus five-class morphology including artefacts, but the cross-field generalization rests on qualitative claims rather than per-field metrics.

read the letter

The paper takes an existing transformer detector (RF-DETR) and trains it on one LOFAR deep field to do instance detection and assign sources to five mutually exclusive morphology classes. It then runs the same model on the other three fields without retraining. On the training field it reaches F1 around 91 percent, recovers most PyBDSF sources, and keeps multi-component galaxies as single detections instead of fragmenting them. It also flags artefacts as their own class, which is useful.

What stands out is the practical handling of radio-specific issues: the five-class scheme, the artefact category, and the fact that it treats extended sources as single objects. That matches real survey needs better than standard Gaussian fitting.

The main gap is the generalization story. The abstract and results describe the other fields as producing consistent catalogues with only modest differences, but they do not report precision, recall, or F1 broken down by field. Without those numbers or an explicit cross-validation setup, it is hard to judge how well the model actually travels across varying depth and calibration quality. The training details on splits and class balance are also light.

This is the kind of applied work that radio survey teams will want to test themselves. It is not a new architecture or theoretical advance, but the concrete pipeline and the artefact handling make it worth a referee's time. I would send it for review with the expectation that the authors add the missing per-field metrics and training protocol details.

Referee Report

2 major / 2 minor

Summary. The paper applies a transformer-based set-prediction detector (RF-DETR) to 150 MHz LOFAR Deep Field continuum images for instance-level source detection and five-class morphological classification. The model is trained on the ELAIS-N1 field (reported F1 ≃ 91 %) and applied without retraining to the remaining three LOFAR Deep Fields; the resulting catalogues are described as consistent with modest field-to-field differences, recover most PyBDSF sources while treating multi-component galaxies as single detections, and identify artefacts as explicit classes. External validation against visually identified extended sources is also presented, with the conclusion that the approach is scalable and relevant for SKA-Low.

Significance. If the cross-field generalization and performance claims hold with quantitative support, the work would demonstrate a practical, morphology-aware alternative to traditional source finders for large radio surveys. The explicit handling of artefacts and multi-component sources as distinct classes, together with the set-prediction formulation, would be a clear methodological advance over component-based tools such as PyBDSF.

major comments (2)

[Abstract / cross-field results] Abstract and the section on cross-field application: the central scalability claim requires that a single model trained on ELAIS-N1 maintains high performance on the other three fields without retraining. Only the F1 ≃ 91 % figure is supplied for the training field; the other fields are described only qualitatively as yielding “consistent catalogues with modest field-to-field differences” and recovering “the majority” of PyBDSF sources. No per-field precision, recall, or F1 values, nor any explicit cross-validation protocol or error bars, are reported. This absence directly weakens the quantitative basis for the generalization assertion.
[Methods] Methods / training description: the manuscript states that the model is “trained with a morphology-driven scheme using five mutually exclusive classes” on multi-frequency-synthesis images, yet provides no quantitative details on training/validation splits, class-imbalance handling, loss weighting, or how the interferometric image properties affect the input representation. These omissions make it impossible to assess whether the reported F1 score is robust or sensitive to the specific data characteristics of the training field.

minor comments (2)

[Abstract] The abstract and results text would benefit from explicit table or figure references for the per-class performance metrics and the catalogue comparison statistics.
[Methods] Notation for the five morphological classes should be defined once in a dedicated table or subsection rather than introduced only in passing.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. The comments highlight important areas where additional quantitative support and methodological transparency would strengthen the presentation of our results. We address each major comment below and indicate the revisions we will make.

read point-by-point responses

Referee: [Abstract / cross-field results] Abstract and the section on cross-field application: the central scalability claim requires that a single model trained on ELAIS-N1 maintains high performance on the other three fields without retraining. Only the F1 ≃ 91 % figure is supplied for the training field; the other fields are described only qualitatively as yielding “consistent catalogues with modest field-to-field differences” and recovering “the majority” of PyBDSF sources. No per-field precision, recall, or F1 values, nor any explicit cross-validation protocol or error bars, are reported. This absence directly weakens the quantitative basis for the generalization assertion.

Authors: We agree that the absence of per-field quantitative metrics limits the strength of the generalization claim. Ground-truth labels for the five morphological classes exist only for ELAIS-N1; the other fields lack equivalent annotations, precluding direct computation of precision/recall/F1 without new labeling. In revision we will add quantitative cross-field statistics that are available without labels, including per-field source counts, overlap fractions with PyBDSF detections, class-distribution histograms, and artefact rates, together with the 5-fold cross-validation protocol and validation error bars already used on ELAIS-N1. These additions will provide a more rigorous basis for the consistency statement while remaining within the scope of existing data. revision: partial
Referee: [Methods] Methods / training description: the manuscript states that the model is “trained with a morphology-driven scheme using five mutually exclusive classes” on multi-frequency-synthesis images, yet provides no quantitative details on training/validation splits, class-imbalance handling, loss weighting, or how the interferometric image properties affect the input representation. These omissions make it impossible to assess whether the reported F1 score is robust or sensitive to the specific data characteristics of the training field.

Authors: We accept that the current Methods section is insufficiently detailed for reproducibility. The revised manuscript will expand this section to report: (i) the exact training/validation split (80/20) and any stratified sampling used, (ii) the class-imbalance mitigation strategy (weighted loss terms derived from inverse class frequencies), (iii) the full RF-DETR loss formulation with the relative weighting of classification, bounding-box, and set-prediction terms, and (iv) the image preprocessing pipeline, including per-image normalization, noise clipping, and handling of the synthesized-beam properties of the LOFAR multi-frequency-synthesis images. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical ML application with independent training/evaluation

full rationale

The manuscript describes training an RF-DETR transformer detector on labeled ELAIS-N1 images, reporting F1≈91% there, then applying the fixed model to three other fields and comparing outputs to PyBDSF catalogues. No equations, derivations, or fitted-parameter predictions appear; performance metrics are computed directly from held-out or external labels rather than being redefined by the model's own outputs. No self-citation chains, ansatzes, or uniqueness theorems are invoked to justify the central claims. The reported generalization is an empirical claim whose quantitative support may be incomplete, but that is an evidence gap, not a circular reduction of the result to its inputs.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the empirical generalization of a supervised model across fields and on the assumption that the chosen five-class taxonomy adequately captures source morphology without significant overlap or omission. No new physical entities are introduced.

free parameters (1)

RF-DETR training hyperparameters and loss weights
Standard ML training choices that affect reported F1 but are not enumerated in the abstract.

axioms (1)

domain assumption The five morphology classes are mutually exclusive and collectively cover the relevant source population in the LOFAR images.
Explicitly stated as the training scheme in the abstract.

pith-pipeline@v0.9.1-grok · 5869 in / 1369 out tokens · 28188 ms · 2026-06-29T00:14:49.281993+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

2 extracted references · 2 canonical work pages

[1]

R., Vaccari M., 2018, MNRAS, 480, 2085 Alonso D., Bellini E., Hale C., Jarvis M

Alhassan W., Taylor A. R., Vaccari M., 2018, MNRAS, 480, 2085 Alonso D., Bellini E., Hale C., Jarvis M. J., Schwarz D. J., 2021, MNRAS, 502, 876 Banfield J. K., et al., 2015, MNRAS, 453, 2326 Baron Perez N., Brüggen M., Kasieczka G., Lucie-Smith L., 2025, A&A, 699, A302 Bendo G. J., Beswick R. J., D’Cruze M. J., Dickinson C., Fuller G. A., Muxlow T. W. B....

work page doi:10.1109/iccv.2017.322 2018
[2]

K., et al., 2025, MNRAS, 536, L32 Ndung’u S., Grobler T., Wijnholds S

Finder, Astrophysics Source Code Library, record ascl:1502.007 Morabito L. K., et al., 2025, MNRAS, 536, L32 Ndung’u S., Grobler T., Wijnholds S. J., Karastoyanova D., Azzopardi G., 2023, New Astron. Rev., 97, 101685 Ndung’u S., Grobler T., Wijnholds S. J., Karastoyanova D., Azzopardi G., 2024, MNRAS, 530, 783 Oquab M., et al., 2024, Transactions on Machi...

work page arXiv 2025

[1] [1]

R., Vaccari M., 2018, MNRAS, 480, 2085 Alonso D., Bellini E., Hale C., Jarvis M

Alhassan W., Taylor A. R., Vaccari M., 2018, MNRAS, 480, 2085 Alonso D., Bellini E., Hale C., Jarvis M. J., Schwarz D. J., 2021, MNRAS, 502, 876 Banfield J. K., et al., 2015, MNRAS, 453, 2326 Baron Perez N., Brüggen M., Kasieczka G., Lucie-Smith L., 2025, A&A, 699, A302 Bendo G. J., Beswick R. J., D’Cruze M. J., Dickinson C., Fuller G. A., Muxlow T. W. B....

work page doi:10.1109/iccv.2017.322 2018

[2] [2]

K., et al., 2025, MNRAS, 536, L32 Ndung’u S., Grobler T., Wijnholds S

Finder, Astrophysics Source Code Library, record ascl:1502.007 Morabito L. K., et al., 2025, MNRAS, 536, L32 Ndung’u S., Grobler T., Wijnholds S. J., Karastoyanova D., Azzopardi G., 2023, New Astron. Rev., 97, 101685 Ndung’u S., Grobler T., Wijnholds S. J., Karastoyanova D., Azzopardi G., 2024, MNRAS, 530, 783 Oquab M., et al., 2024, Transactions on Machi...

work page arXiv 2025