pith. machine review for the scientific record. sign in

arxiv: 2605.04167 · v1 · submitted 2026-05-05 · 🌌 astro-ph.IM

Recognition: 3 theorem links

Overview of the New Hubble Spectroscopic Legacy Archive

Adrian Lucy, Alec Hirschauer, Andrew Cortese, Anna Payne, Ben Falk, Brian Charlow, Christopher Rahmani, Dan Welty, David Rodriguez, Elaine Frazer, Fred Romelfanger, Jennifer Kotler, John Debes, Joleen Carlberg, Kate Rowlands, Lauren Miller, Leonardo Dos Santos, Lisa Sherbert, Marc Rafelski, Matthew Burger, Peter Forshay, Ravi Sankrit, Rich Kidwell, Robert Jedrzejewski, Robert Swaters, Sara Anderson, Scott Fleming, Sunita Malla, Svea Hernandez, Syed Gilani, Talya Kelley, Thomas Bair, Thomas Wevers, Tim Kimball, Tracy Ellis, Travis Fischer, Van Dixon

Authors on Pith no claims yet

Pith reviewed 2026-05-08 18:25 UTC · model grok-4.3

classification 🌌 astro-ph.IM
keywords Hubble Spectroscopic Legacy ArchiveCOSSTIScoadded spectraultraviolet spectroscopydata archiveMAST
0
0 comments X

The pith

The Hubble Spectroscopic Legacy Archive automatically generates coadded spectra for every target observed with COS and STIS.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents the Hubble Spectroscopic Legacy Archive as a resource that combines all available observations of the same astronomical targets from the COS and STIS instruments into coadded spectra. This process happens automatically when new data are released or recalibrated, using the Mikulski Archive as the source. Targets are identified through their sky coordinates, corrected for proper motion, and classified with help from SIMBAD, NED, and proposal information. The archive supplies not only the coadded spectra for each observing mode but also a broad-wavelength version and detailed metadata files. It further offers the underlying code publicly so users can perform custom coadds when needed.

Core claim

The HSLA automatically produces coadded spectra for individual targets from COS and STIS data over the instruments' lifetimes, grouping observations by coordinates with proper motion accounting, classifying them via SIMBAD, NED and Phase II proposals, and creating mode-specific coadds along with an abutting full-range spectrum for each target.

What carries the argument

The coaddition engine that groups observations into targets using coordinate matching and proper motion corrections, then generates one coadd per observing mode with lifetime-position splits for COS FUV and an abutting combined spectrum.

If this is right

  • Coadded spectra are generated and updated automatically as new data arrive.
  • Each target gets a metadata file containing key information for searches.
  • The coadding code is released publicly for users to adapt in special cases.
  • A single spectrum spanning all observed wavelengths is created by abutting selected modes.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • This could make it easier for astronomers to study time-variable phenomena or weak features by using the combined data directly.
  • Similar legacy archives for other instruments might follow the same model of automatic coaddition and metadata provision.
  • The public tools open the door for community-driven improvements to the coaddition algorithms.
  • Researchers studying objects with multiple observations could discover inconsistencies in the automatic groupings that manual review might miss.

Load-bearing premise

Automatic target identification via coordinates, proper-motion corrections, SIMBAD, NED, and Phase II proposals will correctly group observations and assign classifications without significant mismatches or missing data.

What would settle it

A verification where a significant fraction of targets have coadded spectra that fail to match the expected combination from their individual input observations due to grouping errors.

Figures

Figures reproduced from arXiv: 2605.04167 by Adrian Lucy, Alec Hirschauer, Andrew Cortese, Anna Payne, Ben Falk, Brian Charlow, Christopher Rahmani, Dan Welty, David Rodriguez, Elaine Frazer, Fred Romelfanger, Jennifer Kotler, John Debes, Joleen Carlberg, Kate Rowlands, Lauren Miller, Leonardo Dos Santos, Lisa Sherbert, Marc Rafelski, Matthew Burger, Peter Forshay, Ravi Sankrit, Rich Kidwell, Robert Jedrzejewski, Robert Swaters, Sara Anderson, Scott Fleming, Sunita Malla, Svea Hernandez, Syed Gilani, Talya Kelley, Thomas Bair, Thomas Wevers, Tim Kimball, Tracy Ellis, Travis Fischer, Van Dixon.

Figure 1
Figure 1. Figure 1: Completeness, accuracy and contamination of name-based target association as a function of cross-match radius. The cross-match completeness is defined as the number of apertures that are identified as cross-match divided by the total number of apertures that match one of the target SIMBAD aliases as a function of radius. Coordinate accuracy is defined as the number of apertures that, based on name and coor… view at source ↗
Figure 2
Figure 2. Figure 2: Flow chart detailing the decision tree of the final object classification. The databases SIMBAD, NED, and the NASA Exoplanet Archive are queried and assigned as the classification in that order of object type availability, but if the target is not present in any, then the Phase II keywords are assigned as the classification. The red asterisks denote the location in the decision tree when an additional quer… view at source ↗
Figure 3
Figure 3. Figure 3: Example metadata file for the LMC O star SK -69 212. 4.4 Metadata Files In addition to the spectra, each HSLA target association has a companion metadata text file that compiles information about the target and the input data in a simple human￾readable format. We present two examples and describe the contents of these files. The first is the file for SK -69 212, an O star in the Large Magellanic Cloud (LMC; view at source ↗
Figure 4
Figure 4. Figure 4: Example metadata file for the exoplanet hosting A star β-Pictoris. been observed in 11 programs and with several COS and STIS modes. As it is an exoplanet host, NExScI was queried to obtain the classifcation in addition to SIMBAD (see view at source ↗
Figure 5
Figure 5. Figure 5: The top panels show the HSLA abutted spectrum of G 191-B2B overlaid on a synthetic spectrum of the star. The plot on the left spans about 1100 A and that on ˚ the right spans the remaining ≈8000 A covered by COS and STIS. The bottom panels ˚ show residuals obtained after binning the observed and model spectra. The bin sizes are different for the two sections of the spectrum, and were chosen solely for clar… view at source ↗
Figure 6
Figure 6. Figure 6: Same as view at source ↗
Figure 7
Figure 7. Figure 7: Same as view at source ↗
Figure 8
Figure 8. Figure 8: Selected regions of the HSLA quick look spectrum of WD0380-565, which spans from 920-10,000 A. Top: Spectrum near 950 ˚ A that shows several ISM H lines. ˚ Bottom: Several ISM Si and Photospheric C lines are seen in a very high SNR region of the FUV spectrum, obtained by dozens of individual observations of this flux standard. Residual OI airglow lines are present around 1301 and 1306 A. ˚ limited by fixed… view at source ↗
Figure 9
Figure 9. Figure 9: Full UV SED of β Pictoris derived from the HSLA quicklook aspec file, covering three separate COS and STIS gratings. Abundant stellar and circumstellar absorption features are seen in the spectrum. Instrument Science Report COS 2025-18(v1) Page 32 view at source ↗
Figure 10
Figure 10. Figure 10: An aitoff projection of galactic coordinates for HSLA white dwarfs. The grey triangles are WDs that have sufficient wavelength coverage and resolution to resolve and detect the FUV Si lines at 1261 A and 1265 ˚ A. Light blue circles represent ˚ HSLA WDs with a significant Si 1261 A line, indicative of detectable local ISM ˚ absorption, while red squares represent HSLA WDs with a significant Si 1265 A line… view at source ↗
read the original abstract

The new Hubble Spectroscopic Legacy Archive (HSLA) provides coadded spectra of individual targets that have been observed with the Cosmic Origins Spectrograph (COS) and the Space Telescope Imaging Spectrograph (STIS) over their operating lifetime. HSLA uses data available in the Mikulski Archive for Space Telescopes (MAST). It automatically produces coadds whenever new data become publicly available or when there is newly recalibrated data. HSLA defines individual targets by their associated coordinates, accounting for proper motions, and uses SIMBAD, NED and the Phase II observing proposals to obtain astronomical classifications for each object. Coadded spectra are produced for each observing mode. In the case of COS far-ultraviolet observations there is one coadded spectrum for each lifetime position (LP). Additionally, a spectrum spanning the entire wavelength range covered by the observations is produced by abutting the spectra from a selection of individual modes. For each individual target, HSLA also provides a human-readable metadata file with key information that can be used in searches or for further exploration of the data. The HSLA project also makes the code used for coadding spectra publicly available along with several other tools (using Jupyter notebooks) for custom coaddition required in special cases. In this report we will describe the main components of HSLA and provide a brief description of how the data and metadata can be accessed.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 2 minor

Summary. The manuscript provides an overview of the Hubble Spectroscopic Legacy Archive (HSLA), which automatically generates coadded spectra for individual targets observed with COS and STIS using publicly available data from the Mikulski Archive for Space Telescopes (MAST). Targets are defined via coordinates with proper-motion corrections and classified using SIMBAD, NED, and Phase II proposals. Coadds are produced per observing mode (with separate handling for each COS FUV lifetime position), a broad-wavelength spectrum is formed by abutting selected modes, and human-readable metadata files are supplied for each target. The coaddition code and Jupyter notebooks for custom processing are released publicly.

Significance. If the described automated workflow operates as outlined, HSLA constitutes a useful public data product that consolidates multi-epoch UV spectroscopy from Hubble, lowering the barrier for users to access combined spectra. The release of the coaddition code and notebooks is a clear strength, supporting reproducibility and enabling community-driven extensions. The paper's descriptive nature means the automatic target-grouping procedure is presented as the implemented method rather than a claim of zero mismatches; therefore the reader's noted concern about potential identification errors does not undermine the central claim of the archive's existence and basic operation.

minor comments (2)
  1. [Coaddition procedure] The description of how individual mode spectra are selected and abutted to form the full-wavelength coverage spectrum lacks technical specifics (e.g., overlap region handling, normalization, or wavelength stitching algorithm). Adding a short paragraph or pseudocode would improve clarity without altering the overview character of the paper.
  2. [Overview of data holdings] No quantitative summary (e.g., total number of targets, spectra, or coadds produced to date) is provided. A simple table or sentence giving scale would help readers gauge the archive's current scope.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their careful reading of the manuscript and for the positive assessment of the Hubble Spectroscopic Legacy Archive (HSLA) as a useful public data product. The referee's summary accurately reflects the scope and implementation of the archive. We note the recommendation for minor revision and will incorporate any editorial improvements in the revised version.

Circularity Check

0 steps flagged

Descriptive overview with no derivations or predictions

full rationale

The paper is a factual description of the HSLA data product and its implementation for coadding COS/STIS spectra from MAST. It details coordinate-based target grouping, metadata sources (SIMBAD/NED/Phase II), and coaddition procedures without any equations, fitted parameters, model predictions, or claims that reduce to self-defined quantities. No derivation chain exists to inspect for circularity; the content is self-contained as an archive overview.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is a descriptive paper about a data archive and processing pipeline. It introduces no free parameters, mathematical axioms, or new physical entities.

pith-pipeline@v0.9.0 · 5673 in / 1067 out tokens · 28883 ms · 2026-05-08T18:25:19.183515+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

10 extracted references

  1. [1]

    2024 COS Instrument Science Report 2024-01(v2)

    Debes, J., et al. 2024 COS Instrument Science Report 2024-01(v2)

  2. [2]

    2018, ApJS, 236, 24F

    Frey, K., & Accomazzi, A. 2018, ApJS, 236, 24F

  3. [3]

    Joyce, S. R. G., Barstow, M. A., Holberg, J. B., et al. 2018, MNRAS, 481, 2361

  4. [4]

    T., & Farihi, J

    Koester, D., G¨ansicke, B. T., & Farihi, J. 2014, A&a, 566, A34

  5. [5]

    et al., 2017, COS Instrument Science Report 2017-4

    Peeples, M. et al., 2017, COS Instrument Science Report 2017-4

  6. [6]

    D., Weinberger, A

    Roberge, A., Feldman, P. D., Weinberger, A. J., et al. 2006, Nature, 441, 724

  7. [7]

    2025 ApJ, 985, 109

    Roman-Duval, J., et al. 2025 ApJ, 985, 109

  8. [8]

    A., Kerr, R., Lecavelier des Etangs, A., et al

    Wilson, P. A., Kerr, R., Lecavelier des Etangs, A., et al. 2019, A&A, 621, A121

  9. [9]

    A., Lecavelier des Etangs, A., Vidal-Madjar, A., et al

    Wilson, P. A., Lecavelier des Etangs, A., Vidal-Madjar, A., et al. 2017, A&A, 599, A75

  10. [10]

    2007, ApJ, 671, 872 Instrument Science Report COS 2025-18(v1) Page 35

    Zuckerman, B., Koester, D., Melis, C., et al. 2007, ApJ, 671, 872 Instrument Science Report COS 2025-18(v1) Page 35