pith. machine review for the scientific record. sign in

arxiv: 2605.03389 · v1 · submitted 2026-05-05 · 📡 eess.SP

Recognition: unknown

Near-field Wideband Multi-User Localization using NFMR-Net

Authors on Pith no claims yet

Pith reviewed 2026-05-07 14:55 UTC · model grok-4.3

classification 📡 eess.SP
keywords near-field localizationmulti-userwidebanddeep learningMUSIC algorithmZadoff-Chu pilotsNFMR-Netchannel estimation
0
0 comments X

The pith

NFMR-Net refines coarse near-field localization estimates from Zadoff-Chu pilots to outperform 2D MUSIC in multi-user scenarios.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper develops a deep learning approach for localizing multiple users in the near-field region using wideband signals. It starts by using Zadoff-Chu sequences to reduce interference and estimate the channel matrix, from which the line-of-sight array response is extracted. Coarse range and angle estimates are obtained via parabolic interpolation and 2D MUSIC, then fed into NFMR-Net for refinement. A sympathetic reader would care because accurate near-field localization matters for applications like 6G communications and radar where far-field assumptions fail.

Core claim

The paper claims that the near-field music refinement network (NFMR-Net), consisting of separate sub-networks for range and angle, can refine coarse estimates derived from the LoS array response to achieve better localization performance than the conventional 2D MUSIC algorithm, as shown through numerical analysis.

What carries the argument

NFMR-Net, a neural network with separate sub-networks that takes the extracted LoS array response and coarse estimates as input to refine range and angle parameters.

Load-bearing premise

The line-of-sight array response extracted from the delay-tap energy profile of the multi-tap channel matrix estimated via Zadoff-Chu pilots is sufficiently accurate to serve as input for both coarse estimation and NFMR-Net refinement.

What would settle it

A simulation or measurement where the channel estimation has significant interference residuals or model mismatch, showing if the localization error remains lower than 2D MUSIC or not.

Figures

Figures reproduced from arXiv: 2605.03389 by Pearl Hetul Shah, Praful D. Mankar, Srikar Sharma Sadhu.

Figure 1
Figure 1. Figure 1: An Illustration of the System Model to detect a new path and obtain coarse estimates of its range, azimuth and elevation AoAs, and finally refines these estimates using damped Newton refinement method. Further, a two￾stage beamspace MUSIC method is proposed in [11] for near￾field localization using XL-MIMO systems. An optimally pre￾compensated distance based codebook is applied enable 1D se￾quential search… view at source ↗
Figure 2
Figure 2. Figure 2: Proposed NFMR-Net pipeline. TABLE I SIMULATION PARAMETERS Parameter Symbol Value Carrier frequency fc 2.4 GHz Sample time Ts 5 × 10−9 s Delay resolution cTs 1.5 m Antenna spacing dA 0.5λ ULA size M 41 Fraunhofer distance dF 100 m Number of UEs K 3 Number of obstacles P 5 Absorption loss β 0.5 Pilot sequence length N 256 Number of transmissions T 4 Local range window ∆r 8 m Local angle window ∆w 10◦ SNR ran… view at source ↗
Figure 3
Figure 3. Figure 3: Scatter plots of range (left) and angle (right) parameters. view at source ↗
Figure 4
Figure 4. Figure 4: RMSE vs. SNR for range (left) and angle (right). view at source ↗
Figure 5
Figure 5. Figure 5: RMSE vs. N, for range (left) and angle (right), at SNR = 10 dB view at source ↗
read the original abstract

This paper proposes a deep learning based method for wideband near-field multi-user localization. In particular, the proposed approach utilizes the Zadoff-Chu (ZC) sequence based pilots to mitigate the inter-user interference, which in turn aids the estimation of the multi-tap channel matrix. From this channel matrix, we extract the line-of-sight (LoS) array response based on the delay-tap energy profile. The LoS delay-tap is further refined using parabolic interpolation to obtain the coarse estimate of range parameter. Next, the extracted LoS array response is used to obtain the coarse angle estimate using 2D MUSIC algorithm. These coarse estimates are further refined using the near-field music refinement network (NFMR-Net), which involves separate sub-networks for range and angle estimations. Through numerical analysis, the proposed NFMR-Net is demonstrated to outperform conventional 2D MUSIC algorithm.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes NFMR-Net, a hybrid deep-learning method for wideband near-field multi-user localization. Zadoff-Chu pilots are used to estimate the multi-tap channel matrix while mitigating inter-user interference; the LoS array response is extracted from the delay-tap energy profile, refined by parabolic interpolation for a coarse range estimate, and fed to 2D MUSIC for a coarse angle estimate. Separate sub-networks then refine the range and angle estimates. Numerical simulations are reported to show that NFMR-Net outperforms conventional 2D MUSIC.

Significance. If the numerical gains prove robust, the hybrid coarse-estimate-plus-refinement architecture could offer a practical route to improved accuracy in near-field wideband localization without abandoning established signal-processing primitives. The explicit use of ZC sequences for interference control and the separation of range and angle sub-networks are concrete design choices that could be reused in related 5G/6G positioning tasks.

major comments (2)
  1. [Abstract / Numerical results] Abstract and numerical-results section: the central performance claim rests on unspecified numerical simulations that provide neither error bars, Monte-Carlo trial counts, dataset generation details, nor ablation studies on the LoS-extraction step. Without these, it is impossible to determine whether the reported improvement over 2D MUSIC is statistically significant or an artifact of idealized channel conditions.
  2. [Proposed method] Proposed-method description (LoS array-response extraction): the pipeline assumes that the delay-tap energy profile obtained from ZC-based multi-tap channel estimates yields an accurate LoS array response even in the presence of finite-length ZC correlation sidelobes, delay spread, and possible timing/Doppler residuals. No analytic bound or sensitivity analysis is supplied to quantify how residual interference or tap-selection errors propagate into both the 2D MUSIC coarse estimate and the subsequent NFMR-Net inputs.
minor comments (1)
  1. [Proposed method] Notation for the parabolic-interpolation step and the exact definition of the input feature vector to each NFMR-Net sub-network should be stated explicitly (e.g., as an equation) rather than described only in prose.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments and the positive assessment of the potential practical value of the hybrid architecture. We address each major comment below and indicate planned revisions.

read point-by-point responses
  1. Referee: [Abstract / Numerical results] Abstract and numerical-results section: the central performance claim rests on unspecified numerical simulations that provide neither error bars, Monte-Carlo trial counts, dataset generation details, nor ablation studies on the LoS-extraction step. Without these, it is impossible to determine whether the reported improvement over 2D MUSIC is statistically significant or an artifact of idealized channel conditions.

    Authors: We agree that the current description of the numerical results lacks sufficient detail for independent assessment of statistical significance. In the revised manuscript we will report the exact number of Monte-Carlo trials, include error bars (standard deviation across trials) on all curves, provide complete dataset-generation parameters (carrier frequency, bandwidth, array size, user locations, noise variance, and channel model), and add an ablation study isolating the contribution of the LoS-extraction and parabolic-interpolation steps. These additions will allow readers to judge whether the observed gains are robust. revision: yes

  2. Referee: [Proposed method] Proposed-method description (LoS array-response extraction): the pipeline assumes that the delay-tap energy profile obtained from ZC-based multi-tap channel estimates yields an accurate LoS array response even in the presence of finite-length ZC correlation sidelobes, delay spread, and possible timing/Doppler residuals. No analytic bound or sensitivity analysis is supplied to quantify how residual interference or tap-selection errors propagate into both the 2D MUSIC coarse estimate and the subsequent NFMR-Net inputs.

    Authors: ZC sequences are selected precisely because their ideal periodic autocorrelation minimizes sidelobes and inter-user interference; the multi-tap estimation step further exploits this property. We nevertheless acknowledge that the manuscript supplies neither an analytic error-propagation bound nor a dedicated sensitivity study. In the revision we will add a new subsection that derives a first-order bound on the perturbation of the extracted LoS vector due to finite-length correlation sidelobes and residual timing/Doppler errors, and we will complement it with Monte-Carlo sensitivity curves showing how these perturbations affect both the coarse 2D-MUSIC estimates and the final NFMR-Net accuracy under controlled levels of delay spread and synchronization error. revision: yes

Circularity Check

0 steps flagged

No circularity: NFMR-Net pipeline uses standard estimation tools plus independent neural refinement, with outperformance shown numerically

full rationale

The paper describes a processing chain starting from ZC-pilot channel estimation, LoS tap extraction via energy profile and parabolic interpolation, coarse 2D MUSIC angle estimation, and refinement by NFMR-Net sub-networks. The central claim of outperformance over 2D MUSIC is supported solely by numerical simulations rather than any closed-form derivation or fitted parameter that reduces to the input by construction. No self-citations, uniqueness theorems, or ansatzes are invoked as load-bearing steps; the method remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 1 invented entities

The approach rests on standard near-field wideband channel assumptions and the effectiveness of ZC sequences for interference mitigation; the neural network itself introduces many implicit parameters whose values are learned from simulation data.

free parameters (1)
  • NFMR-Net weights and hyperparameters
    Deep network parameters fitted during training on simulated channel data; exact architecture and training details not provided in abstract.
axioms (2)
  • domain assumption Zadoff-Chu sequences enable accurate multi-tap channel matrix estimation by mitigating inter-user interference
    Invoked to justify extraction of LoS array response from the estimated channel.
  • domain assumption Delay-tap energy profile reliably identifies the LoS component in near-field wideband channels
    Used to select the array response for subsequent MUSIC and refinement steps.
invented entities (1)
  • NFMR-Net no independent evidence
    purpose: Refine coarse range and angle estimates from traditional methods
    New deep learning architecture with separate sub-networks for range and angle; no independent evidence provided beyond numerical tests.

pith-pipeline@v0.9.0 · 5453 in / 1644 out tokens · 80527 ms · 2026-05-07T14:55:10.712127+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

17 extracted references · 4 canonical work pages · 1 internal anchor

  1. [1]

    6G near-field technologies white paper 2.0,

    Y . Zhao and et al., “6G near-field technologies white paper 2.0,” 2025. [Online]. Available: https://eprints.gla.ac.uk/354426/

  2. [2]

    Near-field communications: A tutorial review,

    Y . Liu, Z. Wang, and et. al, “Near-field communications: A tutorial review,”IEEE Open Journal of the Communications Society, vol. 4, p. 1999–2049, 2023

  3. [3]

    A survey of enabling technologies for network localization, tracking, and navigation,

    C. Laoudias, A. Moreira, and et. al, “A survey of enabling technologies for network localization, tracking, and navigation,”IEEE Communications Surveys & Tutorials, vol. 20, no. 4, pp. 3607–3644, 2018

  4. [4]

    Recent advances in indoor localization: A survey on theoretical approaches and applications,

    A. Yassin, Y . Nasser, and et. al, “Recent advances in indoor localization: A survey on theoretical approaches and applications,”IEEE Communica- tions Surveys & Tutorials, vol. 19, no. 2, pp. 1327–1346, 2017

  5. [5]

    Near-field localization and sensing with large-aperture arrays: From signal modeling to processing,

    Z. Wang, P. Ramezani, and et. al, “Near-field localization and sensing with large-aperture arrays: From signal modeling to processing,”IEEE Signal Processing Magazine, vol. 42, no. 1, pp. 74–87, 2025

  6. [6]

    Beam squint assisted user localization in near-field integrated sensing and communications systems,

    H. Luo and et. al, “Beam squint assisted user localization in near-field integrated sensing and communications systems,”IEEE Transactions on Wireless Communications, vol. 23, no. 5, pp. 4504–4517, 2023

  7. [7]

    Near field localization via AI-aided subspace methods,

    A. Gast, L. L. Magoarou, and N. Shlezinger, “Near field localization via AI-aided subspace methods,” 2025. [Online]. Available: https://arxiv.org/abs/2504.00599

  8. [8]

    Deep learning-based near-field user localization with beam squint in wideband XL-MIMO systems,

    H. Lei, , and et. al, “Deep learning-based near-field user localization with beam squint in wideband XL-MIMO systems,”IEEE Transactions on Wireless Communications, vol. 24, no. 2, pp. 1568–1583, 2025

  9. [9]

    Machine learning for wideband localization,

    T. Van Nguyen, Y . Jeong, H. Shin, and M. Z. Win, “Machine learning for wideband localization,”IEEE Journal on Selected Areas in Commu- nications, vol. 33, no. 7, pp. 1357–1380, 2015

  10. [10]

    Near-field localization and channel reconstruction for ELAA systems,

    Z. Lu, Y . Han, S. Jin, and M. Matthaiou, “Near-field localization and channel reconstruction for ELAA systems,”IEEE Transactions on Wireless Communications, vol. 23, no. 7, pp. 6938–6953, 2024

  11. [11]

    Two-stage beamspace MUSIC-based near-field channel estimation for hybrid XL-MIMO,

    K. Qu, S. Guo, J. Ye, and H. Zhao, “Two-stage beamspace MUSIC-based near-field channel estimation for hybrid XL-MIMO,”IEEE Communica- tions Letters, vol. 28, no. 8, pp. 1949–1953, 2024

  12. [12]

    A primer on Zadoff Chu sequences,

    J. G. Andrews, “A primer on Zadoff Chu sequences,” 2025. [Online]. Available: https://arxiv.org/abs/2211.05702

  13. [13]

    DOA refinement through com- plex parabolic interpolation of a sparse recovered signal,

    L. Pallotta, G. Giunta, and A. Farina, “DOA refinement through com- plex parabolic interpolation of a sparse recovered signal,”IEEE Signal Processing Letters, vol. 28, pp. 274–278, 2021

  14. [14]

    Learning representations by back-propagating errors,

    D. E. Rumelhart and et. al, “Learning representations by back-propagating errors,”Nature, vol. 323, no. 6088, pp. 533–536, 1986

  15. [15]

    Layer Normalization

    J. L. Ba, J. R. Kiros, and G. E. Hinton, “Layer normalization,” 2016. [Online]. Available: https://arxiv.org/abs/1607.06450

  16. [16]

    GELU activation function in deep learning: A comprehensive mathematical analysis and performance,

    M. Lee, “GELU activation function in deep learning: A comprehensive mathematical analysis and performance,” 2023. [Online]. Available: https://arxiv.org/abs/2305.12073

  17. [17]

    Wing loss for robust facial landmark localisation with convolutional neural networks,

    Z.-H. Feng, J. Kittler, and W. Christmas, “Wing loss for robust facial landmark localisation with convolutional neural networks,” inProceed- ings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 2235–2245