Recognition: unknown
Near-field Wideband Multi-User Localization using NFMR-Net
Pith reviewed 2026-05-07 14:55 UTC · model grok-4.3
The pith
NFMR-Net refines coarse near-field localization estimates from Zadoff-Chu pilots to outperform 2D MUSIC in multi-user scenarios.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper claims that the near-field music refinement network (NFMR-Net), consisting of separate sub-networks for range and angle, can refine coarse estimates derived from the LoS array response to achieve better localization performance than the conventional 2D MUSIC algorithm, as shown through numerical analysis.
What carries the argument
NFMR-Net, a neural network with separate sub-networks that takes the extracted LoS array response and coarse estimates as input to refine range and angle parameters.
Load-bearing premise
The line-of-sight array response extracted from the delay-tap energy profile of the multi-tap channel matrix estimated via Zadoff-Chu pilots is sufficiently accurate to serve as input for both coarse estimation and NFMR-Net refinement.
What would settle it
A simulation or measurement where the channel estimation has significant interference residuals or model mismatch, showing if the localization error remains lower than 2D MUSIC or not.
Figures
read the original abstract
This paper proposes a deep learning based method for wideband near-field multi-user localization. In particular, the proposed approach utilizes the Zadoff-Chu (ZC) sequence based pilots to mitigate the inter-user interference, which in turn aids the estimation of the multi-tap channel matrix. From this channel matrix, we extract the line-of-sight (LoS) array response based on the delay-tap energy profile. The LoS delay-tap is further refined using parabolic interpolation to obtain the coarse estimate of range parameter. Next, the extracted LoS array response is used to obtain the coarse angle estimate using 2D MUSIC algorithm. These coarse estimates are further refined using the near-field music refinement network (NFMR-Net), which involves separate sub-networks for range and angle estimations. Through numerical analysis, the proposed NFMR-Net is demonstrated to outperform conventional 2D MUSIC algorithm.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes NFMR-Net, a hybrid deep-learning method for wideband near-field multi-user localization. Zadoff-Chu pilots are used to estimate the multi-tap channel matrix while mitigating inter-user interference; the LoS array response is extracted from the delay-tap energy profile, refined by parabolic interpolation for a coarse range estimate, and fed to 2D MUSIC for a coarse angle estimate. Separate sub-networks then refine the range and angle estimates. Numerical simulations are reported to show that NFMR-Net outperforms conventional 2D MUSIC.
Significance. If the numerical gains prove robust, the hybrid coarse-estimate-plus-refinement architecture could offer a practical route to improved accuracy in near-field wideband localization without abandoning established signal-processing primitives. The explicit use of ZC sequences for interference control and the separation of range and angle sub-networks are concrete design choices that could be reused in related 5G/6G positioning tasks.
major comments (2)
- [Abstract / Numerical results] Abstract and numerical-results section: the central performance claim rests on unspecified numerical simulations that provide neither error bars, Monte-Carlo trial counts, dataset generation details, nor ablation studies on the LoS-extraction step. Without these, it is impossible to determine whether the reported improvement over 2D MUSIC is statistically significant or an artifact of idealized channel conditions.
- [Proposed method] Proposed-method description (LoS array-response extraction): the pipeline assumes that the delay-tap energy profile obtained from ZC-based multi-tap channel estimates yields an accurate LoS array response even in the presence of finite-length ZC correlation sidelobes, delay spread, and possible timing/Doppler residuals. No analytic bound or sensitivity analysis is supplied to quantify how residual interference or tap-selection errors propagate into both the 2D MUSIC coarse estimate and the subsequent NFMR-Net inputs.
minor comments (1)
- [Proposed method] Notation for the parabolic-interpolation step and the exact definition of the input feature vector to each NFMR-Net sub-network should be stated explicitly (e.g., as an equation) rather than described only in prose.
Simulated Author's Rebuttal
We thank the referee for the constructive comments and the positive assessment of the potential practical value of the hybrid architecture. We address each major comment below and indicate planned revisions.
read point-by-point responses
-
Referee: [Abstract / Numerical results] Abstract and numerical-results section: the central performance claim rests on unspecified numerical simulations that provide neither error bars, Monte-Carlo trial counts, dataset generation details, nor ablation studies on the LoS-extraction step. Without these, it is impossible to determine whether the reported improvement over 2D MUSIC is statistically significant or an artifact of idealized channel conditions.
Authors: We agree that the current description of the numerical results lacks sufficient detail for independent assessment of statistical significance. In the revised manuscript we will report the exact number of Monte-Carlo trials, include error bars (standard deviation across trials) on all curves, provide complete dataset-generation parameters (carrier frequency, bandwidth, array size, user locations, noise variance, and channel model), and add an ablation study isolating the contribution of the LoS-extraction and parabolic-interpolation steps. These additions will allow readers to judge whether the observed gains are robust. revision: yes
-
Referee: [Proposed method] Proposed-method description (LoS array-response extraction): the pipeline assumes that the delay-tap energy profile obtained from ZC-based multi-tap channel estimates yields an accurate LoS array response even in the presence of finite-length ZC correlation sidelobes, delay spread, and possible timing/Doppler residuals. No analytic bound or sensitivity analysis is supplied to quantify how residual interference or tap-selection errors propagate into both the 2D MUSIC coarse estimate and the subsequent NFMR-Net inputs.
Authors: ZC sequences are selected precisely because their ideal periodic autocorrelation minimizes sidelobes and inter-user interference; the multi-tap estimation step further exploits this property. We nevertheless acknowledge that the manuscript supplies neither an analytic error-propagation bound nor a dedicated sensitivity study. In the revision we will add a new subsection that derives a first-order bound on the perturbation of the extracted LoS vector due to finite-length correlation sidelobes and residual timing/Doppler errors, and we will complement it with Monte-Carlo sensitivity curves showing how these perturbations affect both the coarse 2D-MUSIC estimates and the final NFMR-Net accuracy under controlled levels of delay spread and synchronization error. revision: yes
Circularity Check
No circularity: NFMR-Net pipeline uses standard estimation tools plus independent neural refinement, with outperformance shown numerically
full rationale
The paper describes a processing chain starting from ZC-pilot channel estimation, LoS tap extraction via energy profile and parabolic interpolation, coarse 2D MUSIC angle estimation, and refinement by NFMR-Net sub-networks. The central claim of outperformance over 2D MUSIC is supported solely by numerical simulations rather than any closed-form derivation or fitted parameter that reduces to the input by construction. No self-citations, uniqueness theorems, or ansatzes are invoked as load-bearing steps; the method remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (1)
- NFMR-Net weights and hyperparameters
axioms (2)
- domain assumption Zadoff-Chu sequences enable accurate multi-tap channel matrix estimation by mitigating inter-user interference
- domain assumption Delay-tap energy profile reliably identifies the LoS component in near-field wideband channels
invented entities (1)
-
NFMR-Net
no independent evidence
Reference graph
Works this paper leans on
-
[1]
6G near-field technologies white paper 2.0,
Y . Zhao and et al., “6G near-field technologies white paper 2.0,” 2025. [Online]. Available: https://eprints.gla.ac.uk/354426/
2025
-
[2]
Near-field communications: A tutorial review,
Y . Liu, Z. Wang, and et. al, “Near-field communications: A tutorial review,”IEEE Open Journal of the Communications Society, vol. 4, p. 1999–2049, 2023
1999
-
[3]
A survey of enabling technologies for network localization, tracking, and navigation,
C. Laoudias, A. Moreira, and et. al, “A survey of enabling technologies for network localization, tracking, and navigation,”IEEE Communications Surveys & Tutorials, vol. 20, no. 4, pp. 3607–3644, 2018
2018
-
[4]
Recent advances in indoor localization: A survey on theoretical approaches and applications,
A. Yassin, Y . Nasser, and et. al, “Recent advances in indoor localization: A survey on theoretical approaches and applications,”IEEE Communica- tions Surveys & Tutorials, vol. 19, no. 2, pp. 1327–1346, 2017
2017
-
[5]
Near-field localization and sensing with large-aperture arrays: From signal modeling to processing,
Z. Wang, P. Ramezani, and et. al, “Near-field localization and sensing with large-aperture arrays: From signal modeling to processing,”IEEE Signal Processing Magazine, vol. 42, no. 1, pp. 74–87, 2025
2025
-
[6]
Beam squint assisted user localization in near-field integrated sensing and communications systems,
H. Luo and et. al, “Beam squint assisted user localization in near-field integrated sensing and communications systems,”IEEE Transactions on Wireless Communications, vol. 23, no. 5, pp. 4504–4517, 2023
2023
-
[7]
Near field localization via AI-aided subspace methods,
A. Gast, L. L. Magoarou, and N. Shlezinger, “Near field localization via AI-aided subspace methods,” 2025. [Online]. Available: https://arxiv.org/abs/2504.00599
-
[8]
Deep learning-based near-field user localization with beam squint in wideband XL-MIMO systems,
H. Lei, , and et. al, “Deep learning-based near-field user localization with beam squint in wideband XL-MIMO systems,”IEEE Transactions on Wireless Communications, vol. 24, no. 2, pp. 1568–1583, 2025
2025
-
[9]
Machine learning for wideband localization,
T. Van Nguyen, Y . Jeong, H. Shin, and M. Z. Win, “Machine learning for wideband localization,”IEEE Journal on Selected Areas in Commu- nications, vol. 33, no. 7, pp. 1357–1380, 2015
2015
-
[10]
Near-field localization and channel reconstruction for ELAA systems,
Z. Lu, Y . Han, S. Jin, and M. Matthaiou, “Near-field localization and channel reconstruction for ELAA systems,”IEEE Transactions on Wireless Communications, vol. 23, no. 7, pp. 6938–6953, 2024
2024
-
[11]
Two-stage beamspace MUSIC-based near-field channel estimation for hybrid XL-MIMO,
K. Qu, S. Guo, J. Ye, and H. Zhao, “Two-stage beamspace MUSIC-based near-field channel estimation for hybrid XL-MIMO,”IEEE Communica- tions Letters, vol. 28, no. 8, pp. 1949–1953, 2024
1949
-
[12]
A primer on Zadoff Chu sequences,
J. G. Andrews, “A primer on Zadoff Chu sequences,” 2025. [Online]. Available: https://arxiv.org/abs/2211.05702
-
[13]
DOA refinement through com- plex parabolic interpolation of a sparse recovered signal,
L. Pallotta, G. Giunta, and A. Farina, “DOA refinement through com- plex parabolic interpolation of a sparse recovered signal,”IEEE Signal Processing Letters, vol. 28, pp. 274–278, 2021
2021
-
[14]
Learning representations by back-propagating errors,
D. E. Rumelhart and et. al, “Learning representations by back-propagating errors,”Nature, vol. 323, no. 6088, pp. 533–536, 1986
1986
-
[15]
J. L. Ba, J. R. Kiros, and G. E. Hinton, “Layer normalization,” 2016. [Online]. Available: https://arxiv.org/abs/1607.06450
work page internal anchor Pith review arXiv 2016
-
[16]
GELU activation function in deep learning: A comprehensive mathematical analysis and performance,
M. Lee, “GELU activation function in deep learning: A comprehensive mathematical analysis and performance,” 2023. [Online]. Available: https://arxiv.org/abs/2305.12073
-
[17]
Wing loss for robust facial landmark localisation with convolutional neural networks,
Z.-H. Feng, J. Kittler, and W. Christmas, “Wing loss for robust facial landmark localisation with convolutional neural networks,” inProceed- ings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 2235–2245
2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.