Recognition: unknown
Data driven approach for Outdoor Channel Prediction in 5G and Beyond
Pith reviewed 2026-05-09 17:02 UTC · model grok-4.3
The pith
Linear regression predicts 5G outdoor channel coefficients from transmitter and user locations alone
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
For a given user location, channel estimation at 7GHz frequency band is done using data generated by ray tracing mechanism where feature variables are transmitter location and user location and target variable is channel coefficient. Among Linear Regression, Support Vector Regression and Decision Tree Regression, Linear Regression performs better with MAE of 7.5155×10^{-5} and RMSE of 9.2861×10^{-5}.
What carries the argument
Linear regression model that maps ray-traced transmitter and user location coordinates directly to the complex channel coefficient value.
Load-bearing premise
Ray-tracing simulations at 7 GHz produce channel coefficients that are sufficiently representative of real outdoor propagation, and transmitter and user locations alone contain enough information to predict the channel coefficient accurately.
What would settle it
Compare the model's predicted channel coefficients against actual field measurements collected at 7 GHz in an outdoor environment with the same known transmitter and user locations.
Figures
read the original abstract
An evolution of Wireless Communications towards 5G and beyond provides improved user experience in terms of quality of services. Understanding and estimating Channel information plays crucial role in providing better user experience. Traditional methods of channel estimation involves periodically sending pilots (known signals), estimating channel and send back estimated channel information to the BS which increases computational complexity and communication complexity. Hence, we focus on data driven approach for channel estimation. This work can be deployed as Digital twin in 5G and beyond wireless networks. In this work, we explore a channel estimation mechanism at 7GHz frequency band for a given user location. This work involves data generation using Ray tracing mechanism and Machine learning model training that contains feature variables such as transmitter location, user location and target variable as channel coefficient . We explored Linear Regression, Support Vector Regression and Decision Tree Regression. We found via simulations that Linear Regression performs (with MAE of $\mathbf{7.5155\times10^{-5}}$ and RMSE of $\mathbf{9.2861\times10^{-5}}$) better than Support Vector Regression and Decision Tree Regression.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a data-driven channel prediction method for outdoor 5G scenarios at 7 GHz. Ray-tracing simulations generate data with transmitter and user locations as input features and channel coefficients as the target. Linear Regression, Support Vector Regression, and Decision Tree Regression are trained and compared on this data, with the abstract claiming Linear Regression achieves the lowest errors (MAE 7.5155×10^{-5}, RMSE 9.2861×10^{-5}) and suggesting the approach can serve as a digital twin to reduce pilot overhead.
Significance. If the performance claims are substantiated with proper validation protocols, the work could contribute to low-overhead channel estimation in 5G networks by exploiting location-based prediction. The reported error levels on deterministic ray-tracing data highlight that simple regressors may capture propagation effects in controlled simulations, supporting the digital-twin concept in principle. However, the simulation-only scope restricts immediate significance for real deployments.
major comments (3)
- [Data generation and model training] The data generation and model training description supplies no information on dataset size, location sampling ranges or distributions, train/test split, hyperparameter selection for SVR and DTR, or cross-validation. These omissions are load-bearing because the central claim that Linear Regression outperforms the others with the stated MAE/RMSE rests on the empirical results; without them the metrics cannot be assessed for overfitting, generalization, or statistical reliability.
- [Results and validation] No real-world channel measurements, sensitivity analysis on ray-tracing parameters (e.g., material properties, diffuse scattering), or out-of-distribution location tests are presented. The weakest assumption—that ray-tracing at 7 GHz with location inputs alone produces representative outdoor channels—is therefore untested, directly affecting whether the low errors reflect predictive power or the simplicity of the deterministic forward model.
- [Results] The abstract and results report specific error values for the three models but contain no variance estimates, confidence intervals, or formal significance test for the performance gap. This leaves the headline comparison unsupported by visible evidence of robustness.
minor comments (1)
- [Abstract] The abstract contains minor grammatical and phrasing issues (e.g., 'provides improved user experience' and 'contains feature variables such as...') that reduce readability.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments. These have highlighted important aspects of reproducibility, validation scope, and statistical presentation that we will address in the revision. Below we respond point-by-point to the major comments.
read point-by-point responses
-
Referee: The data generation and model training description supplies no information on dataset size, location sampling ranges or distributions, train/test split, hyperparameter selection for SVR and DTR, or cross-validation. These omissions are load-bearing because the central claim that Linear Regression outperforms the others with the stated MAE/RMSE rests on the empirical results; without them the metrics cannot be assessed for overfitting, generalization, or statistical reliability.
Authors: We agree that these methodological details are essential for evaluating the reliability of the reported results. The original manuscript omitted them primarily due to brevity. In the revised version we will add a dedicated subsection under 'Data Generation and Model Training' that specifies the total number of generated samples, the spatial ranges and sampling distribution for transmitter and user locations, the train/test split ratio employed, the hyperparameter tuning procedure (including any grid or random search) used for SVR and DTR, and whether cross-validation was applied. These additions will allow readers to assess potential overfitting and generalization. revision: yes
-
Referee: No real-world channel measurements, sensitivity analysis on ray-tracing parameters (e.g., material properties, diffuse scattering), or out-of-distribution location tests are presented. The weakest assumption—that ray-tracing at 7 GHz with location inputs alone produces representative outdoor channels—is therefore untested, directly affecting whether the low errors reflect predictive power or the simplicity of the deterministic forward model.
Authors: We acknowledge that the study is confined to deterministic ray-tracing simulations and does not include real-world measurements or sensitivity analyses. This limits direct claims about real deployments. In the revision we will expand the 'Discussion' and 'Limitations' sections to explicitly articulate the assumptions of the ray-tracing model, note that real outdoor channels contain additional stochastic and site-specific effects not captured here, and outline future work involving field measurements and parameter sensitivity studies. However, acquiring and incorporating actual measurement data or performing new sensitivity experiments lies outside the scope of the present revision. revision: partial
-
Referee: The abstract and results report specific error values for the three models but contain no variance estimates, confidence intervals, or formal significance test for the performance gap. This leaves the headline comparison unsupported by visible evidence of robustness.
Authors: We agree that reporting variability strengthens the comparison. In the revised manuscript we will re-run the three regression models across multiple independent random seeds (or data shuffles) to obtain mean and standard-deviation values for MAE and RMSE. These statistics, together with the original point estimates, will be presented in the results tables and text. We will also add a short discussion of the observed consistency of the performance ordering across runs. revision: yes
- Incorporating real-world channel measurements or conducting sensitivity analyses on ray-tracing parameters, as these require new experimental campaigns and data collection that are not available within the current study.
Circularity Check
No significant circularity; empirical ML performance metrics on simulated data
full rationale
The paper generates channel coefficient data via ray-tracing at 7 GHz using transmitter and user locations as inputs, then trains and evaluates Linear Regression, SVR, and Decision Tree models to predict those coefficients. The reported MAE (7.5155×10^{-5}) and RMSE (9.2861×10^{-5}) are direct empirical outputs of standard model fitting and error computation on the generated dataset. No derivation step reduces these metrics to inputs by construction, no self-citations or uniqueness theorems are load-bearing, and the chain is self-contained as a simulation-based regression study without circular reduction.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Ray-tracing simulation at 7 GHz produces channel coefficients representative of real outdoor propagation
Reference graph
Works this paper leans on
-
[1]
A survey of artificial in- telligence enabled channel estimation methods: Recent advance, performance, and outlook,
B. Li, Q. Zheng, X. Tian, M. Yang, G. Gui, W. Jiang, H. Lei, J. Jiang, F. Shu, A. Elhanashiet al., “A survey of artificial in- telligence enabled channel estimation methods: Recent advance, performance, and outlook,”Artificial Intelligence Review, vol. 58, no. 6, p. 187, 2025
2025
-
[2]
6g wireless communications in 7–24 ghz band: Opportunities, techniques, and challenges,
Z. Cui, P. Zhang, and S. Pollin, “6g wireless communications in 7–24 ghz band: Opportunities, techniques, and challenges,” in2025 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN). IEEE, 2025, pp. 1–8
2025
-
[3]
Review on ray tracing channel simulation accuracy in sub-6 ghz outdoor deployment scenarios,
A. W. Mbugua, Y . Chen, L. Raschkowski, L. Thiele, S. Jaeckel, and W. Fan, “Review on ray tracing channel simulation accuracy in sub-6 ghz outdoor deployment scenarios,”IEEE Open Journal of Antennas and Propagation, vol. 2, pp. 22–37, 2020
2020
-
[4]
Ray tracing for radio propagation modeling: Principles and applications,
Z. Yun and M. F. Iskander, “Ray tracing for radio propagation modeling: Principles and applications,”IEEE Access, vol. 3, pp. 1089–1100, 2015
2015
-
[5]
Deeprt: A hybrid framework combining large model architectures and ray tracing principles for 6g digital twin channels,
M. Li, T. Wu, Z. Dong, X. Liu, Y . Lu, S. Zhang, Z. Wu, Y . Zhang, L. Yu, and J. Zhang, “Deeprt: A hybrid framework combining large model architectures and ray tracing principles for 6g digital twin channels,”Electronics, vol. 14, no. 9, p. 1849, 2025
2025
-
[6]
A light-weighted machine learning approach to channel estimation for new-radio systems,
H. W. Lee and S. W. Choi, “A light-weighted machine learning approach to channel estimation for new-radio systems,”Electron- ics, vol. 12, no. 23, p. 4740, November 2023
2023
-
[7]
DeepMIMO: A Generic Deep Learning Dataset for Millimeter Wave and Massive MIMO Applications
A. Alkhateeb, “Deepmimo: A generic deep learning dataset for millimeter wave and massive mimo applications,”arXiv preprint arXiv:1902.06435, 2019
work page Pith review arXiv 1902
-
[8]
Ai-assisted agile propagation modeling for real-time digital twin wireless networks,
A. Saeizadeh, M. Tehrani Moayyed, D. Villa, J. G. Beattie, I. C. Wong, P. Johari, E. W. Anderson, S. Basagni, and T. Melodia, “Ai-assisted agile propagation modeling for real-time digital twin wireless networks,” inProc. IEEE 29th Int. Workshop on Computer Aided Modeling and Design of Communication Links and Networks (CAMAD), 2024, pp. 1–6
2024
-
[9]
Deep learning for physical-layer wireless communications,
J. Hoydis, S. ten Brink, and M. Debbah, “Deep learning for physical-layer wireless communications,”IEEE Signal Process- ing Magazine, vol. 35, no. 5, pp. 24–45, 2018
2018
-
[10]
Machine learning- based path loss prediction in suburban environment in the sub-6 ghz band,
F. Tarhouni, M. Al-Zubi, and M.-S. Alouini, “Machine learning- based path loss prediction in suburban environment in the sub-6 ghz band,”IEEE Transactions on Vehicular Technology, 2025, early Access
2025
-
[11]
Path loss prediction in urban environments using ray tracing and machine learning: A comparative evaluation,
Y . Egi, “Path loss prediction in urban environments using ray tracing and machine learning: A comparative evaluation,” 10 2025
2025
-
[12]
Predicting wireless channel features using neural networks,
S. Navabi, C. Wang, O. Y . Bursalioglu, and H. Papadopoulos, “Predicting wireless channel features using neural networks,” in 2018 IEEE International Conference on Communications (ICC), 2018, pp. 1–6
2018
-
[13]
A study on path loss modeling using resnet and pre-training with free space path loss,
T. Nagao and T. Hayashi, “A study on path loss modeling using resnet and pre-training with free space path loss,”2022 IEEE 33rd Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), 2022
2022
-
[14]
Millimeter wave wireless communications (rappaport, t., et al; 2014)[book review],
S. Misra, “Millimeter wave wireless communications (rappaport, t., et al; 2014)[book review],”IEEE Wireless Communications, vol. 22, no. 5, pp. 6–7, 2015
2014
-
[15]
Creating rf scenarios for large-scale, real-time wire- less channel emulators,
M. Tehrani-Moayyed, L. Bonati, P. Johari, T. Melodia, and S. Basagni, “Creating rf scenarios for large-scale, real-time wire- less channel emulators,” in2021 19th Mediterranean Commu- nication and Computer Networking Conference (MedComNet), 2021, pp. 1–8
2021
-
[16]
K. P. Murphy,Probabilistic machine learning: an introduction. MIT press, 2022
2022
-
[17]
F. A ¨ıt Aoudia, J. Hoydis, M. Nimier-David, B. Nicolet, S. Cam- merer, and A. Keller, “Sionna rt: Technical report,”arXiv preprint, vol. arXiv:2504.21719, 2025
-
[18]
Study on channel model for frequencies from 0.5 to 100 ghz (release 17),
3rd Generation Partnership Project (3GPP), “Study on channel model for frequencies from 0.5 to 100 ghz (release 17),” 3GPP Technical Specification Group Radio Access Network (TSG RAN), Tech. Rep. TR 38.901, 2022
2022
-
[19]
A light-weighted machine learning approach to channel estimation for new-radio systems,
H. W. Lee and S. W. Choi, “A light-weighted machine learning approach to channel estimation for new-radio systems,” Electronics, vol. 12, no. 23, 2023. [Online]. Available: https://www.mdpi.com/2079-9292/12/23/4740
2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.