Rivaling Transformers: Multi-Scale Structured State-Space Mixtures for Agentic 6G O-RAN

Farhad Rezazadeh; Hatim Chergui; Houbing Song; Lingjia Liu; Merouane Debbah

arxiv: 2510.05255 · v2 · pith:BAQVW6RHnew · submitted 2025-10-06 · 💻 cs.NI

Rivaling Transformers: Multi-Scale Structured State-Space Mixtures for Agentic 6G O-RAN

Farhad Rezazadeh , Hatim Chergui , Merouane Debbah , Houbing Song , Lingjia Liu This is my paper

classification 💻 cs.NI

keywords latencyo-ranmodelsstate-spaceaccuracyagenticconstraintscontrol

0 comments

read the original abstract

In sixth-generation (6G) Open Radio Access Networks (O-RAN), proactive control is preferable. A key open challenge is delivering control-grade predictions within Near-Real-Time (Near-RT) latency and computational constraints under multi-timescale dynamics. We therefore cast RAN Intelligent Controller (RIC) analytics as an agentic perceive-predict xApp that turns noisy, multivariate RAN telemetry into short-horizon per-User Equipment (UE) key performance indicator (KPI) forecasts to drive anticipatory control. In this regard, Transformers are powerful for sequence learning and time-series forecasting, but they are memory-intensive, which limits Near-RT RIC use. Therefore, we need models that maintain accuracy while reducing latency and data movement. To this end, we propose a lightweight Multi-Scale Structured State-Space Mixtures (MS3M) forecaster that mixes HiPPO-LegS kernels to capture multi-timescale radio dynamics. We develop stable discrete state-space models (SSMs) via bilinear (Tustin) discretization and apply their causal impulse responses as per-feature depthwise convolutions. Squeeze-and-Excitation gating dynamically reweights KPI channels as conditions change, and a compact gated channel-mixing layer models cross-feature nonlinearities without Transformer-level cost. The model is KPI-agnostic -- Reference Signal Received Power (RSRP) serves as a canonical use case -- and is trained on sliding windows to predict the immediate next step. Empirical evaluations conducted using our bespoke O-RAN testbed KPI time-series dataset (59,441 windows across 13 KPIs). Crucially for O-RAN constraints, MS3M achieves a 0.057 s per-inference latency with 0.70M parameters, yielding 3-10x lower latency than the Transformer baselines evaluated on the same hardware, while maintaining competitive accuracy.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Tutorial on Cognitive Biases in Agentic AI-Driven 6G Autonomous Networks
cs.NI 2025-10 unverdicted novelty 5.0

Randomized Weibull anchors and debiased collective memory with decay and inflection bonuses let agentic AI in 6G cut anchoring, temporal, and confirmation biases, doubling energy savings to 25% and reducing latency by...