Adaptive Speech-to-Spike Encoding for Spiking Neural Networks

· 2026 · cs.NE · arXiv 2606.19039

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

The mismatch between continuous acoustic signals and discrete event-driven processing remains a fundamental bottleneck for neuromorphic speech processing. Current systems typically rely on fixed spike encoders, forcing downstream Spiking Neural Networks (SNNs) to compensate for non-adaptive input representations. To address this, we present a learnable residual speech-to-spike encoder jointly trained end-to-end with a Recurrent Leaky Integrate-and-Fire (R-LIF) backbone. We validate this approach on the Google Speech Commands v2 (GSC-v2) benchmark, achieving up to 94.97% accuracy. Notably, the learned encoder remains highly parameter-efficient with a compact 35k-parameter variant that reaches 89.8%, matching or exceeding prior baselines that require an order of magnitude more parameters. Our encoder-focused analysis, including linear probing and gradient-residual inspection, indicates that the encoder does not target faithful signal reconstruction but instead learns task-aligned spike representations that enhance class separability. Finally, we benchmark bio-inspired, hardware-friendly credit assignment by comparing Direct Feedback Alignment (DFA) with surrogate-gradient BPTT under identical architectures and training conditions. We find that DFA reaches 91.5% accuracy, quantifying the performance trade-off of bio-inspired learning rules for modern neuromorphic audio.

representative citing papers

Adaptive Speech-to-Spike Encoding for Spiking Neural Networks

cs.NE · 2026-06-17 · unverdicted · novelty 6.0

A learnable residual speech-to-spike encoder jointly trained with an R-LIF SNN achieves up to 94.97% accuracy on GSC-v2 with a 35k-parameter model and supports DFA credit assignment at 91.5%.

citing papers explorer

Showing 1 of 1 citing paper.

Adaptive Speech-to-Spike Encoding for Spiking Neural Networks cs.NE · 2026-06-17 · unverdicted · none · ref 4 · internal anchor
A learnable residual speech-to-spike encoder jointly trained with an R-LIF SNN achieves up to 94.97% accuracy on GSC-v2 with a 35k-parameter model and supports DFA credit assignment at 91.5%.

Adaptive Speech-to-Spike Encoding for Spiking Neural Networks

fields

years

verdicts

representative citing papers

citing papers explorer