Sigmoid-weighted linear units for neural network function approximation in reinforcement learning

Elfwing, S · 2018 · DOI 10.1016/j.neunet.2017.12.012

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

open at publisher browse 7 citing papers

representative citing papers

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

cs.LG · 2024-01-19 · conditional · novelty 7.0

Medusa augments LLMs with multiple decoding heads and tree-based attention to predict and verify several tokens in parallel, yielding 2.2-3.6x inference speedup via two fine-tuning regimes.

On the global convergence of gradient descent for wide shallow models with bounded nonlinearities

math.OC · 2026-05-11 · unverdicted · novelty 6.0

Gradient descent on wide shallow models with bounded nonlinearities converges globally in the mean-field limit as non-global critical points are unstable under the dynamics.

ANTIC: Adaptive Neural Temporal In-situ Compressor

cs.LG · 2026-04-10 · unverdicted · novelty 6.0 · 3 refs

ANTIC reduces storage for large-scale PDE simulations by orders of magnitude through adaptive temporal snapshot selection combined with continual neural-field residual compression while preserving physics accuracy.

Bolek: A Multimodal Language Model for Molecular Reasoning

cs.LG · 2026-05-04 · unverdicted · novelty 5.0

Bolek injects Morgan fingerprint embeddings into an instruction-tuned text model, then fine-tunes on molecular alignment and synthetic chain-of-thought tasks to improve performance and grounding on 15 TDC binary classification endpoints while generalizing to unseen tasks.

Training-inference input alignment outweighs framework choice in longitudinal retinal image prediction

cs.CV · 2026-04-18 · unverdicted · novelty 5.0

Training-inference input alignment outweighs framework choice for longitudinal retinal image prediction, with deterministic regression matching complex models when acquisition variability dominates disease progression.

Internally triggered retrospective learning in neural networks

q-bio.NC · 2026-05-09 · unverdicted · novelty 4.0

Neural networks learn via sparse retrospective updates triggered internally when prediction error exceeds a threshold derived from recent error statistics, leading to stepwise parameter changes in simulations.

Developing a Strong Pre-Trained Base Model for Plant Leaf Disease Classification

cs.CV · 2026-05-02 · unverdicted · novelty 3.0

A DenseNet201 base model trained on a constructed plant leaf disease dataset outperforms baselines and enables faster, more robust transfer learning with less data than general models.

citing papers explorer

Showing 7 of 7 citing papers.

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads cs.LG · 2024-01-19 · conditional · none · ref 14
Medusa augments LLMs with multiple decoding heads and tree-based attention to predict and verify several tokens in parallel, yielding 2.2-3.6x inference speedup via two fine-tuning regimes.
On the global convergence of gradient descent for wide shallow models with bounded nonlinearities math.OC · 2026-05-11 · unverdicted · none · ref 77
Gradient descent on wide shallow models with bounded nonlinearities converges globally in the mean-field limit as non-global critical points are unstable under the dynamics.
ANTIC: Adaptive Neural Temporal In-situ Compressor cs.LG · 2026-04-10 · unverdicted · none · ref 21 · 3 links
ANTIC reduces storage for large-scale PDE simulations by orders of magnitude through adaptive temporal snapshot selection combined with continual neural-field residual compression while preserving physics accuracy.
Bolek: A Multimodal Language Model for Molecular Reasoning cs.LG · 2026-05-04 · unverdicted · none · ref 64
Bolek injects Morgan fingerprint embeddings into an instruction-tuned text model, then fine-tunes on molecular alignment and synthetic chain-of-thought tasks to improve performance and grounding on 15 TDC binary classification endpoints while generalizing to unseen tasks.
Training-inference input alignment outweighs framework choice in longitudinal retinal image prediction cs.CV · 2026-04-18 · unverdicted · none · ref 23
Training-inference input alignment outweighs framework choice for longitudinal retinal image prediction, with deterministic regression matching complex models when acquisition variability dominates disease progression.
Internally triggered retrospective learning in neural networks q-bio.NC · 2026-05-09 · unverdicted · none · ref 7
Neural networks learn via sparse retrospective updates triggered internally when prediction error exceeds a threshold derived from recent error statistics, leading to stepwise parameter changes in simulations.
Developing a Strong Pre-Trained Base Model for Plant Leaf Disease Classification cs.CV · 2026-05-02 · unverdicted · none · ref 145
A DenseNet201 base model trained on a constructed plant leaf disease dataset outperforms baselines and enables faster, more robust transfer learning with less data than general models.

Sigmoid-weighted linear units for neural network function approximation in reinforcement learning

fields

years

verdicts

representative citing papers

citing papers explorer