Sigmoid-weighted linear units for neural network function approximation in reinforcement learning.Neural Networks, 107:3–11, November 2018

Elfwing, S · 2018 · DOI 10.1016/j.neunet.2017.12.012

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

open at publisher browse 11 citing papers

citation-role summary

background 2 method 1

citation-polarity summary

background 2 use method 1

representative citing papers

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

cs.LG · 2024-01-19 · conditional · novelty 7.0

Medusa augments LLMs with multiple decoding heads and tree-based attention to predict and verify several tokens in parallel, yielding 2.2-3.6x inference speedup via two fine-tuning regimes.

Neutrino Fingerprints: Image-Based Encodings of IceCube Events for CNN Direction Reconstruction

astro-ph.IM · 2026-06-01 · unverdicted · novelty 6.0

IceCube events are encoded as 72x72x3 images and processed by ResNet18 to reach 1.10 rad mean angular error in neutrino direction reconstruction.

MoE-dqINR: A Unified Mixture-of-Experts Implicit Neural Representation Framework for Scan-Specific Dynamic and Quantitative MRI Reconstruction

eess.IV · 2026-05-29 · unverdicted · novelty 6.0

MoE-dqINR factorizes INR-based MRI reconstruction into shared spatial experts plus state-conditioned routing to unify dynamic and quantitative reconstruction at roughly 30 seconds per scan.

Double Metric Learning for Building Directed Graphs with Chain Connections for the ATLAS ITk Detector

physics.data-an · 2026-05-13 · unverdicted · novelty 6.0

Double metric learning learns two embeddings per node to build directed graphs with chain connections, yielding better performance than single metric learning for high-pT particles and accurate edge direction prediction in ATLAS ITk simulations.

On the global convergence of gradient descent for wide shallow models with bounded nonlinearities

math.OC · 2026-05-11 · unverdicted · novelty 6.0

Gradient descent on wide shallow models with bounded nonlinearities converges globally in the mean-field limit as non-global critical points are unstable under the dynamics.

ANTIC: Adaptive Neural Temporal In-situ Compressor

cs.LG · 2026-04-10 · unverdicted · novelty 6.0 · 3 refs

ANTIC reduces storage for large-scale PDE simulations by orders of magnitude through adaptive temporal snapshot selection combined with continual neural-field residual compression while preserving physics accuracy.

Bolek: A Multimodal Language Model for Molecular Reasoning

cs.LG · 2026-05-04 · unverdicted · novelty 5.0

Bolek injects Morgan fingerprint embeddings into an instruction-tuned text model, then fine-tunes on molecular alignment and synthetic chain-of-thought tasks to improve performance and grounding on 15 TDC binary classification endpoints while generalizing to unseen tasks.

Training-inference input alignment outweighs framework choice in longitudinal retinal image prediction

cs.CV · 2026-04-18 · unverdicted · novelty 5.0

Training-inference input alignment outweighs framework choice for longitudinal retinal image prediction, with deterministic regression matching complex models when acquisition variability dominates disease progression.

Kolmogorov--Arnold Networks as Implicit Regularizers: Noise Robustness and Interpretability for Stellar Classification

astro-ph.IM · 2026-05-27 · unverdicted · novelty 4.0

KAN noise robustness in star/galaxy/quasar classification arises from implicit C2-spline regularization rather than architecture, as weight-decay-tuned MLPs match performance on SDSS and DESI data.

Internally triggered retrospective learning in neural networks

q-bio.NC · 2026-05-09 · unverdicted · novelty 4.0

Neural networks learn via sparse retrospective updates triggered internally when prediction error exceeds a threshold derived from recent error statistics, leading to stepwise parameter changes in simulations.

Developing a Strong Pre-Trained Base Model for Plant Leaf Disease Classification

cs.CV · 2026-05-02 · unverdicted · novelty 3.0

A DenseNet201 base model trained on a constructed plant leaf disease dataset outperforms baselines and enables faster, more robust transfer learning with less data than general models.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Bolek: A Multimodal Language Model for Molecular Reasoning cs.LG · 2026-05-04 · unverdicted · none · ref 64
Bolek injects Morgan fingerprint embeddings into an instruction-tuned text model, then fine-tunes on molecular alignment and synthetic chain-of-thought tasks to improve performance and grounding on 15 TDC binary classification endpoints while generalizing to unseen tasks.
Internally triggered retrospective learning in neural networks q-bio.NC · 2026-05-09 · unverdicted · none · ref 7
Neural networks learn via sparse retrospective updates triggered internally when prediction error exceeds a threshold derived from recent error statistics, leading to stepwise parameter changes in simulations.

Sigmoid-weighted linear units for neural network function approximation in reinforcement learning.Neural Networks, 107:3–11, November 2018

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer