hub Mixed citations

Title resolution pending

David Ha, Andrew Dai, Quoc V. Le · 2016 · cs.LG · arXiv 1609.09106

Mixed citation behavior. Most common role is background (62%).

43 Pith papers citing it

Background 62% of classified citations

open full Pith review browse 43 citing papers arXiv PDF

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

abstract

This work explores hypernetworks: an approach of using a one network, also known as a hypernetwork, to generate the weights for another network. Hypernetworks provide an abstraction that is similar to what is found in nature: the relationship between a genotype - the hypernetwork - and a phenotype - the main network. Though they are also reminiscent of HyperNEAT in evolution, our hypernetworks are trained end-to-end with backpropagation and thus are usually faster. The focus of this work is to make hypernetworks useful for deep convolutional networks and long recurrent networks, where hypernetworks can be viewed as relaxed form of weight-sharing across layers. Our main result is that hypernetworks can generate non-shared weights for LSTM and achieve near state-of-the-art results on a variety of sequence modelling tasks including character-level language modelling, handwriting generation and neural machine translation, challenging the weight-sharing paradigm for recurrent networks. Our results also show that hypernetworks applied to convolutional networks still achieve respectable results for image recognition tasks compared to state-of-the-art baseline models while requiring fewer learnable parameters.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 8 method 5

citation-polarity summary

background 8 use method 5

representative citing papers

Neural Ordinary Differential Equations

cs.LG · 2018-06-19 · accept · novelty 8.0

Neural networks are redefined as continuous dynamical systems by learning the derivative of the hidden state with a neural network and integrating it with an ODE solver.

OnlyDense: Reduced-Order Modeling for Lagrangian simulation

cs.LG · 2026-06-08 · unverdicted · novelty 7.0

OnlyDense learns neural basis functions to approximate particle system states in a low-dimensional linear Hilbert subspace, unifying projection-based ROM with deep learning for accurate SPH dynamics modeling with 32 bases at R²>0.99.

CoMetaPNS: Continually Meta-learning Personalized Neural Surrogates for Cardiac Electrophysiology Simulations

cs.LG · 2026-06-05 · unverdicted · novelty 7.0

CoMetaPNS combines meta-learned neural surrogates with a continual Bayesian Gaussian Mixture Model to adapt cardiac electrophysiology simulations to new data while avoiding catastrophic forgetting.

DISC: Decoupling Instruction from State-Conditioned Control via Policy Generation

cs.RO · 2026-05-20 · unverdicted · novelty 7.0

A hypernetwork generates complete task-specific visuomotor policy parameters from instructions alone to structurally eliminate observation leakage in language-conditioned robotic control.

Good Agentic Friends Do Not Just Give Verbal Advice: They Can Update Your Weights

cs.CL · 2026-05-13 · unverdicted · novelty 7.0

TFlow enables multi-agent LLMs to collaborate via transient low-rank LoRA perturbations derived from sender activations, yielding up to 8.5 accuracy gains and 83% token reduction versus text-based baselines on Qwen3-4B models.

Stylized Text-to-Motion Generation via Hypernetwork-Driven Low-Rank Adaptation

cs.CV · 2026-05-13 · unverdicted · novelty 7.0

A hypernetwork maps style motion embeddings to LoRA updates that stylize text-driven motion diffusion models with improved generalization to unseen styles via contrastive structuring of the style space.

Events as Triggers for Behavioral Diversity in Multi-Agent Reinforcement Learning

cs.MA · 2026-05-12 · unverdicted · novelty 7.0 · 2 refs

Events trigger on-the-fly LoRA module generation via hypernetworks over a shared team policy in MARL, paired with a Neural Manifold Diversity metric, enabling sequential role reassignment while preserving reward maximization.

NonZero: Interaction-Guided Exploration for Multi-Agent Monte Carlo Tree Search

cs.LG · 2026-05-01 · unverdicted · novelty 7.0

NonZero introduces an interaction score and bandit-formalized proposal rule for local agent deviations in multi-agent MCTS, delivering a sublinear local-regret guarantee and improved sample efficiency on game benchmarks without full joint-action enumeration.

Wireless Communication Enhanced Value Decomposition for Multi-Agent Reinforcement Learning

cs.LG · 2026-04-09 · unverdicted · novelty 7.0

CLOVER augments value decomposition with a GNN mixer whose weights depend on the realized wireless communication graph, proving permutation invariance, monotonicity, and greater expressiveness than QMIX while showing gains on Predator-Prey and Lumberjacks under p-CSMA channels.

Instance-Adaptive Parametrization for Amortized Variational Inference

cs.LG · 2026-04-08 · unverdicted · novelty 7.0

IA-VAE augments amortized variational inference with hypernetwork-generated instance-adaptive modulations, strictly containing the standard variational family and improving held-out ELBO on synthetic and image data.

SLE-FNO: Single-Layer Extensions for Task-Agnostic Continual Learning in Fourier Neural Operators

cs.LG · 2026-03-20 · unverdicted · novelty 7.0

SLE-FNO achieves zero forgetting and strong plasticity-stability balance in continual learning for FNO surrogate models of pulsatile blood flow by adding minimal single-layer extensions across four out-of-distribution tasks.

ReWeaver: Towards Simulation-Ready and Topology-Accurate Garment Reconstruction

cs.CV · 2026-01-23 · unverdicted · novelty 7.0

ReWeaver reconstructs topology-accurate 3D garments and sewing patterns from sparse multi-view images by predicting seams and panels in 2D UV and 3D space using a new 100k-sample synthetic dataset.

UniReg: A Universal Model for Controllable CT Image Registration

cs.CV · 2025-03-17 · unverdicted · novelty 7.0

UniReg introduces a conditional unified neural model for multi-scenario CT registration that conditions on anatomical priors, inter/intra-subject type, and instance features to achieve higher accuracy and cross-scenario generalization than task-specific networks.

Searching for Activation Functions

cs.NE · 2017-10-16 · conditional · novelty 7.0

Automated search discovers Swish activation f(x) = x * sigmoid(βx) that improves top-1 ImageNet accuracy over ReLU by 0.9% on Mobile NASNet-A and 0.6% on Inception-ResNet-v2.

DroneFINE: Domain-Aware Parameter-Efficient Fine-Tuning of Vision-Language Detectors for Drone Images

cs.CV · 2026-07-01 · unverdicted · novelty 6.0

DroneFINE is a domain-aware PEFT approach for VLM-based drone detectors using foreground-aware multi-path adaptation and text-conditioned background suppression, outperforming standard PEFT and matching full fine-tuning on VisDrone and UAVDT with fewer trainable parameters.

Prompt2Effect: Training-Free Image-to-Video Model Specialization via LoRA Generation

cs.CV · 2026-06-11 · unverdicted · novelty 6.0

Prompt2Effect is a weight-driven hypernetwork that synthesizes LoRA adapters for I2V models from prompts and base weights via SVD parameterization, matching fine-tuned quality at 3.3s inference instead of 56 GPU hours.

Polaris: Scaling Up Instruction-Guided Image Generation Towards Millions of Personalized Style Needs

cs.CV · 2026-06-01 · unverdicted · novelty 6.0

Polaris retrieves and integrates relevant models from a large library of checkpoints and adapters to enable scalable instruction-guided image generation and editing without additional training.

Rethinking Amortized Neural Representations for High-Resolution Terrain Elevation Data

cs.CV · 2026-05-29 · unverdicted · novelty 6.0

Introduces a terrain-specific benchmark showing cross-domain gaps in INR methods and demonstrates that HUVR+SIREN achieves superior height and derivative fidelity in a compact quantized format.

InfoAtlas: A Foundation Model for Zero-Shot Statistical Dependence Estimate

cs.LG · 2026-05-29 · unverdicted · novelty 6.0

InfoAtlas is a pretrained neural model for zero-shot mutual information estimation that matches state-of-the-art accuracy with 100x speedup and handles varying dimensions via a single model.

Hyper-V2X: Hypernetworks for Estimating Epistemic and Aleatoric Uncertainty in Cooperative Bird's-Eye-View Semantic Segmentation

cs.CV · 2026-05-20 · unverdicted · novelty 6.0

Hyper-V2X uses a Bayesian hypernetwork with partial weight generation and V2X context embedding to produce calibrated epistemic and aleatoric uncertainty estimates for multi-agent BEV segmentation on the OPV2V benchmark.

MULTI: Disentangling Camera Lens, Sensor, View, and Domain for Novel Image Generation

cs.CV · 2026-05-12 · unverdicted · novelty 6.0

MULTI uses two-stage textual inversion to disentangle camera lens, sensor, view, and domain factors for novel image generation, supporting dataset extension and ControlNet modifications on the new DF-RICO benchmark.

Hystar: Hypernetwork-driven Style-adaptive Retrieval via Dynamic SVD Modulation

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

Hystar adapts CLIP-like models to unseen query styles by generating per-input singular-value perturbations with a hypernetwork for attention layers and a new StyleNCE contrastive loss.

Environment-Conditioned Diffusion Meta-Learning for Data-Efficient WiFi Localization

eess.SP · 2026-05-11 · unverdicted · novelty 6.0

EnvCoLoc combines environment-conditioned diffusion meta-learning with 3D point cloud descriptors to reduce mean localization error by up to 20% in NLOS WiFi scenarios using only 10 support samples.

RareCP: Regime-Aware Retrieval for Efficient Conformal Prediction

cs.LG · 2026-05-09 · unverdicted · novelty 6.0

RareCP improves interval efficiency for time series conformal prediction by retrieving and weighting regime-specific calibration examples while adapting to drift and maintaining coverage.

citing papers explorer

Showing 5 of 5 citing papers after filters.

Hystar: Hypernetwork-driven Style-adaptive Retrieval via Dynamic SVD Modulation cs.CV · 2026-05-11 · unverdicted · none · ref 7 · internal anchor
Hystar adapts CLIP-like models to unseen query styles by generating per-input singular-value perturbations with a hypernetwork for attention layers and a new StyleNCE contrastive loss.
Hyperfastrl: Hypernetwork-based reinforcement learning for unified control of parametric chaotic PDEs cs.CE · 2026-04-07 · unverdicted · none · ref 71 · internal anchor
Hypernetworks map a forcing parameter directly to policy weights in an RL framework, enabling unified stabilization of the Kuramoto-Sivashinsky equation across regimes with KAN architectures showing strongest extrapolation.
HyperFitS -- Hypernetwork Fitting Spectra for metabolic quantification of ${}^1$H MR spectroscopic imaging cs.LG · 2026-04-03 · unverdicted · none · ref 35 · internal anchor
HyperFitS is a hypernetwork for configurable spectral fitting in 1H MRSI that matches conventional LCModel results while processing whole-brain data in seconds instead of hours and adapting to varied protocols without retraining.
Why Invariance is Not Enough for Biomedical Domain Generalization and How to Fix It eess.IV · 2026-04-02 · unverdicted · none · ref 19 · internal anchor
MaskGen improves domain generalization for biomedical image segmentation by using source intensities plus domain-stable foundation model representations with minimal added complexity.
Adaptive Learned State Estimation based on KalmanNet cs.RO · 2026-04-02 · unverdicted · none · ref 19 · internal anchor
AM-KNet adds sensor-specific modules, hypernetwork conditioning on target type and pose, and Joseph-form covariance estimation to KalmanNet, yielding better accuracy and stability than base KalmanNet on nuScenes and View-of-Delft data.

Title resolution pending

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer