Stop wasting my time! saving days of ima- genet and bert training with latest weight averaging

Jean Kaddour · 2022 · arXiv 2209.14981

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

cs.LG · 2025-02-07 · unverdicted · novelty 7.0

A recurrent-depth architecture enables language models to improve reasoning performance by iterating computation in latent space, achieving gains equivalent to much larger models on benchmarks.

MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications

cs.CV · 2026-04-03 · unverdicted · novelty 5.0

MOMO merges sensor-specific models from three Mars orbital instruments at matched validation loss stages to form a foundation model that outperforms ImageNet, Earth observation, sensor-specific, and supervised baselines on nine Mars-Bench tasks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications cs.CV · 2026-04-03 · unverdicted · none · ref 35
MOMO merges sensor-specific models from three Mars orbital instruments at matched validation loss stages to form a foundation model that outperforms ImageNet, Earth observation, sensor-specific, and supervised baselines on nine Mars-Bench tasks.

Stop wasting my time! saving days of ima- genet and bert training with latest weight averaging

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer