NTR adds a self-distillation masked latent reconstruction objective that uses only scene tokens to reconstruct masked patch features, improving visual representation quality and planning performance in end-to-end autonomous driving.
Coto-Elena, J.E
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it