Training language models to follow instructions with human feedback.Ad- vances in neural information processing systems, 35:27730– 27744

Long Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, et al · 2022

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

browse 5 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Breaking the Illusion: When Positive Meets Negative in Multimodal Decoding

cs.LG · 2026-04-22 · unverdicted · novelty 7.0

PND reduces object hallucination in VLMs via a dual-path contrast during decoding that amplifies visual features and penalizes linguistic priors, achieving reported SOTA results on POPE, MME, and CHAIR without retraining.

Turning Generators into Retrievers: Unlocking MLLMs for Natural Language-Guided Geo-Localization

cs.CV · 2026-04-12 · unverdicted · novelty 6.0

Parameter-efficient fine-tuning lets MLLMs serve as effective retrievers for natural-language-guided cross-view geo-localization, beating dual-encoder baselines on GeoText-1652 and CVG-Text while using far fewer trainable parameters.

3DrawAgent: Teaching LLM to Draw in 3D with Early Contrastive Experience

cs.CV · 2026-04-09 · unverdicted · novelty 6.0

3DrawAgent lets LLMs create complex 3D sketches from text prompts by using pairwise comparisons of their own outputs to self-improve spatial drawing skills without parameter updates.

GeoWorld: Geometric World Models

cs.CV · 2026-02-26 · unverdicted · novelty 6.0

GeoWorld applies hyperbolic geometry to JEPA world models and introduces geometric reinforcement learning, reporting modest success-rate gains of ~3% and ~2% on 3- and 4-step planning tasks versus V-JEPA 2.

MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications

cs.CV · 2026-04-03 · unverdicted · novelty 5.0

MOMO merges sensor-specific models from three Mars orbital instruments at matched validation loss stages to form a foundation model that outperforms ImageNet, Earth observation, sensor-specific, and supervised baselines on nine Mars-Bench tasks.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Breaking the Illusion: When Positive Meets Negative in Multimodal Decoding cs.LG · 2026-04-22 · unverdicted · none · ref 28
PND reduces object hallucination in VLMs via a dual-path contrast during decoding that amplifies visual features and penalizes linguistic priors, achieving reported SOTA results on POPE, MME, and CHAIR without retraining.
MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications cs.CV · 2026-04-03 · unverdicted · none · ref 54
MOMO merges sensor-specific models from three Mars orbital instruments at matched validation loss stages to form a foundation model that outperforms ImageNet, Earth observation, sensor-specific, and supervised baselines on nine Mars-Bench tasks.

Training language models to follow instructions with human feedback.Ad- vances in neural information processing systems, 35:27730– 27744

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer