Title resolution pending

Do pretrained transformers learn in-context by gradient descent? Preprint, arXiv:2310 · 2022 · arXiv 2310.08540

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Mitigating Many-shot Jailbreak Attacks with One Single Demonstration

cs.CR · 2026-05-08 · conditional · novelty 7.0

A single safety demonstration appended at inference time mitigates many-shot jailbreak attacks by counteracting implicit malicious fine-tuning on harmful examples.

Fine-tuning vs. In-context Learning in Large Language Models: A Formal Language Learning Perspective

cs.CL · 2026-04-25 · unverdicted · novelty 7.0

Fine-tuning shows higher proficiency than in-context learning on in-distribution generalization in formal languages, with equal out-of-distribution performance and diverging inductive biases at high proficiency.

A Survey on In-context Learning

cs.CL · 2022-12-31 · unverdicted · novelty 3.0

The paper surveys definitions, techniques, applications, and challenges in in-context learning for large language models.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Fine-tuning vs. In-context Learning in Large Language Models: A Formal Language Learning Perspective cs.CL · 2026-04-25 · unverdicted · none · ref 60
Fine-tuning shows higher proficiency than in-context learning on in-distribution generalization in formal languages, with equal out-of-distribution performance and diverging inductive biases at high proficiency.
A Survey on In-context Learning cs.CL · 2022-12-31 · unverdicted · none · ref 9
The paper surveys definitions, techniques, applications, and challenges in in-context learning for large language models.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer