pith. sign in

hub Mixed citations

The Prompt Report: A Systematic Survey of Prompt Engineering Techniques

Mixed citation behavior. Most common role is background (60%).

41 Pith papers citing it
Background 60% of classified citations
abstract

Generative Artificial Intelligence (GenAI) systems are increasingly being deployed across diverse industries and research domains. Developers and end-users interact with these systems through the use of prompting and prompt engineering. Although prompt engineering is a widely adopted and extensively researched area, it suffers from conflicting terminology and a fragmented ontological understanding of what constitutes an effective prompt due to its relatively recent emergence. We establish a structured understanding of prompt engineering by assembling a taxonomy of prompting techniques and analyzing their applications. We present a detailed vocabulary of 33 vocabulary terms, a taxonomy of 58 LLM prompting techniques, and 40 techniques for other modalities. Additionally, we provide best practices and guidelines for prompt engineering, including advice for prompting state-of-the-art (SOTA) LLMs such as ChatGPT. We further present a meta-analysis of the entire literature on natural language prefix-prompting. As a culmination of these efforts, this paper presents the most comprehensive survey on prompt engineering to date.

hub tools

citation-role summary

background 4 method 1

citation-polarity summary

clear filters

representative citing papers

Self-Harness: Harnesses That Improve Themselves

cs.CL · 2026-06-08 · unverdicted · novelty 7.0

Self-Harness lets LLM agents autonomously refine their interaction harnesses through weakness mining, proposal generation, and validation, raising held-out pass rates on Terminal-Bench-2.0 from 40.5% to 61.9%, 23.8% to 38.1%, and 42.9% to 57.1% across three models.

AtelierEval: Agentic Evaluation of Humans & LLMs as Text-to-Image Prompters

cs.AI · 2026-05-21 · unverdicted · novelty 7.0

AtelierEval is the first unified benchmark that quantifies prompting proficiency of humans and MLLMs across 360 tasks using a cognitive taxonomy, with AtelierJudge providing scalable evaluation that correlates 0.79 with experts and shows mimicry outperforming planning.

PRIMETIME : Limits of LLMs in Temporal Primitives

cs.NE · 2025-04-22 · unverdicted · novelty 7.0

PRIMETIME generator reveals that LLM datetime parsing and arithmetic primitives are individually unreliable but fully learnable via fine-tuning, enabling frontier-level accuracy on event planning with small LoRA models.

Automated Design of Agentic Systems

cs.AI · 2024-08-15 · conditional · novelty 7.0

Meta Agent Search uses a meta-agent to iteratively program novel agentic systems in code, producing agents that outperform state-of-the-art hand-designed ones across coding, science, and math while transferring across domains and models.

Analogies between Transformer Layers and Power Method

cs.LG · 2026-05-25 · unverdicted · novelty 6.0

Transformer layers are analogous to power method steps, tilting tokens toward the principal eigenvector of the output-value weight product, with stronger analytical and empirical alignment in shared-weight models and a proposed steering method.

Alignment has a Fantasia Problem

cs.AI · 2026-04-23 · unverdicted · novelty 6.0

AI alignment must move beyond assuming users have fully formed goals and instead provide active cognitive support to help form and refine intent over time.

LLM Prompt Duel Optimizer: Efficient Label-Free Prompt Optimization

cs.CL · 2025-10-14 · unverdicted · novelty 6.0

Prompt Duel Optimizer uses dueling bandits and LLM-as-judge pairwise feedback with Double Thompson Sampling and top-performer mutation to find stronger prompts than label-free baselines on BBH and MS MARCO under limited comparison budgets.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.