pith. sign in

An analysis and survey of the development of mutation test- ing,

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 3 other 1

citation-polarity summary

years

2026 6 2025 1

polarities

background 3 unclear 1

representative citing papers

Overeager Coding Agents: Measuring Out-of-Scope Actions on Benign Tasks

cs.SE · 2026-05-18 · conditional · novelty 7.0

The paper presents OverEager-Gen, a 500-scenario benchmark showing that removing consent declarations from prompts increases overeager actions by 11.9-17.2 percentage points across models, with agent framework choice dominating base-model effects.

Robust Mutation Analysis of Quantum Programs Under Noise

cs.SE · 2026-05-13 · conditional · novelty 6.0

Noise from quantum hardware simulators significantly alters mutant detection distances, making equivalent mutants harder to separate from faults, with output-distribution metrics reaching 73.03% accuracy and 74.89% F1-score under device-specific thresholds.

Probabilistic Condition, Decision and Path Coverage of Circuit-based Quantum Programs

quant-ph · 2026-04-29 · unverdicted · novelty 6.0

Quantum circuits show high average condition (97.56%) and decision (97.63%) coverage but lower path coverage (71.84%), with probabilistic versions adding confidence levels (averages 88.87%, 88.65%, 37.18%); mutation testing reveals weak or no correlation between structural coverage and fault finding

Quality-Driven Selective Mutation for Deep Learning

cs.SE · 2026-04-24 · unverdicted · novelty 6.0

A dual-axis quality framework ranks DL mutation operators by statistical resistance and Jaccard-based realism to real faults, enabling up to 55.6% fewer mutants on held-out validation data without dropping baseline performance.

citing papers explorer

Showing 7 of 7 citing papers.

  • Overeager Coding Agents: Measuring Out-of-Scope Actions on Benign Tasks cs.SE · 2026-05-18 · conditional · none · ref 9

    The paper presents OverEager-Gen, a 500-scenario benchmark showing that removing consent declarations from prompts increases overeager actions by 11.9-17.2 percentage points across models, with agent framework choice dominating base-model effects.

  • Articulate but Wrong: Self-Review Failures in LLM-Based Code Modernization cs.SE · 2026-05-20 · conditional · none · ref 8

    LLM code modernizers produce semantic drift in 39.7% of legacy-Python-2 cases and endorse 31.7% of those drifts in self-review, with rates varying widely across models but not tracking capability.

  • Robust Mutation Analysis of Quantum Programs Under Noise cs.SE · 2026-05-13 · conditional · none · ref 38

    Noise from quantum hardware simulators significantly alters mutant detection distances, making equivalent mutants harder to separate from faults, with output-distribution metrics reaching 73.03% accuracy and 74.89% F1-score under device-specific thresholds.

  • Probabilistic Condition, Decision and Path Coverage of Circuit-based Quantum Programs quant-ph · 2026-04-29 · unverdicted · none · ref 26

    Quantum circuits show high average condition (97.56%) and decision (97.63%) coverage but lower path coverage (71.84%), with probabilistic versions adding confidence levels (averages 88.87%, 88.65%, 37.18%); mutation testing reveals weak or no correlation between structural coverage and fault finding

  • Quality-Driven Selective Mutation for Deep Learning cs.SE · 2026-04-24 · unverdicted · none · ref 21

    A dual-axis quality framework ranks DL mutation operators by statistical resistance and Jaccard-based realism to real faults, enabling up to 55.6% fewer mutants on held-out validation data without dropping baseline performance.

  • QuanForge: A Mutation Testing Framework for Quantum Neural Networks cs.SE · 2026-04-22 · unverdicted · none · ref 26

    QuanForge introduces statistical mutation killing and nine post-training mutation operators for QNNs to distinguish test suites and localize vulnerable circuit regions.

  • MutDafny: A Mutation-Based Approach to Assess Dafny Specifications cs.SE · 2025-11-19 · conditional · none · ref 30

    MutDafny uses 40 mutation operators on 794 real-world Dafny programs to detect weak specifications, manually confirming five such cases at a rate of one per 241 lines.