Beyond Pattern Matching: Seven Cross-Domain Techniques for Prompt Injection Detection

Thamilvendhan Munirathinam

arxiv: 2604.18248 · v2 · pith:IFMVKYUVnew · submitted 2026-04-20 · 💻 cs.CR · cs.CL

Beyond Pattern Matching: Seven Cross-Domain Techniques for Prompt Injection Detection

Thamilvendhan Munirathinam This is my paper

Pith reviewed 2026-05-21 00:46 UTC · model grok-4.3

classification 💻 cs.CR cs.CL

keywords prompt injectiondetection techniquescross-domainlocal alignmentstylometryLLM securityforensic linguisticssequence alignment

0 comments

The pith

Porting mechanisms from seven outside fields improves prompt injection detection over regex and fine-tuned models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper establishes that detection techniques borrowed from forensic linguistics, bioinformatics, materials science, and four other domains can address the weaknesses of current prompt injection detectors. Current methods like regular expressions fail on paraphrased attacks, and fine-tuned classifiers can be bypassed by adaptive adversaries. By implementing and testing three of these cross-domain techniques on multiple benchmarks, the work shows measurable gains in F1 scores without increasing false positives. A reader would care because robust detection is essential for securing LLM applications against manipulation.

Core claim

The paper's central claim is that seven specific techniques—local sequence alignment from bioinformatics, stylometry from forensic linguistics, fatigue tracking from materials science, plus others from network security deception, economics mechanism design, epidemiology spectral analysis, and compiler taint tracking—can be ported to LLM prompt streams to detect injections more effectively than standard approaches.

What carries the argument

Cross-domain porting of analytical mechanisms, such as the local-alignment detector which applies sequence alignment to identify injection patterns in prompts.

If this is right

The local-alignment detector increases F1 score on the deepset dataset from 0.033 to 0.378 with zero added false positives.
The stylometric detector improves F1 by 11.1 percentage points on an indirect-injection benchmark.
The fatigue tracker passes validation through a probing-campaign integration test.
Three of the techniques are released in prompt-shield v0.4.1 under Apache 2.0.
These methods offer potential resilience against adaptive attacks that bypass fine-tuned classifiers.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If successful, this cross-domain strategy could be extended to detect other LLM vulnerabilities like data poisoning or model extraction.
Combining these ported detectors with existing ones might create more robust multi-layer defenses.
Further testing on real-world deployment scenarios could identify which imported mechanisms transfer most reliably.
The approach opens the door to using economic game theory or epidemiological models for broader AI safety problems.

Load-bearing premise

The mechanisms from other disciplines will continue to detect injections effectively when applied to LLM prompts without needing major retraining or adaptation for the new domain.

What would settle it

A set of new adaptive attacks that evade the local-alignment and stylometric detectors while maintaining high success rates on the evaluated benchmarks would falsify the claim of improved detection power.

read the original abstract

Current open-source prompt-injection detectors converge on two architectural choices: regular-expression pattern matching and fine-tuned transformer classifiers. Both share failure modes that recent work has made concrete. Regular expressions miss paraphrased attacks. Fine-tuned classifiers are vulnerable to adaptive adversaries: a 2025 NAACL Findings study reported that eight published indirect-injection defenses were bypassed with greater than fifty percent attack success rates under adaptive attacks. This work proposes seven detection techniques that each port a specific mechanism from a discipline outside large-language-model security: forensic linguistics, materials-science fatigue analysis, deception technology from network security, local-sequence alignment from bioinformatics, mechanism design from economics, spectral signal analysis from epidemiology, and taint tracking from compiler theory. Three of the seven techniques are implemented in the prompt-shield v0.4.1 release (Apache 2.0) and evaluated in a four-configuration ablation across six datasets including deepset/prompt-injections, NotInject, LLMail-Inject, AgentHarm, and AgentDojo. The local-alignment detector lifts F1 on deepset from 0.033 to 0.378 with zero additional false positives. The stylometric detector adds 11.1 percentage points of F1 on an indirect-injection benchmark. The fatigue tracker is validated via a probing-campaign integration test. All code, data, and reproduction scripts are released under Apache 2.0.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper ports seven mechanisms from other fields to prompt injection detection, releases code with some static F1 gains, but skips the adaptive attack tests it cites as a key failure mode for prior work.

read the letter

The main point is that this paper tries to improve prompt injection detection by borrowing seven specific techniques from outside LLM security, including local sequence alignment from bioinformatics, stylometry from forensic linguistics, and fatigue tracking from materials science. They implement three of them in the prompt-shield release and report concrete improvements on static benchmarks across datasets like deepset and an indirect-injection set. The local-alignment approach raises F1 from 0.033 to 0.378 with no extra false positives, and stylometry adds 11.1 points. Releasing the code, data, and scripts under Apache 2.0 is useful for anyone who wants to test or extend the ideas themselves.

Referee Report

1 major / 2 minor

Summary. The paper proposes seven prompt-injection detection techniques ported from outside LLM security (forensic linguistics/stylometry, bioinformatics/local sequence alignment, materials-science fatigue tracking, deception technology, mechanism design, spectral analysis, and taint tracking). Three are implemented in prompt-shield v0.4.1 and evaluated via four-configuration ablations on six datasets (deepset/prompt-injections, NotInject, LLMail-Inject, AgentHarm, AgentDojo). Reported results include an F1 lift from 0.033 to 0.378 on deepset with the local-alignment detector (zero added false positives) and an 11.1 pp F1 gain on an indirect-injection benchmark with the stylometric detector; the fatigue tracker is validated via probing integration. All code, data, and scripts are released under Apache 2.0.

Significance. If the robustness claims hold, the work meaningfully expands the detection toolkit beyond regex and fine-tuned classifiers by importing mechanisms that may evade the adaptive-attack failure modes documented in the cited 2025 NAACL Findings paper. The explicit release of reproducible code, data, and reproduction scripts is a clear strength that supports community verification and extension.

major comments (1)

[§4] §4 (Evaluation) and the abstract: the central motivation is the >50% adaptive-attack bypass rate for eight prior indirect-injection defenses (2025 NAACL Findings). Yet all reported F1 lifts (local-alignment on deepset, stylometric on indirect benchmark) derive from static held-out dataset ablations only. No adaptive-adversary experiments are described, which directly undermines the claim that the cross-domain ports deliver practically useful detection.

minor comments (2)

The abstract states concrete F1 numbers but the manuscript provides neither error bars nor statistical significance tests for the reported lifts; adding these would strengthen the ablation claims.
[§3] Details on the precise adaptation steps for each cross-domain mechanism (e.g., how local alignment from bioinformatics is parameterized for token streams) are only sketched; a short appendix with pseudocode or hyper-parameter tables would improve reproducibility.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback on our evaluation design. We address the major comment point by point below and commit to revisions that clarify the scope of our claims.

read point-by-point responses

Referee: [§4] §4 (Evaluation) and the abstract: the central motivation is the >50% adaptive-attack bypass rate for eight prior indirect-injection defenses (2025 NAACL Findings). Yet all reported F1 lifts (local-alignment on deepset, stylometric on indirect benchmark) derive from static held-out dataset ablations only. No adaptive-adversary experiments are described, which directly undermines the claim that the cross-domain ports deliver practically useful detection.

Authors: We agree that the motivation section and abstract reference the >50% bypass rates from the 2025 NAACL Findings paper to motivate the need for new detection approaches. Our evaluations are indeed limited to static held-out ablations across the six datasets, showing F1 improvements such as the lift from 0.033 to 0.378 on deepset/prompt-injections for the local-alignment detector. We do not claim that these results prove robustness to adaptive adversaries; rather, the cross-domain techniques are presented as alternatives that may avoid the documented failure modes of regex and fine-tuned classifiers. We will revise the abstract, §4, and a new limitations subsection to explicitly note that adaptive-adversary testing is absent from the current work and is planned as future research. This revision will temper any implication of immediate practical robustness while preserving the value of the static benchmark results as an initial demonstration. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical measurements on held-out data with no self-referential definitions or fitted predictions

full rationale

The paper ports mechanisms from external disciplines and reports direct F1 improvements measured on static held-out datasets (deepset, indirect-injection benchmark, etc.). No equations, parameters fitted to subsets then renamed as predictions, or self-citations that bear the load of the central claims appear in the abstract or described results. The F1 lifts are presented as empirical observations rather than quantities defined in terms of the paper's own constructs. The cited NAACL result serves only as motivation for prior detector weaknesses and does not reduce the new detectors' reported performance to a self-referential input.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The abstract supplies no explicit free parameters, axioms, or invented entities; the central claims rest on the unstated premise that cross-domain porting preserves signal without introducing new fitting steps.

pith-pipeline@v0.9.0 · 5776 in / 1231 out tokens · 25429 ms · 2026-05-21T00:46:13.780809+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

The stylometric approach recasts indirect-injection detection as a well-studied style-change-boundary problem... Jensen-Shannon divergence... The d028 detector maintains a curated database... Smith-Waterman algorithm... substitution matrix
IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean J_uniquely_calibrated_via_higher_derivative unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

adversarial fatigue tracking... exponentially-weighted moving average... Basquin's stress-life curves

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

26 extracted references · 26 canonical work pages · 6 internal anchors

[1]

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Andriushchenko, M., Souly, A., et al. "AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents." ICLR 2025. arXiv:2410.09024

work page internal anchor Pith review Pith/arXiv arXiv 2025
[2]

The Exponential Law of Endurance Tests

Basquin, O. H. "The Exponential Law of Endurance Tests." Proceedings of the American Society for Testing and Materials 10:625-630, 1910

work page 1910
[3]

Overview of PAN 2024: Multi-author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification

Bevendorff, J., et al. "Overview of PAN 2024: Multi-author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification." CLEF 2024. DOI 10.1007/978-3-031-71908-0_11

work page doi:10.1007/978-3-031-71908-0_11 2024
[4]

Securing AI Agents with Information-Flow Control

Costa, M., Köpf, B., et al. "Securing AI Agents with Information-Flow Control." Microsoft Research, arXiv:2505.23643, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025
[5]

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents

Debenedetti, E., Zhang, J., Balunovic, M., Beurer-Kellner, L., Fischer, M., Tramèr, F. "AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents." NeurIPS 2024 Datasets and Benchmarks. arXiv:2406.13352

work page internal anchor Pith review Pith/arXiv arXiv 2024
[6]

TaintP2X: Detecting Taint-Style Prompt-to-Anything Injection Vulnerabilities in LLM-Integrated Applications

He, X., Wang, B., Zhao, Y., Hou, X., Liu, J., Zou, H., Wang, H. "TaintP2X: Detecting Taint-Style Prompt-to-Anything Injection Vulnerabilities in LLM-Integrated Applications." ICSE 2026 Research Track

work page 2026
[7]

& Henikoff, J

Henikoff, S., Henikoff, J. G. "Amino acid substitution matrices from protein blocks." Proceedings of the National Academy of Sciences 89(22):10915-10919, 1992. DOI 10.1073/pnas.89.22.10915

work page doi:10.1073/pnas.89.22.10915 1992
[8]

Defending Against Indirect Prompt Injection Attacks With Spotlighting

Hines, K., Lopez, G., Hall, M., Zarfati, F., Zunger, Y., Kiciman, E. "Defending Against Indirect Prompt Injection Attacks With Spotlighting." arXiv:2403.14720, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[9]

Logarithmic Market Scoring Rules for Modular Combinatorial Information Aggregation

Hanson, R. "Logarithmic Market Scoring Rules for Modular Combinatorial Information Aggregation." Journal of Prediction Markets 1(1):3-15, 2007

work page 2007
[10]

PIGuard: Prompt Injection Guardrail via Mitigating Overdefense for Free

Li, H., Liu, Y., Zhang, C., Xiao, Y. "PIGuard: Prompt Injection Guardrail via Mitigating Overdefense for Free." ACL 2025 Long Papers. aclanthology.org/2025.acl-long.1468

work page 2025
[11]

1991, IEEE Transactions on Information Theory, 37, 145, 10.1109/18.61115

Lin, J. "Divergence Measures Based on the Shannon Entropy." IEEE Transactions on Information Theory 37(1):145-151, 1991. DOI 10.1109/18.61115

work page doi:10.1109/18.61115 1991
[12]

Formalizing and benchmarking prompt injection attacks and defenses,

Liu, Y., Jia, Y., Geng, R., Jia, J., Gong, N. Z. "Formalizing and Benchmarking Prompt Injection Attacks and Defenses." USENIX Security 2024. arXiv:2310.12815

work page arXiv 2024
[13]

The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against Llm Jailbreaks and Prompt Injections

Nasr, M., Carlini, N., Sitawarin, C., et al. "The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against LLM Jailbreaks and Prompt Injections." arXiv:2510.09023,

work page internal anchor Pith review Pith/arXiv arXiv
[14]

[Submission status pending; OpenReview 7B9mTg7z25.]

work page
[15]

StyloAI: Distinguishing AI-Generated Content with Stylometric Analysis

Opara, C. "StyloAI: Distinguishing AI-Generated Content with Stylometric Analysis." arXiv:2405.10129, 2024

work page arXiv 2024
[16]

Continuous Inspection Schemes

Page, E. S. "Continuous Inspection Schemes." Biometrika 41(1/2):100-115, 1954

work page 1954
[17]

Hacking back the ai-hacker: Prompt injection as a defense against llm-driven cyberattacks,

Pasquini, D., Corti, E., Ateniese, G. "Hacking Back the AI-Hacker: Prompt Injection as a Defense Against LLM-driven Cyberattacks." arXiv:2410.20911, 2024

work page arXiv 2024
[18]

https://doi.org/10.48550/arXiv.2410.13919, arXiv:2410.13919 [cs.CR] 18 Authors Suppressed Due to Excessive Length

Reworr, Volkov, D. "LLM Agent Honeypot: Monitoring AI Hacking Agents in the Wild." Palisade Research, arXiv:2410.13919, 2024

work page arXiv 2024
[19]

and Waterman, M.S

Smith, T. F., Waterman, M. S. "Identification of Common Molecular Subsequences." Journal of Molecular Biology 147:195-197, 1981. DOI 10.1016/0022-2836(81)90087-5

work page doi:10.1016/0022-2836(81)90087-5 1981
[20]

Fatigue of Materials

Suresh, S. Fatigue of Materials. Cambridge University Press, second edition, 1998. ISBN 9780521578479

work page 1998
[21]

Using WPCA and EWMA Control Chart to Construct a Network Intrusion Detection Model

Tsai, C.-W., et al. "Using WPCA and EWMA Control Chart to Construct a Network Intrusion Detection Model." IET Information Security, 2024. DOI 10.1049/2024/3948341

work page doi:10.1049/2024/3948341 2024
[22]

Assessing 3 Outbreak Detection Algorithms in an Electronic Syndromic Surveillance System

Vial, F., et al. "Assessing 3 Outbreak Detection Algorithms in an Electronic Syndromic Surveillance System." Emerging Infectious Diseases 26(9), US Centers for Disease Control and Prevention, 2020

work page 2020
[23]

Selfdefend: Llms can defend themselves against jailbreaking in a practical manner,

Wu, X., Wang, R., et al. "SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner." arXiv:2406.05498, 2024

work page arXiv 2024
[24]

Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on

Zhan, Q., Fang, H., Panchal, A., Kang, D. "Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents." NAACL 2025 Findings. arXiv:2503.00061

work page arXiv 2025
[25]

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents

Zhang, J., Yu, R., et al. "Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents." ICLR 2025. arXiv:2410.02644

work page internal anchor Pith review Pith/arXiv arXiv 2025
[26]

MELON: Provable defense against indirect prompt injection attacks in AI agents,

Zhu, K., Yang, Y., Wang, R., Guo, Y., Wang, H. "MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents." ICML 2025. arXiv:2502.05174. Appendix A. Released Artifacts All artifacts released alongside this paper are in the public repository at github.com/mthamil107/prompt-shield under the Apache 2.0 license. Relevant paths: • src/prom...

work page arXiv 2025

[1] [1]

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Andriushchenko, M., Souly, A., et al. "AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents." ICLR 2025. arXiv:2410.09024

work page internal anchor Pith review Pith/arXiv arXiv 2025

[2] [2]

The Exponential Law of Endurance Tests

Basquin, O. H. "The Exponential Law of Endurance Tests." Proceedings of the American Society for Testing and Materials 10:625-630, 1910

work page 1910

[3] [3]

Overview of PAN 2024: Multi-author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification

Bevendorff, J., et al. "Overview of PAN 2024: Multi-author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification." CLEF 2024. DOI 10.1007/978-3-031-71908-0_11

work page doi:10.1007/978-3-031-71908-0_11 2024

[4] [4]

Securing AI Agents with Information-Flow Control

Costa, M., Köpf, B., et al. "Securing AI Agents with Information-Flow Control." Microsoft Research, arXiv:2505.23643, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025

[5] [5]

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents

Debenedetti, E., Zhang, J., Balunovic, M., Beurer-Kellner, L., Fischer, M., Tramèr, F. "AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents." NeurIPS 2024 Datasets and Benchmarks. arXiv:2406.13352

work page internal anchor Pith review Pith/arXiv arXiv 2024

[6] [6]

TaintP2X: Detecting Taint-Style Prompt-to-Anything Injection Vulnerabilities in LLM-Integrated Applications

He, X., Wang, B., Zhao, Y., Hou, X., Liu, J., Zou, H., Wang, H. "TaintP2X: Detecting Taint-Style Prompt-to-Anything Injection Vulnerabilities in LLM-Integrated Applications." ICSE 2026 Research Track

work page 2026

[7] [7]

& Henikoff, J

Henikoff, S., Henikoff, J. G. "Amino acid substitution matrices from protein blocks." Proceedings of the National Academy of Sciences 89(22):10915-10919, 1992. DOI 10.1073/pnas.89.22.10915

work page doi:10.1073/pnas.89.22.10915 1992

[8] [8]

Defending Against Indirect Prompt Injection Attacks With Spotlighting

Hines, K., Lopez, G., Hall, M., Zarfati, F., Zunger, Y., Kiciman, E. "Defending Against Indirect Prompt Injection Attacks With Spotlighting." arXiv:2403.14720, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[9] [9]

Logarithmic Market Scoring Rules for Modular Combinatorial Information Aggregation

Hanson, R. "Logarithmic Market Scoring Rules for Modular Combinatorial Information Aggregation." Journal of Prediction Markets 1(1):3-15, 2007

work page 2007

[10] [10]

PIGuard: Prompt Injection Guardrail via Mitigating Overdefense for Free

Li, H., Liu, Y., Zhang, C., Xiao, Y. "PIGuard: Prompt Injection Guardrail via Mitigating Overdefense for Free." ACL 2025 Long Papers. aclanthology.org/2025.acl-long.1468

work page 2025

[11] [11]

1991, IEEE Transactions on Information Theory, 37, 145, 10.1109/18.61115

Lin, J. "Divergence Measures Based on the Shannon Entropy." IEEE Transactions on Information Theory 37(1):145-151, 1991. DOI 10.1109/18.61115

work page doi:10.1109/18.61115 1991

[12] [12]

Formalizing and benchmarking prompt injection attacks and defenses,

Liu, Y., Jia, Y., Geng, R., Jia, J., Gong, N. Z. "Formalizing and Benchmarking Prompt Injection Attacks and Defenses." USENIX Security 2024. arXiv:2310.12815

work page arXiv 2024

[13] [13]

The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against Llm Jailbreaks and Prompt Injections

Nasr, M., Carlini, N., Sitawarin, C., et al. "The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against LLM Jailbreaks and Prompt Injections." arXiv:2510.09023,

work page internal anchor Pith review Pith/arXiv arXiv

[14] [14]

[Submission status pending; OpenReview 7B9mTg7z25.]

work page

[15] [15]

StyloAI: Distinguishing AI-Generated Content with Stylometric Analysis

Opara, C. "StyloAI: Distinguishing AI-Generated Content with Stylometric Analysis." arXiv:2405.10129, 2024

work page arXiv 2024

[16] [16]

Continuous Inspection Schemes

Page, E. S. "Continuous Inspection Schemes." Biometrika 41(1/2):100-115, 1954

work page 1954

[17] [17]

Hacking back the ai-hacker: Prompt injection as a defense against llm-driven cyberattacks,

Pasquini, D., Corti, E., Ateniese, G. "Hacking Back the AI-Hacker: Prompt Injection as a Defense Against LLM-driven Cyberattacks." arXiv:2410.20911, 2024

work page arXiv 2024

[18] [18]

https://doi.org/10.48550/arXiv.2410.13919, arXiv:2410.13919 [cs.CR] 18 Authors Suppressed Due to Excessive Length

Reworr, Volkov, D. "LLM Agent Honeypot: Monitoring AI Hacking Agents in the Wild." Palisade Research, arXiv:2410.13919, 2024

work page arXiv 2024

[19] [19]

and Waterman, M.S

Smith, T. F., Waterman, M. S. "Identification of Common Molecular Subsequences." Journal of Molecular Biology 147:195-197, 1981. DOI 10.1016/0022-2836(81)90087-5

work page doi:10.1016/0022-2836(81)90087-5 1981

[20] [20]

Fatigue of Materials

Suresh, S. Fatigue of Materials. Cambridge University Press, second edition, 1998. ISBN 9780521578479

work page 1998

[21] [21]

Using WPCA and EWMA Control Chart to Construct a Network Intrusion Detection Model

Tsai, C.-W., et al. "Using WPCA and EWMA Control Chart to Construct a Network Intrusion Detection Model." IET Information Security, 2024. DOI 10.1049/2024/3948341

work page doi:10.1049/2024/3948341 2024

[22] [22]

Assessing 3 Outbreak Detection Algorithms in an Electronic Syndromic Surveillance System

Vial, F., et al. "Assessing 3 Outbreak Detection Algorithms in an Electronic Syndromic Surveillance System." Emerging Infectious Diseases 26(9), US Centers for Disease Control and Prevention, 2020

work page 2020

[23] [23]

Selfdefend: Llms can defend themselves against jailbreaking in a practical manner,

Wu, X., Wang, R., et al. "SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner." arXiv:2406.05498, 2024

work page arXiv 2024

[24] [24]

Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on

Zhan, Q., Fang, H., Panchal, A., Kang, D. "Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents." NAACL 2025 Findings. arXiv:2503.00061

work page arXiv 2025

[25] [25]

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents

Zhang, J., Yu, R., et al. "Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents." ICLR 2025. arXiv:2410.02644

work page internal anchor Pith review Pith/arXiv arXiv 2025

[26] [26]

MELON: Provable defense against indirect prompt injection attacks in AI agents,

Zhu, K., Yang, Y., Wang, R., Guo, Y., Wang, H. "MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents." ICML 2025. arXiv:2502.05174. Appendix A. Released Artifacts All artifacts released alongside this paper are in the public repository at github.com/mthamil107/prompt-shield under the Apache 2.0 license. Relevant paths: • src/prom...

work page arXiv 2025