arXiv preprint arXiv:2311.13647 , year=

Language model inversion , author= · 2023 · arXiv 2311.13647

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Black-Box Inference of LLM Architectural Properties with Restrictive API Access

cs.LG · 2026-07-01 · unverdicted · novelty 6.0

NightVision recovers LLM hidden dimension to 23% average relative error (9% on MoE) and depth/parameter count to 53% on models >3B parameters using common-set prompting, spectral analysis, and TTFT under single-logit black-box access.

Why Trust Your Agent? Empirical Security Gains from TRiSM-Guided Agentic Workflows in Healthcare

cs.CR · 2026-06-27 · unverdicted · novelty 4.0

TRiSM-guided agentic workflows reduced RAG poisoning attack success from 31% to 10%, data-field injection from 42% to 25%, eliminated network injection, and raised report accuracy from 72.5% to 86.5% across five LLMs and 800 generations.

citing papers explorer

Showing 2 of 2 citing papers.

Black-Box Inference of LLM Architectural Properties with Restrictive API Access cs.LG · 2026-07-01 · unverdicted · none · ref 36
NightVision recovers LLM hidden dimension to 23% average relative error (9% on MoE) and depth/parameter count to 53% on models >3B parameters using common-set prompting, spectral analysis, and TTFT under single-logit black-box access.
Why Trust Your Agent? Empirical Security Gains from TRiSM-Guided Agentic Workflows in Healthcare cs.CR · 2026-06-27 · unverdicted · none · ref 11
TRiSM-guided agentic workflows reduced RAG poisoning attack success from 31% to 10%, data-field injection from 42% to 25%, eliminated network injection, and raised report accuracy from 72.5% to 86.5% across five LLMs and 800 generations.

arXiv preprint arXiv:2311.13647 , year=

fields

years

verdicts

representative citing papers

citing papers explorer