Title resolution pending

Because ⊛ is chosen non-deterministically in the AI- Control Game, Safety, Use are measured conditional on the mode, the probability (up to Red’s actions) of a given ⊕ trajectory in the ZSPOSG is half the probability of the corre · 1997

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols

cs.AI · 2024-09-12 · unverdicted · novelty 7.0

Introduces AI-Control Games as a formal decision model for AI deployment red-teaming, with reductions to zero-sum POSGs for synthesizing Pareto-optimal trusted monitoring protocols and analysis of modeling assumptions.

citing papers explorer

Showing 1 of 1 citing paper.

Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols cs.AI · 2024-09-12 · unverdicted · none · ref 17
Introduces AI-Control Games as a formal decision model for AI deployment red-teaming, with reductions to zero-sum POSGs for synthesizing Pareto-optimal trusted monitoring protocols and analysis of modeling assumptions.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer