pith. sign in

arxiv: 2601.08648 · v2 · pith:65PLZADTnew · submitted 2026-01-13 · 💻 cs.CL · cs.LG

Safe Language Generation in the Limit

classification 💻 cs.CL cs.LG
keywords languagegenerationsafeidentificationimpossiblelimitlearningtractable
0
0 comments X
read the original abstract

Recent results in learning a language in the limit have shown that, although language identification is impossible, language generation is tractable. As this foundational area expands, we need to consider the implications of language generation in real-world settings. This work offers the first theoretical treatment of safe language generation. Building on the computational paradigm of learning in the limit, we formalize the tasks of safe language identification and generation. We prove that under this model, safe language identification is impossible, and that safe language generation is at least as hard as (vanilla) language identification, which is also impossible. Last, we discuss several intractable and tractable cases.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Contrastive Identification and Generation in the Limit

    cs.LG 2026-05 unverdicted novelty 8.0

    Contrastive pair presentations yield exact identifiability characterizations via a geometric refinement of Angluin's condition, a new contrastive closure dimension for generation, mutual incomparability with text iden...

  2. On Language Generation in the Limit with Bounded Memory

    cs.DS 2026-05 unverdicted novelty 7.0

    Memoryless generation succeeds for any countable collection of infinite languages under an enumeration restriction, with optimal minimax densities for finite collections via Sperner's theorem; sliding windows add no w...

  3. Mistake-Bounded Language Generation

    cs.LG 2026-05 unverdicted novelty 6.0

    Defines mistake-bounded generation and gives an algorithm for finite classes achieving optimal last-mistake time Cdim(L) with floor(log2 |L|) mistakes, plus a trade-off for infinite classes and noisy extensions.