Safe Language Generation in the Limit

Antonios Anastasopoulos; Evgenios M. Kornaropoulos; Giuseppe Ateniese

arxiv: 2601.08648 · v2 · pith:65PLZADTnew · submitted 2026-01-13 · 💻 cs.CL · cs.LG

Safe Language Generation in the Limit

Antonios Anastasopoulos , Giuseppe Ateniese , Evgenios M. Kornaropoulos This is my paper

classification 💻 cs.CL cs.LG

keywords languagegenerationsafeidentificationimpossiblelimitlearningtractable

0 comments

read the original abstract

Recent results in learning a language in the limit have shown that, although language identification is impossible, language generation is tractable. As this foundational area expands, we need to consider the implications of language generation in real-world settings. This work offers the first theoretical treatment of safe language generation. Building on the computational paradigm of learning in the limit, we formalize the tasks of safe language identification and generation. We prove that under this model, safe language identification is impossible, and that safe language generation is at least as hard as (vanilla) language identification, which is also impossible. Last, we discuss several intractable and tractable cases.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Contrastive Identification and Generation in the Limit
cs.LG 2026-05 unverdicted novelty 8.0

Contrastive pair presentations yield exact identifiability characterizations via a geometric refinement of Angluin's condition, a new contrastive closure dimension for generation, mutual incomparability with text iden...
On Language Generation in the Limit with Bounded Memory
cs.DS 2026-05 unverdicted novelty 7.0

Memoryless generation succeeds for any countable collection of infinite languages under an enumeration restriction, with optimal minimax densities for finite collections via Sperner's theorem; sliding windows add no w...
Mistake-Bounded Language Generation
cs.LG 2026-05 unverdicted novelty 6.0

Defines mistake-bounded generation and gives an algorithm for finite classes achieving optimal last-mistake time Cdim(L) with floor(log2 |L|) mistakes, plus a trade-off for infinite classes and noisy extensions.