M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection

Wang, Yuxia, Mansurov, Jonibek, Ivanov, Petar, Su, Jinyan, Shelmanov, Artem, Tsvigun, Akim · 2024 · DOI 10.18653/v1/2024.eacl-long.83

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Hitting a Moving Target: Test-Time Adaptation for AI Text Detection under Continual Distribution Shift

cs.CL · 2026-06-23 · unverdicted · novelty 6.0

Test-time adaptation with semi-supervised learning leverages inference-time homogeneity to maintain AI text detection performance under adversarial humanization, new LLMs, and temporal drift.

Adversarial Creation and Detection of AI-Generated Social Bot Content

cs.CL · 2026-06-05 · unverdicted · novelty 6.0

An adversarial methodology generates a multilingual cross-platform dataset of paired human-AI social messages, and models trained on it outperform prior detectors on real-world out-of-distribution data.

Inverse Turing Bench: Evaluating Language Models as Judges of Human vs. AI Dialogue

cs.CL · 2026-06-20 · unverdicted · novelty 5.0

Inverse Turing Bench evaluates LLMs on distinguishing human-human from human-AI dialogues, with GPTZero at 89.41%, Claude Opus-4.6 at 77.92%, and GPT-5.5 at 75.94% accuracy.

citing papers explorer

Showing 3 of 3 citing papers.

Hitting a Moving Target: Test-Time Adaptation for AI Text Detection under Continual Distribution Shift cs.CL · 2026-06-23 · unverdicted · none · ref 103
Test-time adaptation with semi-supervised learning leverages inference-time homogeneity to maintain AI text detection performance under adversarial humanization, new LLMs, and temporal drift.
Adversarial Creation and Detection of AI-Generated Social Bot Content cs.CL · 2026-06-05 · unverdicted · none · ref 63
An adversarial methodology generates a multilingual cross-platform dataset of paired human-AI social messages, and models trained on it outperform prior detectors on real-world out-of-distribution data.
Inverse Turing Bench: Evaluating Language Models as Judges of Human vs. AI Dialogue cs.CL · 2026-06-20 · unverdicted · none · ref 69
Inverse Turing Bench evaluates LLMs on distinguishing human-human from human-AI dialogues, with GPTZero at 89.41%, Claude Opus-4.6 at 77.92%, and GPT-5.5 at 75.94% accuracy.

M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection

fields

years

verdicts

representative citing papers

citing papers explorer