Are large language models really good logical reasoners? a comprehensive evaluation and beyond

Xu, F · 2025 · arXiv 2025.353600

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

support 1

representative citing papers

LGMT: Logic-Grounded Metamorphic Testing for Evaluating the Reasoning Reliability of LLMs

cs.AI · 2026-05-12 · unverdicted · novelty 7.0

LGMT applies metamorphic testing derived from first-order logic equivalences to detect reasoning inconsistencies in LLMs that static benchmarks miss.

Wiring the 'Why': A Unified Taxonomy and Survey of Abductive Reasoning in LLMs

cs.AI · 2026-04-09 · accept · novelty 7.0

The paper delivers the first survey of abductive reasoning in LLMs, a unified two-stage taxonomy, a compact benchmark, and an analysis of gaps relative to deductive and inductive reasoning.

From Intention to Text: AI-Supported Goal Setting in Academic Writing

cs.HC · 2026-04-17 · unverdicted · novelty 6.0

WriteFlow is a voice-based AI system that scaffolds metacognitive regulation in academic writing by enabling iterative goal refinement, goal-text alignment, and evaluation of goal fulfillment, as demonstrated in user studies.

citing papers explorer

Showing 3 of 3 citing papers.

LGMT: Logic-Grounded Metamorphic Testing for Evaluating the Reasoning Reliability of LLMs cs.AI · 2026-05-12 · unverdicted · none · ref 55
LGMT applies metamorphic testing derived from first-order logic equivalences to detect reasoning inconsistencies in LLMs that static benchmarks miss.
Wiring the 'Why': A Unified Taxonomy and Survey of Abductive Reasoning in LLMs cs.AI · 2026-04-09 · accept · none · ref 103
The paper delivers the first survey of abductive reasoning in LLMs, a unified two-stage taxonomy, a compact benchmark, and an analysis of gaps relative to deductive and inductive reasoning.
From Intention to Text: AI-Supported Goal Setting in Academic Writing cs.HC · 2026-04-17 · unverdicted · none · ref 26
WriteFlow is a voice-based AI system that scaffolds metacognitive regulation in academic writing by enabling iterative goal refinement, goal-text alignment, and evaluation of goal fulfillment, as demonstrated in user studies.

Are large language models really good logical reasoners? a comprehensive evaluation and beyond

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer