AI-Generated Smells: An Analysis of Code and Architecture in LLM and Agent-Driven Development

· 2026 · cs.SE · arXiv 2605.02741

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

The promise of Large Language Models in automated software engineering is often measured by functional correctness, overlooking the critical issue of long term maintainability. This paper presents a systematic audit of technical debt in AI-generated software, revealing that AI does not eliminate flaws but rather introduces a distinct machine signature of defects. Our multi-scale analysis, spanning single-file algorithmic tasks and complex, agent generated systems, identifies a fundamental Reasoning-Complexity Trade-off: as models become more capable, they generate increasingly bloated and coupled code. This architectural decay is so pronounced that we establish a Volume-Quality Inverse Law, where code volume is a near perfect predictor of structural degradation. Crucially, we demonstrate that neither functional correctness nor detailed prompting mitigates this decay. These findings challenge the current paradigm of prompt-driven generation, reframing the central problem of AI-based software engineering from one of code generation to one of architectural complexity management. We conclude that future progress depends on equipping agents with explicit architectural foresight to ensure the software they build is not just functional, but also maintainable.

representative citing papers

Rethinking Complexity Metrics for LLM-Integrated Applications: Beyond Source Code

cs.AI · 2026-07-02 · unverdicted · novelty 7.0

HECATE generates and validates ten complexity metrics (seven new) for LLM apps by treating prompts as behavioral specifications and filtering against maintenance activity from version history, showing prompt complexity as an independent factor.

Microskill Architecture: A Modular Skill-Driven Framework for AI-Native Code Generation

cs.SE · 2026-06-04 · unverdicted · novelty 4.0

MicroSkill Architecture partitions knowledge into atomic skill capsules selected via constrained optimization to cut token use over 90% and improve code generation metrics in one enterprise case study.

citing papers explorer

Showing 2 of 2 citing papers.

Rethinking Complexity Metrics for LLM-Integrated Applications: Beyond Source Code cs.AI · 2026-07-02 · unverdicted · none · ref 36 · internal anchor
HECATE generates and validates ten complexity metrics (seven new) for LLM apps by treating prompts as behavioral specifications and filtering against maintenance activity from version history, showing prompt complexity as an independent factor.
Microskill Architecture: A Modular Skill-Driven Framework for AI-Native Code Generation cs.SE · 2026-06-04 · unverdicted · none · ref 2 · internal anchor
MicroSkill Architecture partitions knowledge into atomic skill capsules selected via constrained optimization to cut token use over 90% and improve code generation metrics in one enterprise case study.

AI-Generated Smells: An Analysis of Code and Architecture in LLM and Agent-Driven Development

fields

years

verdicts

representative citing papers

citing papers explorer