Title resolution pending

Gengyi Sun · 2025 · DOI 10.1109/icse-

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

open at publisher browse 10 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 3

citation-polarity summary

background 2 support 1

representative citing papers

How Do Developers Interact with AI? An Exploratory Study on Modeling Developer Programming Behavior

cs.SE · 2026-03-28 · unverdicted · novelty 7.0

Developers using AI assistants exhibit more stable emotions and greater focus on code creation, evaluation, and verification, captured in a new four-dimensional S-IASE model from retrospective labeling of screen recordings, surveys, and interviews.

Can Language Models Go Beyond Coding? Assessing the Capability of Language Models to Build Real-World Systems

cs.SE · 2025-11-02 · unverdicted · novelty 7.0

Build-bench is the first architecture-aware benchmark that evaluates LLMs on repairing cross-ISA build failures via iterative tool-augmented reasoning, with the best model reaching 63.19% success.

ClarifySTL: An Interactive LLM Agent Framework for STL Transformation through Requirements Clarification

cs.SE · 2026-05-02 · unverdicted · novelty 6.0

ClarifySTL uses LLM agents to interactively detect and resolve vagueness and ambiguity in natural language requirements via clarification queries before generating STL formulas, with evaluations on existing and new benchmarks showing effectiveness.

Quality-Driven Selective Mutation for Deep Learning

cs.SE · 2026-04-24 · unverdicted · novelty 6.0

A dual-axis quality framework ranks DL mutation operators by statistical resistance and Jaccard-based realism to real faults, enabling up to 55.6% fewer mutants on held-out validation data without dropping baseline performance.

Ethics Testing: Proactive Identification of Generative AI System Harms

cs.SE · 2026-04-23 · unverdicted · novelty 6.0

Ethics testing is introduced as a systematic approach to generate tests that identify software harms induced by unethical behavior in generative AI outputs.

LDMDroid: Leveraging LLMs for Detecting Data Manipulation Errors in Android Apps

cs.SE · 2026-04-01 · conditional · novelty 6.0

LDMDroid applies LLMs in a state-aware process to trigger data manipulation functions and uses visual cues to detect errors, finding 17 bugs across 24 Android apps with 14 developer confirmations.

Knowledge-Graph-Driven Data Synthesis for Low-Resource Software Development: A HarmonyOS Case Study

cs.SE · 2025-11-29 · unverdicted · novelty 6.0

APIKG4Syn synthesizes API-oriented training data via knowledge graphs and Monte Carlo search to fine-tune a 7B model that reaches 25% pass@1 on HarmonyOS code generation, beating untuned GPT-4o at 17.59%.

How Do Software Engineering Students Use Generative AI in Real-World Capstone Projects? An Empirical Baseline Study

cs.SE · 2026-04-27 · unverdicted · novelty 5.0

This empirical baseline study characterizes generative AI usage across the software lifecycle in capstone projects, student-recommended responsible practices, and client expectations for understanding and quality.

From Helpful to Trustworthy: LLM Agents for Pair Programming

cs.SE · 2026-04-11 · unverdicted · novelty 3.0

A research proposal for three studies on multi-agent LLM pair programming that externalizes intent and uses automated validation to increase trustworthiness.

Assessing REST API Test Generation Strategies with Log Coverage

cs.SE · 2026-04-08

citing papers explorer

Showing 10 of 10 citing papers.

How Do Developers Interact with AI? An Exploratory Study on Modeling Developer Programming Behavior cs.SE · 2026-03-28 · unverdicted · none · ref 5
Developers using AI assistants exhibit more stable emotions and greater focus on code creation, evaluation, and verification, captured in a new four-dimensional S-IASE model from retrospective labeling of screen recordings, surveys, and interviews.
Can Language Models Go Beyond Coding? Assessing the Capability of Language Models to Build Real-World Systems cs.SE · 2025-11-02 · unverdicted · none · ref 53
Build-bench is the first architecture-aware benchmark that evaluates LLMs on repairing cross-ISA build failures via iterative tool-augmented reasoning, with the best model reaching 63.19% success.
ClarifySTL: An Interactive LLM Agent Framework for STL Transformation through Requirements Clarification cs.SE · 2026-05-02 · unverdicted · none · ref 16
ClarifySTL uses LLM agents to interactively detect and resolve vagueness and ambiguity in natural language requirements via clarification queries before generating STL formulas, with evaluations on existing and new benchmarks showing effectiveness.
Quality-Driven Selective Mutation for Deep Learning cs.SE · 2026-04-24 · unverdicted · none · ref 18
A dual-axis quality framework ranks DL mutation operators by statistical resistance and Jaccard-based realism to real faults, enabling up to 55.6% fewer mutants on held-out validation data without dropping baseline performance.
Ethics Testing: Proactive Identification of Generative AI System Harms cs.SE · 2026-04-23 · unverdicted · none · ref 4
Ethics testing is introduced as a systematic approach to generate tests that identify software harms induced by unethical behavior in generative AI outputs.
LDMDroid: Leveraging LLMs for Detecting Data Manipulation Errors in Android Apps cs.SE · 2026-04-01 · conditional · none · ref 33
LDMDroid applies LLMs in a state-aware process to trigger data manipulation functions and uses visual cues to detect errors, finding 17 bugs across 24 Android apps with 14 developer confirmations.
Knowledge-Graph-Driven Data Synthesis for Low-Resource Software Development: A HarmonyOS Case Study cs.SE · 2025-11-29 · unverdicted · none · ref 62
APIKG4Syn synthesizes API-oriented training data via knowledge graphs and Monte Carlo search to fine-tune a 7B model that reaches 25% pass@1 on HarmonyOS code generation, beating untuned GPT-4o at 17.59%.
How Do Software Engineering Students Use Generative AI in Real-World Capstone Projects? An Empirical Baseline Study cs.SE · 2026-04-27 · unverdicted · none · ref 2
This empirical baseline study characterizes generative AI usage across the software lifecycle in capstone projects, student-recommended responsible practices, and client expectations for understanding and quality.
From Helpful to Trustworthy: LLM Agents for Pair Programming cs.SE · 2026-04-11 · unverdicted · none · ref 1
A research proposal for three studies on multi-agent LLM pair programming that externalizes intent and uses automated validation to increase trustworthiness.
Assessing REST API Test Generation Strategies with Log Coverage cs.SE · 2026-04-08 · unreviewed · ref 18

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer