Title resolution pending

Majdinasab, Vahid, Bishop, Michael Joshua, Rasheed, Shawn, Moradidakhel, Arghavan, Tahir, Amjed, Khomh, Foutse , year = · 2024 · arXiv 0148.2024

13 Pith papers cite this work. Polarity classification is still indexing.

13 Pith papers citing it

read on arXiv browse 13 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 4

citation-polarity summary

background 4

representative citing papers

Demystifying the Silence of Correctness Bugs in PyTorch Compiler

cs.SE · 2026-04-09 · conditional · novelty 8.0

First empirical study of correctness bugs in torch.compile characterizes their patterns and proposes AlignGuard, which found 23 confirmed new bugs via LLM-guided test mutation.

Method-level Change-proneness: A Better Metric for Black-box Test Suite Minimization

cs.SE · 2026-05-13 · unverdicted · novelty 7.0

MCTM applies method-level change-proneness from version history and call-graph analysis to minimize black-box test suites, reporting 0.93 accuracy and 0.94 fault detection rate on 15 Java projects with 635 buggy versions.

Are We Lost in the Woods? Detecting Silent Semantic Faults for Random Forest Classifiers with Data-informed Static Analysis

cs.SE · 2026-06-05 · unverdicted · novelty 6.0

dille detects silent semantic faults in random forest ML pipelines with 91% precision via data-informed static analysis on Kaggle notebooks, finding 12-18% of scripts affected.

Beyond the Tip of the Iceberg: Understanding SATD in Dockerfiles through the Lens of Co-evolution

cs.SE · 2026-05-20 · unverdicted · novelty 6.0

Analysis of SATD in Dockerfiles shows 27% of admissions and 40% of repayments are coupled to non-Dockerfile artifacts, with coupled events repaid faster overall and external dependencies as a key trigger.

SecureForge: Finding and Preventing Vulnerabilities in LLM-Generated Code via Prompt Optimization

cs.CR · 2026-05-08 · unverdicted · novelty 6.0

SecureForge audits LLM code for vulnerabilities, builds a synthetic prompt corpus via Markovian sampling, and optimizes system prompts to cut security issues by up to 48% while preserving unit test performance, with zero-shot transfer to real prompts.

DynamicsLLM: a Dynamic Analysis-based Tool for Generating Intelligent Execution Traces Using LLMs to Detect Android Behavioural Code Smells

cs.SE · 2026-04-12 · unverdicted · novelty 6.0 · 2 refs

DynamicsLLM uses LLMs to generate execution traces that cover three times more code smell-related events than the prior Dynamics tool on 333 F-Droid Android apps, with a hybrid method adding 25.9% coverage for low-activity apps.

A Comparative Study of Semantic Log Representations for Software Log-based Anomaly Detection

cs.SE · 2026-04-09 · unverdicted · novelty 6.0

QTyBERT matches or exceeds BERT-based log anomaly detection effectiveness while reducing embedding generation time to near static word embedding levels.

TreeRanker: Fast and Model-agnostic Ranking System for Code Suggestions in IDEs

cs.SE · 2025-08-04 · unverdicted · novelty 6.0

TreeRanker ranks static code completions by organizing candidates in a prefix tree and collecting token scores via a single greedy language-model decoding pass.

XOXO: Stealthy Cross-Origin Context Poisoning Attacks against AI Coding Assistants

cs.CR · 2025-03-18 · unverdicted · novelty 6.0

XOXO is a cross-origin context poisoning attack on AI coding assistants that uses a Cayley Graph search algorithm (GCGS) to find stealthy perturbations, achieving 75.72% average success rate across five tasks and eleven models.

How Do Developers Use Migration Guides? A Case Study of Log4j

cs.SE · 2026-04-27 · unverdicted · novelty 5.0

Developers most frequently reference the full Log4j migration guide in pull request descriptions (82.81% of cases) and continue consulting it during post-update maintenance tasks.

Towards Better Static Code Analysis Reports: Sentence Transformer-based Filtering of Non-Actionable Alerts

cs.SE · 2026-04-20 · conditional · novelty 5.0

STAF applies sentence embeddings from transformers to classify SCA findings, reaching 89% F1 and beating prior filters by 11% within projects and 6% across projects.

The Case for Model Science: Verify, Explore, Steer, Refine

cs.AI · 2026-05-31 · unverdicted · novelty 4.0

Position paper proposing Model Science as a discipline to systematically analyze AI model behavior beyond benchmarks, drawing analogies from cognitive science, neuroscience, medicine, and agriculture.

Security of LLM-generated Code: A Comparative Analysis

cs.SE · 2026-05-21 · unverdicted · novelty 4.0

Empirical evaluation shows that code generated by all seven tested LLMs contains vulnerabilities, the majority of critical or high severity.

citing papers explorer

Showing 13 of 13 citing papers.

Demystifying the Silence of Correctness Bugs in PyTorch Compiler cs.SE · 2026-04-09 · conditional · none · ref 16
First empirical study of correctness bugs in torch.compile characterizes their patterns and proposes AlignGuard, which found 23 confirmed new bugs via LLM-guided test mutation.
Method-level Change-proneness: A Better Metric for Black-box Test Suite Minimization cs.SE · 2026-05-13 · unverdicted · none · ref 32
MCTM applies method-level change-proneness from version history and call-graph analysis to minimize black-box test suites, reporting 0.93 accuracy and 0.94 fault detection rate on 15 Java projects with 635 buggy versions.
Are We Lost in the Woods? Detecting Silent Semantic Faults for Random Forest Classifiers with Data-informed Static Analysis cs.SE · 2026-06-05 · unverdicted · none · ref 48
dille detects silent semantic faults in random forest ML pipelines with 91% precision via data-informed static analysis on Kaggle notebooks, finding 12-18% of scripts affected.
Beyond the Tip of the Iceberg: Understanding SATD in Dockerfiles through the Lens of Co-evolution cs.SE · 2026-05-20 · unverdicted · none · ref 25
Analysis of SATD in Dockerfiles shows 27% of admissions and 40% of repayments are coupled to non-Dockerfile artifacts, with coupled events repaid faster overall and external dependencies as a key trigger.
SecureForge: Finding and Preventing Vulnerabilities in LLM-Generated Code via Prompt Optimization cs.CR · 2026-05-08 · unverdicted · none · ref 22
SecureForge audits LLM code for vulnerabilities, builds a synthetic prompt corpus via Markovian sampling, and optimizes system prompts to cut security issues by up to 48% while preserving unit test performance, with zero-shot transfer to real prompts.
DynamicsLLM: a Dynamic Analysis-based Tool for Generating Intelligent Execution Traces Using LLMs to Detect Android Behavioural Code Smells cs.SE · 2026-04-12 · unverdicted · none · ref 6 · 2 links
DynamicsLLM uses LLMs to generate execution traces that cover three times more code smell-related events than the prior Dynamics tool on 333 F-Droid Android apps, with a hybrid method adding 25.9% coverage for low-activity apps.
A Comparative Study of Semantic Log Representations for Software Log-based Anomaly Detection cs.SE · 2026-04-09 · unverdicted · none · ref 24
QTyBERT matches or exceeds BERT-based log anomaly detection effectiveness while reducing embedding generation time to near static word embedding levels.
TreeRanker: Fast and Model-agnostic Ranking System for Code Suggestions in IDEs cs.SE · 2025-08-04 · unverdicted · none · ref 16
TreeRanker ranks static code completions by organizing candidates in a prefix tree and collecting token scores via a single greedy language-model decoding pass.
XOXO: Stealthy Cross-Origin Context Poisoning Attacks against AI Coding Assistants cs.CR · 2025-03-18 · unverdicted · none · ref 41
XOXO is a cross-origin context poisoning attack on AI coding assistants that uses a Cayley Graph search algorithm (GCGS) to find stealthy perturbations, achieving 75.72% average success rate across five tasks and eleven models.
How Do Developers Use Migration Guides? A Case Study of Log4j cs.SE · 2026-04-27 · unverdicted · none · ref 15
Developers most frequently reference the full Log4j migration guide in pull request descriptions (82.81% of cases) and continue consulting it during post-update maintenance tasks.
Towards Better Static Code Analysis Reports: Sentence Transformer-based Filtering of Non-Actionable Alerts cs.SE · 2026-04-20 · conditional · none · ref 46
STAF applies sentence embeddings from transformers to classify SCA findings, reaching 89% F1 and beating prior filters by 11% within projects and 6% across projects.
The Case for Model Science: Verify, Explore, Steer, Refine cs.AI · 2026-05-31 · unverdicted · none · ref 220
Position paper proposing Model Science as a discipline to systematically analyze AI model behavior beyond benchmarks, drawing analogies from cognitive science, neuroscience, medicine, and agriculture.
Security of LLM-generated Code: A Comparative Analysis cs.SE · 2026-05-21 · unverdicted · none · ref 41
Empirical evaluation shows that code generated by all seven tested LLMs contains vulnerabilities, the majority of critical or high severity.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer