Ai governance and accountability: An analysis of anthropic’s claude.arXiv preprint arXiv:2407.01557, 2024

Aman Priyanshu, Yash Maurya, Zuofei Hong · 2024 · arXiv 2407.01557

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

CodegenBench: Can LLMs Write Efficient Code Across Architectures?

cs.SE · 2026-06-01 · unverdicted · novelty 6.0

CodegenBench shows LLMs generate optimized code well for x86_64 but exhibit significant performance degradation on Sunway and Kunpeng due to limited documentation and training data.

Chain-of-Procedure: Hierarchical Visual-Language Reasoning for Procedural QA

cs.CL · 2026-05-14 · unverdicted · novelty 6.0

Introduces ProcedureVQA benchmark and Chain-of-Procedure framework that improves VLM next-step prediction in procedures by up to 13% over baselines.

citing papers explorer

Showing 2 of 2 citing papers.

CodegenBench: Can LLMs Write Efficient Code Across Architectures? cs.SE · 2026-06-01 · unverdicted · none · ref 38
CodegenBench shows LLMs generate optimized code well for x86_64 but exhibit significant performance degradation on Sunway and Kunpeng due to limited documentation and training data.
Chain-of-Procedure: Hierarchical Visual-Language Reasoning for Procedural QA cs.CL · 2026-05-14 · unverdicted · none · ref 32
Introduces ProcedureVQA benchmark and Chain-of-Procedure framework that improves VLM next-step prediction in procedures by up to 13% over baselines.

Ai governance and accountability: An analysis of anthropic’s claude.arXiv preprint arXiv:2407.01557, 2024

fields

years

verdicts

representative citing papers

citing papers explorer