Introduces SolidityBench benchmark and SolidityScore metric for repository-level Solidity code generation, finding supervised fine-tuning outperforms prompting, CoT, ICL, and RAG methods on evaluated LLMs.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Input/output constraints boost LLM-generated decision model structural similarity to gold standards by 37-54%, with models matching gold outcomes on 51-53% of test scenarios while removing redundant logic.
citing papers explorer
-
Repository-Level Solidity Code Generation with Large Language Models: From Prompting to Fine-Tuning
Introduces SolidityBench benchmark and SolidityScore metric for repository-level Solidity code generation, finding supervised fine-tuning outperforms prompting, CoT, ICL, and RAG methods on evaluated LLMs.
-
From Legal Text to Executable Decision Models: Evaluating Structured Representations for Legal Decision Model Generation
Input/output constraints boost LLM-generated decision model structural similarity to gold standards by 37-54%, with models matching gold outcomes on 51-53% of test scenarios while removing redundant logic.