Code LLMs generate substantially worse comments outside English, and no tested automatic metric or LLM judge reliably matches human assessment of those outputs.
R ob BERT : a D utch R o BERT a-based L anguage M odel
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
years
2026 3verdicts
UNVERDICTED 3representative citing papers
LLaMA 3.1 extracts visual rating scores from Dutch neuroradiology reports with 87-96% balanced accuracy but only 66-80% on numerical counts, with few-shot prompting raising the latter to 81-92%.
PortBERT releases two RoBERTa models for Portuguese that match or beat prior monolingual and multilingual models on translated GLUE/SuperGLUE tasks while reporting training and inference times.
citing papers explorer
-
Automatic Extraction of Structured Information from Brain MRI Reports Using an Open-Weight Large Language Model
LLaMA 3.1 extracts visual rating scores from Dutch neuroradiology reports with 87-96% balanced accuracy but only 66-80% on numerical counts, with few-shot prompting raising the latter to 81-92%.