FormInv supplies an audit protocol, Semantic Consistency Rate (SCR), and a No-Free-Benchmark corollary showing that any target model ranking over nine frontier models can be realized by weighting paraphrase families.
A review and synthesis of the measurement invariance literature: Suggestions, practices, and recommendations for organizational research [J]
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Converting percentage scores to A/B/C/D grades reduces information entropy by 69 percent, makes optimal student clusters sensitive to single data points, and drops temporal diagnostic consistency from 93-96 percent to 52-96 percent.
citing papers explorer
-
FormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarks
FormInv supplies an audit protocol, Semantic Consistency Rate (SCR), and a No-Free-Benchmark corollary showing that any target model ranking over nine frontier models can be realized by weighting paraphrase families.
-
Data Aphasia: An Institutional Counterfactual Study of the Stability of Academic Cognition Under Letter-Grade Evaluation Systems
Converting percentage scores to A/B/C/D grades reduces information entropy by 69 percent, makes optimal student clusters sensitive to single data points, and drops temporal diagnostic consistency from 93-96 percent to 52-96 percent.