Title resolution pending

Ji, Y · 2025 · DOI 10.1038/s41746-025-01576-4

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Beyond Accuracy: Measuring Bias Acknowledgment in Chain-of-Thought Reasoning for Responsible AI Evaluation

cs.LG · 2026-06-13 · unverdicted · novelty 6.0

GPT-4o and Claude Sonnet 4 show similar susceptibility to bias on GSM8K (1.3% vs 1.2%) but differ sharply in acknowledgment rates (13% vs 75%) under a rubric-defined metric.

SafetyRepro: Configuration-Conditional Rank Instability on Alignment Benchmarks

cs.LG · 2026-05-25 · unverdicted · novelty 6.0

Configuration choices alone flip pairwise safety verdicts on every tested alignment benchmark, isolated via a finite-envelope proposition linking disagreement rate to strict ordering reversal.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Beyond Accuracy: Measuring Bias Acknowledgment in Chain-of-Thought Reasoning for Responsible AI Evaluation cs.LG · 2026-06-13 · unverdicted · none · ref 6
GPT-4o and Claude Sonnet 4 show similar susceptibility to bias on GSM8K (1.3% vs 1.2%) but differ sharply in acknowledgment rates (13% vs 75%) under a rubric-defined metric.
SafetyRepro: Configuration-Conditional Rank Instability on Alignment Benchmarks cs.LG · 2026-05-25 · unverdicted · none · ref 24
Configuration choices alone flip pairwise safety verdicts on every tested alignment benchmark, isolated via a finite-envelope proposition linking disagreement rate to strict ordering reversal.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer