Bigbench: A unified benchmark for evaluating multi-dimensional social biases in text-to-image models.arXiv preprint arXiv:2407.15240, 2024

Hanjun Luo, Haoyu Huang, Ziye Deng, Xinfeng Li, Hewei Wang, Yingbin Jin, Yang Liu, Wenyuan Xu, Zuozhu Liu · 2024 · arXiv 2407.15240

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

Gender Artifacts from Art History to Text-to-Image Generation

cs.CV · 2026-06-04 · unverdicted · novelty 7.0

Introduces the StyleGender dataset and PixelSGA/MaskSGA metrics showing that text-to-image models amplify gender artifacts present in artistic styles beyond historical baselines.

AInstein: Can LLMs Solve Research Problems From Parametric Memory Alone?

cs.AI · 2025-10-06 · unverdicted · novelty 6.0

LLMs generate valid solutions to over 70% of AI research problems from parametric memory alone but rediscover the exact published approach less than 19% of the time, with performance limited by cross-domain analogical transfer.

citing papers explorer

Showing 1 of 1 citing paper after filters.

AInstein: Can LLMs Solve Research Problems From Parametric Memory Alone? cs.AI · 2025-10-06 · unverdicted · none · ref 7
LLMs generate valid solutions to over 70% of AI research problems from parametric memory alone but rediscover the exact published approach less than 19% of the time, with performance limited by cross-domain analogical transfer.

Bigbench: A unified benchmark for evaluating multi-dimensional social biases in text-to-image models.arXiv preprint arXiv:2407.15240, 2024

fields

years

verdicts

representative citing papers

citing papers explorer