AssayBench is a new gene-ranking benchmark for phenotypic CRISPR screens that shows zero-shot generalist LLMs outperform both biology-specific LLMs and trainable baselines on adjusted nDCG.
arXiv preprint arXiv:2503.04013 , year=
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
GenomeQA benchmark shows general LLMs outperform random guessing on raw DNA sequences by detecting local patterns like GC content but struggle with multi-step or indirect genomic inferences.
citing papers explorer
-
AssayBench: An Assay-Level Virtual Cell Benchmark for LLMs and Agents
AssayBench is a new gene-ranking benchmark for phenotypic CRISPR screens that shows zero-shot generalist LLMs outperform both biology-specific LLMs and trainable baselines on adjusted nDCG.
-
GenomeQA: Benchmarking General Large Language Models for Genome Sequence Understanding
GenomeQA benchmark shows general LLMs outperform random guessing on raw DNA sequences by detecting local patterns like GC content but struggle with multi-step or indirect genomic inferences.