pith. sign in

Yujiong Shen

Identifiers

  • name variant Yujiong Shen 0.60 · backfill

Papers (4)

  1. LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening cs.CL · 2026 · author #4
  2. CL-bench Life: Can Language Models Learn from Real-Life Context? cs.CL · 2026 · author #2
  3. Can Deep Research Agents Retrieve and Organize? Evaluating the Synthesis Gap with Expert Taxonomies cs.CL · 2026 · author #7
  4. LLMEval-Fair: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models cs.CL · 2025 · author #2

Mentions

  • 2605.19597 #4 · arxiv_oai · confidence 0.70 Yujiong Shen
  • 2601.12369 #7 · arxiv_oai · confidence 0.70 Yujiong Shen

Frequent Coauthors