Jade: A linguistics-based safety evaluation platform for large language models.arXiv preprint arXiv:2311.00286, 2023a

18 JT-Safe-V2TECHNICALREPORT Mi Zhang, Xudong Pan, Min Yang · 2023 · arXiv 2311.00286

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

JT-SAFE-V2: Safety-by-Design Foundation Model with World-Context Data

cs.AI · 2026-05-23 · unverdicted · novelty 4.0

JT-Safe-V2 is a safety-by-design LLM that reports SOTA scores on both capability and safety benchmarks while Safe-MoMA cuts inference cost over 30 percent.

A Survey on LLM-as-a-Judge

cs.CL · 2024-11-23 · unverdicted · novelty 4.0

A survey on LLM-as-a-Judge that reviews reliability strategies, proposes evaluation methods, and introduces a novel benchmark for assessing such systems.

citing papers explorer

Showing 2 of 2 citing papers.

JT-SAFE-V2: Safety-by-Design Foundation Model with World-Context Data cs.AI · 2026-05-23 · unverdicted · none · ref 14
JT-Safe-V2 is a safety-by-design LLM that reports SOTA scores on both capability and safety benchmarks while Safe-MoMA cuts inference cost over 30 percent.
A Survey on LLM-as-a-Judge cs.CL · 2024-11-23 · unverdicted · none · ref 209
A survey on LLM-as-a-Judge that reviews reliability strategies, proposes evaluation methods, and introduces a novel benchmark for assessing such systems.

Jade: A linguistics-based safety evaluation platform for large language models.arXiv preprint arXiv:2311.00286, 2023a

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer