CS3 strengthens two-tower retrievers via cycle-adaptive feature denoising, cross-tower mutual awareness, and cascade knowledge reuse, delivering consistent gains on public datasets and up to 8.36% revenue lift in production advertising at millisecond latency.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
A cognitive-uncertainty guided two-stage KD framework filters to 10.3% of samples to reach 0.9585 MAP@3 and 84.38% accuracy with a 4B model, beating larger LLMs on misconception classification.
citing papers explorer
-
Cognitive-Uncertainty Guided Knowledge Distillation for Accurate Classification of Student Misconceptions
A cognitive-uncertainty guided two-stage KD framework filters to 10.3% of samples to reach 0.9585 MAP@3 and 84.38% accuracy with a 4B model, beating larger LLMs on misconception classification.