MRC protocol with multi-plane Clos and SRv6 enables large AI training clusters to continue jobs through network failures that previously halted training.
victim” traffic as discussed in Section 5.2.8. Here we present some additional results for the same cross- spine 7 to 1 incast traffic pattern run in parallel to a “victim
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.NI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Resilient AI Supercomputer Networking using MRC and SRv6
MRC protocol with multi-plane Clos and SRv6 enables large AI training clusters to continue jobs through network failures that previously halted training.