Review history
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation
-
2026-05-15 UNVERDICTED
-
2026-05-07 UNVERDICTED
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation