pith. sign in

Understanding int4 quantization for transformer models: Latency speedup, composability, and failure cases

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

citation-role summary

method 1

citation-polarity summary

fields

cs.CL 1 cs.CV 1

years

2026 1 2024 1

verdicts

UNVERDICTED 2

roles

method 1

polarities

use method 1

clear filters

representative citing papers

Yi: Open Foundation Models by 01.AI

cs.CL · 2024-03-07 · unverdicted · novelty 4.0

Yi models are 6B and 34B open foundation models pretrained on 3.1T curated tokens that achieve strong benchmark results through data quality and targeted extensions like long context and vision alignment.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Yi: Open Foundation Models by 01.AI cs.CL · 2024-03-07 · unverdicted · none · ref 83

    Yi models are 6B and 34B open foundation models pretrained on 3.1T curated tokens that achieve strong benchmark results through data quality and targeted extensions like long context and vision alignment.