TIGER delivers the first GPU-accelerated high-precision TFHE implementations for LLM nonlinear layers, with measured speedups of 7.17x for GELU, 16.68x for Softmax, and 17.05x for LayerNorm over CPU baselines.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2representative citing papers
citing papers explorer
-
GPU Acceleration of TFHE-Based High-Precision Nonlinear Layers for Encrypted LLM Inference
TIGER delivers the first GPU-accelerated high-precision TFHE implementations for LLM nonlinear layers, with measured speedups of 7.17x for GELU, 16.68x for Softmax, and 17.05x for LayerNorm over CPU baselines.
- PROBE: Probabilistic Occupancy BEV Encoding with Analytical Translation Robustness for 3D Place Recognition