ML

Pushing the Limits of Large Language Model Quantization via the Linearity Theorem.
PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression.