SPINQUANT - LLM QUANTIZATION WITH LEARNED ROTATIONS

November 6, 2025

QSVD: Efficient Low-rank Approximation for Unified Query-Key-Value Weight Compression in Low-Precision Vision-Language Models

October 30, 2025