OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models

Publication
International Conference on Learning Representation (ICLR) 2024

Related