LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Publication
ICLR, 2024