LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation

Publication
The 40th International Conference on Machine Learning (ICML), 2023