Chen Liang
Chen Liang
Home
Publications
Experience
Teaching & Services
assa
Publications
Type
Conference paper
Date
2023
2022
2021
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Yixiao Li*
,
Yifan Yu*
,
Chen Liang
,
Pengcheng He
,
Nikos Karampatziakis
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Microsoft Blog
Ckpt
Github
Module-wise Adaptive Distillation for Multimodality Foundation Models
Chen Liang
,
Jiahui Yu
,
Ming-Hsuan Yang
,
Matthew Brown
,
Yin Cui
,
Tuo Zhao
,
Boqing Gong
,
Tianyi Zhou
Arxiv
LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation
Yixiao Li*
,
Yifan Yu*
,
Qingru Zhang
,
Chen Liang
,
Pengcheng He
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Github
Less is More: Task-aware Layer-wise Distillation for Language Model Compression
Chen Liang
,
Simiao Zuo
,
Qingru Zhang
,
Pengcheng He
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Github
Ckpt
HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers
Chen Liang
,
Haoming Jiang
,
Zheng Li
,
Xianfeng Tang
,
Bing Yin
,
Tuo Zhao
Arxiv
PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance
Qingru Zhang
,
Simiao Zuo
,
Chen Liang
,
Alexander Bukharin
,
Pengcheng He
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Github
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
Chen Liang
,
Haoming Jiang
,
Simiao Zuo
,
Pengcheng He
,
Xiaodong Liu
,
Jianfeng Gao
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Github
CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing
Chen Liang
,
Pengcheng He
,
Yelong Shen
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Github
Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach
Simiao Zuo
,
Chen Liang
,
Haoming Jiang
,
Xiaodong Liu
,
Pengcheng He
,
Jianfeng Gao
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Github
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Chen Liang
,
Simiao Zuo
,
Minshuo Chen
,
Haoming Jiang
,
Xiaodong Liu
,
Pengcheng He
,
Tuo Zhao
,
Weizhu Chen
Arxiv
Github
Cite
×