Chen Liang
Chen Liang
Home
Publications
Experience
Teaching & Services
assa
Publications
Type
Conference paper
Date
2025
2023
2022
2021
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation
Zichong Li
+
,
Chen Liang
,
Zixuan Zhang
,
Ilgee Hong
,
Young Jin Kim
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Phi-tiny-MoE-instruct
Phi-mini-MoE-instruct
Phi-mini-MoE-instruct-GGUF
GRIN-MoE Paper
Phi-3 Technical Report
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Liliang Ren
,
Yang Liu
,
Yadong Lu
,
Yelong Shen
,
Chen Liang
,
Weizhu Chen
Arxiv
Github
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Yixiao Li*
,
Yifan Yu*
+
,
Chen Liang
,
Pengcheng He
,
Nikos Karampatziakis
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Blog
Ckpt
Github
Module-wise Adaptive Distillation for Multimodality Foundation Models
Chen Liang
,
Jiahui Yu
,
Ming-Hsuan Yang
,
Matthew Brown
,
Yin Cui
,
Tuo Zhao
,
Boqing Gong
,
Tianyi Zhou
Arxiv
Less is More: Task-aware Layer-wise Distillation for Language Model Compression
Chen Liang
,
Simiao Zuo
,
Qingru Zhang
,
Pengcheng He
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Github
Ckpt
HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers
Chen Liang
,
Haoming Jiang
,
Zheng Li
,
Xianfeng Tang
,
Bing Yin
,
Tuo Zhao
Arxiv
PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance
Qingru Zhang
,
Simiao Zuo
,
Chen Liang
,
Alexander Bukharin
,
Pengcheng He
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Github
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
Chen Liang
,
Haoming Jiang
,
Simiao Zuo
,
Pengcheng He
,
Xiaodong Liu
,
Jianfeng Gao
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Github
CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing
Chen Liang
,
Pengcheng He
,
Yelong Shen
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Github
Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach
Simiao Zuo
,
Chen Liang
,
Haoming Jiang
,
Xiaodong Liu
,
Pengcheng He
,
Jianfeng Gao
,
Weizhu Chen
,
Tuo Zhao
Arxiv
Github
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Chen Liang
,
Simiao Zuo
,
Minshuo Chen
,
Haoming Jiang
,
Xiaodong Liu
,
Pengcheng He
,
Tuo Zhao
,
Weizhu Chen
Arxiv
Github
Cite
×