Shuffle the Context: RoPE-Perturbed Self-Distillation for Long-Context Adaptation

Publication
The 43th International Conference on Machine Learning (ICML), 2026