Zhenghua Wang, Yiran Ding, Changze Lv, Zhibo Xu, Tianlong Li, Tianyuan Shi, Xiaoqing Zheng, Xuanjing Huang
Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling
https://arxiv.org/abs/2503.04355
Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling
https://arxiv.org/abs/2503.04355
Comments