Revisiting Intermediate Layer Distillation for Compressing Language Models: An Overfitting Perspective

Category
Year/Month
2023
Status
Publications
Findings of EACL