SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Category
Network Quantization
Year/Month
2022-11
Status
Done
Publications
ICML