LLM-QAT: Data-Free Quantization Aware Training for Large Language Models

Category
Network Quantization
Year/Month
2023-05
Status
Publications
Preprint