QGen: On the Ability to Generalize in Quantization Aware Training

AskariHemmat, MohammadHossein; Jeddi, Ahmadreza; Hemmat, Reyhane Askari; Lazarevich, Ivan; Hoffman, Alexander; Sah, Sudhakar; Saboori, Ehsan; Savaria, Yvon; David, Jean-Pierre

Computer Science > Machine Learning

arXiv:2404.11769 (cs)

[Submitted on 17 Apr 2024 (v1), last revised 19 Apr 2024 (this version, v2)]

Title:QGen: On the Ability to Generalize in Quantization Aware Training

Authors:MohammadHossein AskariHemmat, Ahmadreza Jeddi, Reyhane Askari Hemmat, Ivan Lazarevich, Alexander Hoffman, Sudhakar Sah, Ehsan Saboori, Yvon Savaria, Jean-Pierre David

View PDF HTML (experimental)

Abstract:Quantization lowers memory usage, computational requirements, and latency by utilizing fewer bits to represent model weights and activations. In this work, we investigate the generalization properties of quantized neural networks, a characteristic that has received little attention despite its implications on model performance. In particular, first, we develop a theoretical model for quantization in neural networks and demonstrate how quantization functions as a form of regularization. Second, motivated by recent work connecting the sharpness of the loss landscape and generalization, we derive an approximate bound for the generalization of quantized models conditioned on the amount of quantization noise. We then validate our hypothesis by experimenting with over 2000 models trained on CIFAR-10, CIFAR-100, and ImageNet datasets on convolutional and transformer-based models.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.11769 [cs.LG]
	(or arXiv:2404.11769v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2404.11769

Submission history

From: MohammadHossein AskariHemmat [view email]
[v1] Wed, 17 Apr 2024 21:52:21 UTC (2,631 KB)
[v2] Fri, 19 Apr 2024 16:50:05 UTC (2,631 KB)

Computer Science > Machine Learning

Title:QGen: On the Ability to Generalize in Quantization Aware Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:QGen: On the Ability to Generalize in Quantization Aware Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators