Integer-Only CNNs with 4 Bit Weights and Bit-Shift Quantization Scales at Full-Precision Accuracy

Quantization of neural networks has been one of the most popular techniques to compress models for embedded (IoT) hardware platforms with highly constrained latency, storage, memory-bandwidth, and energy specifications. Limiting the number of bits per weight and activation has been the main focus in...

Descripción completa

Guardado en:

Detalles Bibliográficos
Autores principales:	Maarten Vandersteegen, Kristof Van Beeck, Toon Goedemé
Formato:	article
Lenguaje:	EN
Publicado:	MDPI AG 2021
Materias:	quantization neural networks nonuniform power-of-two scales low-cost hardware Electronics TK7800-8360
Acceso en línea:	https://doaj.org/article/9b83f42050394e609be6a8c4a4b79011
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Sea el primero en dejar un comentario!

Integer-Only CNNs with 4 Bit Weights and Bit-Shift Quantization Scales at Full-Precision Accuracy

Ejemplares similares