“Practical DNN Quantization Techniques and Tools,” a Presentation from Facebook

Raghuraman Krishnamoorthi, Software Engineer at Facebook, presents the “Practical DNN Quantization Techniques and Tools” tutorial at the September 2020 Embedded Vision Summit.

Quantization is a key technique to enable the efficient deployment of deep neural networks. In this talk, Krishnamoorthi presents an overview of techniques for quantizing convolutional neural networks for inference with integer weights and activations.

Krishnamoorthi explores simple and advanced quantization approaches and examine their effects on latency and accuracy on various target processors. He also presents best practices for quantization-aware training to obtain high accuracy with quantized weights and activations.

See here for a PDF of the slides.

Here you’ll find a wealth of practical technical insights and expert advice to help you bring AI and visual intelligence into your products without flying blind.

Contact

Address

Berkeley Design Technology, Inc.
PO Box #4446
Walnut Creek, CA 94596

Phone
Phone: +1 (925) 954-1411
Scroll to Top