Inconsistency of CMSIS-NN Quantization Method(Q-format) with ARM Documentation #116

42bPhD · 2024-03-04T15:24:51Z

Hello.

I am currently in the process of developing using the Q-Format (Qm.n) for quantization. However, upon reviewing the revision history, I noticed that starting from version 4.1.0, the q-format approach is no longer being followed. My current approach aligns with the methods outlined in the following ARM documentation links:

While TensorFlow Lite for Microcontrollers employs Zero Point and Scale Factor for quantization, which necessitates additional memory and floating-point operations, it appears that Q-format based quantization would be more suitable for Cortex-M processors due to these constraints.

Could you kindly provide a clear explanation for the necessity of this change? The absence of discussion regarding its impact on speed and accuracy has left me somewhat perplexed. Any insight into the rationale behind this decision would be greatly appreciated, as it would aid in understanding the best practices for quantization within the context of TensorFlow Lite for Microcontrollers and CMSIS-NN.

Thank you for your time and consideration.

JonatanAntoni · 2024-03-04T15:29:06Z

Hi @LEE-SEON-WOO, please recognise I have transferred your question to the new home for CMSIS-NN. This dups #115, hence I close it right away.

42bPhD · 2024-03-04T15:32:21Z

Hello. @JonatanAntoni Thank you for your attention.

Hi @LEE-SEON-WOO, please recognise I have transferred your question to the new home for CMSIS-NN. This dups #115, hence I close it right away.

JonatanAntoni transferred this issue from ARM-software/CMSIS_5 Mar 4, 2024

JonatanAntoni closed this as completed Mar 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistency of CMSIS-NN Quantization Method(Q-format) with ARM Documentation #116

Inconsistency of CMSIS-NN Quantization Method(Q-format) with ARM Documentation #116

42bPhD commented Mar 4, 2024

JonatanAntoni commented Mar 4, 2024

42bPhD commented Mar 4, 2024

Inconsistency of CMSIS-NN Quantization Method(Q-format) with ARM Documentation #116

Inconsistency of CMSIS-NN Quantization Method(Q-format) with ARM Documentation #116

Comments

42bPhD commented Mar 4, 2024

JonatanAntoni commented Mar 4, 2024

42bPhD commented Mar 4, 2024