Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Int4 Depthwise performance improvement #117

Merged

Conversation

ArmRyan
Copy link
Collaborator

@ArmRyan ArmRyan commented Mar 4, 2024

  • Fix unit test generation for depthwise
  • Add new unit tests for arm_depthwise_conv_s4_generic
  • Improved performance for arm_depthwise_conv_s4_generic
  • Fix buffer allocation for arm_depthwise_conv_s4_generic unit tests

Change-Id: I87543d055e936481f406f1d0872debcf87efdbdd

 * Fix unit test generation for depthwise
 * Add new unit tests for arm_depthwise_conv_s4_generic
 * Improved performance for arm_depthwise_conv_s4_generic
 * Fix buffer allocation for arm_depthwise_conv_s4_generic unit tests

Change-Id: I87543d055e936481f406f1d0872debcf87efdbdd
Signed-off-by: Ryan O'Shea <[email protected]>
@ArmRyan ArmRyan merged commit 6cc31fb into ARM-software:main Mar 5, 2024
1 of 2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants