Skip to content

Commit

Permalink
[matmul] fix single core design (#1721)
Browse files Browse the repository at this point in the history
Co-authored-by: Joseph Melber <[email protected]>
  • Loading branch information
andrej and jgmelber committed Sep 10, 2024
1 parent 3903d97 commit 89d1544
Show file tree
Hide file tree
Showing 2 changed files with 14 additions and 0 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -297,6 +297,9 @@ def sequence(A, B, C):
)
bd_id_base = 8 * pingpong
num_tile_rows = min([rows_per_block // 2, M_div_m - row_base])
if num_tile_rows <= 0:
# At the very last iteration, we may not need a 'pong' iteration
break
npu_dma_memcpy_nd(
metadata="outC",
bd_id=bd_id_base,
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
// (c) Copyright 2024 Advanced Micro Devices, Inc.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//
// REQUIRES: ryzen_ai, chess
//
// RUN: mkdir -p %S/test_2
// RUN: cd %S/test_2
// RUN: make -f %S/Makefile clean
// RUN: env M=768 K=512 N=512 m=64 k=64 n=64 dtype_in=i16 dtype_out=i16 make -f %S/Makefile
// RUN: %run_on_npu env M=768 K=512 N=512 m=64 k=64 n=64 dtype_in=i16 dtype_out=i16 make -f %S/Makefile run | FileCheck %s
// CHECK: PASS!

0 comments on commit 89d1544

Please sign in to comment.