Skip to content

Commit

Permalink
[ blas/bugfix ] Fix irrelevant function call
Browse files Browse the repository at this point in the history
- Since current function implementations are not using CBLAS params, should directly call function from cblas.h

**Self evaluation:**
1. Build test:	 [X]Passed [ ]Failed [ ]Skipped
2. Run test:	 [X]Passed [ ]Failed [ ]Skipped

Signed-off-by: skykongkong8 <[email protected]>
  • Loading branch information
skykongkong8 authored and myungjoo committed Aug 26, 2024
1 parent a8993a2 commit f373d88
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion nntrainer/tensor/blas_interface.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -343,7 +343,8 @@ static void sgemm_FP16(const unsigned int TStorageOrder, bool TransA,
scopy(M * K, A, 1, A_, 1);
scopy(N * K, B, 1, B_, 1);
scopy(M * N, C, 1, C_, 1);
sgemm(order, transA, transB, M, N, K, alpha, A_, lda, B_, ldb, beta, C_, ldc);
cblas_sgemm(order, transA, transB, M, N, K, alpha, A_, lda, B_, ldb, beta, C_,
ldc);
scopy(M * N, C_, 1, C, 1);

delete[] A_;
Expand Down

0 comments on commit f373d88

Please sign in to comment.