Skip to content

Commit

Permalink
Documentation to rely on correct URLs
Browse files Browse the repository at this point in the history
  • Loading branch information
hfp committed Sep 19, 2023
1 parent 05d4004 commit 578bf0e
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 3 deletions.
4 changes: 2 additions & 2 deletions docs/guide/2-user-guide/4-gpu/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ Users interested to tune kernels for the CUDA/HIP backend and LIBSMM_ACC, can ta

# OpenCL Backend

This section shows how to auto-tune a kernel for the OpenCL based LIBSMM library. The process builds a stand-alone driver program which is then driven by an [OpenTuner](https://opentuner.org/) based script guiding the auto-tuning of the desired kernel. The [Developer Guide](../../3-developer-guide/3-programming/2-accelerator-backend/4-opencl-libsmm.html) provides more information, e.g., about constraining execution time or parallelizing the tuning-process as well as how to select and tune an entire set of kernels.
This section shows how to auto-tune a kernel for the OpenCL based LIBSMM library. The process builds a stand-alone driver program which is then driven by an [OpenTuner](https://opentuner.org/) based script guiding the auto-tuning of the desired kernel. The [Developer Guide](../../3-developer-guide/3-programming/2-accelerator-backend/3-libsmm_ocl/1-autotune.html) provides more information, e.g., about constraining execution time or parallelizing the tuning-process as well as how to select and tune an entire set of kernels.

For simplicity, the GNU Compiler is used to build the afore mentioned driver program, both DBCSR and LIBXSMM are Git-cloned into the same common directory, e.g., the user's `HOME` directory, and the driver is built for tuning double-precision kernels (DP).

Expand All @@ -37,7 +37,7 @@ cd ${HOME}/dbcsr/src/acc/opencl/smm
./tune_multiply.py 23x23x23
```

Beside of interactive termination, above process would also terminate based on OpenTuner's default or can be constrained by the number of steps (experiments), time to be spent, or a combination of both. Details can be found in the [Developer Guide](../../3-developer-guide/3-programming/2-accelerator-backend/4-opencl-libsmm.html).
Beside of interactive termination, above process would also terminate based on OpenTuner's default or can be constrained by the number of steps (experiments), time to be spent, or a combination of both. Details can be found in the [Developer Guide](../../3-developer-guide/3-programming/2-accelerator-backend/2-libsmm_acc/3-tune.html).

Suppose the 23x23x23-kernel was tuned for some time (e.g., 5-10 minutes), tuned parameters can be incorporated into the backend. The aggregated parameters (`tune_multiply.csv`) are automatically embedded when rebuilding the library and driver.

Expand Down
2 changes: 1 addition & 1 deletion src/acc/opencl/smm/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -29,4 +29,4 @@ The full list of tunable parameters and some explanation can be received with `s

# Auto Tuning

To tune and optimize a kernel and generating kernel parameters, please refer to the [Auto Tuning](README-autotune.md) guide. To update or retune an entire set of kernels (optimized parameters), please refer to the [Bulk Tuning](README-bulktune.md) guide.
To tune and optimize a kernel and generating kernel parameters, please refer to the [Auto Tuning](https://cp2k.github.io/dbcsr/develop/page/3-developer-guide/3-programming/2-accelerator-backend/3-libsmm_ocl/1-autotune.html) guide. To update or retune an entire set of kernels (optimized parameters), please refer to the [Bulk Tuning](https://cp2k.github.io/dbcsr/develop/page/3-developer-guide/3-programming/2-accelerator-backend/3-libsmm_ocl/2-bulktune.html) guide.

0 comments on commit 578bf0e

Please sign in to comment.