Skip to content

Actions: alpaka-group/llama

Publish amalgamated llama.hpp to single-header

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
102 workflow runs
102 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Drop support for g++-9
Publish amalgamated llama.hpp to single-header #244: Commit c9df133 pushed by bernhardmgruber
January 10, 2024 16:38 23s develop
January 10, 2024 16:38 23s
Improve layout dump when all computed fields are resolved
Publish amalgamated llama.hpp to single-header #243: Commit 0e3ecc4 pushed by bernhardmgruber
January 9, 2024 18:19 26s develop
January 9, 2024 18:19 26s
Add more layouts and disable others
Publish amalgamated llama.hpp to single-header #242: Commit 244e632 pushed by bernhardmgruber
January 9, 2024 16:12 31s develop
January 9, 2024 16:12 31s
Use 20 repetitions for LHCb HEP benchmark
Publish amalgamated llama.hpp to single-header #241: Commit b570821 pushed by bernhardmgruber
January 7, 2024 17:36 25s develop
January 7, 2024 17:36 25s
Print analysis time SEM and add warmup run
Publish amalgamated llama.hpp to single-header #240: Commit 2c25211 pushed by bernhardmgruber
January 7, 2024 10:50 25s develop
January 7, 2024 10:50 25s
Fix wrong shared memory address computation
Publish amalgamated llama.hpp to single-header #239: Commit 1e2c4bc pushed by bernhardmgruber
January 5, 2024 01:49 23s develop
January 5, 2024 01:49 23s
Only run GM layout variations when running update
Publish amalgamated llama.hpp to single-header #238: Commit 59243ff pushed by bernhardmgruber
January 4, 2024 22:27 26s develop
January 4, 2024 22:27 26s
Implement SIMD load/store between different record dimensions
Publish amalgamated llama.hpp to single-header #237: Commit bd95c53 pushed by bernhardmgruber
January 3, 2024 19:26 28s develop
January 3, 2024 19:26 28s
Update CUDA 12.3 download link to latest release
Publish amalgamated llama.hpp to single-header #236: Commit 61848c4 pushed by bernhardmgruber
January 3, 2024 18:55 25s develop
January 3, 2024 18:55 25s
Align manual AoSoA SIMD blocks to SIMD width
Publish amalgamated llama.hpp to single-header #235: Commit c7014f3 pushed by bernhardmgruber
January 2, 2024 21:49 21s develop
January 2, 2024 21:49 21s
Add fastpath for loading/storing SimdN<T, 1, ...>
Publish amalgamated llama.hpp to single-header #234: Commit 40c07af pushed by bernhardmgruber
January 2, 2024 20:32 27s develop
January 2, 2024 20:32 27s
Automatically run LLAMA SIMD versions with 1 lane
Publish amalgamated llama.hpp to single-header #233: Commit e44b23e pushed by bernhardmgruber
January 2, 2024 17:29 24s develop
January 2, 2024 17:29 24s
Remove dead variables
Publish amalgamated llama.hpp to single-header #232: Commit 9a0dc60 pushed by bernhardmgruber
January 2, 2024 00:40 22s develop
January 2, 2024 00:40 22s
Consistently multiply scalars first in SIMD code
Publish amalgamated llama.hpp to single-header #231: Commit 9680e3d pushed by bernhardmgruber
December 29, 2023 19:13 28s develop
December 29, 2023 19:13 28s
Rename allocViewStack to allocScalarView
Publish amalgamated llama.hpp to single-header #230: Commit 6e9ea66 pushed by bernhardmgruber
December 27, 2023 09:49 24s develop
December 27, 2023 09:49 24s
Fix clang-tidy warning
Publish amalgamated llama.hpp to single-header #229: Commit 6825e0d pushed by bernhardmgruber
December 17, 2023 02:37 22s develop
December 17, 2023 02:37 22s
Downgrade g++-13 for clang-16 CUDA CI
Publish amalgamated llama.hpp to single-header #228: Commit 1ec195c pushed by bernhardmgruber
December 16, 2023 23:04 7m 48s develop
December 16, 2023 23:04 7m 48s
Generalize blob copying functions
Publish amalgamated llama.hpp to single-header #227: Commit 26bfa0a pushed by bernhardmgruber
December 16, 2023 22:54 1m 33s develop
December 16, 2023 22:54 1m 33s
Use SoA implementation in Copy specialization directly
Publish amalgamated llama.hpp to single-header #226: Commit 7375de2 pushed by bernhardmgruber
December 16, 2023 19:59 24s develop
December 16, 2023 19:59 24s
Relicense all LGPL-3.0+ content to MPL-2.0
Publish amalgamated llama.hpp to single-header #225: Commit 635cdce pushed by bernhardmgruber
December 16, 2023 17:02 22s develop
December 16, 2023 17:02 22s
Make PIC example a CXX project
Publish amalgamated llama.hpp to single-header #224: Commit 23ba485 pushed by bernhardmgruber
December 13, 2023 15:40 7m 2s develop
December 13, 2023 15:40 7m 2s
Update CUDA download URLs
Publish amalgamated llama.hpp to single-header #223: Commit 97a0681 pushed by bernhardmgruber
December 13, 2023 15:37 27s develop
December 13, 2023 15:37 27s
Repeat compile-time benchmark 20 times
Publish amalgamated llama.hpp to single-header #222: Commit dcd643d pushed by bernhardmgruber
December 12, 2023 15:17 8m 16s develop
December 12, 2023 15:17 8m 16s
Small refactoring
Publish amalgamated llama.hpp to single-header #221: Commit aa30cc6 pushed by bernhardmgruber
November 23, 2023 11:08 28s develop
November 23, 2023 11:08 28s
Improve benchmark plots
Publish amalgamated llama.hpp to single-header #220: Commit 5a7e6e6 pushed by bernhardmgruber
November 22, 2023 16:52 8m 4s develop
November 22, 2023 16:52 8m 4s