Publish amalgamated llama.hpp to single-header

Actions

Publish amalgamated llama.hpp to single-header

Actions

Loading...
Loading

single-header.yml

102 workflow runs

Drop support for g++-9 Publish amalgamated llama.hpp to single-header #244: Commit c9df133 pushed by bernhardmgruber

January 10, 2024 16:38

23s develop

develop

January 10, 2024 16:38

23s

Improve layout dump when all computed fields are resolved Publish amalgamated llama.hpp to single-header #243: Commit 0e3ecc4 pushed by bernhardmgruber

January 9, 2024 18:19

26s develop

develop

January 9, 2024 18:19

26s

Add more layouts and disable others Publish amalgamated llama.hpp to single-header #242: Commit 244e632 pushed by bernhardmgruber

January 9, 2024 16:12

31s develop

develop

January 9, 2024 16:12

31s

Use 20 repetitions for LHCb HEP benchmark Publish amalgamated llama.hpp to single-header #241: Commit b570821 pushed by bernhardmgruber

January 7, 2024 17:36

25s develop

develop

January 7, 2024 17:36

25s

Print analysis time SEM and add warmup run Publish amalgamated llama.hpp to single-header #240: Commit 2c25211 pushed by bernhardmgruber

January 7, 2024 10:50

25s develop

develop

January 7, 2024 10:50

25s

Fix wrong shared memory address computation Publish amalgamated llama.hpp to single-header #239: Commit 1e2c4bc pushed by bernhardmgruber

January 5, 2024 01:49

23s develop

develop

January 5, 2024 01:49

23s

Only run GM layout variations when running update Publish amalgamated llama.hpp to single-header #238: Commit 59243ff pushed by bernhardmgruber

January 4, 2024 22:27

26s develop

develop

January 4, 2024 22:27

26s

Implement SIMD load/store between different record dimensions Publish amalgamated llama.hpp to single-header #237: Commit bd95c53 pushed by bernhardmgruber

January 3, 2024 19:26

28s develop

develop

January 3, 2024 19:26

28s

Update CUDA 12.3 download link to latest release Publish amalgamated llama.hpp to single-header #236: Commit 61848c4 pushed by bernhardmgruber

January 3, 2024 18:55

25s develop

develop

January 3, 2024 18:55

25s

Align manual AoSoA SIMD blocks to SIMD width Publish amalgamated llama.hpp to single-header #235: Commit c7014f3 pushed by bernhardmgruber

January 2, 2024 21:49

21s develop

develop

January 2, 2024 21:49

21s

Add fastpath for loading/storing SimdN<T, 1, ...> Publish amalgamated llama.hpp to single-header #234: Commit 40c07af pushed by bernhardmgruber

January 2, 2024 20:32

27s develop

develop

January 2, 2024 20:32

27s

Automatically run LLAMA SIMD versions with 1 lane Publish amalgamated llama.hpp to single-header #233: Commit e44b23e pushed by bernhardmgruber

January 2, 2024 17:29

24s develop

develop

January 2, 2024 17:29

24s

Remove dead variables Publish amalgamated llama.hpp to single-header #232: Commit 9a0dc60 pushed by bernhardmgruber

January 2, 2024 00:40

22s develop

develop

January 2, 2024 00:40

22s

Consistently multiply scalars first in SIMD code Publish amalgamated llama.hpp to single-header #231: Commit 9680e3d pushed by bernhardmgruber

December 29, 2023 19:13

28s develop

develop

December 29, 2023 19:13

28s

Rename allocViewStack to allocScalarView Publish amalgamated llama.hpp to single-header #230: Commit 6e9ea66 pushed by bernhardmgruber

December 27, 2023 09:49

24s develop

develop

December 27, 2023 09:49

24s

Fix clang-tidy warning Publish amalgamated llama.hpp to single-header #229: Commit 6825e0d pushed by bernhardmgruber

December 17, 2023 02:37

22s develop

develop

December 17, 2023 02:37

22s

Downgrade g++-13 for clang-16 CUDA CI Publish amalgamated llama.hpp to single-header #228: Commit 1ec195c pushed by bernhardmgruber

December 16, 2023 23:04

7m 48s develop

develop

December 16, 2023 23:04

7m 48s

Generalize blob copying functions Publish amalgamated llama.hpp to single-header #227: Commit 26bfa0a pushed by bernhardmgruber

December 16, 2023 22:54

1m 33s develop

develop

December 16, 2023 22:54

1m 33s

Use SoA implementation in Copy specialization directly Publish amalgamated llama.hpp to single-header #226: Commit 7375de2 pushed by bernhardmgruber

December 16, 2023 19:59

24s develop

develop

December 16, 2023 19:59

24s

Relicense all LGPL-3.0+ content to MPL-2.0 Publish amalgamated llama.hpp to single-header #225: Commit 635cdce pushed by bernhardmgruber

December 16, 2023 17:02

22s develop

develop

December 16, 2023 17:02

22s

Make PIC example a CXX project Publish amalgamated llama.hpp to single-header #224: Commit 23ba485 pushed by bernhardmgruber

December 13, 2023 15:40

7m 2s develop

develop

December 13, 2023 15:40

7m 2s

Update CUDA download URLs Publish amalgamated llama.hpp to single-header #223: Commit 97a0681 pushed by bernhardmgruber

December 13, 2023 15:37

27s develop

develop

December 13, 2023 15:37

27s

Repeat compile-time benchmark 20 times Publish amalgamated llama.hpp to single-header #222: Commit dcd643d pushed by bernhardmgruber

December 12, 2023 15:17

8m 16s develop

develop

December 12, 2023 15:17

8m 16s

Small refactoring Publish amalgamated llama.hpp to single-header #221: Commit aa30cc6 pushed by bernhardmgruber

November 23, 2023 11:08

28s develop

develop

November 23, 2023 11:08

28s

Improve benchmark plots Publish amalgamated llama.hpp to single-header #220: Commit 5a7e6e6 pushed by bernhardmgruber

November 22, 2023 16:52

8m 4s develop

develop

November 22, 2023 16:52

8m 4s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management