Skip to content

Popular repositories Loading

  1. triton-kernels triton-kernels Public

    Triton kernels for Flux

    Python 12

  2. minRF-ONNX minRF-ONNX Public

    Forked from cloneofsimo/minRF

    Minimal implementation of scalable rectified flow transformers, based on SD3's approach

    Jupyter Notebook 5

  3. flux flux Public

    Forked from black-forest-labs/flux

    Official inference repo for FLUX.1 models

    Python 5

  4. quanto quanto Public

    Python 1

  5. nexfort nexfort Public

    OneDiff compiler infrastructure using torch Inductor

    Python 1

  6. quant_dit_models quant_dit_models Public

    Python

Repositories

Showing 10 of 12 repositories
  • test_attn Public Forked from littsk/test_attn

    Testing and benchmarking different attention implementations and backends

    ai-compiler-study/test_attn’s past year of commit activity
    Python 0 2 0 0 Updated Oct 10, 2024
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    ai-compiler-study/flash-attention’s past year of commit activity
    Python 0 BSD-3-Clause 1,265 0 0 Updated Oct 10, 2024
  • flux Public Forked from black-forest-labs/flux

    Official inference repo for FLUX.1 models

    ai-compiler-study/flux’s past year of commit activity
    Python 5 Apache-2.0 1,073 0 0 Updated Oct 9, 2024
  • triton-kernels Public

    Triton kernels for Flux

    ai-compiler-study/triton-kernels’s past year of commit activity
    Python 12 MIT 0 0 0 Updated Oct 8, 2024
  • nexfort Public

    OneDiff compiler infrastructure using torch Inductor

    ai-compiler-study/nexfort’s past year of commit activity
    Python 1 0 0 0 Updated Sep 26, 2024
  • ai-compiler-study/kernels’s past year of commit activity
    Python 0 13 0 0 Updated Sep 9, 2024
  • xDiT Public Forked from xdit-project/xDiT

    xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters

    ai-compiler-study/xDiT’s past year of commit activity
    Python 0 Apache-2.0 50 0 0 Updated Aug 30, 2024
  • quanto Public
    ai-compiler-study/quanto’s past year of commit activity
    Python 1 0 1 0 Updated Aug 14, 2024
  • minRF-ONNX Public Forked from cloneofsimo/minRF

    Minimal implementation of scalable rectified flow transformers, based on SD3's approach

    ai-compiler-study/minRF-ONNX’s past year of commit activity
    Jupyter Notebook 5 Apache-2.0 29 0 0 Updated Jul 27, 2024
  • ai-compiler-study/quant_dit_models’s past year of commit activity
    Python 0 0 0 0 Updated Jul 25, 2024

Top languages

Loading…

Most used topics

Loading…