Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sync release to main for rhoai-2.14 #172

Open
wants to merge 420 commits into
base: release
Choose a base branch
from
This pull request is big! We’re only showing the most recent 250 commits.

Commits on Sep 11, 2024

  1. Configuration menu
    Copy the full SHA
    775f00f View commit details
    Browse the repository at this point in the history
  2. Pixtral (vllm-project#8377)

    Co-authored-by: Roger Wang <[email protected]>
    patrickvonplaten and ywang96 committed Sep 11, 2024
    Configuration menu
    Copy the full SHA
    d394787 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    3fd2b0d View commit details
    Browse the repository at this point in the history

Commits on Sep 12, 2024

  1. Configuration menu
    Copy the full SHA
    a65cb16 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f842a7a View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    b71c956 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b6c75e1 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    5a60699 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    1bf2dd9 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    295c473 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    42ffba1 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    7de49aa View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    520ca38 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    e56bf27 View commit details
    Browse the repository at this point in the history
  12. [Model] Support multiple images for qwen-vl (vllm-project#8247)

    Signed-off-by: Alex-Brooks <[email protected]>
    Co-authored-by: Cyrus Leung <[email protected]>
    Co-authored-by: DarkLight1337 <[email protected]>
    3 people committed Sep 12, 2024
    Configuration menu
    Copy the full SHA
    c6202da View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    8a23e93 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    1f0c75a View commit details
    Browse the repository at this point in the history
  15. [Bugfix] Offline mode fix (vllm-project#8376)

    Signed-off-by: Joe Runde <[email protected]>
    joerunde committed Sep 12, 2024
    Configuration menu
    Copy the full SHA
    f2e263b View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    a6c0f36 View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    551ce01 View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    0198772 View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    c163694 View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    b61bd98 View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    d31174a View commit details
    Browse the repository at this point in the history
  22. Configuration menu
    Copy the full SHA
    a480939 View commit details
    Browse the repository at this point in the history

Commits on Sep 13, 2024

  1. Configuration menu
    Copy the full SHA
    360ddbd View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    8f44a92 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    5ec9c0f View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    40c3965 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    3f79bc3 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    8427550 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    6821020 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    ba77527 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    acda0b3 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    9b4a3b2 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    cab69a1 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    06311e2 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    a246912 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    ecd7a1d View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    0a4806f View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    a84e598 View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    f57092c View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    18e9e1f View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    9ba0817 View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    8517252 View commit details
    Browse the repository at this point in the history

Commits on Sep 14, 2024

  1. Configuration menu
    Copy the full SHA
    1ef0d2e View commit details
    Browse the repository at this point in the history
  2. [Model] support minicpm3 (vllm-project#8297)

    Co-authored-by: DarkLight1337 <[email protected]>
    SUDA-HLT-ywfang and DarkLight1337 committed Sep 14, 2024
    Configuration menu
    Copy the full SHA
    8a0cf1d View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    a36e070 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    47790f3 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    50e9ec4 View commit details
    Browse the repository at this point in the history

Commits on Sep 15, 2024

  1. Configuration menu
    Copy the full SHA
    3724d5f View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    fc990f9 View commit details
    Browse the repository at this point in the history

Commits on Sep 16, 2024

  1. Configuration menu
    Copy the full SHA
    a091e2d View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    837c196 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    acd5511 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    781e3b9 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    5d73ae4 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    2759a43 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    47f5e03 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    5478c4b View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    5ce45eb View commit details
    Browse the repository at this point in the history

Commits on Sep 17, 2024

  1. [Bugfix] Fix 3.12 builds on main (vllm-project#8510)

    Signed-off-by: Joe Runde <[email protected]>
    joerunde committed Sep 17, 2024
    Configuration menu
    Copy the full SHA
    cca6164 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    546034b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    1c1bb38 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    ee2bcea View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    99aa4ed View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    cbdb252 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    1b6de83 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    1009e93 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    9855b99 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    a54ed80 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    56c3de0 View commit details
    Browse the repository at this point in the history
  12. [Bugfix] Fix TP > 1 for new granite (vllm-project#8544)

    Signed-off-by: Joe Runde <[email protected]>
    joerunde committed Sep 17, 2024
    Configuration menu
    Copy the full SHA
    98f9713 View commit details
    Browse the repository at this point in the history
  13. [doc] improve installation doc (vllm-project#8550)

    Co-authored-by: Andy Dai <[email protected]>
    youkaichao and Imss27 committed Sep 17, 2024
    Configuration menu
    Copy the full SHA
    fa0c114 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    09deb47 View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    8110e44 View commit details
    Browse the repository at this point in the history

Commits on Sep 18, 2024

  1. Configuration menu
    Copy the full SHA
    95965d3 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e351572 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    6ffa3f3 View commit details
    Browse the repository at this point in the history
  4. [CI/Build] Update Ruff version (vllm-project#8469)

    Signed-off-by: Aaron Pham <[email protected]>
    Co-authored-by: Cyrus Leung <[email protected]>
    aarnphm and DarkLight1337 committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    9d104b5 View commit details
    Browse the repository at this point in the history
  5. [Core][Bugfix][Perf] Introduce MQLLMEngine to avoid asyncio OH (v…

    …llm-project#8157)
    
    Co-authored-by: Nick Hill <[email protected]>
    Co-authored-by: [email protected] <[email protected]>
    Co-authored-by: Robert Shaw <[email protected]>
    Co-authored-by: Simon Mo <[email protected]>
    5 people committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    7c7714d View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    a8c1d16 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    d65798f View commit details
    Browse the repository at this point in the history
  8. [Model] Support Solar Model (vllm-project#8386)

    Co-authored-by: Michael Goin <[email protected]>
    shing100 and mgoin committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    e18749f View commit details
    Browse the repository at this point in the history
  9. [AMD][ROCm]Quantization methods on ROCm; Fix _scaled_mm call (vllm-pr…

    …oject#8380)
    
    Co-authored-by: Alexei-V-Ivanov-AMD <[email protected]>
    Co-authored-by: Michael Goin <[email protected]>
    3 people committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    b3195bc View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    db9120c View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    d9cd78e View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    0d47bf3 View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2024

  1. Configuration menu
    Copy the full SHA
    4c34ce8 View commit details
    Browse the repository at this point in the history
  2. [Bugfix] [Encoder-Decoder] Bugfix for encoder specific metadata const…

    …ruction during decode of encoder-decoder models. (vllm-project#8545)
    sroy745 committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    3118f63 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    02c9afa View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    c52ec5f View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    855c8ae View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    76515f3 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    9cc373f View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    e42c634 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    ea4647b View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    9e99407 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    6cb748e View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    de6f90a View commit details
    Browse the repository at this point in the history

Commits on Sep 20, 2024

  1. Configuration menu
    Copy the full SHA
    18ae428 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9e5ec35 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    260d40b View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    3b63de9 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    2940afa View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    b28298f View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    035fa89 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    2874bac View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    b4e4eda View commit details
    Browse the repository at this point in the history
  10. [Doc] neuron documentation update (vllm-project#8671)

    Signed-off-by: omrishiv <[email protected]>
    omrishiv committed Sep 20, 2024
    Configuration menu
    Copy the full SHA
    7c8566a View commit details
    Browse the repository at this point in the history
  11. [Hardware][AWS] update neuron to 2.20 (vllm-project#8676)

    Signed-off-by: omrishiv <[email protected]>
    omrishiv committed Sep 20, 2024
    Configuration menu
    Copy the full SHA
    7f9c890 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    0f961b3 View commit details
    Browse the repository at this point in the history

Commits on Sep 21, 2024

  1. Configuration menu
    Copy the full SHA
    0057894 View commit details
    Browse the repository at this point in the history
  2. [MISC] add support custom_op check (vllm-project#8557)

    Co-authored-by: youkaichao <[email protected]>
    jikunshang and youkaichao committed Sep 21, 2024
    Configuration menu
    Copy the full SHA
    d4bf085 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    0455c46 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    0faab90 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    71c6049 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    5e85f4f View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    4dfdf43 View commit details
    Browse the repository at this point in the history
  8. [Kernel][Triton][AMD] Remove tl.atomic_add from awq_gemm_kernel, 2-5x…

    … speedup MI300, minor improvement for MI250 (vllm-project#8646)
    rasmith committed Sep 21, 2024
    Configuration menu
    Copy the full SHA
    ec4aaad View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    9dc7c6c View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    d66ac62 View commit details
    Browse the repository at this point in the history

Commits on Sep 22, 2024

  1. Configuration menu
    Copy the full SHA
    13d88d4 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0e40ac9 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    06ed281 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    8ca5051 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    ca2b628 View commit details
    Browse the repository at this point in the history
  6. [Model][VLM] Add LLaVA-Onevision model support (vllm-project#8486)

    Co-authored-by: litianjian <[email protected]>
    Co-authored-by: Cyrus Leung <[email protected]>
    Co-authored-by: Roger Wang <[email protected]>
    Co-authored-by: DarkLight1337 <[email protected]>
    5 people committed Sep 22, 2024
    Configuration menu
    Copy the full SHA
    5b59532 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    c6bd70d View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    d4a2ac8 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    92ba7e7 View commit details
    Browse the repository at this point in the history

Commits on Sep 23, 2024

  1. Configuration menu
    Copy the full SHA
    3dda7c2 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    57a0702 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d23679e View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    9b8c8ba View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    e551ca1 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    3e83c12 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    a79e522 View commit details
    Browse the repository at this point in the history
  8. [VLM] Fix paligemma, fuyu and persimmon with transformers 4.45 : use …

    …config.text_config.vocab_size (vllm-project#8707)
    janimo committed Sep 23, 2024
    Configuration menu
    Copy the full SHA
    f2bd246 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    ee5f34b View commit details
    Browse the repository at this point in the history
  10. [Kernel] (2/N) Machete - Integrate into CompressedTensorsWNA16 and GP…

    …TQMarlin (vllm-project#7701)
    
    Co-authored-by: mgoin <[email protected]>
    Co-authored-by: Divakar Verma <[email protected]>
    Co-authored-by: Tyler Michael Smith <[email protected]>
    4 people committed Sep 23, 2024
    Configuration menu
    Copy the full SHA
    86e9c8d View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    9b0e3ec View commit details
    Browse the repository at this point in the history
  12. [Core] Allow IPv6 in VLLM_HOST_IP with zmq (vllm-project#8575)

    Signed-off-by: Russell Bryant <[email protected]>
    russellb committed Sep 23, 2024
    Configuration menu
    Copy the full SHA
    b05f5c9 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    5f7bb58 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    1a2aef3 View commit details
    Browse the repository at this point in the history

Commits on Sep 24, 2024

  1. Configuration menu
    Copy the full SHA
    530821d View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    88577ac View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    0250dd6 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    3185fb0 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    b8747e8 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    3f06bae View commit details
    Browse the repository at this point in the history
  7. [Model] Expose Phi3v num_crops as a mm_processor_kwarg (vllm-project#…

    …8658)
    
    Signed-off-by: Alex-Brooks <[email protected]>
    Co-authored-by: Cyrus Leung <[email protected]>
    Co-authored-by: DarkLight1337 <[email protected]>
    3 people committed Sep 24, 2024
    Configuration menu
    Copy the full SHA
    8ff7ced View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    cc4325b View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    a928ded View commit details
    Browse the repository at this point in the history
  10. [Frontend] Batch inference for llm.chat() API (vllm-project#8648)

    Co-authored-by: Cyrus Leung <[email protected]>
    Co-authored-by: Cyrus Leung <[email protected]>
    Co-authored-by: Roger Wang <[email protected]>
    Co-authored-by: Roger Wang <[email protected]>
    5 people committed Sep 24, 2024
    Configuration menu
    Copy the full SHA
    2529d09 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    72fc97a View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    2467b64 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    1e7d5c0 View commit details
    Browse the repository at this point in the history

Commits on Sep 25, 2024

  1. Configuration menu
    Copy the full SHA
    13f9f7a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    01b6f9e View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    6da1ab6 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    6e0c9d6 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    ee777d9 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    b452247 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    fc3afc2 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    e3dd069 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    c239536 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    3e073e6 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    1ac3de0 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    3368c3a View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    8fae5ed View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    1c04644 View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    300da09 View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    c6f2485 View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    0c4d2ad View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    28e1299 View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    64840df View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    873edda View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    4f1ba08 View commit details
    Browse the repository at this point in the history
  22. [Model] Add support for the multi-modal Llama 3.2 model (vllm-project…

    …#8811)
    
    Co-authored-by: simon-mo <[email protected]>
    Co-authored-by: Chang Su <[email protected]>
    Co-authored-by: Simon Mo <[email protected]>
    Co-authored-by: Roger Wang <[email protected]>
    Co-authored-by: Roger Wang <[email protected]>
    6 people committed Sep 25, 2024
    Configuration menu
    Copy the full SHA
    770ec60 View commit details
    Browse the repository at this point in the history
  23. Configuration menu
    Copy the full SHA
    e2c6e0a View commit details
    Browse the repository at this point in the history
  24. Configuration menu
    Copy the full SHA
    7193774 View commit details
    Browse the repository at this point in the history

Commits on Sep 27, 2024

  1. chore: add fork OWNERS

    z103cb authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    bfa692e View commit details
    Browse the repository at this point in the history
  2. add ubi Dockerfile

    dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    b96ffe3 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    acbab07 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    d54bfce View commit details
    Browse the repository at this point in the history
  5. gha: add sync workflow

    dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    8065d82 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    a5b5eb0 View commit details
    Browse the repository at this point in the history
  7. Dockerfile.ubi: remove vllm-nccl workaround

    Fixed upstream in vllm-project#5091
    dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    cab1bac View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    7c65254 View commit details
    Browse the repository at this point in the history
  9. add triton CustomCacheManger

    fixes RHOAIENG-8043
    
    Co-authored-by: Chih-Chieh-Yang <[email protected]>
    Signed-off-by: Thomas Parnell <[email protected]>
    2 people authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    9f11204 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    8914a32 View commit details
    Browse the repository at this point in the history
  11. add smoke/unit tests scripts

    dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    1f8b826 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    1beb801 View commit details
    Browse the repository at this point in the history
  13. Dockerfile.ubi: misc improvements

    - get rid cuda-devel stage, use cuda 12.4
    - add build flags
    - remove useless installs
    dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    b102823 View commit details
    Browse the repository at this point in the history
  14. update OWNERS

    dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    c5e1313 View commit details
    Browse the repository at this point in the history
  15. Dockerfile.ubi: use tensorizer (opendatahub-io#64)

    add libsodium for tensorizer encryption
    
    ---------
    
    Signed-off-by: Prashant Gupta <[email protected]>
    Co-authored-by: Daniele <[email protected]>
    prashantgupta24 and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    ff1cc50 View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    129720a View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    ee779e6 View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    160ddb8 View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    2478277 View commit details
    Browse the repository at this point in the history
  20. Dockerfile.ubi: get rid of --distributed-executor-backend=mp

    this is the default when `--worker-use-ray` is not provided and
    world-size > 1
    dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    9dc4dd3 View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    96b598b View commit details
    Browse the repository at this point in the history
  22. pin adapter to 2.0.0

    prashantgupta24 authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    4d6fd09 View commit details
    Browse the repository at this point in the history
  23. Configuration menu
    Copy the full SHA
    ef7738d View commit details
    Browse the repository at this point in the history
  24. Update OWNERS with IBM folks

    heyselbi authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    3972a7d View commit details
    Browse the repository at this point in the history
  25. Configuration menu
    Copy the full SHA
    9bac7b9 View commit details
    Browse the repository at this point in the history
  26. gha: remove reminder_comment

    dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    97d24e4 View commit details
    Browse the repository at this point in the history
  27. Configuration menu
    Copy the full SHA
    c4aa1e3 View commit details
    Browse the repository at this point in the history
  28. fix: update setup.py to differentiate between fork and upstream

    Signed-off-by: Nathan Weinberg <[email protected]>
    nathan-weinberg authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    e856dd3 View commit details
    Browse the repository at this point in the history
  29. Configuration menu
    Copy the full SHA
    8ac5afb View commit details
    Browse the repository at this point in the history
  30. Configuration menu
    Copy the full SHA
    58cbebb View commit details
    Browse the repository at this point in the history
  31. Configuration menu
    Copy the full SHA
    d5373dd View commit details
    Browse the repository at this point in the history
  32. Configuration menu
    Copy the full SHA
    013813d View commit details
    Browse the repository at this point in the history
  33. Configuration menu
    Copy the full SHA
    53f9489 View commit details
    Browse the repository at this point in the history
  34. Configuration menu
    Copy the full SHA
    0a20a57 View commit details
    Browse the repository at this point in the history
  35. Configuration menu
    Copy the full SHA
    d67d8f7 View commit details
    Browse the repository at this point in the history
  36. Configuration menu
    Copy the full SHA
    dfe980d View commit details
    Browse the repository at this point in the history
  37. Configuration menu
    Copy the full SHA
    c002b3f View commit details
    Browse the repository at this point in the history
  38. Configuration menu
    Copy the full SHA
    857e618 View commit details
    Browse the repository at this point in the history
  39. Configuration menu
    Copy the full SHA
    75adb8a View commit details
    Browse the repository at this point in the history
  40. feat: allow long max seq length

    Signed-off-by: Travis Johnson <[email protected]>
    tjohnson31415 authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    ce5c1bb View commit details
    Browse the repository at this point in the history
  41. Configuration menu
    Copy the full SHA
    94625bd View commit details
    Browse the repository at this point in the history
  42. Configuration menu
    Copy the full SHA
    5c90a8b View commit details
    Browse the repository at this point in the history
  43. fix: enable logprobs during spec decoding by default

    Signed-off-by: Travis Johnson <[email protected]>
    tjohnson31415 authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    fe77683 View commit details
    Browse the repository at this point in the history
  44. Configuration menu
    Copy the full SHA
    3c5d24c View commit details
    Browse the repository at this point in the history
  45. Disable usage tracking

    This turns off tracking by default. If someone wants to, they
    can simply override this in yaml.
    stevegrubb authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    a24dfae View commit details
    Browse the repository at this point in the history
  46. Start by updating the image

    A review showed that nowhere in the Dockerfile.ubi do we do a
    microdnf -y update to pickup any known CVE and bugfixes. This patch
    adds this to the build process.
    stevegrubb authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    6acc54f View commit details
    Browse the repository at this point in the history
  47. Update ROCm build for UBI

    Xaenalt authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    6d60952 View commit details
    Browse the repository at this point in the history
  48. Configuration menu
    Copy the full SHA
    d218479 View commit details
    Browse the repository at this point in the history
  49. Harden build of libsodium

    Libsodium is being built with default CFLAGS. This adds optimization on par
    with cmake release builds. It also adds security hardening flags suggested
    for RHEL 9 to protect against various issues.
    stevegrubb authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    c7872e9 View commit details
    Browse the repository at this point in the history
  50. Update Dockerfile.ubi

    Remove debug code.
    RH-steve-grubb authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    09d0994 View commit details
    Browse the repository at this point in the history
  51. Update OWNERS file

    vaibhavjainwiz authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    9041883 View commit details
    Browse the repository at this point in the history
  52. Dockerfile.rocm.ubi: cleanup

    - get rid of non-essential dependencies
    - consolidate package installs
    - do not copy wheels in final stage
    - fix ccache usage
    - use flashattention with triton backend by default:
        - clone main_perf branch
        - build rocm target
        - set up triton rocm env var
    - configure numba, outlines and triton cache directory
    dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    d3f06b5 View commit details
    Browse the repository at this point in the history
  53. add vllm-tgis-adapter layer

    dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    3cc6ea4 View commit details
    Browse the repository at this point in the history
  54. Configuration menu
    Copy the full SHA
    b30f9ed View commit details
    Browse the repository at this point in the history
  55. Configuration menu
    Copy the full SHA
    8238489 View commit details
    Browse the repository at this point in the history
  56. Configuration menu
    Copy the full SHA
    67080c2 View commit details
    Browse the repository at this point in the history
  57. Configuration menu
    Copy the full SHA
    010c1bd View commit details
    Browse the repository at this point in the history
  58. Configuration menu
    Copy the full SHA
    f91432f View commit details
    Browse the repository at this point in the history
  59. Configuration menu
    Copy the full SHA
    b1b179f View commit details
    Browse the repository at this point in the history
  60. Dockerfile.rocm.ubi: get rid of build triton stage

    this is a torch dependency when installed from the pytorch/rocm6.1
    index: https://download.pytorch.org/whl/nightly/rocm6.1
    dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    cd4b748 View commit details
    Browse the repository at this point in the history
  61. Configuration menu
    Copy the full SHA
    38247ba View commit details
    Browse the repository at this point in the history
  62. Configuration menu
    Copy the full SHA
    451470c View commit details
    Browse the repository at this point in the history
  63. Configuration menu
    Copy the full SHA
    81a8400 View commit details
    Browse the repository at this point in the history
  64. This sets the vllm build to a Release build type, builds libsodium,

    fixes bad permissions on a logging directory, installs libsodium,
    and adds a LD_LIBRARY_PATH to fixup python bindings for 2 packages.
    stevegrubb authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    ec1f663 View commit details
    Browse the repository at this point in the history
  65. Move libsodium install

    This moves the install up a couple lines.
    stevegrubb authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    d16bf47 View commit details
    Browse the repository at this point in the history
  66. Correct logging directory permissions

    The /var/log/rocm_smi_lib/ directory was world writable. It
    is fixed now so that it is world readable. Similarly,
    the file in it's directory is also world writable, it is
    now world readable.
    stevegrubb authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    f63fbdd View commit details
    Browse the repository at this point in the history
  67. bump tgis adapter to v0.5.0

    NickLucche authored and dtrifiro committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    902985d View commit details
    Browse the repository at this point in the history
  68. Configuration menu
    Copy the full SHA
    9d9bb9c View commit details
    Browse the repository at this point in the history