-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sync release to main for rhoai-2.14 #172
base: release
Are you sure you want to change the base?
Sync release to main for rhoai-2.14 #172
Commits on Sep 11, 2024
-
[Speculative Decoding] Test refactor (vllm-project#8317)
Co-authored-by: youkaichao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 775f00f - Browse repository at this point
Copy the full SHA 775f00fView commit details -
Co-authored-by: Roger Wang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d394787 - Browse repository at this point
Copy the full SHA d394787View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3fd2b0d - Browse repository at this point
Copy the full SHA 3fd2b0dView commit details
Commits on Sep 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a65cb16 - Browse repository at this point
Copy the full SHA a65cb16View commit details -
Configuration menu - View commit details
-
Copy full SHA for f842a7a - Browse repository at this point
Copy the full SHA f842a7aView commit details -
Configuration menu - View commit details
-
Copy full SHA for b71c956 - Browse repository at this point
Copy the full SHA b71c956View commit details -
Configuration menu - View commit details
-
Copy full SHA for b6c75e1 - Browse repository at this point
Copy the full SHA b6c75e1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5a60699 - Browse repository at this point
Copy the full SHA 5a60699View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1bf2dd9 - Browse repository at this point
Copy the full SHA 1bf2dd9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 295c473 - Browse repository at this point
Copy the full SHA 295c473View commit details -
Configuration menu - View commit details
-
Copy full SHA for 42ffba1 - Browse repository at this point
Copy the full SHA 42ffba1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7de49aa - Browse repository at this point
Copy the full SHA 7de49aaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 520ca38 - Browse repository at this point
Copy the full SHA 520ca38View commit details -
[Bugfix] Fix InternVL2 inference with various num_patches (vllm-proje…
…ct#8375) Co-authored-by: DarkLight1337 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e56bf27 - Browse repository at this point
Copy the full SHA e56bf27View commit details -
[Model] Support multiple images for qwen-vl (vllm-project#8247)
Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Cyrus Leung <[email protected]> Co-authored-by: DarkLight1337 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c6202da - Browse repository at this point
Copy the full SHA c6202daView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a23e93 - Browse repository at this point
Copy the full SHA 8a23e93View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1f0c75a - Browse repository at this point
Copy the full SHA 1f0c75aView commit details -
[Bugfix] Offline mode fix (vllm-project#8376)
Signed-off-by: Joe Runde <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f2e263b - Browse repository at this point
Copy the full SHA f2e263bView commit details -
Configuration menu - View commit details
-
Copy full SHA for a6c0f36 - Browse repository at this point
Copy the full SHA a6c0f36View commit details -
Configuration menu - View commit details
-
Copy full SHA for 551ce01 - Browse repository at this point
Copy the full SHA 551ce01View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0198772 - Browse repository at this point
Copy the full SHA 0198772View commit details -
[Hotfix][Core][VLM] Disable chunked prefill by default and prefix cac…
…hing for multimodal models (vllm-project#8425)
Configuration menu - View commit details
-
Copy full SHA for c163694 - Browse repository at this point
Copy the full SHA c163694View commit details -
Configuration menu - View commit details
-
Copy full SHA for b61bd98 - Browse repository at this point
Copy the full SHA b61bd98View commit details -
Configuration menu - View commit details
-
Copy full SHA for d31174a - Browse repository at this point
Copy the full SHA d31174aView commit details -
Configuration menu - View commit details
-
Copy full SHA for a480939 - Browse repository at this point
Copy the full SHA a480939View commit details
Commits on Sep 13, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 360ddbd - Browse repository at this point
Copy the full SHA 360ddbdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8f44a92 - Browse repository at this point
Copy the full SHA 8f44a92View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ec9c0f - Browse repository at this point
Copy the full SHA 5ec9c0fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 40c3965 - Browse repository at this point
Copy the full SHA 40c3965View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f79bc3 - Browse repository at this point
Copy the full SHA 3f79bc3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8427550 - Browse repository at this point
Copy the full SHA 8427550View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6821020 - Browse repository at this point
Copy the full SHA 6821020View commit details -
Configuration menu - View commit details
-
Copy full SHA for ba77527 - Browse repository at this point
Copy the full SHA ba77527View commit details -
Configuration menu - View commit details
-
Copy full SHA for acda0b3 - Browse repository at this point
Copy the full SHA acda0b3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9b4a3b2 - Browse repository at this point
Copy the full SHA 9b4a3b2View commit details -
Configuration menu - View commit details
-
Copy full SHA for cab69a1 - Browse repository at this point
Copy the full SHA cab69a1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 06311e2 - Browse repository at this point
Copy the full SHA 06311e2View commit details -
Configuration menu - View commit details
-
Copy full SHA for a246912 - Browse repository at this point
Copy the full SHA a246912View commit details -
Configuration menu - View commit details
-
Copy full SHA for ecd7a1d - Browse repository at this point
Copy the full SHA ecd7a1dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0a4806f - Browse repository at this point
Copy the full SHA 0a4806fView commit details -
Configuration menu - View commit details
-
Copy full SHA for a84e598 - Browse repository at this point
Copy the full SHA a84e598View commit details -
Configuration menu - View commit details
-
Copy full SHA for f57092c - Browse repository at this point
Copy the full SHA f57092cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 18e9e1f - Browse repository at this point
Copy the full SHA 18e9e1fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9ba0817 - Browse repository at this point
Copy the full SHA 9ba0817View commit details -
[Hardware][intel GPU] bump up ipex version to 2.3 (vllm-project#8365)
Co-authored-by: Yan Ma <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8517252 - Browse repository at this point
Copy the full SHA 8517252View commit details
Commits on Sep 14, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1ef0d2e - Browse repository at this point
Copy the full SHA 1ef0d2eView commit details -
[Model] support minicpm3 (vllm-project#8297)
Co-authored-by: DarkLight1337 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8a0cf1d - Browse repository at this point
Copy the full SHA 8a0cf1dView commit details -
Configuration menu - View commit details
-
Copy full SHA for a36e070 - Browse repository at this point
Copy the full SHA a36e070View commit details -
Configuration menu - View commit details
-
Copy full SHA for 47790f3 - Browse repository at this point
Copy the full SHA 47790f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 50e9ec4 - Browse repository at this point
Copy the full SHA 50e9ec4View commit details
Commits on Sep 15, 2024
-
[Bugfix][Model] Fix Python 3.8 compatibility in Pixtral model by upda…
…ting type annotations (vllm-project#8490)
Configuration menu - View commit details
-
Copy full SHA for 3724d5f - Browse repository at this point
Copy the full SHA 3724d5fView commit details -
Configuration menu - View commit details
-
Copy full SHA for fc990f9 - Browse repository at this point
Copy the full SHA fc990f9View commit details
Commits on Sep 16, 2024
-
[Kernel] Enable 8-bit weights in Fused Marlin MoE (vllm-project#8032)
Co-authored-by: Dipika <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a091e2d - Browse repository at this point
Copy the full SHA a091e2dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 837c196 - Browse repository at this point
Copy the full SHA 837c196View commit details -
Configuration menu - View commit details
-
Copy full SHA for acd5511 - Browse repository at this point
Copy the full SHA acd5511View commit details -
Configuration menu - View commit details
-
Copy full SHA for 781e3b9 - Browse repository at this point
Copy the full SHA 781e3b9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d73ae4 - Browse repository at this point
Copy the full SHA 5d73ae4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2759a43 - Browse repository at this point
Copy the full SHA 2759a43View commit details -
Configuration menu - View commit details
-
Copy full SHA for 47f5e03 - Browse repository at this point
Copy the full SHA 47f5e03View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5478c4b - Browse repository at this point
Copy the full SHA 5478c4bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ce45eb - Browse repository at this point
Copy the full SHA 5ce45ebView commit details
Commits on Sep 17, 2024
-
[Bugfix] Fix 3.12 builds on main (vllm-project#8510)
Signed-off-by: Joe Runde <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for cca6164 - Browse repository at this point
Copy the full SHA cca6164View commit details -
Configuration menu - View commit details
-
Copy full SHA for 546034b - Browse repository at this point
Copy the full SHA 546034bView commit details -
[Frontend] Improve Nullable kv Arg Parsing (vllm-project#8525)
Signed-off-by: Alex-Brooks <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1c1bb38 - Browse repository at this point
Copy the full SHA 1c1bb38View commit details -
Configuration menu - View commit details
-
Copy full SHA for ee2bcea - Browse repository at this point
Copy the full SHA ee2bceaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 99aa4ed - Browse repository at this point
Copy the full SHA 99aa4edView commit details -
[Misc] Limit to ray[adag] 2.35 to avoid backward incompatible change (v…
…llm-project#8509) Signed-off-by: Rui Qiao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for cbdb252 - Browse repository at this point
Copy the full SHA cbdb252View commit details -
[Benchmark] Support sample from HF datasets and image input for bench…
…mark_serving (vllm-project#8495)
Configuration menu - View commit details
-
Copy full SHA for 1b6de83 - Browse repository at this point
Copy the full SHA 1b6de83View commit details -
[Encoder decoder] Add cuda graph support during decoding for encoder-…
…decoder models (vllm-project#7631)
Configuration menu - View commit details
-
Copy full SHA for 1009e93 - Browse repository at this point
Copy the full SHA 1009e93View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9855b99 - Browse repository at this point
Copy the full SHA 9855b99View commit details -
[Model] Add mistral function calling format to all models loaded with…
… "mistral" format (vllm-project#8515) Co-authored-by: Cyrus Leung <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a54ed80 - Browse repository at this point
Copy the full SHA a54ed80View commit details -
Configuration menu - View commit details
-
Copy full SHA for 56c3de0 - Browse repository at this point
Copy the full SHA 56c3de0View commit details -
[Bugfix] Fix TP > 1 for new granite (vllm-project#8544)
Signed-off-by: Joe Runde <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 98f9713 - Browse repository at this point
Copy the full SHA 98f9713View commit details -
[doc] improve installation doc (vllm-project#8550)
Co-authored-by: Andy Dai <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fa0c114 - Browse repository at this point
Copy the full SHA fa0c114View commit details -
Configuration menu - View commit details
-
Copy full SHA for 09deb47 - Browse repository at this point
Copy the full SHA 09deb47View commit details -
[Kernel] Change interface to Mamba causal_conv1d_update for continuou…
…s batching (vllm-project#8012)
Configuration menu - View commit details
-
Copy full SHA for 8110e44 - Browse repository at this point
Copy the full SHA 8110e44View commit details
Commits on Sep 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 95965d3 - Browse repository at this point
Copy the full SHA 95965d3View commit details -
Configuration menu - View commit details
-
Copy full SHA for e351572 - Browse repository at this point
Copy the full SHA e351572View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6ffa3f3 - Browse repository at this point
Copy the full SHA 6ffa3f3View commit details -
[CI/Build] Update Ruff version (vllm-project#8469)
Signed-off-by: Aaron Pham <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9d104b5 - Browse repository at this point
Copy the full SHA 9d104b5View commit details -
[Core][Bugfix][Perf] Introduce
MQLLMEngine
to avoidasyncio
OH (v……llm-project#8157) Co-authored-by: Nick Hill <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Robert Shaw <[email protected]> Co-authored-by: Simon Mo <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7c7714d - Browse repository at this point
Copy the full SHA 7c7714dView commit details -
Configuration menu - View commit details
-
Copy full SHA for a8c1d16 - Browse repository at this point
Copy the full SHA a8c1d16View commit details -
[Core] zmq: bind only to 127.0.0.1 for local-only usage (vllm-project…
…#8543) Signed-off-by: Russell Bryant <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d65798f - Browse repository at this point
Copy the full SHA d65798fView commit details -
[Model] Support Solar Model (vllm-project#8386)
Co-authored-by: Michael Goin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e18749f - Browse repository at this point
Copy the full SHA e18749fView commit details -
[AMD][ROCm]Quantization methods on ROCm; Fix _scaled_mm call (vllm-pr…
…oject#8380) Co-authored-by: Alexei-V-Ivanov-AMD <[email protected]> Co-authored-by: Michael Goin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b3195bc - Browse repository at this point
Copy the full SHA b3195bcView commit details -
[Kernel] Change interface to Mamba selective_state_update for continu…
…ous batching (vllm-project#8039)
Configuration menu - View commit details
-
Copy full SHA for db9120c - Browse repository at this point
Copy the full SHA db9120cView commit details -
Configuration menu - View commit details
-
Copy full SHA for d9cd78e - Browse repository at this point
Copy the full SHA d9cd78eView commit details -
[Bugfix] add
dead_error
property to engine client (vllm-project#8574)Signed-off-by: Joe Runde <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0d47bf3 - Browse repository at this point
Copy the full SHA 0d47bf3View commit details
Commits on Sep 19, 2024
-
[Kernel] Remove marlin moe templating on thread_m_blocks (vllm-projec…
…t#8573) Co-authored-by: [email protected]
Configuration menu - View commit details
-
Copy full SHA for 4c34ce8 - Browse repository at this point
Copy the full SHA 4c34ce8View commit details -
[Bugfix] [Encoder-Decoder] Bugfix for encoder specific metadata const…
…ruction during decode of encoder-decoder models. (vllm-project#8545)
Configuration menu - View commit details
-
Copy full SHA for 3118f63 - Browse repository at this point
Copy the full SHA 3118f63View commit details -
Configuration menu - View commit details
-
Copy full SHA for 02c9afa - Browse repository at this point
Copy the full SHA 02c9afaView commit details -
Configuration menu - View commit details
-
Copy full SHA for c52ec5f - Browse repository at this point
Copy the full SHA c52ec5fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 855c8ae - Browse repository at this point
Copy the full SHA 855c8aeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 76515f3 - Browse repository at this point
Copy the full SHA 76515f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9cc373f - Browse repository at this point
Copy the full SHA 9cc373fView commit details -
Configuration menu - View commit details
-
Copy full SHA for e42c634 - Browse repository at this point
Copy the full SHA e42c634View commit details -
Configuration menu - View commit details
-
Copy full SHA for ea4647b - Browse repository at this point
Copy the full SHA ea4647bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9e99407 - Browse repository at this point
Copy the full SHA 9e99407View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6cb748e - Browse repository at this point
Copy the full SHA 6cb748eView commit details -
Configuration menu - View commit details
-
Copy full SHA for de6f90a - Browse repository at this point
Copy the full SHA de6f90aView commit details
Commits on Sep 20, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 18ae428 - Browse repository at this point
Copy the full SHA 18ae428View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9e5ec35 - Browse repository at this point
Copy the full SHA 9e5ec35View commit details -
Configuration menu - View commit details
-
Copy full SHA for 260d40b - Browse repository at this point
Copy the full SHA 260d40bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b63de9 - Browse repository at this point
Copy the full SHA 3b63de9View commit details -
[CI/Build] Removing entrypoints/openai/test_embedding.py test from RO…
…Cm build (vllm-project#8670)
Configuration menu - View commit details
-
Copy full SHA for 2940afa - Browse repository at this point
Copy the full SHA 2940afaView commit details -
Configuration menu - View commit details
-
Copy full SHA for b28298f - Browse repository at this point
Copy the full SHA b28298fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 035fa89 - Browse repository at this point
Copy the full SHA 035fa89View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2874bac - Browse repository at this point
Copy the full SHA 2874bacView commit details -
Configuration menu - View commit details
-
Copy full SHA for b4e4eda - Browse repository at this point
Copy the full SHA b4e4edaView commit details -
[Doc] neuron documentation update (vllm-project#8671)
Signed-off-by: omrishiv <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7c8566a - Browse repository at this point
Copy the full SHA 7c8566aView commit details -
[Hardware][AWS] update neuron to 2.20 (vllm-project#8676)
Signed-off-by: omrishiv <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7f9c890 - Browse repository at this point
Copy the full SHA 7f9c890View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0f961b3 - Browse repository at this point
Copy the full SHA 0f961b3View commit details
Commits on Sep 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0057894 - Browse repository at this point
Copy the full SHA 0057894View commit details -
[MISC] add support custom_op check (vllm-project#8557)
Co-authored-by: youkaichao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d4bf085 - Browse repository at this point
Copy the full SHA d4bf085View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0455c46 - Browse repository at this point
Copy the full SHA 0455c46View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0faab90 - Browse repository at this point
Copy the full SHA 0faab90View commit details -
Configuration menu - View commit details
-
Copy full SHA for 71c6049 - Browse repository at this point
Copy the full SHA 71c6049View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5e85f4f - Browse repository at this point
Copy the full SHA 5e85f4fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4dfdf43 - Browse repository at this point
Copy the full SHA 4dfdf43View commit details -
[Kernel][Triton][AMD] Remove tl.atomic_add from awq_gemm_kernel, 2-5x…
… speedup MI300, minor improvement for MI250 (vllm-project#8646)
Configuration menu - View commit details
-
Copy full SHA for ec4aaad - Browse repository at this point
Copy the full SHA ec4aaadView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9dc7c6c - Browse repository at this point
Copy the full SHA 9dc7c6cView commit details -
Configuration menu - View commit details
-
Copy full SHA for d66ac62 - Browse repository at this point
Copy the full SHA d66ac62View commit details
Commits on Sep 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 13d88d4 - Browse repository at this point
Copy the full SHA 13d88d4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0e40ac9 - Browse repository at this point
Copy the full SHA 0e40ac9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 06ed281 - Browse repository at this point
Copy the full SHA 06ed281View commit details -
[Misc] Use NamedTuple in Multi-image example (vllm-project#8705)
Signed-off-by: Alex-Brooks <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8ca5051 - Browse repository at this point
Copy the full SHA 8ca5051View commit details -
Configuration menu - View commit details
-
Copy full SHA for ca2b628 - Browse repository at this point
Copy the full SHA ca2b628View commit details -
[Model][VLM] Add LLaVA-Onevision model support (vllm-project#8486)
Co-authored-by: litianjian <[email protected]> Co-authored-by: Cyrus Leung <[email protected]> Co-authored-by: Roger Wang <[email protected]> Co-authored-by: DarkLight1337 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5b59532 - Browse repository at this point
Copy the full SHA 5b59532View commit details -
Configuration menu - View commit details
-
Copy full SHA for c6bd70d - Browse repository at this point
Copy the full SHA c6bd70dView commit details -
Configuration menu - View commit details
-
Copy full SHA for d4a2ac8 - Browse repository at this point
Copy the full SHA d4a2ac8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 92ba7e7 - Browse repository at this point
Copy the full SHA 92ba7e7View commit details
Commits on Sep 23, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 3dda7c2 - Browse repository at this point
Copy the full SHA 3dda7c2View commit details -
[Bugfix] Fix CPU CMake build (vllm-project#8723)
Co-authored-by: Yuan <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 57a0702 - Browse repository at this point
Copy the full SHA 57a0702View commit details -
Configuration menu - View commit details
-
Copy full SHA for d23679e - Browse repository at this point
Copy the full SHA d23679eView commit details -
[Core][Frontend] Support Passing Multimodal Processor Kwargs (vllm-pr…
…oject#8657) Signed-off-by: Alex-Brooks <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9b8c8ba - Browse repository at this point
Copy the full SHA 9b8c8baView commit details -
Configuration menu - View commit details
-
Copy full SHA for e551ca1 - Browse repository at this point
Copy the full SHA e551ca1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e83c12 - Browse repository at this point
Copy the full SHA 3e83c12View commit details -
Configuration menu - View commit details
-
Copy full SHA for a79e522 - Browse repository at this point
Copy the full SHA a79e522View commit details -
[VLM] Fix paligemma, fuyu and persimmon with transformers 4.45 : use …
…config.text_config.vocab_size (vllm-project#8707)
Configuration menu - View commit details
-
Copy full SHA for f2bd246 - Browse repository at this point
Copy the full SHA f2bd246View commit details -
[CI/Build] use setuptools-scm to set __version__ (vllm-project#4738)
Co-authored-by: youkaichao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ee5f34b - Browse repository at this point
Copy the full SHA ee5f34bView commit details -
[Kernel] (2/N) Machete - Integrate into CompressedTensorsWNA16 and GP…
…TQMarlin (vllm-project#7701) Co-authored-by: mgoin <[email protected]> Co-authored-by: Divakar Verma <[email protected]> Co-authored-by: Tyler Michael Smith <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 86e9c8d - Browse repository at this point
Copy the full SHA 86e9c8dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9b0e3ec - Browse repository at this point
Copy the full SHA 9b0e3ecView commit details -
[Core] Allow IPv6 in VLLM_HOST_IP with zmq (vllm-project#8575)
Signed-off-by: Russell Bryant <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b05f5c9 - Browse repository at this point
Copy the full SHA b05f5c9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5f7bb58 - Browse repository at this point
Copy the full SHA 5f7bb58View commit details -
Add output streaming support to multi-step + async while ensuring Req…
…uestOutput obj reuse (vllm-project#8335)
Configuration menu - View commit details
-
Copy full SHA for 1a2aef3 - Browse repository at this point
Copy the full SHA 1a2aef3View commit details
Commits on Sep 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 530821d - Browse repository at this point
Copy the full SHA 530821dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 88577ac - Browse repository at this point
Copy the full SHA 88577acView commit details -
re-implement beam search on top of vllm core (vllm-project#8726)
Co-authored-by: Brendan Wong <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0250dd6 - Browse repository at this point
Copy the full SHA 0250dd6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3185fb0 - Browse repository at this point
Copy the full SHA 3185fb0View commit details -
Configuration menu - View commit details
-
Copy full SHA for b8747e8 - Browse repository at this point
Copy the full SHA b8747e8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f06bae - Browse repository at this point
Copy the full SHA 3f06baeView commit details -
[Model] Expose Phi3v num_crops as a mm_processor_kwarg (vllm-project#…
…8658) Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Cyrus Leung <[email protected]> Co-authored-by: DarkLight1337 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8ff7ced - Browse repository at this point
Copy the full SHA 8ff7cedView commit details -
Configuration menu - View commit details
-
Copy full SHA for cc4325b - Browse repository at this point
Copy the full SHA cc4325bView commit details -
[Kernel] Split Marlin MoE kernels into multiple files (vllm-project#8661
) Co-authored-by: mgoin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a928ded - Browse repository at this point
Copy the full SHA a928dedView commit details -
[Frontend] Batch inference for llm.chat() API (vllm-project#8648)
Co-authored-by: Cyrus Leung <[email protected]> Co-authored-by: Cyrus Leung <[email protected]> Co-authored-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2529d09 - Browse repository at this point
Copy the full SHA 2529d09View commit details -
Configuration menu - View commit details
-
Copy full SHA for 72fc97a - Browse repository at this point
Copy the full SHA 72fc97aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2467b64 - Browse repository at this point
Copy the full SHA 2467b64View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1e7d5c0 - Browse repository at this point
Copy the full SHA 1e7d5c0View commit details
Commits on Sep 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 13f9f7a - Browse repository at this point
Copy the full SHA 13f9f7aView commit details -
[Core][Bugfix] Support prompt_logprobs returned with speculative deco…
…ding (vllm-project#8047) Signed-off-by: Travis Johnson <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 01b6f9e - Browse repository at this point
Copy the full SHA 01b6f9eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6da1ab6 - Browse repository at this point
Copy the full SHA 6da1ab6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6e0c9d6 - Browse repository at this point
Copy the full SHA 6e0c9d6View commit details -
Configuration menu - View commit details
-
Copy full SHA for ee777d9 - Browse repository at this point
Copy the full SHA ee777d9View commit details -
Configuration menu - View commit details
-
Copy full SHA for b452247 - Browse repository at this point
Copy the full SHA b452247View commit details -
Configuration menu - View commit details
-
Copy full SHA for fc3afc2 - Browse repository at this point
Copy the full SHA fc3afc2View commit details -
Configuration menu - View commit details
-
Copy full SHA for e3dd069 - Browse repository at this point
Copy the full SHA e3dd069View commit details -
Configuration menu - View commit details
-
Copy full SHA for c239536 - Browse repository at this point
Copy the full SHA c239536View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e073e6 - Browse repository at this point
Copy the full SHA 3e073e6View commit details -
[Frontend] OpenAI server: propagate usage accounting to FastAPI middl…
…eware layer (vllm-project#8672)
Configuration menu - View commit details
-
Copy full SHA for 1ac3de0 - Browse repository at this point
Copy the full SHA 1ac3de0View commit details -
[Bugfix] Ray 2.9.x doesn't expose available_resources_per_node (vllm-…
…project#8767) Signed-off-by: darthhexx <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3368c3a - Browse repository at this point
Copy the full SHA 3368c3aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8fae5ed - Browse repository at this point
Copy the full SHA 8fae5edView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1c04644 - Browse repository at this point
Copy the full SHA 1c04644View commit details -
Configuration menu - View commit details
-
Copy full SHA for 300da09 - Browse repository at this point
Copy the full SHA 300da09View commit details -
Configuration menu - View commit details
-
Copy full SHA for c6f2485 - Browse repository at this point
Copy the full SHA c6f2485View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0c4d2ad - Browse repository at this point
Copy the full SHA 0c4d2adView commit details -
Configuration menu - View commit details
-
Copy full SHA for 28e1299 - Browse repository at this point
Copy the full SHA 28e1299View commit details -
Configuration menu - View commit details
-
Copy full SHA for 64840df - Browse repository at this point
Copy the full SHA 64840dfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 873edda - Browse repository at this point
Copy the full SHA 873eddaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4f1ba08 - Browse repository at this point
Copy the full SHA 4f1ba08View commit details -
[Model] Add support for the multi-modal Llama 3.2 model (vllm-project…
…#8811) Co-authored-by: simon-mo <[email protected]> Co-authored-by: Chang Su <[email protected]> Co-authored-by: Simon Mo <[email protected]> Co-authored-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 770ec60 - Browse repository at this point
Copy the full SHA 770ec60View commit details -
Configuration menu - View commit details
-
Copy full SHA for e2c6e0a - Browse repository at this point
Copy the full SHA e2c6e0aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7193774 - Browse repository at this point
Copy the full SHA 7193774View commit details
Commits on Sep 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for bfa692e - Browse repository at this point
Copy the full SHA bfa692eView commit details -
Configuration menu - View commit details
-
Copy full SHA for b96ffe3 - Browse repository at this point
Copy the full SHA b96ffe3View commit details -
Configuration menu - View commit details
-
Copy full SHA for acbab07 - Browse repository at this point
Copy the full SHA acbab07View commit details -
Configuration menu - View commit details
-
Copy full SHA for d54bfce - Browse repository at this point
Copy the full SHA d54bfceView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8065d82 - Browse repository at this point
Copy the full SHA 8065d82View commit details -
Configuration menu - View commit details
-
Copy full SHA for a5b5eb0 - Browse repository at this point
Copy the full SHA a5b5eb0View commit details -
Dockerfile.ubi: remove vllm-nccl workaround
Fixed upstream in vllm-project#5091
Configuration menu - View commit details
-
Copy full SHA for cab1bac - Browse repository at this point
Copy the full SHA cab1bacView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7c65254 - Browse repository at this point
Copy the full SHA 7c65254View commit details -
fixes RHOAIENG-8043 Co-authored-by: Chih-Chieh-Yang <[email protected]> Signed-off-by: Thomas Parnell <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9f11204 - Browse repository at this point
Copy the full SHA 9f11204View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8914a32 - Browse repository at this point
Copy the full SHA 8914a32View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1f8b826 - Browse repository at this point
Copy the full SHA 1f8b826View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1beb801 - Browse repository at this point
Copy the full SHA 1beb801View commit details -
Dockerfile.ubi: misc improvements
- get rid cuda-devel stage, use cuda 12.4 - add build flags - remove useless installs
Configuration menu - View commit details
-
Copy full SHA for b102823 - Browse repository at this point
Copy the full SHA b102823View commit details -
Configuration menu - View commit details
-
Copy full SHA for c5e1313 - Browse repository at this point
Copy the full SHA c5e1313View commit details -
Dockerfile.ubi: use tensorizer (opendatahub-io#64)
add libsodium for tensorizer encryption --------- Signed-off-by: Prashant Gupta <[email protected]> Co-authored-by: Daniele <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ff1cc50 - Browse repository at this point
Copy the full SHA ff1cc50View commit details -
Configuration menu - View commit details
-
Copy full SHA for 129720a - Browse repository at this point
Copy the full SHA 129720aView commit details -
Configuration menu - View commit details
-
Copy full SHA for ee779e6 - Browse repository at this point
Copy the full SHA ee779e6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 160ddb8 - Browse repository at this point
Copy the full SHA 160ddb8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2478277 - Browse repository at this point
Copy the full SHA 2478277View commit details -
Dockerfile.ubi: get rid of --distributed-executor-backend=mp
this is the default when `--worker-use-ray` is not provided and world-size > 1
Configuration menu - View commit details
-
Copy full SHA for 9dc4dd3 - Browse repository at this point
Copy the full SHA 9dc4dd3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 96b598b - Browse repository at this point
Copy the full SHA 96b598bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d6fd09 - Browse repository at this point
Copy the full SHA 4d6fd09View commit details -
Configuration menu - View commit details
-
Copy full SHA for ef7738d - Browse repository at this point
Copy the full SHA ef7738dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3972a7d - Browse repository at this point
Copy the full SHA 3972a7dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9bac7b9 - Browse repository at this point
Copy the full SHA 9bac7b9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 97d24e4 - Browse repository at this point
Copy the full SHA 97d24e4View commit details -
Configuration menu - View commit details
-
Copy full SHA for c4aa1e3 - Browse repository at this point
Copy the full SHA c4aa1e3View commit details -
fix: update setup.py to differentiate between fork and upstream
Signed-off-by: Nathan Weinberg <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e856dd3 - Browse repository at this point
Copy the full SHA e856dd3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8ac5afb - Browse repository at this point
Copy the full SHA 8ac5afbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 58cbebb - Browse repository at this point
Copy the full SHA 58cbebbView commit details -
Configuration menu - View commit details
-
Copy full SHA for d5373dd - Browse repository at this point
Copy the full SHA d5373ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for 013813d - Browse repository at this point
Copy the full SHA 013813dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 53f9489 - Browse repository at this point
Copy the full SHA 53f9489View commit details -
Dockerfile.ubi: get rid of custom cache manager
fixed in vllm-project#6140 fixes https://issues.redhat.com/browse/RHOAIENG-8043
Configuration menu - View commit details
-
Copy full SHA for 0a20a57 - Browse repository at this point
Copy the full SHA 0a20a57View commit details -
Configuration menu - View commit details
-
Copy full SHA for d67d8f7 - Browse repository at this point
Copy the full SHA d67d8f7View commit details -
Configuration menu - View commit details
-
Copy full SHA for dfe980d - Browse repository at this point
Copy the full SHA dfe980dView commit details -
Configuration menu - View commit details
-
Copy full SHA for c002b3f - Browse repository at this point
Copy the full SHA c002b3fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 857e618 - Browse repository at this point
Copy the full SHA 857e618View commit details -
Configuration menu - View commit details
-
Copy full SHA for 75adb8a - Browse repository at this point
Copy the full SHA 75adb8aView commit details -
feat: allow long max seq length
Signed-off-by: Travis Johnson <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ce5c1bb - Browse repository at this point
Copy the full SHA ce5c1bbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 94625bd - Browse repository at this point
Copy the full SHA 94625bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c90a8b - Browse repository at this point
Copy the full SHA 5c90a8bView commit details -
fix: enable logprobs during spec decoding by default
Signed-off-by: Travis Johnson <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fe77683 - Browse repository at this point
Copy the full SHA fe77683View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3c5d24c - Browse repository at this point
Copy the full SHA 3c5d24cView commit details -
This turns off tracking by default. If someone wants to, they can simply override this in yaml.
Configuration menu - View commit details
-
Copy full SHA for a24dfae - Browse repository at this point
Copy the full SHA a24dfaeView commit details -
A review showed that nowhere in the Dockerfile.ubi do we do a microdnf -y update to pickup any known CVE and bugfixes. This patch adds this to the build process.
Configuration menu - View commit details
-
Copy full SHA for 6acc54f - Browse repository at this point
Copy the full SHA 6acc54fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d60952 - Browse repository at this point
Copy the full SHA 6d60952View commit details -
Configuration menu - View commit details
-
Copy full SHA for d218479 - Browse repository at this point
Copy the full SHA d218479View commit details -
Libsodium is being built with default CFLAGS. This adds optimization on par with cmake release builds. It also adds security hardening flags suggested for RHEL 9 to protect against various issues.
Configuration menu - View commit details
-
Copy full SHA for c7872e9 - Browse repository at this point
Copy the full SHA c7872e9View commit details -
Remove debug code.
Configuration menu - View commit details
-
Copy full SHA for 09d0994 - Browse repository at this point
Copy the full SHA 09d0994View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9041883 - Browse repository at this point
Copy the full SHA 9041883View commit details -
- get rid of non-essential dependencies - consolidate package installs - do not copy wheels in final stage - fix ccache usage - use flashattention with triton backend by default: - clone main_perf branch - build rocm target - set up triton rocm env var - configure numba, outlines and triton cache directory
Configuration menu - View commit details
-
Copy full SHA for d3f06b5 - Browse repository at this point
Copy the full SHA d3f06b5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3cc6ea4 - Browse repository at this point
Copy the full SHA 3cc6ea4View commit details -
Configuration menu - View commit details
-
Copy full SHA for b30f9ed - Browse repository at this point
Copy the full SHA b30f9edView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8238489 - Browse repository at this point
Copy the full SHA 8238489View commit details -
Configuration menu - View commit details
-
Copy full SHA for 67080c2 - Browse repository at this point
Copy the full SHA 67080c2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 010c1bd - Browse repository at this point
Copy the full SHA 010c1bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for f91432f - Browse repository at this point
Copy the full SHA f91432fView commit details -
Configuration menu - View commit details
-
Copy full SHA for b1b179f - Browse repository at this point
Copy the full SHA b1b179fView commit details -
Dockerfile.rocm.ubi: get rid of build triton stage
this is a torch dependency when installed from the pytorch/rocm6.1 index: https://download.pytorch.org/whl/nightly/rocm6.1
Configuration menu - View commit details
-
Copy full SHA for cd4b748 - Browse repository at this point
Copy the full SHA cd4b748View commit details -
Configuration menu - View commit details
-
Copy full SHA for 38247ba - Browse repository at this point
Copy the full SHA 38247baView commit details -
Configuration menu - View commit details
-
Copy full SHA for 451470c - Browse repository at this point
Copy the full SHA 451470cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 81a8400 - Browse repository at this point
Copy the full SHA 81a8400View commit details -
This sets the vllm build to a Release build type, builds libsodium,
fixes bad permissions on a logging directory, installs libsodium, and adds a LD_LIBRARY_PATH to fixup python bindings for 2 packages.
Configuration menu - View commit details
-
Copy full SHA for ec1f663 - Browse repository at this point
Copy the full SHA ec1f663View commit details -
This moves the install up a couple lines.
Configuration menu - View commit details
-
Copy full SHA for d16bf47 - Browse repository at this point
Copy the full SHA d16bf47View commit details -
Correct logging directory permissions
The /var/log/rocm_smi_lib/ directory was world writable. It is fixed now so that it is world readable. Similarly, the file in it's directory is also world writable, it is now world readable.
Configuration menu - View commit details
-
Copy full SHA for f63fbdd - Browse repository at this point
Copy the full SHA f63fbddView commit details -
Configuration menu - View commit details
-
Copy full SHA for 902985d - Browse repository at this point
Copy the full SHA 902985dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9d9bb9c - Browse repository at this point
Copy the full SHA 9d9bb9cView commit details