Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q3 2024
#5805 opened Jun 25, 2024 by simon-mo
Open 41
vLLM's V2 Engine Architecture
#8779 opened Sep 24, 2024 by simon-mo
Open 5
Hardware Backend Deprecation Policy
#8932 opened Sep 29, 2024 by youkaichao
Open 2
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: vllm serve --config.yaml - Order of arguments matters? bug Something isn't working
#8947 opened Sep 29, 2024 by FloWsnr
1 task done
[Feature]: Qwen2.5 bitsandbytes support feature request
#8941 opened Sep 29, 2024 by hanan9m
1 task done
[New Model]: Molmo support new model Requests to new models
#8940 opened Sep 29, 2024 by win4r
1 task done
[Usage]: caching with different batches usage How to use vllm
#8939 opened Sep 29, 2024 by KevinZeng08
1 task done
[Bug]: Error when using tensor_parallel in v0.6.1.post1 or 0.6.2 bug Something isn't working
#8937 opened Sep 29, 2024 by ruleGreen
1 task done
Hardware Backend Deprecation Policy misc
#8932 opened Sep 29, 2024 by youkaichao
1 task done
[Performance] TTFT regression from v0.5.4 to 0.6.2 performance Performance-related issues
#8918 opened Sep 27, 2024 by rickyyx
1 task done
[Usage]: LLM with tensor_parallel_size larger than n. gpus in one node usage How to use vllm
#8908 opened Sep 27, 2024 by gpucce
1 task done
[Usage]: guided_regex in offline model usage How to use vllm
#8907 opened Sep 27, 2024 by RonanKMcGovern
1 task done
[Bug]: Tokenization Mismatch Between HuggingFace and vLLM bug Something isn't working
#8904 opened Sep 27, 2024 by rafapi
1 task done
[Performance]: Talk about the model parallelism performance Performance-related issues
#8898 opened Sep 27, 2024 by baifanxxx
1 task done
[Bug]: Variance Between Mutiple Prefix Cache Example runs bug Something isn't working
#8890 opened Sep 27, 2024 by Imss27
1 task done
[Bug]: assert len(self._async_stopped) == 0 bug Something isn't working
#8881 opened Sep 27, 2024 by sfc-gh-zhwang
1 task done
[Bug]: Server - aqlm fails with --cpu-offload-gb bug Something isn't working
#8873 opened Sep 26, 2024 by JMPSequeira
1 task done
[Performance]: Slowdown compared to Gradio performance Performance-related issues
#8866 opened Sep 26, 2024 by theoren
1 task done
ProTip! Exclude everything labeled bug with -label:bug.