Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add runtimeclass nvidia as a default option for nimcache #177

Open
jxdn opened this issue Oct 5, 2024 · 4 comments
Open

add runtimeclass nvidia as a default option for nimcache #177

jxdn opened this issue Oct 5, 2024 · 4 comments

Comments

@jxdn
Copy link

jxdn commented Oct 5, 2024

Hi ,

can help to add runtimeclass on the nimcache and all others crd ?

got this error

Traceback (most recent call last):
File "/usr/local/bin/download-to-cache", line 5, in
from vllm_nvext.hub.pre_download import download_to_cache
File "/usr/local/lib/python3.10/dist-packages/vllm_nvext/hub/pre_download.py", line 20, in
from vllm_nvext.hub.ngc_injector import get_optimal_manifest_config
File "/usr/local/lib/python3.10/dist-packages/vllm_nvext/hub/ngc_injector.py", line 22, in
from vllm.engine.arg_utils import AsyncEngineArgs
File "/usr/local/lib/python3.10/dist-packages/vllm/init.py", line 3, in
from vllm.engine.arg_utils import AsyncEngineArgs, EngineArgs
File "/usr/local/lib/python3.10/dist-packages/vllm/engine/arg_utils.py", line 6, in
from vllm.config import (CacheConfig, DecodingConfig, DeviceConfig,
File "/usr/local/lib/python3.10/dist-packages/vllm/config.py", line 12, in
from vllm.model_executor.layers.quantization import QUANTIZATION_METHODS
File "/usr/local/lib/python3.10/dist-packages/vllm/model_executor/layers/quantization/init.py", line 3, in
from vllm.model_executor.layers.quantization.aqlm import AQLMConfig
File "/usr/local/lib/python3.10/dist-packages/vllm/model_executor/layers/quantization/aqlm.py", line 11, in
from vllm._C import ops
ImportError: libcuda.so.1: cannot open shared object file: No such file or directory

@jxdn
Copy link
Author

jxdn commented Oct 6, 2024

Image

i add this line and rebuild the nim-operator, and it works

@jxdn jxdn changed the title add runtimeclass option for nimcache add runtimeclass nvidia as a default option for nimcache Oct 6, 2024
@jxdn
Copy link
Author

jxdn commented Oct 6, 2024

this happened also on nimservices
need to patch with
kubectl patch deployment meta-llama3-8b-instruct --type='merge' -p='{"spec": {"template": {"spec": {"runtimeClassName": "nvidia"}}}}' -n nim

@mkhaas
Copy link
Collaborator

mkhaas commented Oct 6, 2024

Thanks for the suggestion. We'll add it to our backlog. In the meantime, recommend adding webhooks to add runtimeclass.

@kirson-git
Copy link

Can i use the patch command for NIMCACHE ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants