Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue converting mpt-7b-instruct to caikit format #182

Open
dagrayvid opened this issue Sep 7, 2023 · 1 comment
Open

Issue converting mpt-7b-instruct to caikit format #182

dagrayvid opened this issue Sep 7, 2023 · 1 comment
Labels
bug Something isn't working

Comments

@dagrayvid
Copy link

Describe the bug

Unable to load model mpt-7b-instruct from the files generated during converting/bootstrapping to caikit-format.

Platform

Using caikit-nlp pip installed from latest version in opendatahub-io/caikit-nlp

git clone https://github.com/opendatahub-io/caikit-nlp  && \
python3 -m pip install ./caikit-nlp/

Sample Code

Conversion script: https://github.com/opendatahub-io/caikit-tgis-serving/blob/main/utils/convert.py

Expected behavior

Should be able to load model from the output files from the convert.py script (config.yml and artifacts/). This seems to work with other models such as flan-t5-large and gpt-neox-20b.

Observed behavior

During model loading I see this error:

Shard 0: OSError: /mnt/models/mpt-7b-instruct/artifacts does not appear to have a file named configuration_mpt.py. Checkout 'https://huggingface.co//mnt/models/mpt-7b-instruct/artifacts/None' for available files.

Additional context

I tried copying all .py files from the huggingface repo for this model into the artifacts/ dir, but model loading failed with different errors related to missing triton modules: ModuleNotFoundError: No module named 'triton_pre_mlir'

I was able to load the model by just replacing the artifacts/ directory with the full huggingface repo contents.
/cc @gkumbhat

@chakrn chakrn added the bug Something isn't working label Sep 20, 2023
@chakrn
Copy link
Collaborator

chakrn commented Nov 13, 2023

@dagrayvid does this still reproduce on latest tuning code?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: ToDo
Development

No branches or pull requests

2 participants