Skip to content

The cpu mirror can run the 'Meta-Llama-3.1-8B-Instruct.Q4_K_M.gguf' model normally, but using the gpu mirror to run this model will report the rpc error, can you help to point out? #3169

961815748 started this conversation in General
Discussion options

You must be logged in to vote

Replies: 0 comments

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
1 participant