[CLI] bake run.yaml file inside docker container #122

yanxi0830 · 2024-09-26T21:54:13Z

Changes

Motivation: We should not need to install llama CLI & run llama stack configure / llama stack run outside of docker containers. Downloading docker image should be sufficient to start Llama Stack server.
[RFC] Only use docker commands for running w/ docker container. New developer flow for interaction with docker image.

Developer Flow

Download docker image from docker hub.

docker image pull llamastack/llamastack-local-gpu

Run w/ built in default config

podman run --network host -it -p 5000:5000 -v ~/.llama:/root/.llama --gpus=all llamastack-local-gpu --port 5000

(Advanced Option 1) Run with custom config outside docker

podman run --network host -it \
-p 5000:5000 \
-v path/to/run.yaml:/app/run.yaml \
-v ~/.llama:/root/.llama \
--gpus=all \
llamastack-d1 \
--port 5000
--config /app/config.yaml \

where path/to/run.yaml is absolute path to config outside container.

(Advanced Option 2) Configure w/ wizard inside docker image & run

podman run --network host -it --entrypoint "/bin/bash" llamastack-d0

$ (inside container) llama stack configure llamastack-build.yaml
...
Run configuration saved to d0-run.yaml

podman run --network host -it -p 5001:5001 -v ~/.llama:/root/.llama --gpus=all llamastack-d0 --port 5001 --config /app/d0-run.yaml

Add templated local-gpu run.yaml & local-cpu run.yaml files for easier configuration in (3).

Distribution Owner: Building Docker

$ llama stack build

> Enter a name for your Llama Stack (e.g. my-local-stack): d7
> Enter the image type you want your Llama Stack to be built as (docker or conda): docker

 Llama Stack is composed of several APIs working together. Let's configure the providers (implementations) you want to use for these APIs.
> Enter provider for the inference API: (default=meta-reference): meta-reference
> Enter provider for the safety API: (default=meta-reference): meta-reference
> Enter provider for the agents API: (default=meta-reference): meta-reference
> Enter provider for the memory API: (default=meta-reference): meta-reference
> Enter provider for the telemetry API: (default=meta-reference): meta-reference
 
 > (Optional) Enter a short description for your Llama Stack:
Build spec configuration saved at /data/users/xiyan/llama-stack/tmp/configs/d7-build.yaml
Configuring API `inference`...
=== Configuring provider `meta-reference` for API inference...
Enter value for model (default: Llama3.1-8B-Instruct) (required): 
Do you want to configure quantization? (y/n): n
Enter value for torch_seed (optional): 
Enter value for max_seq_len (default: 4096) (required): 
Enter value for max_batch_size (default: 1) (required): 

Configuring API `safety`...
=== Configuring provider `meta-reference` for API safety...
Do you want to configure llama_guard_shield? (y/n): n
Do you want to configure prompt_guard_shield? (y/n): n

Configuring API `agents`...
=== Configuring provider `meta-reference` for API agents...
Enter `type` for persistence_store (options: redis, sqlite, postgres) (default: sqlite): 

Configuring SqliteKVStoreConfig:
Enter value for namespace (optional): 
Enter value for db_path (default: /home/xiyan/.llama/runtime/kvstore.db) (required): 

Configuring API `memory`...
=== Configuring provider `meta-reference` for API memory...
> Please enter the supported memory bank type your provider has for memory: vect
or

Configuring API `telemetry`...
=== Configuring provider `meta-reference` for API telemetry...

> YAML configuration has been written to `/data/users/xiyan/llama-stack/tmp/configs/d7-run.yaml`.
Dockerfile created successfully in /tmp/tmp.4Mfy6zpfb2/DockerfileFROM python:3.10-slim
WORKDIR /app
...

...
Success! You can run it with: podman run -p 8000:8000 llamastack-d7

bake run file into container

e956224

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 26, 2024

yanxi0830 added 4 commits September 26, 2024 21:36

configure outside container

0ad0a15

remove configure outside for docker

3b80791

reorder output msg

ecd17ce

wip

ebb57a0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CLI] bake run.yaml file inside docker container #122

[CLI] bake run.yaml file inside docker container #122

yanxi0830 commented Sep 26, 2024 •

edited

Loading

[CLI] bake run.yaml file inside docker container #122

Are you sure you want to change the base?

[CLI] bake run.yaml file inside docker container #122

Conversation

yanxi0830 commented Sep 26, 2024 • edited Loading

Changes

Developer Flow

Distribution Owner: Building Docker

yanxi0830 commented Sep 26, 2024 •

edited

Loading