Skip to content

Commit

Permalink
fix model name
Browse files Browse the repository at this point in the history
Signed-off-by: Dan Sun <[email protected]>
  • Loading branch information
yuzisun committed Dec 25, 2024
1 parent 2bc7f62 commit 6531ada
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions docs/modelserving/storage/modelcache/localmodel.md
Original file line number Diff line number Diff line change
Expand Up @@ -121,7 +121,7 @@ After `LocalModelCache` is created, KServe creates the download jobs on each nod
```bash
kubectl get jobs meta-llama3-8b-instruct-kind-worker -n kserve-localmodel-jobs
NAME STATUS COMPLETIONS DURATION AGE
meta-llama3-8b-instruct-gptq-kind-worker Complete 1/1 4m21s 5d17h
meta-llama3-8b-instruct-kind-worker Complete 1/1 4m21s 5d17h
```

The download job is created using the provisioned PV/PVC.
Expand All @@ -142,7 +142,7 @@ kubectl get localmodelcache meta-llama3-8b-instruct -oyaml
apiVersion: serving.kserve.io/v1alpha1
kind: LocalModelCache
metadata:
name: meta-llama3-8b-instruct-gptq
name: meta-llama3-8b-instruct
spec:
modelSize: 10Gi
nodeGroup: workers
Expand Down Expand Up @@ -209,4 +209,4 @@ The model cache is currently disabled by default. To enable, you need to modify
memory: 24Gi
nvidia.com/gpu: "1"
EOF
```
```

0 comments on commit 6531ada

Please sign in to comment.