Pārlūkot izejas kodu

add

Signed-off-by: Roger Wang <hey@rogerw.io>
Roger Wang 1 mēnesi atpakaļ
vecāks
revīzija
1fda6f1ba7
1 mainītis faili ar 1 papildinājumiem un 1 dzēšanām
  1. 1 1
      README.md

+ 1 - 1
README.md

@@ -1028,7 +1028,7 @@ We highly recommend using vLLM for deployment and inference. All of our evaluati
 
 
 ```shell
 ```shell
 # Launch vLLM model server
 # Launch vLLM model server
-vllm serve rednote-hilab/dots.ocr --trust-remote-code --async-scheduling
+vllm serve rednote-hilab/dots.ocr --trust-remote-code --async-scheduling --gpu-memory-utilization 0.95
 
 
 # vLLM API Demo
 # vLLM API Demo
 # See demo/demo_vllm.py for details on parameter and prompt settings that help achieve the best output quality.
 # See demo/demo_vllm.py for details on parameter and prompt settings that help achieve the best output quality.