昇腾环境300v pro 搭建qwen3 vl

发布时间:2026/5/23 7:40:47

昇腾环境300v pro 搭建qwen3 vl 1.启动dockerdocker run -itd \--name qwen-vl-serve \--nethost \--device/dev/davinci0 \--device/dev/davinci_manager \--device/dev/devmm_svm \--device/dev/hisi_hdc \-v /home/zhouty/Qwen3-VL-8B-Instruct:/workspace/models \-v /usr/local/Ascend/driver:/usr/local/Ascend/driver \quay.io/ascend/vllm-ascend:v0.18.0-310p-openeuler \/bin/bash2.启动服务export TORCH_COMPILE_DISABLE1export VLLM_USE_V10export VLLM_ASCEND_DISABLE_DYNAMIC_QUANT1vllm serve /workspace/models \--dtype float16 \--host 0.0.0.0 \--port 8000 \--tensor-parallel-size 1 \--trust-remote-code \--max-model-len 81923.双卡的启动docker run -itd \--name qwen-vl-serve \--nethost \--device/dev/davinci0 \--device/dev/davinci2 \--device/dev/davinci_manager \--device/dev/devmm_svm \--device/dev/hisi_hdc \-e ASCEND_RT_VISIBLE_DEVICES0,1 \-v /home/zhouty/Qwen3-VL-8B-Instruct:/workspace/models \-v /usr/local/Ascend/driver:/usr/local/Ascend/driver \quay.io/ascend/vllm-ascend:v0.18.0-310p-openeuler \/bin/bashexport TORCH_COMPILE_DISABLE1export VLLM_USE_V10export VLLM_ASCEND_DISABLE_DYNAMIC_QUANT1vllm serve /workspace/models \--dtype float16 \--host 0.0.0.0 \--port 8000 \--tensor-parallel-size 1 \--trust-remote-code \--max-model-len 8192

相关新闻