We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent a78a055 commit 33cae36Copy full SHA for 33cae36
articles/gpt-oss/run-vllm.md
@@ -26,7 +26,10 @@ Both models are **MXFP4 quantized** out of the box.
26
```shell
27
uv venv --python 3.12 --seed
28
source .venv/bin/activate
29
-uv pip install vllm --torch-backend=auto
+uv pip install --pre vllm==0.10.1+gptoss \
30
+ --extra-index-url https://wheels.vllm.ai/gpt-oss/ \
31
+ --extra-index-url https://download.pytorch.org/whl/nightly/cu128 \
32
+ --index-strategy unsafe-best-match
33
```
34
35
2. **Start up a server and download the model**
0 commit comments