| Nemotron3 Nano 4B | NVIDIA Nemotron | — | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | ✓ | — | Nemotron3 Nano 4B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| FunctionGemma | Google Gemma3 | — | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | ✓ | — | FunctionGemma Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Cosmos Reason 1 7B | NVIDIA Cosmos Reason | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | — | Cosmos Reason 1 7B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Gemma 3 270M | Google Gemma3 | — | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Gemma 3 270M Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Gemma 4 E2B New | Google Gemma4 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | ✓ | — | Gemma 4 E2B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| GPT OSS 20B | OpenAI GPT OSS | — | ✓ | ✓ | ✓ | — | — | ✓ | — | — | — | GPT OSS 20B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Llama 3.2 3B | Meta Llama 3 | — | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Llama 3.2 3B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Ministral 3 3B Instruct | Mistral AI Ministral 3 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Ministral 3 3B Instruct Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Nemotron3 Nano 30B-A3B | NVIDIA Nemotron | — | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Nemotron3 Nano 30B-A3B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3 4B | Alibaba Qwen3 | — | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | — | Qwen3 4B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3.5 35B-A3B (MoE) | Alibaba Qwen3.5 | — | ✓ | ✓ | ✓ | — | — | ✓ | — | — | — | Qwen3.5 35B-A3B (MoE) Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3.6 35B-A3B (MoE) New | Alibaba Qwen3.6 | — | ✓ | ✓ | ✓ | — | — | ✓ | — | — | — | Qwen3.6 35B-A3B (MoE) Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Gemma 3 1B | Google Gemma3 | — | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Gemma 3 1B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Gemma 4 E4B | Google Gemma4 | VLM | ✓ | ✓ | ✓ | ✓ | — | ✓ | — | ✓ | — | Gemma 4 E4B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| GPT OSS 120B | OpenAI GPT OSS | — | ✓ | ✓ | — | — | — | ✓ | — | — | — | GPT OSS 120B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Llama 3.1 8B | Meta Llama 3 | — | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Llama 3.1 8B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Ministral 3 8B Instruct | Mistral AI Ministral 3 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Ministral 3 8B Instruct Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Nemotron Nano 9B v2 | NVIDIA Nemotron | — | ✓ | ✓ | — | — | — | ✓ | — | — | — | Nemotron Nano 9B v2 Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3.5 27B | Alibaba Qwen3.5 | — | ✓ | ✓ | ✓ | — | — | ✓ | — | — | — | Qwen3.5 27B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3.6 27B New | Alibaba Qwen3.6 | — | ✓ | ✓ | — | — | — | ✓ | — | — | — | Qwen3.6 27B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3 8B | Alibaba Qwen3 | — | ✓ | ✓ | ✓ | ✓ | — | ✓ | — | — | — | Qwen3 8B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Gemma 3 4B | Google Gemma3 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Gemma 3 4B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Gemma 4 26B-A4B New | Google Gemma4 | VLM | ✓ | ✓ | ✓ | — | — | ✓ | — | ✓ | — | Gemma 4 26B-A4B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Cosmos Reason 2 2B | NVIDIA Cosmos Reason | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | ✓ | — | |
| Llama 3.1 70B | Meta Llama 3 | — | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Llama 3.1 70B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Ministral 3 14B Instruct | Mistral AI Ministral 3 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Ministral 3 14B Instruct Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Nemotron Nano 12B VL | NVIDIA Nemotron | VLM | ✓ | ✓ | — | — | — | ✓ | — | — | — | Nemotron Nano 12B VL Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3 30B-A3B (MoE) | Alibaba Qwen3 | — | ✓ | ✓ | ✓ | — | — | ✓ | — | — | — | Qwen3 30B-A3B (MoE) Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3.5 9B | Alibaba Qwen3.5 | VLM | ✓ | ✓ | ✓ | ✓ | — | ✓ | — | — | — | Qwen3.5 9B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Gemma 3 12B | Google Gemma3 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Gemma 3 12B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Cosmos Reason 2 8B | NVIDIA Cosmos Reason | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | ✓ | — | |
| Gemma 4 31B | Google Gemma4 | VLM | ✓ | ✓ | ✓ | — | — | ✓ | — | ✓ | — | Gemma 4 31B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Ministral 3 3B Reasoning | Mistral AI Ministral 3 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Ministral 3 3B Reasoning Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3 32B | Alibaba Qwen3 | — | ✓ | ✓ | — | — | — | ✓ | — | — | — | Qwen3 32B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3.5 4B | Alibaba Qwen3.5 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | — | Qwen3.5 4B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Gemma 3 27B | Google Gemma3 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Gemma 3 27B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Ministral 3 8B Reasoning | Mistral AI Ministral 3 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Ministral 3 8B Reasoning Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3.5 0.8B | Alibaba Qwen3.5 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | — | Qwen3.5 0.8B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3 VL 4B | Alibaba Qwen3 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | — | Qwen3 VL 4B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Ministral 3 14B Reasoning | Mistral AI Ministral 3 | VLM | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | — | — | Ministral 3 14B Reasoning Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|
| Qwen3 VL 8B | Alibaba Qwen3 | VLM | ✓ | ✓ | ✓ | ✓ | — | ✓ | — | — | — | Qwen3 VL 8B Quick Start Runner Inference Engine Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Details
|