Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.sglang.io/llms.txt

Use this file to discover all available pages before exploring further.

This guide explains the parallelism configuration fields used in SGLang model configurations and how they map to SGLang server command-line arguments.

Quick Reference

Config FieldSGLang CLI ArgumentDescription
tp--tp-size, --tensor-parallel-sizeTensor Parallelism - splits model across GPUs
dp--dp-size, --data-parallel-sizeData Parallelism - runs multiple model replicas
ep--ep-size, --expert-parallel-size, --epExpert Parallelism - distributes MoE experts
enable_dp_attention--enable-dp-attentionDP for attention, TP for FFN (hybrid)