Llama 3.1 405B
Run 405B (fp16) on Two NodesCommand
Command
DeepSeek V3/R1
Please refer to DeepSeek documents for reference.Multi-Node Inference on SLURM
This example showcases how to serve SGLang server across multiple nodes by SLURM. Submit the following job to the SLURM cluster.Command
