SGLang Workshop

AI Engineer World's Fair, June 3, 2025 | Yineng Zhang and Philip Kiely

Welcome to the SGLang workshop at AI Engineer World's Fair! We are very excited to have you here and to spend some time this morning talking about model performance optimization with SGLang.

Developer setup

In this hands-on workshop, you'll have the ability to deploy models with SGLang yourself. To follow along, please complete the following steps:

Fork and clone this repository.
Create a Baseten account.
1. Baseten will provide compute credits for this workshop.
2. If you're stuck in a "waiting room" for more than a couple of minutes, please flag Philip.
Install Truss with pip install --upgrade truss

Required: Get access to Llama 3.1 8B

These workshop examples are based on Llama 3.1 8B.

Accept the terms and conditions for Llama 3.1 8B.
Create an access token with READ permissions on Hugging Face.
Add it as a secret on your Baseten account with the name hf_access_token.

Recommended: Set up your Baseten API key

Create a file ~/.trussrc and paste in the following (using your actual API key):

[baseten]
remote_provider = baseten
api_key = abcdefgh.1234567890ABCDEFGHIJKL1234567890
remote_url = https://app.baseten.co

Add your API key to your environment variables in your profile of choice:

export BASETEN_API_KEY=abcdefgh.1234567890ABCDEFGHIJKL1234567890

You are now ready to complete the workshop.

Deploying models

Each folder has an example SGLang configuration along with instructions for deployment.

Calling models

Use call.ipynb to call individual deployments. Models deployed with SGLang are compatible with the OpenAI SDK -- just pass your model ID and API key.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
00_first_model		00_first_model
01_fp8		01_fp8
02_eagle		02_eagle
03_cuda_graph		03_cuda_graph
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SGLang Workshop Slides.pdf		SGLang Workshop Slides.pdf
call.ipynb		call.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SGLang Workshop

Developer setup

Required: Get access to Llama 3.1 8B

Recommended: Set up your Baseten API key

Deploying models

Calling models

Useful links

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

basetenlabs/SGLang-Workshop

Folders and files

Latest commit

History

Repository files navigation

SGLang Workshop

Developer setup

Required: Get access to Llama 3.1 8B

Recommended: Set up your Baseten API key

Deploying models

Calling models

Useful links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages