Posts for: #Cloud

Deploying an OpenAI Compatible Endpoint on Runpod with vLLM and K6 Load Testing

2024-03-03Alex Darbyshire

#LLMs #Linux #Cloud

Renting a cloud GPU from RunPod, running a large language model via vLLM’s OpenAI compatible endpoint, and load testing it with K6.

[Read more]