Building an LLM-powered writing assistant in Roo Code. A meta exercise - this post is written by an LLM based on analysis of the author’s posts.
Posts for: #LLMs
Deploying an OpenAI Compatible Endpoint on Runpod with vLLM and K6 Load Testing
Renting a cloud GPU from RunPod, running a large language model via vLLM’s OpenAI compatible endpoint, and load testing it with K6.
Converting a Pytorch Model to Safetensors Format and Quantising to Exl2
Notes on converting a transformer model from PyTorch to Safetensors format and quantising to ExLlamaV2, using a code-based calibration dataset.
A Conversation with Q on the Nature of Time
A short example of using LLMs to converse with a fictional character from a seminal science fiction universe.