A 16x9 teaser image in the style of early telephone late telegraph era advertising, with a woman at the phone and a man besides her, taking notes. No text visible. AI generated with gpt-4o in ChatGPT.

Evals for Voice Agents (Session Notes)

These are my notes of a whirlwind intro to evals for voice agents, a session of Kwindla’s and swyx’s Voice AI Course. It was so packed that I catched many ideas only during note taking. I wrote th...

Jun 30, 2025 Voice Agents

An AI generated image of a man sitting in front of a code agent which looks like a slot machine.

Coding Agents as Slot Machines

After reading a nice post via HN about On Stepping Back, I habitually scrolled through the comments and found this nugget by evrimoztamur: Interacting with LLM coding tools is much like playing...

Jun 1, 2025 Amazing Ideas

An AI generated image of an old and incorrect map of the world with an arrow pointing from America to Europe.

Pipecat Cloud Latency for EU Users

Pipecat Cloud is the easiest way to run a Pipecat driven voice agent in production. I’ve heard in the Voice Agents course that Daily.co, the company behind Pipecat Cloud, currently only has region...

May 11, 2025 Voice Agents

K/V Cache Quantization in Ollama

A somewhat hidden feature of Ollama is K/V Cache quantization. K/V Cache quantization in Ollama is not on by default, so you need to activate it by setting the OLLAMA_KV_CACHE_TYPE environment vari...

May 10, 2025 Local AI

An AI generated pop art style image showing a server stack under a cloud, an arrow pointing to a smart phone, two classic telephone receiver and an icon which could symbolize AI.

Deploying Voice Agents to Production

These are my notes on the session about deploying voice agents to production of the Voice Agents Course. TLDR: Use a voice AI provider for simple, scalable deployment for production. Use a s...

May 9, 2025 Voice Agents

An AI generated pop art style image showing a laptop under a light bulp at the left, an arrow pointing to a runner jumping over an abstract hurdle at the center and right.

The Speedy Solopreneur

In The Batch 300, Andrew Ng shared some insights on the importance of speed for startups and how to be fast as a startup: focus on one concrete idea (don’t hedge) quickly switch to a differen...

May 8, 2025 Amazing Ideas

An AI generated pop art style image showing a mic, a comic speech bubble, a human head speaking and listening to a loudspeaker. In a corner, there is a symbol for AI showing six circles, connected and arranged like a star.

An Overview of the Voice AI Landscape (Session Notes)

I’m so happy to be part of the Voice Agents Course by Kwindla and swyx. Yesterday, Kwindla kicked it off with an overview of the voice AI landscape. The pace, insights, and questions from the audie...

May 8, 2025 Voice Agents

How to Install Docker and Docker Compose on Raspberry Pi OS (rootful and rootless)

This is a quick guide on how to install Docker and Docker Compose on Raspberry PI. I use this on various Raspberry Pis 4 with 4 and 8 GB RAM, it’s tested with Raspberry Pi OS 64-bit 12 (“bookworm”)...

May 5, 2025 Homelab

A diagram showing WebSockets in context of an AI App.

A note about the hidden complexities of WebSockets

AI Apps are often expected to be realtime. On the web, realtime can best be implemented with WebSockets. Atul Jalan from Compose has written a nice blog post about The Hidden Complexity of Scalin...

Jan 25, 2025 Building AI Apps

The NVIDIA RTX 5090 and the NVIDIA RTX 4090 dies side by side.

A look at the NVIDIA RTX 5090 specs for local LLM inference

Update (January 14, 2025): The RTX 5090 was officially announced at CES and will be available January 30, 2025. This post was updated with the aktual RTX 5090 specs. I kept the rumored specs in the...

Nov 26, 2024 Local AI