Self-Hosting

The Rise of LLM ASICs: Why Inference Hardware Matters

The future of AI isn’t just about smarter models - it’s about smarter silicon. Specialized hardware for LLM inference is driving a revolution similar to Bitcoin mining’s shift to ASICs.

Indie Web: Reclaiming Digital Independence

The web was originally designed as a decentralized network where anyone could publish and connect. Over time, corporate platforms consolidated control, creating walled gardens where users are products and content is locked in. The Indie Web movement aims to restore the original promise of the web: personal ownership, creative freedom, and genuine connection.

DGX Spark vs. Mac Studio: Price-Checked Look at NVIDIA's Personal AI Supercomp

NVIDIA DGX Spark is real, on sale Oct 15, 2025, and targeted at CUDA developers needing local LLM work with an integrated NVIDIA AI stack. US MSRP $3,999; UK/DE/JP retail is higher due to VAT and channel. AUD/KRW public sticker prices are not yet widely posted.

Gemini Protocol: A Minimalist Alternative to the Web

The Gemini protocol represents a return to the fundamentals of internet communication-a lightweight, secure, and privacy-respecting alternative to the increasingly complex modern web.

Go clients for Ollama: SDK comparison and Qwen3/GPT-OSS examples

This guide provides a comprehensive overview of available Go SDKs for Ollama and compares their feature sets.

Here is a comparison between Qwen3:30b and GPT-OSS:20b focusing on instruction following and performance parameters, specs and speed:

Writefreely Federated Blogging Platform - selfhosting vs managed costs

Here’s a quick info on Write.as / WriteFreely - how it fits into the fediverse, where to get managed hosting, what the usage trend looks like, and how to self-host (plus rough costings).

Integrating Ollama with Python: REST API and Python Client Examples

In this post, we’ll explore two ways to connect your Python application to Ollama: 1. Via HTTP REST API; 2. Via the official Ollama Python library.

Proxmox in 2025: A Practical, All-In-One Virtualization Stack

Proxmox Virtual Environment (Proxmox VE) is an open-source, type-1 hypervisor and datacenter orchestration platform built on Debian.

NVidia RTX 5080 and RTX 5090 prices in Australia - October 2025

Let’s compare prices for top-level consumer GPUs, that are suitable for LLMs in particular and AI in general. Specifically I’m looking at RTX-5080 and RTX-5090 prices. They have slightly dropped.

Ollama’s GPT-OSS models have recurring issues handling structured output, especially when used with frameworks like LangChain, OpenAI SDK, vllm, and others.

Constraining LLMs with Structured Output: Ollama, Qwen3 & Python or Go

Large Language Models (LLMs) are powerful, but in production we rarely want free-form paragraphs. Instead, we want predictable data: attributes, facts, or structured objects you can feed into an app. That’s LLM Structured Output.

Kubuntu vs KDE Neon: A Technical Deep Dive

For KDE Plasma fans, two Linux distributions frequently come up in discussion: Kubuntu and KDE Neon. They may appear similar - both ship with KDE Plasma as the default desktop, both are based on Ubuntu, and both are friendly to newcomers.

Memory allocation and model scheduling in Ollama new version - v0.12.1

Here I am comparing how much VRAM new version of Ollama allocating for the model vs previous Ollama version. The new version is worse.

How to Change a Static IP Address in Ubuntu Server

This guide will walk you through the process of changing the static IP address on an Ubuntu Server.

Ollama Enshittification - the Early Signs

Ollama has quickly become one of the most popular tools for running LLMs locally. Its simple CLI, and streamlined model management have made it a go-to option for developers who want to work with AI models outside the cloud. But as with many promising platforms, there are already signs of Enshittification:

Self-Hosting

The Rise of LLM ASICs: Why Inference Hardware Matters

Indie Web: Reclaiming Digital Independence

DGX Spark vs. Mac Studio: Price-Checked Look at NVIDIA's Personal AI Supercomp

Gemini Protocol: A Minimalist Alternative to the Web

Go clients for Ollama: SDK comparison and Qwen3/GPT-OSS examples

Comparison: Qwen3:30b vs GPT-OSS:20b

Writefreely Federated Blogging Platform - selfhosting vs managed costs

Integrating Ollama with Python: REST API and Python Client Examples

Proxmox in 2025: A Practical, All-In-One Virtualization Stack

NVidia RTX 5080 and RTX 5090 prices in Australia - October 2025

Ollama GPT-OSS Structured Output Issues

Constraining LLMs with Structured Output: Ollama, Qwen3 & Python or Go

Kubuntu vs KDE Neon: A Technical Deep Dive

Memory allocation and model scheduling in Ollama new version - v0.12.1

How to Change a Static IP Address in Ubuntu Server

Ollama Enshittification - the Early Signs