Ollama

Ollama is at its happiest when it is treated like a local daemon: the CLI and your apps talk to a loopback HTTP API, and the rest of the network never finds out it exists.

Ollama in Docker Compose with GPU and Persistent Model Storage

Ollama works great on bare metal. It gets even more interesting when you treat it like a service: a stable endpoint, pinned versions, persistent storage, and a GPU that is either available or it is not.

Ollama behind a reverse proxy with Caddy or Nginx for HTTPS streaming

Running Ollama behind a reverse proxy is the simplest way to get HTTPS, optional access control, and predictable streaming behaviour.

Text embeddings for RAG and search - Python, Ollama, OpenAI-compatible APIs

If you are working through retrieval-augmented generation (RAG), this section walks through text embeddings in plain terms — what they are, how they fit search and retrieval, and how to call two common local setups from Python using Ollama or an OpenAI-compatible HTTP API (as many llama.cpp-based servers expose).

I have tested how OpenCode works with several locally hosted on Ollama LLMs, and for comparison added some Free models from OpenCode Zen.

OpenClaw Quickstart: Install with Docker (Ollama GPU or Claude + CPU)

OpenClaw is a self-hosted AI assistant designed to run with local LLM runtimes like Ollama or with cloud-based models such as Claude Sonnet.

LLM Hosting in 2026: Local, Self-Hosted & Cloud Infrastructure Compared

Strategic guide to hosting large language models locally with Ollama, llama.cpp, vLLM, or in the cloud. Compare tools, performance trade-offs, and cost considerations.

LLM Performance in 2026: Benchmarks, Bottlenecks & Optimization

A performance engineering hub for running LLMs efficiently: runtime behavior, bottlenecks, benchmarks, and the real constraints that shape throughput and latency.

Self-hosting LLMs keeps data, models, and inference under your control-a practical path to AI sovereignty for teams, enterprises, nations.

Comparing LLMs performance on Ollama on 16GB VRAM GPU

Running large language models locally gives you privacy, offline capability, and zero API costs. This benchmark reveals exactly what one can expect from 14 popular LLMs on Ollama on an RTX 4080.

Top 19 Trending Go Projects on GitHub - January 2026

The Go ecosystem continues to thrive with innovative projects spanning AI tooling, self-hosted applications, and developer infrastructure. This overview analyzes the top trending Go repositories on GitHub this month.

Open WebUI is a powerful, extensible, and feature-rich self-hosted web interface for interacting with large language models.

DGX Spark AU Pricing: $6,249-$7,999 at Major Retailers

The NVIDIA DGX Spark (GB10 Grace Blackwell) is now available in Australia at major PC retailers with local stock. If you’ve been following the global DGX Spark pricing and availability, you’ll be interested to know that Australian pricing ranges from $6,249 to $7,999 AUD depending on storage configuration and retailer.

Self-Hosting Cognee: Choosing LLM on Ollama

Cognee is a Python framework for building knowledge graphs from documents using LLMs. But does it work with self-hosted models?

BAML vs Instructor: Structured LLM Outputs

When working with Large Language Models in production, getting structured, type-safe outputs is critical. Two popular frameworks - BAML and Instructor - take different approaches to solving this problem.

Choosing the Right LLM for Cognee: Local Ollama Setup

Choosing the Best LLM for Cognee demands balancing graph-building quality, hallucination rates, and hardware constraints. Cognee excels with larger, low-hallucination models (32B+) via Ollama but mid-size options work for lighter setups.

Ollama

Remote Ollama access via Tailscale or WireGuard, no public ports

Ollama in Docker Compose with GPU and Persistent Model Storage

Ollama behind a reverse proxy with Caddy or Nginx for HTTPS streaming

Text embeddings for RAG and search - Python, Ollama, OpenAI-compatible APIs

Best LLMs for OpenCode - Tested Locally

OpenClaw Quickstart: Install with Docker (Ollama GPU or Claude + CPU)

LLM Hosting in 2026: Local, Self-Hosted & Cloud Infrastructure Compared

LLM Performance in 2026: Benchmarks, Bottlenecks & Optimization

LLM Self-Hosting and AI Sovereignty

Comparing LLMs performance on Ollama on 16GB VRAM GPU

Top 19 Trending Go Projects on GitHub - January 2026

Open WebUI: Self-Hosted LLM Interface

DGX Spark AU Pricing: $6,249-$7,999 at Major Retailers

Self-Hosting Cognee: Choosing LLM on Ollama

BAML vs Instructor: Structured LLM Outputs

Choosing the Right LLM for Cognee: Local Ollama Setup