Notes on the margins

Rost Glukhov. Personal site and technical blog

Dokumentationstools
Hardware
LLM-Hosting
LLM-Performance
RAG
Daten
Observability
Entwickler-Tools
KI-Werkzeuge
OpenClaw
Web-Infra
Programmierung
DevOps
Kochbuch
KI
Ollama
Spickzettel
Offline
Über

Benchmarks

LLM-Leistung im Jahr 2026: Benchmarks, Engpässe und Optimierung

A performance engineering hub for running LLMs efficiently: runtime behavior, bottlenecks, benchmarks, and the real constraints that shape throughput and latency.

Letzte Beiträge

RTX 5090 in Australien: Preis, Verfügbarkeit und Realität im März 2026
Remote-Zugriff auf Ollama über Tailscale oder WireGuard, ohne öffentliche Ports
Strukturiertes Logging in Go mit slog für Observability und Alerting
Ollama in Docker Compose mit GPU und persistenter Modell-Speicherung
Ollama hinter einem Reverse-Proxy mit Caddy oder Nginx für HTTPS-Streaming

Kategorien

AI
AI Devtools
Ai Systems
Architecture
Cheatsheet
Coding
Cookbook
Data Infrastructure
Dev
Developer Tools
DevOps
Documentation-Tools
Hardware
Howtos
LLM Hosting
LLM Performance
Observability
Offline
RAG
Research
Self-Hosting
Web Infrastructure

Tags

AI AI Coding Anaconda Android API Architecture AWS AWS Amplify Backup Bash Benchmarks Cheatsheet Claude Cloud Coding Community Conversion Cookbook Cpu CUDA Data Database DeepLearning Deployment Dev DevOps Devtools DGX Spark Digital Detox Docker Documentation Embeddings Filofax Flutter Food Garage GGUF Git Gitea GitHub Go Golang Gpu Grafana GraphQL Hardware Hosting Howtos Hugo Images Infrastructure JavaScript K8S Kubernetes LabelStudio Latex Linux Llama.cpp LLM LLM Performance Logging Machine Learning MacOS Mainroad Markdown MCP Melbourne Microservices Minio MMDetection Monitoring Node.js NVidia ObjectDetection Observability Offline Ollama Open Source Openai OpenClaw Opencode Pdf Performance Perplexica Photos PostgreSQL Printing Privacy Prometheus Python PyTorch RAG Reranking Rust S3 Security Self-Hosting SelfHosting SEO Serverless Sglang SQL Terminal Terraform Testing TypeScript Ubuntu Vector Database Vector Databases Vllm VS Code VSCode Web Hosting Windows

Soziale Netzwerke

root@@@glukhov.au

rost @ lemmy.world

rosgluk @ github

rosgluk @ bluesky

rosgluk @ Medium

rosgluk @ blogspot

rosgluk @ tumblr

Sprachen

EN English
RU Русский
DE Deutsch
ES Español
FR Français
IT Italiano
JA 日本語
KO 한국어
PL Polski
NL Nederlands
SV Svenska

Allgemeine Geschäftsbedingungen | Datenschutzrichtlinie | Kontaktieren Sie Rost Glukhov | Sponsorierte technische Beiträge

© 2026 Rost Glukhov.