Notes on the margins

Rost Glukhov. Personal site and technical blog

Hardware
Herramientas de documentación
Hosting de LLM
Rendimiento de LLM
Programación
DevOps
Recetas
Noticias de hardware
IA
Ollama
Guías Rápidas
Tutoriales
Offline
Acerca de

Inference

Rendimiento de los LLM en 2026: Benchmarks, cuellos de botella y optimización

A performance engineering hub for running LLMs efficiently: runtime behavior, bottlenecks, benchmarks, and the real constraints that shape throughput and latency.

Más Recientes

Automatización de Navegadores en Go: Selenium, chromedp, Playwright, ZenRows
Cómo configurar lanzadores de escritorio en Ubuntu 24 con iconos estándar
Crear AWS CloudFront en el plan de pago por uso (no el plan gratuito)
Automatización de Navegadores en Python: Playwright, Selenium y Más
Interfaz de usuario de terminal: BubbleTea (Go) vs Ratatui (Rust)

Categorías

AI
Architecture
Cheatsheet
Coding
Community
Cookbook
Dev
DevOps
Hardware
Howtos
Offline
Privacy
Research
Security
Self-Hosting

Etiquetas

AI AI Coding Ai-Infrastructure Anaconda Android API Architecture AWS AWS Amplify Backup Bash Cheatsheet CI/CD Claude CLI Cloud Cloud-Llm CloudFront Coding Community Conversion Cookbook Cpu Database DeepLearning Dev DevOps DGX Spark Digital Detox Docker Docker-Model-Runner Documentation Filofax Flutter Food Git Gitea GitHub GitHub Actions Go Golang Gpu GraphQL Hardware Hosting Howtos Hugo Images Inference Infrastructure JavaScript K8S Kubernetes LabelStudio Latency Latex Linux LLM Llm-Benchmarks Llm-Hosting Llm-Infrastructure Llm-Performance Llm-Server Local-Llm Machine Learning Mainroad Markdown MCP Melbourne Memory Microservices Minio MMDetection Monitoring NLP Node.js NVidia ObjectDetection Offline Ollama Open Source Pdf Performance Performance-Engineering Perplexica Photos PostgreSQL Printing Privacy Prometheus Python PyTorch RAG Rust S3 Search Security Self-Hosted-Llm Self-Hosting SEO Serverless Social Media SQL Terminal Terraform Testing Throughput Tools TUI TypeScript Ubuntu Vllm Vram VSCode Web Hosting Windows

Social

root@@@glukhov.au

rost @ lemmy.world

rosgluk @ github

rosgluk @ bluesky

rosgluk @ Medium

rosgluk @ blogspot

rosgluk @ tumblr

Idiomas

EN English
RU Русский
DE Deutsch
ES Español
FR Français
IT Italiano
JA 日本語
KO 한국어
PL Polski
PT Português
NL Nederlands
SV Svenska

© 2026 Rost Glukhov. Generado con Hugo y Mainroad.