Hardware

In the midst of the modern world’s turmoil here I’m comparing tech specs of different cards suitable for AI tasks (Deep Learning, Object Detection and LLMs). They are all incredibly expensive though.

This guide explains how Ollama handles parallel requests (concurrency, queuing, and resource limits), and how to tune it using the OLLAMA_NUM_PARALLEL environment variable (and related knobs).

Installing Epson EcoTank ET-8500 Linux Driver

Installing ET-8500 on Windows is well documented in the manual. The ET-8500 Linux Driver installation is simple but not trivial.

Comparing prediction speed of several versions of LLMs: llama3 (Meta/Facebook), phi3 (Microsoft), gemma (Google), mistral(open source) on CPU and GPU.

Hardware

Comparing NVidia GPU suitability for AI

How Ollama Handles Parallel Requests

Installing Epson EcoTank ET-8500 Linux Driver

Large Language Models Speed Test