Q: How do I extract Markdown from HTML using Ollama?

Use a reader-style model such as ReaderLM-v2 ( milkey/reader-lm-v2:latest ). Run Ollama with a prompt that asks to extract the main content from the given HTML and convert it to Markdown; the post includes a bash script example.

Q: How do I call Ollama from the command line for HTML to Markdown?

Use ollama run milkey/reader-lm-v2 and pass a prompt that contains your HTML and instructs the model to extract main content and output Markdown. Redirect output to a file, e.g. ollama run "$MODEL" "$PROMPT" > response.md . The post has a full bash script.

Q: Are there alternatives to using an LLM for HTML to Markdown?

Yes. Dedicated Python libraries (e.g. html2text , markdownify , html2md ) are usually faster and more deterministic. See our Convert HTML to Markdown in Python guide in the Documentation Tools section. LLMs are useful when you need semantic extraction or handling of messy or non-standard HTML.

Question 1

How do I extract Markdown from HTML using Ollama?

Accepted Answer

Use a reader-style model such as ReaderLM-v2 (milkey/reader-lm-v2:latest). Run Ollama with a prompt that asks to extract the main content from the given HTML and convert it to Markdown; the post includes a bash script example.

Question 2

Which Ollama model converts HTML to Markdown?

Accepted Answer

ReaderLM-v2 (built on Qwen2.5-1.5B-Instruction) is trained for this. Pull it with ollama pull milkey/reader-lm-v2 and use it with a prompt that includes your HTML and asks for Markdown output.

Question 3

Is HTML-to-Markdown conversion with Ollama fast?

Accepted Answer

It depends on HTML size and your hardware. Large pages (e.g. 100k+ tokens) can be slow. In the post, a 121KB sample took about 1 second on a typical PC. For many small snippets it is fine; for bulk or very large pages, Python libraries (e.g. in our Convert HTML to Markdown in Python guide) may be faster.

Question 4

How do I call Ollama from the command line for HTML to Markdown?

Accepted Answer

Use ollama run milkey/reader-lm-v2 and pass a prompt that contains your HTML and instructs the model to extract main content and output Markdown. Redirect output to a file, e.g. ollama run "$MODEL" "$PROMPT" > response.md. The post has a full bash script.

Question 5

What prompt should I use for HTML to Markdown with an LLM?

Accepted Answer

Ask the model to extract the main content from the given HTML and convert it to Markdown format. Example - &ldquo;Extract the main content from the given HTML and convert it to Markdown format.&rdquo; then provide the HTML. The exact phrasing can vary; reader models are tuned for this task.

Question 6

Are there alternatives to using an LLM for HTML to Markdown?

Accepted Answer

Yes. Dedicated Python libraries (e.g. html2text, markdownify, html2md) are usually faster and more deterministic. See our Convert HTML to Markdown in Python guide in the Documentation Tools section. LLMs are useful when you need semantic extraction or handling of messy or non-standard HTML.

Convert HTML content to Markdown using LLM and Ollama

ReaderLM-v2

Calling Ollama Commandline

Useful links